Requirement You have two table named as A and B. and you want to perform all types of join in spark using python. It will help you to understand, how join works in pyspark. Solution Step 1: Input Files Download file  and  from here. And place them into a local directory.Read More →

Requirement Let’s take a scenario where we have already loaded data into an RDD/Dataframe. We got the rows data into columns and columns data into rows. The requirement is to transpose the data i.e. change rows into columns and columns into rows. Sample Data We will use below sample data.Read More →

Requirement Suppose there is a source data which is in JSON format. The requirement is to load JSON data in Hive non-partitioned table using Spark. Let’s break the requirement into two task: Load JSON data in spark data frame and read it Store it in a hive non-partition table ComponentsRead More →