Spark Quiz -2

0%

Spark – Quiz – 2

1 / 10

To Shuffle data for joins or aggregations,which of the following is the default number of partitions in spark?

2 / 10

Which of the following will return all the elements of the dataset as an array at the driver program?

3 / 10

How will you control the number of partitions in spark dataframe across the application ?

4 / 10

Which of the following will give max value of a column “id” in dataframe “df” ?

5 / 10

Which of the following will give the current date in data frame column ?

6 / 10

In Scala Which of the following will give the unixtimestamp in variable “a” (dataType:String) ?

7 / 10

Which of the following will persist the data in to disk only?

8 / 10

How would you set number of executors(say 5) of any spark application ?

9 / 10

Say you have dataframe “df”,how would you add a new column which will have current timestamp ?

10 / 10

Which of the following function will avoid full shuffle in spark if partitions are set to be decreased.

Don’t miss the tutorial on Top Big data courses on Udemy you should Buy

Sharing is caring!

Subscribe to our newsletter
Loading

Leave a Reply