Spark Interview More Questions With Answers
Spark Interview More Questions With Answers
2. When using Shuffle Sort Merge Join, does the shuffling occur on the driver node or the
executor node?
Shuffling occurs on the executor nodes. The driver node initiates the job, but the actual
shuffling and sorting of data happen on the executors.
10. How many partitions are created when we invoke a wide dependency transformation?
The number of partitions created during a wide dependency transformation is determined
by the spark.sql.shuffle.partitions configuration, which defaults to 200. However, it
can be adjusted based on the size of the data and the specific requirements of the job.