Spark Python Course APPLY Project Solution Guide Hints
Spark Python Course APPLY Project Solution Guide Hints
This document contains a proposed solution guide for the problems provided in the course APPLY
project. This is just a guide. You are free try out and solve the problem in your own way. We recommend
you do the following steps
In this part, you will clean and prepare the data for further analysis
2. Perform Analysis
1. Load the Data frame as a temp table /view
2. Query the temp table to solve PR#02
3. Query the temp table to solve PR#03
4. Query the temp table to solve PR#04
5. Perform correlation analysis
Compare your output with the solution provided. It is not necessary to match fully with the provided
solution. It’s just a guide.