Can you please provide the detail, what kind of questions are being asked for the assessments?
Answer: Regarding the kind of assessment questions, you would be asked questions like below but not limited, again complete all the questions and answer from this book as well as practice material provide by HadoopExam.com
- Load the data from file (most frequently asked parquet, JSON) and possibly other format as well like text, csv. Each exam attempt has different questions and answer.
- Create DataFrame and extract the data from it by applying projection or filter
- De-duplicate the data
- Find the distinct records from the DataFrame
- Transform the DataFrame by applying Lambda functions.
- Finally write the data to the file store like in Parquet, JSON or text format.
- Make yourself comfortable with the following file formats in order of priority