Question 59: You have setup an EMR Hadoop cluster with the 10 nodes. You have also installed Spark application for doing various types of data analytics like GraphX, SparkSQL etc. Now you need to do interactive

analysis of the data. Which of the following tool can be useful for this requirement?

1. You will be using Spark Streaming

2. You will be using Spark SQL

3. You will be using Spark GraphX

4. You will be using Spark ML

5. You will be using Zepplin

Correct Answer : 5 Exp : Use Apache Zeppelin as a notebook for interactive data exploration.Zeppelin is included in Amazon EMR release version 5.0.0 and later. Earlier release versions include Zeppelin as a sandbox

application.To access the Zeppelin web interface, set up an SSH tunnel to the master node and a proxy connection.

5