analysis of the data. Which of the following tool can be useful for this requirement?
1. You will be using Spark Streaming
2. You will be using Spark SQL
3. You will be using Spark GraphX
4. You will be using Spark ML
5. You will be using Zepplin
Correct Answer : 5 Exp : Use Apache Zeppelin as a notebook for interactive data exploration.Zeppelin is included in Amazon EMR release version 5.0.0 and later. Earlier release versions include Zeppelin as a sandbox
application.To access the Zeppelin web interface, set up an SSH tunnel to the master node and a proxy connection.
5
 
											