Question-11: Can you give example the kind of data stored by the Apache Ambari for HDP cluster?
Answer: As Ambari is a management tool for the Hadoop (HDP) cluster it needs to store various services data. Like for HDFS it needs to store NameNode, DataNode , Secondary NameNode, Oozie , YARN and Zookeeper data.
Question-12: Can I store this data on the /tmp directory?
Answer: No, you should not use the /tmp directory at all. Because if you use the /tmp directory then data would be deleted. And cluster would fail, hence you should must have persistent storage locations.
Question-13: What is the best way to install Ambari agent on all the 1000 host in HDP cluster?
Answer: As there are 1000 nodes in the cluster, we should not manually install the Ambari agent on all the nodes in the cluster. Rather we should have password less ssh connection setup done between Ambari host and all the nodes in the cluster. And Ambari Server hosts uses the SSH public key authentication to remotely access and install the Ambari Agent.
Question-14: What is the Service account in Ambari?
Answer: As Ambari manages the services like Spark, HDFS, HBase etc on the HDP cluster and to manage these services it requires that you have at least one service account created for each service.
Question-15: What is the default user name and password for Ambari administration?
Answer: The default username and password for the Ambari Administration is created while installing and that is admin/admin. And this user has the full access and control on the Ambari.