Question-131:  Why do you want to refresh the cluster?

Answer: We do cluster refresh action to bring configuration up to date without restarting all services.

 

Question-132:  When can I pause CDH cluster in AWS?

Answer:  If all of the data for a cluster stored in EBS volumes, then You can pause the cluster and stop your EC2 instances during periods when the Cluster will not be used.  When the cluster is paused it is not available and cannot be used to process the data, however it helps in reducing the cost of EC2 instances.  Whatever storage you are using from EBS volumes would incur the cost even when cluster is passed.  It is must to use EBS volume for your storage, whether it's management or Worker nodes.  because data stored on an ephemeral disk will be lost as soon as EC2 instances are stopped.

 

Cloudera Altus

 

Question-133: What is Cloudera Altus Data Engineering?

Answer: Using Cloudera Altus Data Engineering you can create a cluster, with different kind of distributed processing engine like Spark, Hive, Hive on Spark, & MapReduce2(MR2). Once the cluster is created you can run data engineering and Data science, Machine Learning jobs on it. 

 

Question-134: Where can i create cluster using Cloudera Altus?

Answer: Altus Data Engineering usage or access your AWS account or Azure subscription to create cluster in Cloud and run the job on the same cluster.

 

Question-135: What do you mean by Cross-Account access in case of AWS & Cloudera Altus?

Answer: If you are creating a Cluster using Cloudera Altus then AWS Administrator must setup a cross-account access role to provide altus access to your AWS account. If any of the Altus Data Engineering account holder create a Cluster in AWS that time Altus Data Engineering service uses the Altus Cross-Account access credentials to create the cluster in your AWS account.