Question-146: Using Altus can we create cluster across multiple AWS account or Azure subscriptions?

Answer: Yes, Using the Altus Environment which identifies the resources in your AWS account or Azure subscriptions which needs to be used for Cluster or Jobs. Even using Altus Environment, you can create Clusters in multiple AWS accounts or Azure subscription even from the single Altus account. 

 

Question-147: Can I create Cluster using Altus which support more than one Compute engine?

Answer: Yes, you can have one or more from the following compute engine

  • Hive
  • Spark
  • Hive on Spark
  • MapReduce2

 

Question-148: What is the Job Queue?

Answer: In the Altus Data Engineering service each cluster has a Job queue to manage the jobs that run on the cluster and supports a workflow with a single Pipeline. 

 

Question-149: You have huge volume of structured data stored in S3, and wanted to create a Data Warehouse solution on that using CDH cluster, how can you do?

Answer: You will be using Altus Data warehouse service for such requirement using that you will be provisioning cluster in your AWS account, so all your business users would have permission. Now configure a cluster with the Impala SQL engine to enable you to interactively access your data stored in your Cloud object storage for analysis and reporting. Exactly the same can be done using Azure subscriptions.

 

Question-150: What is the Altus SDX Namespace?

Answer: As CDH cluster access the data stored in the public Cloud, all the metadata for this stored data is also stored in a Database. Then Altus SDX namespace points to that database and provide a common and consistent view of the data to the clusters. This SDX namespace service would be shared across multiple Altus cluster which wanted to access the same data and provide the consistent to all the cluster for these data. Actual data would be stored either in AWS S3 or Azure ADLS.