Questions-31: Can you give some example or what kind of configuration is done in cassandra.yaml file?

Answer: The common configuration in the Cassandra.yaml file are, initialization property for the cluster, caching parameters for the tables, parameters for resource utilization, security configuration, and timeout settings. 

Question-32: What is the basic principle while designing the data model for Cassandra database?

Answer: In Cassandra database or NoSQL database you should first think about the queries you would be performing on the database. And based on the query you should design your database. and don’t consider the entity relationship data modelling which is being used for RDBMS. 

Question-33: How do you define the keyspace?

Answer: In Cassandra the outermost grouping of data is done using the key space, it is equivalent to the schema in RDBMS. The tables you want to create must belong to a keyspace. And also, replication is defined on the keyspace level. 

Question-34: If you want high cardinality in the data while doing the data modelling, how can you achieve this?

Answer: In Cassandra while doing the data modelling you generally not considered the normalization and rather create the data tables having the de-normalized data. It is fine to have duplicate data across more than one table. But still you want to have a cardinality or a relationship defined between two tables then you should consider using the materialized views.

Question-35: What is the purpose of partition key and what it is?

Answer: With the partition key it would be decided on which node data should be stored and also it divides the data into logical groups. You should keep in mind while deciding the partition key that data should be evenly distributed across the nodes in the cluster as much as possible and avoid partition key which requires query to read or write data across the partition. Because if your query has to read or write it across the partition it would lead to a higher latency.