Subscribe for updated version
Mobile: +91-8879712614 Phone:022-42669636  | Email : hadoopexam@gmail.com admin@hadoopexam.com

Home About Us
All Products Spark IBM MapR Hortonworks Cloudera NiFi
Hadoop BigData Cloudera:CDH Admin Course-1 HBase NoSQL Spark(Scala) HandsON OOZie HandsOn Scala Programming Python Programming Java 1z0-808 Training AWS SA Associate SAS Base HandsOn NiFi Professional
Amazon AWS SAS
EMCDSA:E20-007 EMCDSA:E20-065 Cloudera DataScience Python Data Science Spark Data Science
HBase Cassandra
Azure:70-532 Azure:70-533
Salesforce Oracle Cloud & Java Android To Activate
FAQ Training FAQ Certification Simulator FAQ
Free Resources
Candidates Recruiter/Employer
Forum Subscribe Annual Subscription (50%+49% off) Author/Trainer
Hadoop Material Packages AWS Material Packages SAS Material Packages
For Business Blog

Green


    25000+ Learners upgraded/switched career  Testimonials

All Certifications preparation material is for renowned vendors like Cloudera, MapR, EMC, Databricks,SAS, Datastax, Oracle, NetApp etc , which has more value, reliability and consideration in industry other than any training institutional certifications.
Note : You can choose more than one product to have custome package created from below and send email to hadoopexam@gmail.com to get discount.Premium Trainings Courses :  HadoopExam focuses on in depth learning with the hands-on session setting up the environment than executing solution and doing hands on that. Below are the available trainings and we are keep adding new trainings. These trainings is being used and subscribed by Devloper, Tester, Administrator, Enterprise(to train their team) and Trainer globally. These trainings are well organized and step by step solutions to learning, and in lesser time as per your convenience you can complete these and even re-visit as required.

All Premium Training Access Annual Subscription (You will get early access to under development training and early edition books) : Used By More than 20000 subscribers

Access All Annual/Semi Annual/Quarterly Subscription from this Link
Spark Professional Training   Spark SQL Hands Training   PySpark : HandsOn Professional Training    PySpark Structured Streaming   Apache NiFi (Hortonworks DataFlow) Training   Hadoop Professional Training   Cloudera Hadoop Admin Training Course-1  HBase Professional Traininghttp  SAS Base Certification Hands On Training OOzie Professional Training   
AWS Solution Architect : Training Associate    AWS Exam Prepare : Kinesis Data Stream   Free Core Java 1Z0-808 Training   Scala Professional Training   Python Professional Training  Read Spark SQL Fundamental and Cookbookhttps://sites.google.com/training4exam.com/spark-sql-2-x-fundamentals/  Book : AWS Solution Architect Associate : Little Guide  NiFi CookBook By HadoopExam  AWS Security Specialization Certification: Little Guide SCS-C01   Spark Interview Questions
Databricks Spark 2.x Developer Certification   Databricks PySpark 2.x (Python Spark) Certification Exam     Oreilly Databricks Spark Certification     Hortonworks HDPCD Spark Certification     Cloudera CCA175 Hadoop and Spark Developer Certifications     MapR V2 Spark Developer Certification ExamCloudera CCA175 Hadoop and Spark Developer Certifications    Cloudera CC159 Hadoop Analytics Certification     Cloudera Hadoop Admin Certification     Cloudera Hadoop Data Engineer Certification    Hadoop Certification Package Deal


Previous   |   Next   | Full version of Cassandra Certification  | Sample Cassandra Certification Questions | Quickly go through Spark Training Python & Scala


Question-1: Which of the following statements are correct with respect to Apache Cassandra database?



A.It uses master slave architecture.
B.It uses peer-to-peer communication
C.It well integrates with Apache Solr for Analytics
D.It uses the De-normalization

Ans: B,D
Exp: Apache Cassandra is a NoSQL database with the following feature for fast and optimized query execution on the large volume of data.
It uses peer-to-peer communication. Hence, option-1 is out.
Yes, it integrates with the Apache Solr but not for the analytics. Its a tricky option, Solr is a Search solution. Hence, option-3 is not correct. If we want solution for analytics then it should use Apache Spark which also well integrate with the Cassandra.
Apache Cassandra does not follow the normalization. However, its Data modeling is similar to relational databases but differ in many key areas for providing blazingly fast interaction. For example RDBMS uses the joins between tables for relationships, whereas Cassandra uses denormalization to achieve more robust querying. Hence, option-4 is also correct.

Question-2: Which of the following is correct with regards to data read/write in/out Cassandra?



A.You can use COPY command to read csv data to Cassandra
B.You can use COPY command to write CSV data from Cassandra to a file system.
C.You can use DUMP command to write CSV data from Cassandra to a file system.
D.You can use sstableloader to bulk upload external data to Cassandra.
E.You can use COPY command to bulk upload external data to Cassandra.
F.You can use DUMP command to bulk upload external data to Cassandra.

Ans: A, B, D
Exp: We can use COPY command to read CSV data to DSE and write CSV data from DSE to a file system. Hence, option-1 and 2 both are correct.
Similarly sstableloader provides the ability to bulk load external data into Cassandra Cluster. Hence, option-4 is also correct.
We do not have DUMP command. Hence, option 3 and 5 are wrong.

Question-3: Which of the following feature helps in Apache Cassandra to help in preventing data loss?



A.Each node uses the peer-to-peer gossip communication protocol.
B.Commit log
C.Memtable
D.A,B
E.A,B,C

Ans: E
Exp: In Apache Cassandra Data loss can be prevented using various features which are below
1.Gossip protocol: Having peer-to-peer distributed system across homogeneous nodes where data is distributed among all nodes in the cluster. And each node frequently exchanges state information about itself and other nodes across the cluster using peer-to-peer gossip communication protocol.
2.Commit Log: A sequentially written commit log on each node captures write activity to ensure data durability.
3.Memtable: Data is indexed and written to an in-memory structure, called a memtable, which resembles a write back cache.

Question-4: Please map the following



A.SSTable
B.Compaction
C.Tombstone
1.Disk written on Disk
2.Marker for column deletion in a Row
3.Used for consolidating data on the disk
A.A-1, B-3, C-2
B.A-1, B-2, C-3
C.A-2, B-3, C-1
D.A-2, B-1, C-3
E.A-3, B-2, C-1
Ans : A
Exp : Each time memory structure is full, then data will be written to disk using SSTable data file. Once written all the data automatically partitioned and replicated throughout the cluster. Then Cassandra will periodically consolidates the SSTable using compaction, which discards the obsolete data which is marked for deletion using tombstone marker. Tombstone is a marker in a row which indicates that a column should be deleted.



Previous   |   Next   |  Full version of Cassandra Certification   | Sample Cassandra Certification Questions | Quickly go through Spark Training Python & Scala




       
      Hadoop Annual Subscription

      Do you know?
      • Training Access: No time constraint and Any future enhancements on same and subscribed training will be free.
      • Question Bank (Online Simulator): Now you can have free updates for additional or updated Questions till your subscription is active.
      • On Mobile/Tablet/Desktop : You know this particular exam you can access from your mobile, tablet or Desktop. You just need internet access and browser.
      • Training Institute : Do you know many of the training institutes subscribe this products from HadoopExam to train their students.

      Read all testimonials its learners voice :
      Testimonials
      Disclaimer :
      1. Hortonworks® is a registered trademark of Hortonworks.
      2. Cloudera® is a registered trademark of Cloudera Inc
      3. Azure® is aregistered trademark of Microsoft Inc.
      4. Oracle®, Java® are registered trademark of Oracle Inc
      5. SAS® is a registered trademark of SAS Inc
      6. IBM® is a registered trademark of IBM Inc
      7. DataStax ® is a registered trademark of DataStax
      8. MapR® is a registered trademark of MapR Inc.

WhatsApp Call Us Any Query Subscribe