HadoopExam.com

Hadoop, BigData,Cloud and Analytics Learning Resources

  • Increase font size
  • Default font size
  • Decrease font size

Apache Spark Professional Training with Hands On Lab : Lifetime Accessible and any future module free 

Complete entire training in Approx. 12 Hours and become Apache Spark Expert  : To the Point training and no lengthy session (Just focus on Apache Spark Learning and Hands-On session : Thats the reason HadoopExam's training are popular)

Training Key Features

1. 24/7 Course Access for lifetime
2. As interactive as Classroom training
3. Very Cost effective
4. No PPT's at all, advanced way of teaching
5. In depth explaination of each topic 6.Contains Hands on session
7. Any New module will be without additional charge(In this training list only)
8. No travel and Hassel
9. Learn as per your comfort 
10. Practice at Home using Virtual Machine

 

Regular Price: $180.00 
  Offer Price: $75.00 (Save Flat 50% + $15 off for next 3 days ) + 15%ST
Note: If having trouble while credit card payment then please create PayPal account and then pay.

ST : India Govt Service Tax
India Bank Transfer
Regular Price: 8000 INR
Offer Price: 3900 INR 3500INR (Save flat 50% + 400 0ff for Next 3 days ) + 15%ST =  4025INR
 
Click Below ICICI Bank Acct. Detail
 
Indian credit and Debit Card(PayuMoney)
Buy Now Spark Trainings

Click Here For Training FAQ          Check free demo below (Best Quality is Promise, watch it right now)

  • Now Includes  Apache Spark  (Beginner to Advanced Spark Training)
  • Learn Complete Spark(Basic to Advanced) Self-Paced Training
  •  Complete training on Cloudera QuickStart Virtual Machine (Includes Pre-installed all the required software ).
  •  Hence, you can practice on your Windows Machine Locally
  •  Its Better than many classroom Hadoop Trainings (Trainer has 12+ years of real Industry experience) Read Testimonials 
  •  To the point session, not lengthy session                                                                                                                                                                     
  •  After this training your Apache Spark knowledge will reach to expert Spark Professional
  •  Even you will use  Hadoop 2.0 and HDFS,  to learn Spark 
  • (Running offer for next 7 days actual cost 3900INR/$89 after discount it will cost you 3500INR/$75 )

Most popular certification for Apache Spark are below (This training will help you prepare for the same )

  1. CloudEra Hadoop CCA175 Certification (Preparation material available)
  2. Oreilly Apache Spark Certification (Preparation material available)
  3. Hortonworks Apache Spark Certification (In process)

Syllabus and Completed Hadoop Training is Below
Sample Demo Session from Actual Training


Module 1: Introduction to Apache Spark (Available Length 48 Minutes)

  1. Introduction to Apache Spark
  2. Features of Apache Spark
  3. Apache Spark Stack
  4. Introduction to RDD's
  5. RDD's Transformation
  6. What is good and bad In MapReduce?
  7. Why to use Apache Spark

Module 2: Cloudera QuickStart VM Installation (Hands-on Lab + PDF Download) (Available Length 34 Minutes)

  1. Include Hadoop
  2. Include Apache Spark
  3. Include Hive
  4. Include Sqoop
  5. Include Hue

Module 3: Deep Dive in HDFS: (Available Length 48 Minutes)

  1. HDFS Design
  2. Fundamental of HDFS (Blocks, NameNode, DataNode, Secondary Name Node)
  3. Rack Awareness
  4. Read/Write from HDFS
  5. HDFS Federation  and High Availability (Hadoop 2.x.x)
  6. HDFS Command Line Interface

Module 4: Spark Shell Hands On Using HDFS (Hands-on Lab + PDF Download) (Available Length 34 Minutes)

  1. Spark Shell Introduction
  2. Create file using Hue
  3. Spark Shell extracting file from HDFS
  4. Create RDD from HDFS file

Module 5: Programming with RDD Part-1 (Hands-on Lab + PDF Download) (Available Length 28 Minutes)

  1. Creating new RDD
  2. Transformations on RDD
  3. Lineage Graph
  4. Actions on RDD
  5. RDD Concepts on Persist and Cache
  6. Lazy evaluation of RDD

Module 6: Scala/Spark Functional Programming (Hands-on Lab+ PDF Download) (Available Length 28 Minutes)

  1. Using Function Literals
  2. Anonymous Functions
  3. Define a function which accepts another function

Module 7: RDD Transformation Programming in Depth (Hands-on Lab+ PDF Download) (Available Length 24 Minutes)

  1. Hands on and core concepts of map() transformation
  2. Hands on and core concepts of filter() transformation
  3. Hands on and core concepts of flatMap() transformation
  4. Compare map and flatMap transformation

Module 8: Apache Spark in Action Depth (Hands-on Lab+ PDF Download) (Available Length 36 Minutes)

  1. Hands on and core concepts of reduce() action
  2. Hands on and core concepts of fold() action
  3. Hands on and core concepts of aggregate() action
  4. Basics of Accumulator
  5. Hands on and core concepts of collect() action
  6. Hands on and core concepts of take() action
  7. Ordered access of RDD

Module 9: Apache Spark Execution Model (Includes PDF Download Available Length 35 Minutes)

  1. How Spark execute program
  2. Concepts of RDD partitioning
  3. RDD data shuffling and performance issue

Module 10: Apache Spark PairRDD (Include PDF Download Available  Length 45 Minutes)

  1. Core concepts of PairRDD
  2. Creation of PairRDD
  3. Aggregation in PairRDD
  4. Aggregation functions understanding in depth

a)    How reduceByKey() work conceptually?

b)    How foldByKey() work conceptually?

c)    How combineByKey()work conceptually?

 Module 11: Spark PairRDD HandsOn Lab (Hands-on Lab+ PDF Download) (Available  Length 12 Minutes)

  1. reduceByKey
  2. foldByKey
  3. combineByKey
  4. groupByKey
 Module 12 : Spark PairRDD Joining, Zipping and Grouping  (Hands-on Lab+ PDF Download(Available  Length 30 Minutes)
  1. reduceByKey versus  groupByKey performance issue
  2. cogroup
  3. zip
  4. joining (left, right, inner etc.)

 Module 13-A: Understanding Hadoop SequenceFile  (Available  Length 7 Minutes)

 Module 13-B: Creating Seqnce File and Processing using SPark (Hands on Lab)-Part-1  (Hands-on Lab+ PDF Download(Available  Length 23 Minutes)

  1. Creating SequenceFile using TSV  file 
  2. Loading Data in Apache Hive
  3. Processing SequnceFile as an RDD.
Module 14 : Spark Shared Variables   ( PDF Download)   (Available  Length 27 Minutes)
  1. Shared Variables: Broadcast Variables  (Available  Length 14 Minutes)
  2. Shared Variables: Accumulators  (Available  Length 13 Minutes) 

Module 15 : Spark Accumulator  (Hands-on Lab+ PDF Download(Available  Length 14 Minutes)

  1. Word count and Character Count  
  2. Counting Bad records in a file 
Module 16 : Spark BroadCast Variable  (Hands-on Lab+ PDF Download(Available  Length 12 Minutes)
  1. Joining two csv files one as a Broadcasted Lookup table 
Module 17 : Spark API : BroadCast Variable, Filter Functions and Saving File to HDFS  (Hands-on Lab+ PDF Download(Available  Length 13 Minutes)
  
Module 18 : Spark API :  Spark Join, GroupBy and Swap function  (Hands-on Lab+ PDF Download(Available  Length 12 Minutes)
  
Module 19 : Spark API : Remove Header from CSV file and Map Each column to Row Data  (Hands-on Lab+ PDF Download(Available  Length 10 Minutes)

Module 20 : Spark SQL  ( PDF Download)   (Available  Length 27 Minutes)

  1. HiveContext
  2. Schema RDD replaced by DataFrame API
  3. History of SparkSQL
  4. Catalyst Optimizer

Module 21 : SparkSQL HandsOn Sessions (Hands-on Lab+ PDF Download(Available  Length 20 Minutes)

  1. Hive Configuration
  2. Create Hive table using Spark
  3. Load Data in HIve table using Spark
  4. Create another table using DataFrame
Module 22 : Implementing Business Logic using SparkSQL  (Hands-on Lab+ PDF Download(Available  Length 25 Minutes)
  1. Loading CSV file
  2. Spark Case classes (To create schema for  csv file)
  3. Convert RDD to DataFrame using DataFrmae API for query data
  4. Using SQL query on DataFrame

Module 23 : Spark Streaming in Depth Part-1  (PDF Download(Available  Length 26 Minutes)

  1. Real/Near real time data processin
  2. Streaming Sources and Sinks
  3. DStream (Discretized Stream)
  4. Dtream Concepts
  5. Stock Visualization Example (How Streaming Helpful)
Module 24 : Spark Streaming in Depth Part-2 (PDF Download ) (Available Length : 22 Minutes)
  1. Execution of Spark Streming  
  2. Spark Streaming Transformation (Stateless and Stateful)
  3. Comining multiple DStream
  4. Understanding transform() operator
Module 25 :  SPARK STREAMING PART-3 STATEFULL (WINDOW) TRANSFORMATIONS (Available 20 Minutes)
  1.  Window Transformation
  2. Window Duration and Sliding Duration
  3. DStream Opeations
  4. WordCount in DStream
Module 26 : Basics of Machine Learning and Data Science (Available Length : 30 Minutes)
  1. Basics of ML and Data Science
  2. Example of Machine Learning
  3. Supervised and Unsupervised Learning
  4. Key terminology e.g. features, training and testing
  5. How to choose right algorithm
  6. Common steps of Machine Learning
      • Collect data
      • Prepare Input data
      • Analyze Input data
      • Train the algorithm
      • Test the algorithm
      • Use the Algorithm
Module 27 :  SPARK STREAMING: REAL TIME STOCK MARKET DATA PROCESSING (HANDS-ON LAB + PDF Download Available Length : 21 Minutes)  
  1. Problem Statement
  2. Data Format
  3. Writing Stream script to filter Bigger Volume data
  4. Write results back to HDFS file System

Module 28 : SPARK STREAMING: REAL TIME STOCK MARKET DATA MAVEN APPLICATION ( Hands-on Lab+ PDF Download) (Available  Length 37 Minutes)

  1. Understanding Maven pom.xml
  2. Importing Scala Application in eclipse
  3. Creating Application JAR file using eclipse and Maven
  4. Run Spark Streaming Application
  5. Process data using Spark Stream Application
Module 29 : SPARK STREAMING & SPARK SQL: REAL TIME MARKET DATA APPLICATION  (Hands-on Lab ) (Available  Length 18 Minutes)
  1. Create Spark Streaming Application
  2. Use SparkSQL in Spark Streaming Application
  3. Querying data

Module 30 : SPARK STREAMING WINDOW FUNCTION& SPARK SQL JOIN: REAL TIME MARKET DATA  APPLICATION  (Hands-on Lab)  (Available  Length 7 Minutes)

  1. Create Spark Streaming Application
  2. Use SparkSQL in Spark Streaming Application
  3. Joining data sets , with real-time streaming data
  4. Using Spark Streaming window function to calculate , running rum of trade volume. 
Module 31 : SPARK ADVANCED : DATA PARTITIONING ( PDF Download) (Available  Length 26 Minutes)
  1. What is Partitioning and why?
  2. Data Partitioning example using Join (Hash Partitioning)
  3. Understand Partitioning using Example for get Recommendations for Customer
  4. Understand Partitioning code using Spark-Scala
  5. Operations which create Partitioned RDD
  6. Operation which get benefit of Partitioning
  7. Operation that affect the partitioning
Module 32 : SPARK PAIR RDD FUNCTIONS : In Depth  (PDF Download
  1. reduceByKey() (Available  Length 17 Minutes)
  2. groupByKey() (Available  Length 14 Minutes)
  3. combineByKey() (Available  Length 13 Minutes)
  4. foldByKey() (Available  Length 15 Minutes)
  5. aggregateByKey() (Available  Length 11 Minutes)
  6. Comparision Between Function (Available  Length 11 Minutes)

 

Package offers on following trainings (50%+20% off)

hadoop_professional_training

 
Hadoop/BigData Professional Training  

$160/7000INR

50% off

$79/3500INR

 

HBase Professional Training

 HBase NoSQL Professional Training

$160/7000INR

50% off

$79/3500INR

Spark Professional Training

Apache Spark Professional Training: 

$180/8000INR

50% off

$89/3900INR

 

Total Cost after 50%

$79/3500INR+$79/3500+$89/3900INR == $247/10900INR

After additional 20% Discount

== $195/8900INR
contact us hadoopexam@gmail.com for package purchase
 

Spark Training + Databricks Certification Simulator offer

3900INR/$89 + 2900INR/$65

Total cost = 6800INR/$154

(Additional 20% off)

 5400INR/$123

contact us hadoopexam@gmail.com for package purchase

 

 

* Please read faq section carefully.





_______________________________________________________________________________________________________________________

Click to View What Learners Say about us : Testimonials

We have training subscriber from TCS, IBM, INFOSYS, ACCENTURE, APPLE, HEWITT, Oracle , NetApp , Capgemini etc.

Books on Spark or PDF to read : Machine Learning with Spark, Fast Data Processing with Spark (Second edition), Mastering Apache Spark, Learning Hadoop 2, Learning Real-time Processing with Spark Streaming, Apache Spark in Action, Apache Spark CookBook, Learning Spark, Advanced Analytics with Spark Download.

 
 

Contact Us

Phone : 022-42669636
Mobile : +91-8879712614
Hadoop Learning Resource
C/O ADITech Global Solutions
Mumbai House Goregaon-E
Mumbai 400063

Follow Us On Twitter
Ad as a Friend on Facebook
Please Join Us On LinkedIn  and G+ with hadoopexam@gmail.com
 

Testimonials

Yeah the material is helpful to test our knowledge. I passed the exam.
Thanks for your help! Read More
--Sudha Udatha, USA

"I wish I had bought your exam prep during my first attempt. More than anything else your tests made me feel confident to crack the CCD-410 exam. It would not have been possible without you. My best wishes to your team"
Read More
-- Sandeep Swami Banglore