Overview
Why Choose the Big Data Hadoop Course by Cambridge InfoTech, Bangalore?
There are a dozen institutes that offer the Big Data Hadoop course through eLearning. While many of these courses give students textbook learning and video tutorials, Cambridge InfoTech goes one step further and gives students additional case studies, interactive presentations, hands-on learning tools and live training. Our course gives students a much deeper perspective about Big Data Hadoop through real examples by trainers who are certified and have vast experience in the IT sector.
The courses we offer our students are designed in a systematic manner with each module broken down further to make learning fun and easy. The courses by Cambridge InfoTech help students gain on-job training experience as well.
What Are The Value-Added Benefits of Cambridge InfoTech Courses?
Before the final CCA175 Hadoop and Spark Developer Certification, the Cambridge InfoTech trainer will provide mock assignments in a guided manner and also help with resume building through extended support as well as interview preparation. Furthermore, once students are certified, Cambridge InfoTech will offer placement services to all the students that have passed the certification. Our students in the past have held a high score of 97% placements in some of the best firms in India.
What is the CCA175 Hadoop and Spark Developer Certification?
The CCA175 Hadoop and Spark Developer Certification is an examination that gives students the needed accreditation to get the highest-paying jobs in the Big Data sector. The certification is internationally accredited and the course by Cambridge InfoTech, Bangalore adequately prepares students to use various aspects of Big Data and Hadoop with ease.
Since the CCA175 certification is a remote proctored exam that can be can be taken from any location in the world, our trainers will help students ace it by giving them practice scenarios to solve based on examples of past certifications. Additional support is also given to our students through online resources that they can utilize later on to climb their respective career paths successfully, with ease and confidence.
What you'll learn?
- Students who have an interest to learn new concepts in Big Data Hadoop can take this course.
- There are no prerequisites to learning this course, but access to a computer and basic computer knowledge is preferred.
Course Modules
Module 1: Introduction to Bigdata and Hadoop
-
Introduction to Big Data and Hadoop
-
Introduction to Big Data
-
Big Data Analytics
-
What is Big Data?
-
Four vs of Big Data
-
Case Study Royal Bank of Scotland
-
Challenges of Traditional System
-
Distributed Systems
-
Introduction to Hadoop
-
Components of Hadoop Ecosystem Part One
-
Components of Hadoop Ecosystem Part Two
-
Components of Hadoop Ecosystem Part Three
-
Commercial Hadoop Distributions
-
Demo: Walk through of Simpli learn Cloud lab
-
Key Takeaways
-
Knowledge Check
Module 2: Hadoop Architecture Distributed Storage (HDFS) and YARN
-
Hadoop Architecture Distributed Storage (HDFS) and YARN
-
What is HDFS
-
Need for HDFS
-
Regular File System vs HDFS
-
Characteristics of HDFS
-
HDFS Architecture and Components
-
High Availability Cluster Implementations
-
HDFS Component File System Namespace
-
Data Block Split
-
Data Replication Topology
-
HDFS Command Line
-
Demo: Common HDFS Commands
-
Practice Project: HDFS Command Line
-
Yarn Introduction
-
Yarn Use Case
-
Yarn and its Architecture
-
Resource Manager
-
How Resource Manager Operates
-
Application Master
-
How Yarn Runs an Application
-
Tools for Yarn Developers
-
Demo: Walk through of Cluster Part One
-
Demo: Walk through of Cluster Part Two
-
Key Takeaways
-
Practice Project: Hadoop Architecture, distributed Storage (HDFS) and Yarn
Module 3: Data Ingestion into Big Data Systems and ETL
-
Data Ingestion Into Big Data Systems and Etl
-
Data Ingestion Overview Part One
-
Data Ingestion Overview Part Two
-
Apache Sqoop
-
Sqoop and Its Uses
-
Sqoop Processing
-
Sqoop Import Process
-
Sqoop Connectors
-
Demo: Importing and Exporting Data from MySQL to HDFS
-
Practice Project: Apache Sqoop
-
Apache Flume
-
Flume Model
-
Scalability in Flume
-
Components in Flume’s Architecture
-
Configuring Flume Components
-
Demo: Ingest Twitter Data
-
Apache Kafka
-
Aggregating User Activity Using Kafka
-
Kafka Data Model
-
Partitions
-
Apache Kafka Architecture
-
Demo: Setup Kafka Cluster
-
Producer Side API Example
-
Consumer Side API
-
Consumer Side API Example
-
Kafka Connect
-
Demo: Creating Sample Kafka Data Pipeline Using Producer and Consumer
-
Key Takeaways Knowledge Check Practice Project
-
Data Ingestion Into Big Data Systems and ETL
Module 4: Distributed Processing Map
-
Distributed Processing Mapreduce Framework and Pig
-
Distributed Processing in Mapreduce
-
Word Count Example
-
Map Execution Phases
-
Map Execution Distributed Two Node Environment
-
Mapreduce Jobs
-
Hadoop Mapreduce Job Work Interaction
-
Setting Up the Environment for Mapreduce Development
-
Set of Classes
-
Creating a New Project
-
Advanced Mapreduce
-
Data Types in Hadoop
-
Output formats in Mapreduce
-
Using Distributed Cache
-
Joins in Mapreduce
-
Replicated Join
-
Introduction to Pig
-
Components of Pig
-
Pig Data Model
-
Pig Interactive Modes
-
Pig Operations
-
Various Relations Performed by Developers
-
Demo: Analyzing Web Log Data Using Mapreduce
-
Demo: Analyzing Sales Data and Solving Kpis Using Pig
-
Practice Project: Apache Pig
-
Demo: Wordcount
-
Key Takeaways
-
Knowledge Check Practice Project: Distributed Processing – Mapreduce Framework and Pig
Module 5 : Apache Hive
Module 6: No SQL Databases HBase
Module 7: Basics of Functional Programming and Scala
Module 8 : Apache Spark Next-Generation Big Data Framework
Module 9 : Spark Core Processing RDD
Module 10 : Spark SQL Processing Data Frames
Module 11 : Spark M Lib Modelling Big Data with Spark
Module 12 : Stream Processing Frameworks and Spark Streaming
Module 13 : Spark Graph X
Student Ratings & Reviews

-
LevelIntermediate
-
Total Enrolled5
-
Duration30 hours
-
Last UpdatedJanuary 18, 2023
-
CertificateCertificate of completion