Course Provider
What will you learn in this course?
In this course, you will learn the data storage type, different types of data, various analytical methods using a variety of tools, data warehousing, data manipulation, Hadoop for Big Data, HDFS, MapReduce program to process the data which gives a clear understanding of Big Data Engineering.
Data Engineering
-
Skill Type
Emerging Tech -
Domain
Big Data Analytics -
Course Category
Deepskilling Course -
Certificate Earned Joint Co-Branded Participation Certificate
-
Course Covered under GoI Incentive
Yes -
-
Course Price
INR 12,000 -
Course Duration
36 Hours
-
Why should you take this course?
- Introduction to Big Data technologies
- Challenges in the real-world data
- How to store, analyze and get insight from the huge volume of data
- How Hadoop distributed file storage works
- Various components of Hadoop
- MongoDB terminologies
- Architecture
- Key features and Indexing
- Schema design
- Challenges with RDBMS
- Types of NoSQL database
- Get introduced to Spark concepts and set up the environment
- A good understanding of RDD and working knowledge of RDD operations
- Participants will have an overview of Spark architecture
- Concepts of performance tuning
- Job submission and Job management
Who should take this course?
All IT people who want to switch to data Engineering
Curriculum
- Hadoop - History Architecture
- Hadoop Components HDFS Architecture
- HDFS Operations
- Hands-on Exercises on Jigsaw Lab
- MapReduce Concept
- MapReduce Architecture, YARN MapReduce Internals Hands-on Exercises, Introduction to Spark and Set up, Spark Ecosystem and Abstractions, Programming in Scala, Sark Properties & Use Cases, Basics of RDD Operations
- Transformations in RDD
- Actions in RDD
- Advanced RDD Operations RDD Persistence, Overview of Shared Variables - Accumulators, Job Submission & Execution, Performance Tuning, Job Scheduling and Management
Tools you will learn in the course
- Big data
- Hadoop
- Data analytics
- Data engineers
- Hive
- SQL
- Data
- HDFS
- MapReduce
- YARN
- HBase
- Sqoop
- NoSQL
- Big data developer
- Data engineering
- Datatypes