HADOOP 6 Weeks

1 - Hadoop Fundamentals 8
1.1 What is Big Data? Why Big Data?
1.2 Hadoop Architecture & Components

2 - HDFS 4
2.1 HDFS Basics
2.2 File Storage
2.3 Fault Tolerance
2.4 Hadoop Commands

3 - Map Reduce 8
3.1 What Is MapReduce?
3.2 Basic MapReduce Concepts
3.3 Algo of Inputs and Output formats to MR Program

4 - Pig and Latin 12
4.1 Basics of Pig and Why Pig?
4.2 Grunt
4.3 Pig's Data Model
4.4 Writing Evaluation
4.5 Filter
4.6 Load & Store Functions
4.7 Benefits of Pig over SQL language
4.8 Input and Output formats to MR program.
4.9 Error Handling and scope of creating UDFs for Pig.

5 - Hive 16
5.1 What is Hive, why we need it and its importance in DWH?
5.2 How Hive is different from Traditional RDBMS
5.3 Modeling in Hive, creating Hive structures and data load process.
5.4 Concepts of Partitioning, Bucketing, Blocks, Hashing, External Tables etc.
5.5 Concepts of serialization, deserialization
5.6 Different Hive data storage formats including ORC, RC, and Parquet.
5.7 Introduction ton HiveQL and examples.
5.8 Hive as an ELT tool and difference between Pig and Hive
5.9 Performance tuning opportunities in Hive, learnings and Best Practices.
5.10 Writing and mastering Hive UDFs
5.11 Error Handling and scope of creating Hive UDFs.

6 - Sqoop 3
6.1 Sqoop Overview
6.2 Sqoop Exercises

7 - Spark 4
7.1 What Is Spark?
7.2 Basic Spark Concepts
7.3 How Spark differs from Map Reduce?

8 - OOZE
8.1 Introductions
8.2 Programs


- Projects
Send Your Query

  • Can't read the image? click here to refresh.
  • Design & Developed By : ETON SOLUTIONS