| Courses Software Training | Locality Jayanagar |
Session Topics Duration (Minutes)
#1 - Hadoop Concepts & Architechture
Big Data Definition 20
Structured versus Unstructured Data 20
IBM Definition 15
Limitations of existing solutions 30
How Hadoop Addresses These Limitations 10
Hadoop Definition 10
Hadoop Ecosystem 30
Hadoop Components 30
Anatomy of file read and write 30
Rack Awareness 15
Hadoop Architecture 30
Total Time 240
#2 - Hadoop Concepts & Architechture
Revision of the previous session 15
Hadoop Cluster 30
Hadoop Typical Configuration 20
Hadoop Modes 30
Configuration Files 20
Master verses Slave Nodes 15
Hands On
Demo of Configuration Files on the Cluster 20
Running the Hadoop Word Count Example 30
Running the Temperature Example 20
Running HDFS Commands 40
Total Time 240
#3 - Map Reduce Explained
Revision of the previous session 15
Industries where Map Reduce / Hadoop is used 10
Traditional Way of distributed computation 20
Map Reduce Way 20
Advantages of Map Reduce 15
Splits & Blocks 20
Additional Supporting Concepts - Combiner 20
Additional Supporting Concepts - Partitioner 20
Hands On
Demo of Map Reduce with different input formats 60
Demo of Combiners 20
Demo of Partitioners 30
Total Time 250
#4 - Advance Map Reduce
Revision of the previous session 15
Joins 30
Sorting 30
Standard & Custom Input Formats 20
Counters 15
Distributed Cache 20
Sequence Files 20
Hands On
Demo of Map and Reduce Side Joins 45
Demo of Counters 15
Demo of Sequence Files 30
Total Time 240
#5 - Pig & Pig Latin
Revision of the previous session 15
Need for Pig 20
When and when not to use it 20
How it is used at Yahoo 15
Basic Structure 20
Data Model 20
Pig Operators 30
Hands On
Word Count in Pig 20
Max Temperature in Pig 20
Custom Functions in Pig 30
Movies Example 15
Joins example in pig 15
Total Time 240
#6 - Hive
Revision of the previous session 15
How Hive came into picture 15
Hive Definition 15
Hive versus pig 10
Hive Architechture and Components 20
Hive Limitations 15
Hive Versus RDBMS 15
Hive Data Model 20
Partition & Buckets 30
Hands On
Hive Commands 30
Table Joins 25
Data Uploads Using SQOOP & Flume 30
Total Time 240
#7 - NoSQL Databases & Hbase
Revision of the previous session 15
Need for Hbase 15
NoSQL World 20
Hbase Defined 15
Hbase History 10
Hbase Versus RDBMS 15
When and when not to use Hbase 15
Understanding Hbase better 20
Hbase Data Model 15
Hbase Storage Architechture 15
Hands On
Hbase Commands 30
Hive To Hbase 15
Pig to Hbase 15
SQOOP & FLUME to Hbase 25
Total Time 240
#8 - Apache Oozie & Zookeeper
Revision of the previous session 15
Zookeep Definition 30
Example Configurations 30
Oozie Defintion 30
Example Configuration 30
Hands On
Demo of Oozie configured with Pig, Hive and Map Reduce 45
Demo of Zookeeper configured with Pig, Hive and Map Reduce 45
SQOOP and FLUME with Oozie 15
Total Time 240
#9 - Hadoop 2.0
Revision of the previous session 15
Hadoop 1.0 Challenges 20
Hadoop 2.0 New Features 20
Hadoop 2.0 High Availability 20
Hadoop 2.0 Federation 20
New Cluster Architechture 20
Hadoop 2.0 Components 20
Hadoop 2.0 Map Reduce Flow 30
Hands On
Map Reduce Example in 2.0 30
Pig Example in 2.0 15
Hive Example in 2.0 15
Hbase Example in 2.0 15
Total Time 240
#10 - Visualization Tools - Part A
Revision of the previous session 15
Mahout Definition and Components 60
Pentaho Definition and Components 60
Hands On
Demo of Mahout 60
Demo of Pentaho 60
Total Time 255
#11 - Visualization Tools - Part B
Revision of the previous session 15
R as a Data Visualization Language 30
R Studio 15
Obtaining Data with R 60
Plotting Data with R 60
Hands On
Simple R Data Reading and Writing Examples 30
Examples of Data Plotting with R 30
Total Time 240
#12 - Project Work
Explaing the Use Cases Available 60
Explaing the Project Completion Proccess 60
Explaing a suggested / sample implementation 60
Queries / Suggestions 60
Total Time 240