| Courses Software Training / Hardware Training / Application Programming | Locality Marathahalli |
1. Course Design and Duration
The training course is specifically meant for the developer, who is developing solutions around Hadoop to solve their big data problems. But my experience says that you can develop robust applications only if you know what s going behind the scenes. Therefore it is necessary to understand the architecture of Hadoop, how it is instal ed, what are the various things needs to be done, etc.
Therefore the training course also covers those aspects of Hadoop which are related to admin part and is required to understand the MapReduce framework in a much clearer way.
The duration of the course is 35 hour including 5 hours of Basics Java training and Linux course.
2. Course Contents
Introduction and Motivation of Hadoop oWhat is Big Data
oChal enges in Big Data
oChal enges in Traditional Application oNew Requirements
oWhat is Hadoop
oBrief history of Hadoop
oFeatures of Hadoop
oHadoop v/s RDBMS
oHadoop Ecosystem s overview
oOverview of HDFS and MapReduce
Understanding Hadoop Distributed File System oUnderstanding Configuration
oHDFS Concepts
o Blocks
o Replication
o Version File
o Safe mode
o NameSpace IDs
oReading and Writing in HDFS
oUnderstanding NameNode
oUnderstanding Data Node
oUnderstanding Secondary NameNode
oUnderstanding Job Tracker
oUnderstanding Task Tracker
HDFS Shell Commands
oHands On Exercise
Accessing HDFS using API
oUnderstanding HDFS Java classes and methods
oHands On Exercise
Map Reduce Programming
oUnderstanding block and input splits oCommon Input and Output Formats
oMapReduce Data types
oUnderstanding Writable and WritableComparable (Introduction) oData Flow in MapReduce Application
oUnderstanding WordCount problem
oWriting MapReduce Application
o Understanding Mapper function
o Understanding Reducer Function
o Understanding Driver
o Understanding Tool Runner
o Hands on Exercise
Solving common problems
o Average word length
o Inverted Index
o Word Co-Occurrence
o Searching
o Sorting
o Hands on exercise
MapReduce Advance
oUsing Combiner
oUsing Distributed Cache
oPassing the parameters to mapper and reducer oHands On Exercise
oWriting Custom key values
oWriting Custom Partitioner
oHands On Exercise
oWriting Custom Input Format
oHands On
Secondary Sorting
oMotivation
oUnderstanding
oHands On
Joins
oMap Side Join
oReduce Side join
oHands on
Practical development
oCalculating Number of Reducers
oMap Only Jobs
oCompression
Introduction to PIG
oTerminology
oUnderstanding Pig Program, structure and Execution oPig Data types
oLoading and Dumping Data
oFiltering
oGroup and Co-Group
oJoins
o Inner Join
o Left Outer Join
o Right Outer Join
o Full Outer Join
oHands on
Introduction to Hive
oMotivation and Understanding Hive
oUsing Hive Command line Interface
oData types and File Formats
oBasic DDL operations
oSchema Design
Sqoop
o Introduction to Sqoop
o Architechture
o Instal ing Sqoop
o Sqoop Import
o Sqoop Export
o Sqoop Jobs
o Sqoop Codegen
o Hands On
Introduction to HBase
o Motivation and Understanding Hbase o CAP theorm
o Installing HBase
o Directory Structure of HBase
o Using HBase shell
o Storing Data/Creating Tables
o Data Reading/Writing into HBase Tables o Hands On
Contact us for more details
ELEGANT IT SERVICES
9 9 8 6 5 4 1264
Email:[email protected]
Website:www.elegantits
ervices.com