| Courses Networking / Telecoms Training / Software Training / Application Programming | Locality Marathahalli |
Best Hadoop BIG DATA Training In Bangalore - Getin IT Solutions
Hadoop training Given by the real time Expert
Hadoop BigData Training Course Content
Hadoop Architecture
MODULE 1 - BIG DATA, HADOOP, INTRODUCTION TO HADOOP ARCHITECTURE
Why did Big Data suddenly become so prominent
Limitations of traditional large scale systems
Compare Hadoop architecture with traditional architecture
Core components of Hadoop
Understanding Hadoop Master-Slave Architecture
Understanding HDFS Architecture
Learn about NameNode, DataNode, Secondary Node
Learn about JobTracker, TaskTracker
Anatomy of Read and Write data on HDFS
MODULE 2 - INSTALLING AND SETTING UP A HADOOP CLUSTER
Hadoop deployment Modes - Standalone, Single node, Multinode
Configuration files in a Hadoop Cluster
Important Web URL's for Hadoop
Run HDFS and Linux commands
Run MapReduce example to get a high level understanding
Manuals for installation of Hadoop 1.0 and Hadoop 2.0
Manual for Demo VM installation steps for Windows
MODULE 3 - UNDERSTANDING HADOOP MAPREDUCE FRAMEWORK
Overview of the MapReduce Framework
Use cases of MapReduce
MapReduce Architecture
Understand the concept of Mappers, Reducers
Anatomy of MapReduce Program
MapReduce Components - Mapper Class, Reducer Class, Driver code
Splits and Blocks
Understand Combiner and Partitioner
MODULE 4 - ADVANCE MAPREDUCE - PART 1
Write your own Partitioner
Writing Map and Reduce in Python
Map Side Join
Distributed Join
Distributed Cache
Reduce Side Join
Counters
Joining Multiple datasets in MapReduce
MODULE 5 - ADVANCE MAPREDUCE - PART 2
MapReduce internals
Understanding Input Format
Custom Input Format
MapReduce API
Hadoop Data Types
Using Writable and Writable comparable
Understanding Output Format
Sequence Files
JUnit and MRUnit Testing Frameworks
MODULE 6 - APACHE PIG
PIG vs MapReduce
PIG components
PIG execution
PIG Data types
PIG Architecture
PIG Latin Relational Operators
PIG Latin Join and CoGroup
PIG Latin Group and Union
Describe, Explain, Illustrate
PIG Latin: File Loaders
PIG Latin: Creating UDF
Data warehousing with Hive
MODULE 7 - APACHE HIVE AND HIVEQL
What is Hive
Hive DDL - Create/Show/Drop Database
Hive DDL - Create/Show/Drop Tables
Hive DML - Load Files into Tables
Hive DML - Inserting Data into Tables
Hive SQL - Select, Filter, Join, Group By
Hive Architecture & Components
Hive Data Model and Data Units
Difference between Hive and RDBMS
MODULE 8 - ADVANCE HIVEQL
Multi-Table Inserts
Joins
Grouping Sets, Cubes, Rollups
Custom Map and Reduce scripts
Hive SerDe
Hive UDF
Hive UDAF
Data INGESTION TOOLS
MODULE 9 - APACHE FLUME, APACHE SQOOP, APACHE OOZIE
Sqoop - How Sqoop works
Import/Export Data
Sqoop Architecture
Flume - How it works
Flume Complex Flow - Calculation/ Multiplexing
Oozie - Simple/Complex Flow
Oozie - Components
Oozie Service/ Scheduler
Example Workflow
Use Cases - Time and Data triggers
Running/Debuggin a Coordinator Job
Bundle
MODULE 10 - NOSQL DATABASES
Introduction to NoSQL
CAP theorem
RDBMS vs NoSQL
Analytical (OLAP)
Key Value stores: Memcached, Riak
Key Value stores: Redis, Dynamo DB
Column Family: Cassandra, HBase
Graph Store: Neo4J
Document Store: MarkLogic, MongoDB
Document Store: CouchBase, CouchDB, Exist DB
MODULE 11 - APACHE HBASE
When/Why to use HBase
HBase Architecture/Storage
HBase Features
HBase Data Model
HBase Families
Terms and Daemons
HBase Master
HBase vs RDBMS
Column Families
Access HBase Data
HBase API
Runtime modes
Running HBase
MODULE 12 - APACHE ZOOKEEPER
What is Zookeeper
Who is using it
Zookeeper Data Model
ZNode versions
Zookeeper API
ZNokde Types
Sequential ZNodes
Security
Standalone/Clustered mode
Installing and Configuring
Running Zookeeper
Zookeeper use cases
MODULE 13 - HADOOP 2.0, YARN, MRV2
Hadoop 1.0 Limitations
MapReduce Limitations
History of Hadoop 2.0
HDFS 2: Architecture
HDFS 2: Quorum based storage
HDFS 2: High availability
HDFS 2: Federation
YARN Architecture
Classic vs YARN
YARN App
Big Data in the Cloud
Amazon Web Services
Concepts: Pay pay use model
Amazon S3, EC2, EMR
Google Cloud Platform
Google Big Query
For more details about Hadoop Bigdata documents please click
www.getinitsolutions.com