Hadoop Training in Chennai

Hadoop Training in Chennai

Hadoop is the data platform chosen by many because it provides high performance – especially if you replace MapReduce in the Hadoop stack with the Apache Spark data processing engine.

Hadoop has evolved into a user-friendly data management system. Different implementations have done their part to optimize Hadoop’s manageability through different administrative tools. Look for a distribution that has intuitive administrative tools that assist in management, troubleshooting, job placement and monitoring.

Hadoop Training in Chennai

Hadoop Training Institute in Chennai

Course Pre-Requisites:

This course targets administrators and operators with at least basic Linux system administration experience. Prior Hadoop experience is not required.

COURSE CONTENTS

 

1. INTRODUCTION

Introduction to Big Data and Hadoop
Getting Started with Hadoop

2. HDFS

HDFS – Hadoop Distributed File System
HDFS Architecture
HDFS Components – Namenode, Datanode, Jobtracker, Tasktracker & Secondary Namenode
Fault tolerance & High availability
Failure handling – FSImage, Edits, Backup nodes
HDFS Commands

3. HADOOP SETUP

Single Node setup
Multi Node setup
Scaling up/down Hadoop cluster
Replication distribution and automatic discovery

4. MAPREDUCE

Map Reduce Anatomy
Map Reduce Examples
Running MapReduce programs in Hadoop
YARN Introduction
Hadoop 2.x vs Hadoop 1.x

5. APACHE PIG

Apache Pig Introduction
Apache Pig Setup
Apache Pig Commands
Structured(including XML/JSON) data processing using Apache Pig
Unstructured data processing using Apache Pig
Best Practices for Pig

6. APACHE HIVE

Apache Hive – Introduction
Apache Hive – Setup
Managed tables & external tables
Apache Hive – Commands
Structured data processing using Apache Hive
PIG vs. HIVE
Partitioning in managed & external tables
Clustering in managed & external tables
Unstructured data processing using Apache Hive
Best Practices for Hive

7. SQOOP

Importing RDB data to HDFS
Importing RDB data to Hive
Importing RDB data to HBase
Exporting HDFS/Hive/HBase data to RDB

8. HBASE

NoSQL – Introduction
HBase – Architecture
ZooKeeper
Region servers, MemCache & WAL
HBase commands
HBase filters
Region splits
Compactions (Major & Minor)
Common issues and fixes in HBase + Best practices
HBase Connectors for Pig & Hive

9. FLUME

Flume with Local
Flume with HDFS
Flume with HBASE

10. CONCLUSION

Hadoop Best Practices and Use Cases
Project/POC

 

Hadoop Training in  Chennai

info@bigdatatraining.in

http://www.bigdatatraining.in/contact/

Call – +91 97899 68765 / 044 – 42645495