phone+91-87222 63165 / +1(510)-379-9024 contact@syncomint.com
Try Our Sample Training Videos

Course Info

Hadoop Admin

Hadoop is an open source implementation of MapReduce by the Apache group. Hadoop provides the capability to store and provide tailored processing for huge amounts of data. The Hadoop framework delivers high availability, fault tolerance and redundancy.

Benefits of the Program

This course provides the hands-on experience to install, configure and manage Hadoop platform and its associated ecosystem. You learn to monitor Hadoop using built-in functionality and associated tools.

Topic List

The course program at Syncomint provides the hands-on experience to install, configure and manage the Hadoop platform. Additionally, you learn how to backup and secure your cluster as well as to integrate associated applications and tools. Syncomint provides you Classroom Training as well as Live Virtual Training.

Course Training
[formac-acc title="Lesson 1: Introduction"]The Case for Apache Hadoop, A brief history of Hadoop, Core Hadoop components, Fundamental concepts[/formac-acc] [formac-acc title="Lesson 2: The Hadoop Distributed File System"]HDFS features, HDFS design assumptions, Overview of HDFS architecture, Writing and reading files, NameNode considerations, An overview of HDFS security, Hands-On Exercise[/formac-acc] [formac-acc title="Lesson 3: MapReduce"]What is MapReduce?, Features of MapReduce, Basic MapReduce concepts, Architectural overview, Failure recovery, Hands-On Exercise[/formac-acc] [formac-acc title="Lesson 4: An Overview of the Hadoop Ecosystem"]What is the Hadoop ecosystem?, Integration tools, Analysis tools, Data storage and retrieval tools[/formac-acc] [formac-acc title="Lesson 5: Planning your Hadoop Cluster"]General planning considerations, Choosing the right hardware, Network considerations, Configuring nodes[/formac-acc] [formac-acc title="Lesson 6: Hadoop Installation"]Installing Hadoop, Using Cloudera Manager for easy installation, Basic configuration parameters, Hands-On Exercise[/formac-acc] [formac-acc title="Lesson 7: Advanced Configuration"]Advanced parameters, Configuring rack awareness, Configuring Federation, Configuring High Availability[/formac-acc] [formac-acc title="Lesson 8: Managing and Scheduling Jobs"]Managing running jobs, Hands-On Exercise, The FIFO Scheduler, The FairScheduler, Configuring the FairScheduler, Hands-On Exercise[/formac-acc] [formac-acc title="Lesson 9: Cluster Maintenance"]Checking HDFS status, Hands-On Exercise, Copying data between clusters, Adding and removing cluster nodes, Rebalancing the cluster, Hands-On Exercise, NameNode Metadata backup, Cluster upgrading[/formac-acc] [formac-acc title="Lesson 10: Cluster Monitoring and Troubleshooting"]General system monitoring, Managing Hadoop's log files, Using the NameNode and JobTracker Web UIs, Hands-On Exercise, Cluster monitoring with Ganglia, Common troubleshooting issues, Benchmarking your cluster[/formac-acc] [formac-acc title="Lesson 11: Populating HDFS from External Sources"]An overview of Flume, Hands-On Exercise, An overview of Sqoop, Best practices for importing data[/formac-acc] [formac-acc title="Lesson 12: Installing and Managing Other Hadoop Projects"]Hive, Pig, HBase[/formac-acc]

ClassRoom Schedule

Classroom Training - 10 Days

Day 1

11AM-5PM
Introduction
  • The Case for Apache Hadoop
  • A brief history of Hadoop
  • Core Hadoop components
  • Fundamental concepts

Day 2

11AM-5PM
The Hadoop Distributed File System
  • HDFS features
  • HDFS design assumptions
  • Overview of HDFS architecture
  • Writing and reading files
  • NameNode considerations
  • An overview of HDFS security
  • Hands-On Exercise

Day 3

11AM-5PM
MapReduce
  • What is Map Reduce?
  • Governance vs management of IT
  • Features of Map Reduce
  • Basic Map Reduce concepts
  • Architectural overview
  • Failure recovery
  • Hands-On Exercise

Day 4

11AM-5PM
An Overview of the Hadoop Ecosystem
  • What is the Hadoop ecosystem?
  • Integration tools
  • Analysis tools
  • Data storage and retrieval tools
Planning your Hadoop Cluster
  • General planning considerations
  • Choosing the right hardware
  • Network considerations
  • Configuring nodes

Day 5

11AM-5PM
Hadoop Installation
  • Installing Hadoop
  • Using Cloudera Manager for easy installation
  • Basic configuration parameters
  • Hands-On Exercise

Day 6

11AM-5PM
Advanced Configuration
  • Advanced parameters
  • Configuring rack awareness
  • Configuring Federation/li>
  • Configuring High Availability
Managing and Scheduling Jobs
  • Managing running jobs
  • Hands-On Exercise
  • The FIFO Scheduler
  • The FairScheduler
  • Configuring the FairScheduler
  • Hands-On Exercise

Day 7

11AM-5PM
Cluster Maintenance
  • Checking HDFS status
  • Hands-On Exercise
  • Copying data between clusters
  • Adding and removing cluster nodes
  • Rebalancing the cluster
  • Hands-On Exercise
  • NameNode Metadata backup
  • Cluster upgrading

Day 8

11AM-5PM
Cluster Monitoring and Troubleshooting
  • General system monitoring
  • Managing Hadoop's log files
  • Using the NameNode and JobTracker Web UIs
  • Hands-On Exercise
  • Cluster monitoring with Ganglia
  • Common troubleshooting issues
  • Benchmarking your cluster

Day 9

11AM-5PM
Populating HDFS from External Sources
  • An overview of Flume
  • Hands-On Exercise
  • An overview of Sqoop
  • Best practices for importing data

Day 10

11AM-5PM
Installing and Managing Other Hadoop Projects
  • Hive
  • Pig
  • HBase

Live Virtual Class Schedule

Virtual Training - 2 Days

Day 1

Introduction
  • The Case for Apache Hadoop
  • A brief history of Hadoop
  • Core Hadoop components
  • Fundamental concepts
The Hadoop Distributed File System
  • HDFS features
  • HDFS design assumptions
  • Overview of HDFS architecture
  • Writing and reading files
  • Name Node considerations
  • An overview of HDFS security
  • Hands-On Exercise

Day 2

MapReduce
  • What is Map Reduce?
  • Governance vs management of IT
  • Features of Map Reduce
  • Basic Map Reduce concepts
  • Architectural overview
  • Failure recovery
  • Hands-On Exercise
An Overview of the Hadoop Ecosystem
  • What is the Hadoop ecosystem?
  • Integration tools
  • Analysis tools
  • Data storage and retrieval tools
Planning your Hadoop Cluster
  • General planning considerations
  • Choosing the right hardware
  • Network considerations
  • Configuring nodes

Day 3

Hadoop Installation
  • Installing Hadoop
  • Using Cloudera Manager for easy installation
  • Basic configuration parameters
  • Hands-On Exercise
Advanced Configuration
  • Advanced parameters
  • Configuring rack awareness
  • Configuring Federation/li>
  • Configuring High Availability
Managing and Scheduling Jobs
  • Managing running jobs
  • Hands-On Exercise
  • The FIFO Scheduler
  • The FairScheduler
  • Configuring the FairScheduler
  • Hands-On Exercise
 

Day 4

Cluster Maintenance
  • Checking HDFS status
  • Hands-On Exercise
  • Copying data between clusters
  • Adding and removing cluster nodes
  • Rebalancing the cluster
  • Hands-On Exercise
  • NameNode Metadata backup
  • Cluster upgrading
Cluster Monitoring and Troubleshooting
  • General system monitoring
  • Managing Hadoop's log files
  • Using the NameNode and JobTracker Web UIs
  • Hands-On Exercise
  • Cluster monitoring with Ganglia
  • Common troubleshooting issues
  • Benchmarking your cluster

Day 5

Populating HDFS from External Sources
  • An overview of Flume
  • Hands-On Exercise
  • An overview of Sqoop
  • Best practices for importing data
Installing and Managing Other Hadoop Projects
  • Hive
  • Pig
  • HBase
Connect With Us

Call: +91-87222 63165 (India)
Call: +1 510-379-9024 (USA)

Mail: contact@syncomint.com