Cloudera Administrator Training for Apache Hadoop

Training » Administrator Training » Apache Hadoop » Cloudera Administrator Training for Apache Hadoop

Course Summary

This three-day hands-on training course is for system administrators and others responsible for managing Apache Hadoop clusters in production or development environments.

Download the full agenda for Cloudera's Administrator Training for Apache Hadoop.

[top] Duration

3 days.

[top] You Will Learn

  • How the Hadoop Distributed File System and MapReduce work
  • What hardware configurations are optimal for Hadoop clusters
  • What network considerations to take into account when building out your cluster
  • How to configure Hadoop's options for best cluster performance
  • How to configure NameNode High Availability
  • How to configure NameNode Federation
  • How to configure the FairScheduler to provide service-level agreements for multiple users of a cluster
  • How to install and implement Kerberos-based security for your cluster
  • How to maintain and monitor your cluster
  • How to load data into the cluster from dynamically-generated files using Flume and from relational database management systems using Sqoop
  • What system administration issues exist with other Hadoop projects such as Hive, Pig and HBase

[top] Prerequisites

This course is appropriate for system administrators who will be setting up or maintaining a Hadoop cluster. Basic Linux system administration experience is a prerequisite for this training session. Prior knowledge of Hadoop is not required.

Hands-On Exercises

Throughout the course, hands-on labs help students build their knowledge and apply the concepts being discussed.

Certification Exam

Following successful completion of the training class, attendees will be given a voucher for one certification exam attempt. This voucher is non-transfearable and is given only to individuals who successfully complete the entire training class. Learn more about the CCAH certification exam.

[top] Outline

  • Introduction
  • The Case for Apache Hadoop
  • The Hadoop Distributed File System
  • MapReduce
  • An Overview of the Hadoop Ecosystem
  • Planning Your Hadoop Cluster
  • Hadoop Installation
  • Advanced Configuration
  • Hadoop Security
  • Managing and Scheduling Jobs
  • Cluster Maintenance
  • Cluster Monitoring and Troubleshooting
  • Populating HDFS from External Sources
  • Installing and Managing Other Hadoop Projects
  • Conclusion
  • Appendix: Kerberos Configuration

Training Schedule

United States May 2013 Jun 2013 Jul 2013 Aug 2013
Atlanta, GA Jun 19 - Jun 21
Charlotte, NC Jun 26 - Jun 28
Chicago, IL Jun 3 - Jun 5
Dallas, TX May 29 - May 31
Los Angeles, CA Jun 12 - Jun 14
New York, NY Metro Area Jun 24 - Jun 26
Jun 24 - Jun 26
Philadelphia, PA Jun 24 - Jun 26
Sacramento, CA Jul 16 - Jul 18
San Francisco Bay Area, CA Jun 19 - Jun 21
Jul 16 - Jul 18
Jul 16 - Jul 18
Washington, DC Metro Area May 29 - May 31
Jun 24 - Jun 26
Jun 12 - Jun 14
Online May 2013 Jun 2013 Jul 2013 Aug 2013
Virtual Classroom Jun 3 - Jun 5
Jun 24 - Jun 26
Jul 16 - Jul 18
International May 2013 Jun 2013 Jul 2013 Aug 2013
Bangalore, India   Jun 17 - Jun 19
   
Barcelona, Spain   Jun 25 - Jun 27
   
Hong Kong, China   Jun 19 - Jun 21
   
London, United Kingdom   Jun 19 - Jun 21
   
Madrid, Spain May 27 - May 29
     
Madrid, Spain   Jun 10 - Jun 12
   
Melbourne, Australia   Jun 10 - Jun 12
   
Mexico DF, Mexico   Jun 24 - Jun 26
   
Milan, Italy     Jul 8 - Jul 10
 
Montreal, Canada   Jun 24 - Jun 26
   
Ottawa, Canada   Jun 24 - Jun 26
   
Paris, France   Jun 5 - Jun 7
   
Paris, France   Jun 24 - Jun 26
   
Rome, Italy   Jun 17 - Jun 19
   
Shanghai, China   Jun 14 - Jun 16
   
Singapore, Singapore   Jun 19 - Jun 21
   
Sydney, Australia     Jul 1 - Jul 3
 
Toronto, Canada   Jun 24 - Jun 26
Jul 16 - Jul 18
 
Warsaw, Poland May 27 - May 29
  Jul 2 - Jul 4
 
Zurich, Switzerland   Jun 26 - Jun 28