Cloudera Training for Apache HBase

Training » Administrator Training » Apache HBase » Cloudera Training for Apache HBase

Course Summary

Cloudera's training for Apache HBase is designed for developers already familiar with Apache Hadoop. Participants should be familiar with Hadoop's architecture and APIs, and have experience writing basic applications. Cloudera's Hadoop Developer course provides all the necessary background required.

[top] Duration

2 days.

[top] You Will Learn

  • Understand the HBase architecture
  • Use the HBase shell to directly manipulate HBase tables
  • Design optimal HBase schemas for efficient data storage and recovery
  • Connect to HBase using the Java API
  • Understand how to bulk-load data into HBase using MapReduce
  • Administer an HBase cluster
  • Identify and resolve performance bottlenecks

Additional Notes

Download the full agenda for Cloudera's Training for Apache HBase.

Hands-On Exercises

During this course students will perform numerous Hands-On Exercises.

[top] Outline

Introduction

Introduction to HBase

  • What is HBase
  • HDFS and HBase
  • Hands-On Exercise
  • HBase Usage scenarios

The HBase Data Model

  • Overview of the HBase Data Model
  • Designing Tables
  • Column Families
  • Cells and Versions

The HBase shell

  • Running the shell
  • Creating Tables
  • Accessing Data in Tables
  • Administration
  • Scripting
  • Hands-On Exercise

HBase Architecture

  • Components of an HBase Cluster
  • Regions
  • Flushes and Copactions
  • Hands-On Exercises
  • Reading and Writing to HBase
  • Bloom Filters and Block Cache
  • WAL and Crash Recovery

Schema Design

  • Schema Fundamentals
  • Designing for Locality
  • Timeseries Data
  • Reverse Timestamps
  • Secondary Indexes
  • Schema Design Examples
  • Hands-On Exercise

The HBase API

  • Administrative Classes
  • Basic Operations
  • Advanced Operations
  • Hands-On Exercise
  • Non-Java APIs

MapReduce and Bulk Loads

  • What is MapReduce?
  • HBase and MapReduce
  • Hands-On Exercise
  • Bulk Loading with MapReduce

HBase Configuration

  • Configuration Overview
  • Operating System Requirements
  • Hadoop Requirements
  • HBase Run Modes
  • ZooKeeper
  • Configuration Files

HBase Administration

  • Monitoring
  • Backup
  • Tools
  • Compression
  • Managed Operations
  • Hands-On Exercise
  • Capacity Planning

Performance Tuning

  • Operating System
  • Network Performance
  • Java and Garbage Collection
  • HBase Configurations
  • Client Configurations
  • Troubleshooting

Conclusion

Training Schedule

Location May 2012 Jun 2012 Jul 2012 Aug 2012
Seaport Conference Center   Jun 27 - Jun 28
   
Bridge Education - Columbia     Jul 19 - Jul 20
 
Training Choice - Sydney       Aug 2 - Aug 3
New Horizons - Berlin   Jun 7 - Jun 8
   
Learning Tree - London     Jul 2 - Jul 3