Cloudera Essentials for Apache Hadoop
Explore the basics of Apache Hadoop, including the Hadoop Distributed File System (HDFS), MapReduce, and the anatomy of a Hadoop cluster. Learn how CDH (Cloudera's Distribution, including Apache Hadoop) addresses the limitations of traditional computing, helps businesses overcome real challenges, and powers new types of big data analytics. This series also introduces the rest of the Apache Hadoop ecosystem and outlines how to prepare the data center and manage Hadoop in production.
Chapter 1: The Motivation for Hadoop
This webinar explores traditional large-scale computing systems and their limitations, alternative approaches to analytics and how Apache Hadoop addresses big data issues.
Chapter 2: Dissecting the Apache Hadoop Stack
There are many components working together in an Apache Hadoop stack. By understanding how each functions, you gain more insight into Hadoop’s functionality in your own IT environment. This webinar goes beyond the motivation for Apache Hadoop and dissects the Hadoop Distributed File System (HDFS), MapReduce and the general topology of a Hadoop cluster.
Chapter 3: Solving Business Challenges with Apache Hadoop
Learn how Apache Hadoop is used in the real world. This webinar explores ways to use Apache Hadoop to harness big data and solve business problems in ways never before imaginable. Explore common business challenges that can be addressed using Hadoop, the origins of big data, types of analyses powered by Hadoop and real-world industry use cases for Hadoop.
Chapter 4: Getting to Know the Components of the Apache Hadoop Ecosystem
Various projects make up the Apache Hadoop ecosystem, and each improves data analysis in its own unique way. This webinar reviews Apache Hive, Pig, HBase, Flume, Sqoop and Oozie, how they function within the stack and how they help integrate Hadoop within the production environment.
Chapter 5: Preparing Your Data Center for Hadoop
It is critical to understand how Apache Hadoop will affect the current setup of the data center and to plan ahead. This webinar helps you seamlessly integrate the platform into your environment. Find out what resources are required to deploy Hadoop and how to plan for cluster capacity.
Chapter 6: Managing the Elephant in the Room
Once you have Hadoop implemented in your environment, what’s next? How do you get the most out of the technology while managing it on a daily basis? Thgis webinar explores the different resources and job skills necessary to manage Hadoop, as well as the different roles and how to overcome hiring and training challenges.
Next Steps
- Learn how Cloudera Manager can increase the performance and decrease the cost of your Apache Hadoop cluster in production.
- Watch webinars on the benefits of CDH for the federal government and bioinformatics industry.
- Explore how Cloudera Impala powers analytics at the speed of thought.
- If you're an administrator, developer, data analyst, HBase specialist, or aspiring data scientist, Cloudera offers training and certification to meet your needs.