This course is designed for administrators who will be installing, configuring and managing HBase clusters. It covers installation with Ambari, configuration, security and troubleshooting HBase implementations. The course includes an end-of-course project in which students work together to design and implement an HBase schema.
Who is the course for
Architects, software developers, and analysts responsible for implementing non-SQL databases in order to handle sparse data sets commonly found in big data use cases.
Students must have basic familiarity with data management systems. Familiarity with Hadoop or databases is helpful but not required. Students new to Hadoop are encouraged to take the HDP Overview: Apache Hadoop Essentials course.
What you will learn
- Hadoop Primer: Hadoop, Hortonworks, and Big Data; HDFS and YARN
- Discussion: Running Applications in the Cloud
- Apache HBase Overview
- Provisioning the Cluster
- Using the HBase Shell
- Ingesting Data
- Operational Management
- Backup and Recovery
- Monitoring HBase and Diagnosing Problems
- Installing and Configuring HBase with Ambari
- Manually Installing HBase (Optional)
- Using Shell Commands
- Ingesting Data using ImportTSV
- Enabling HBase High Availability
- Viewing Log Files
- Configuring and Enabling Snapshots
- Configuring Cluster Replication
- Enabling Authentication and Authorization
- Diagnosing and Resolving Hot Spotting
- Region Splitting
- Monitoring JVM Garbage Collection
- End-of-Course Project: Designing an HBase Schema
- 50% Lecture/Discussion
- 50% Hands-on Labs
Related Training Courses
HDP Developer: Apache Pig and Hive This 4-day hands-on training course teaches attendees how to develop applications and analyse big data stored in Apache Hadoop 2.x using Pig and Hive.
HDP Operations: Hadoop Administration 1 This 4-day course is designed for Hortonworks Data Platform administrators, and covers installation, configuration, maintenance, security and performance topics.
HDP Administrator: Security This 3-day course is designed for experienced administrators who will be implementing secure Hadoop clusters using authentication, authorisation, auditing and data protection strategies and tools.
HDP Analyst: Data Science This 3-day course provides instruction on the processes and practice of data science, including machine learning and natural language processing.
HDP Operations: Hortonworks Data Flow This 3-day course is designed for ‘Data Stewards’ or ‘Data Flow Managers’ who are looking forward to automate the flow of data between systems.
HDP Analyst: Apache HBase Essentials This 2-day workshop introduces HBase basics, structure and operations in an intensely hands-on experience.
HDP Developer: Enterprise Spark 1 This 4-day course is designed as an entry point for developers who need to create applications to analyse big data stored in Apache Hadoop using Spark.No Events on The List at This Time