HDP Analyst: Apache HBase Essentials

Book Now

Overview

This course is designed for big data analysts who want to use the HBase NoSQL database which runs on top of HDFS to provide real-time read/write access to sparse datasets. Topics include HBase architecture, services, installation and schema design.

Duration

2 days

Who is the course for

Attendees must have basic familiarity with data management systems. Familiarity with Hadoop or databases is helpful but not required. Attendees new to Hadoop are encouraged to attend the HDP Overview: Apache Hadoop Essentials course.

Prerequisites

Attendees must have basic familiarity with data management systems. Familiarity with Hadoop or databases is helpful but not required. Attendees new to Hadoop are encouraged to attend the HDP Overview: Apache Hadoop Essentials course.

What you will learn

  • How HBase integrates with Hadoop and HDFS
  • Architectural components and core concepts of HBase
  • HBase functionality
  • Installing and configuring HBase
  • HBase schema design
  • Importing and exporting data
  • Backup and recovery
  • Monitoring and managing HBase
  • How Apache Phoenix works with HBase
  • How HBase integrates with Apache ZooKeeper
  • HBase services and data operations
  • Optimizing HBase Access

Course Outline

Hands-on Labs

  • Using Hadoop and MapReduce
  • Using HBase
  • Importing Data from MySQL to HBase
  • Using Apache ZooKeeper
  • Examining Configuration Files
  • Using Backup and Snapshot
  • HBase Shell Operations
  • Creating Tables with Multiple Column Families
  • Exploring HBase Schema
  • Blocksize and Bloom filters
  • Exporting Data
  • Using Java Data Access Object Application to Interact with HBase

Format

  • 35% Lecture/Discussion
  • 65% Hands-on Labs

Related Training Courses

HDP Developer: Apache Pig and Hive This 4-day hands-on training course teaches attendees how to develop applications and analyse big data stored in Apache Hadoop 2.x using Pig and Hive.

HDP Developer: Java Applications This 4-day course provides Java programmers a deep-dive into Hadoop 2.x application development.

HDP Operations: Hadoop Administration 1 This 4-day course is designed for Hortonworks Data Platform administrators, and covers installation, configuration, maintenance, security and performance topics.

HDP Operations: Hadoop Administration 2 This 3-day course is designed for experienced administrators who manage Hortonworks Data Platform (HDP) 2.3 clusters with Ambari.

HDP Administrator: Security This 3-day course is designed for experienced administrators who will be implementing secure Hadoop clusters using authentication, authorisation, auditing and data protection strategies and tools.

HDP Analyst: Data Science This 3-day course provides instruction on the processes and practice of data science, including machine learning and natural language processing.

HDP Operations: Hortonworks Data Flow This 3-day course is designed for ‘Data Stewards’ or ‘Data Flow Managers’ who are looking forward to automate the flow of data between systems.

HDP Operations: Apache HBase Advanced Management This 4-day course is designed for administrators who will be installing, configuring and managing HBase clusters.

HDP Developer: Enterprise Spark 1 This 4-day course is designed as an entry point for developers who need to create applications to analyse Big Data stored in Apache Hadoop using Spark.

No Events on The List at This Time

X