Data Analysis with Apache Hive


About this Course

This course covers how to use Apache Hive to query structured data without writing MapReduce code. You will learn how Apache Hive fits in the Hadoop ecosystem, how to create and load tables in Hive, and how to query data using the Hive Query Language. Best for data analysts and developers interested in the data pipeline, and those familiar with SQL who want to use data on a distributed filesystem.

Duration : 1 day

What’s Covered in the Course

1: Apache Hive Essentials**
  • Describe Apache HiveExplain Apache Hive Use Cases
  • Describe how Apache Hive Fits in the Data Pipeline
  • Understand Data Types in Apache Hive
Lab Activities
    • Connect to the Hive CLI
    • Cast Data Types in Hive
2: Create and Load Data in Apache Hive**
  • Create Databases and Tables
  • Partition and Bucket DataLoad Tables with Data
  • Alter and Drop Tables
  • Examine Databases and Tables
Lab Activities
    • Create a Database
    • Create a Table
    • Create External and Partitioned Tables
    • Load Data into Tables
3: Query Data in Apache Hive**
  • Query Tables
  • Manipulate Tables
  • Combine and Store Tables
Lab Activities
    • Query Data with SELECT
    • Query Data with FunctionsCombine and Store Data