Analyzing Big Data with Hive
Offered By: LinkedIn Learning
Course Description
Overview
Learn how to use Hive to analyze large datasets and derive information from Hadoop. Learn how to work with tables, structures, aggregations, clauses, functions, and more.
Syllabus
Introduction
- Welcome
- What you should know before watching this course
- Using the exercise files
- Why use Hive
- How Hive works
- Setting up our demo environment
- Understanding table structures in Hive
- Creating tables in Hive
- Handling CSV files in Hive
- Partitioning tables
- Simple SELECT statement
- Retrieving data from complex structures
- Simple aggregations
- Enhanced aggregations with grouping sets
- Using CUBE and ROLLUP
- Simple filter with the WHERE clause
- Filtering aggregates with HAVING clause
- Finding similar values with LIKE
- Combining tables with JOIN
- When to use SEMI JOIN
- Joining multiple tables together
- Types of data manipulation functions
- String functions
- Math functions
- Date functions
- Conditional functions
- Next steps
Taught by
Ben Sullins
Related Courses
Excel 2010Miríadax Intro to Data Science
Udacity Data Manipulation at Scale: Systems and Algorithms
University of Washington via Coursera Statistical Computing with R - a gentle introduction
University College London via Independent Introducción a Data Science: Programación Estadística con R
Universidad Nacional Autónoma de México via Coursera