Analyzing Big Data with Hive
Offered By: LinkedIn Learning
Course Description
Overview
Learn how to use Hive to analyze large datasets and derive information from Hadoop. Learn how to work with tables, structures, aggregations, clauses, functions, and more.
Syllabus
Introduction
- Welcome
- What you should know before watching this course
- Using the exercise files
- Why use Hive
- How Hive works
- Setting up our demo environment
- Understanding table structures in Hive
- Creating tables in Hive
- Handling CSV files in Hive
- Partitioning tables
- Simple SELECT statement
- Retrieving data from complex structures
- Simple aggregations
- Enhanced aggregations with grouping sets
- Using CUBE and ROLLUP
- Simple filter with the WHERE clause
- Filtering aggregates with HAVING clause
- Finding similar values with LIKE
- Combining tables with JOIN
- When to use SEMI JOIN
- Joining multiple tables together
- Types of data manipulation functions
- String functions
- Math functions
- Date functions
- Conditional functions
- Next steps
Taught by
Ben Sullins
Related Courses
Big Data Analytics in HealthcareGeorgia Institute of Technology via Udacity Mining Massive Datasets
Stanford University via edX The Caltech-JPL Summer School on Big Data Analytics
California Institute of Technology via Coursera Big Data Analytics for Healthcare
Georgia Institute of Technology via Coursera Data Lakes for Big Data
EdCast