Hadoop for Data Science Tips, Tricks, & Techniques
Offered By: LinkedIn Learning
Course Description
Overview
Get up to speed with Hadoop. Learn tips and tricks for doing data science work in this popular big data platform.
Syllabus
Introduction
- Welcome
- What you should know
- Exercise files
- Environment setup
- Organize files in HDFS
- Upload files to HDFS
- Move files in HDFS
- Remove files in HDFS
- Explore Hive through Beeline
- Access Hive from Python
- Create aggregates in Hive
- Select partitions in Hive
- Map data in Hive
- Arrays in Hive
- Structs in Hive
- Create flat tables for Impala
- Deconstruct Impala queries
- Next steps
Taught by
Ben Sullins
Related Courses
Big Data Essentials: HDFS, MapReduce and Spark RDDYandex via Coursera Créez votre Data Lake
CentraleSupélec via OpenClassrooms Big data Internship Program - Foundation
Udemy Learning Hadoop
LinkedIn Learning Azure Synapse SQL Pool - Implement Polybase
Coursera Project Network via Coursera