Hadoop for Data Science Tips, Tricks, & Techniques
Offered By: LinkedIn Learning
Course Description
Overview
Get up to speed with Hadoop. Learn tips and tricks for doing data science work in this popular big data platform.
Syllabus
Introduction
- Welcome
- What you should know
- Exercise files
- Environment setup
- Organize files in HDFS
- Upload files to HDFS
- Move files in HDFS
- Remove files in HDFS
- Explore Hive through Beeline
- Access Hive from Python
- Create aggregates in Hive
- Select partitions in Hive
- Map data in Hive
- Arrays in Hive
- Structs in Hive
- Create flat tables for Impala
- Deconstruct Impala queries
- Next steps
Taught by
Ben Sullins
Related Courses
Artificial Intelligence for RoboticsStanford University via Udacity Intro to Computer Science
University of Virginia via Udacity Design of Computer Programs
Stanford University via Udacity Web Development
Udacity Programming Languages
University of Virginia via Udacity