Hadoop for Data Science Tips, Tricks, & Techniques
Offered By: LinkedIn Learning
Course Description
Overview
Get up to speed with Hadoop. Learn tips and tricks for doing data science work in this popular big data platform.
Syllabus
Introduction
- Welcome
- What you should know
- Exercise files
- Environment setup
- Organize files in HDFS
- Upload files to HDFS
- Move files in HDFS
- Remove files in HDFS
- Explore Hive through Beeline
- Access Hive from Python
- Create aggregates in Hive
- Select partitions in Hive
- Map data in Hive
- Arrays in Hive
- Structs in Hive
- Create flat tables for Impala
- Deconstruct Impala queries
- Next steps
Taught by
Ben Sullins
Related Courses
Data AnalysisJohns Hopkins University via Coursera Computing for Data Analysis
Johns Hopkins University via Coursera Scientific Computing
University of Washington via Coursera Introduction to Data Science
University of Washington via Coursera Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera