Hadoop for Data Science Tips, Tricks, & Techniques
Offered By: LinkedIn Learning
Course Description
Overview
Get up to speed with Hadoop. Learn tips and tricks for doing data science work in this popular big data platform.
Syllabus
Introduction
- Welcome
- What you should know
- Exercise files
- Environment setup
- Organize files in HDFS
- Upload files to HDFS
- Move files in HDFS
- Remove files in HDFS
- Explore Hive through Beeline
- Access Hive from Python
- Create aggregates in Hive
- Select partitions in Hive
- Map data in Hive
- Arrays in Hive
- Structs in Hive
- Create flat tables for Impala
- Deconstruct Impala queries
- Next steps
Taught by
Ben Sullins
Related Courses
Intro to Hadoop and MapReduceCloudera via Udacity Processing Big Data with Hadoop in Azure HDInsight
Microsoft via edX Implementing Real-Time Analytics with Hadoop in Azure HDInsight
Microsoft via edX Hadoop Platform and Application Framework
University of California, San Diego via Coursera Data Manipulation at Scale: Systems and Algorithms
University of Washington via Coursera