YoVDO

Hadoop for Data Science Tips, Tricks, & Techniques

Offered By: LinkedIn Learning

Tags

Hadoop Courses Data Science Courses Python Courses HDFS Courses

Course Description

Overview

Get up to speed with Hadoop. Learn tips and tricks for doing data science work in this popular big data platform.

Syllabus

Introduction
  • Welcome
  • What you should know
  • Exercise files
  • Environment setup
1. Working with Files
  • Organize files in HDFS
  • Upload files to HDFS
  • Move files in HDFS
  • Remove files in HDFS
2. Connecting to Hadoop
  • Explore Hive through Beeline
  • Access Hive from Python
  • Create aggregates in Hive
  • Select partitions in Hive
3. Complex Data Structures in Hive
  • Map data in Hive
  • Arrays in Hive
  • Structs in Hive
  • Create flat tables for Impala
  • Deconstruct Impala queries
Conclusion
  • Next steps

Taught by

Ben Sullins

Related Courses

Artificial Intelligence for Robotics
Stanford University via Udacity
Intro to Computer Science
University of Virginia via Udacity
Design of Computer Programs
Stanford University via Udacity
Web Development
Udacity
Programming Languages
University of Virginia via Udacity