YoVDO

Hadoop for Data Science Tips, Tricks, & Techniques

Offered By: LinkedIn Learning

Tags

Hadoop Courses Data Science Courses Python Courses HDFS Courses

Course Description

Overview

Get up to speed with Hadoop. Learn tips and tricks for doing data science work in this popular big data platform.

Syllabus

Introduction
  • Welcome
  • What you should know
  • Exercise files
  • Environment setup
1. Working with Files
  • Organize files in HDFS
  • Upload files to HDFS
  • Move files in HDFS
  • Remove files in HDFS
2. Connecting to Hadoop
  • Explore Hive through Beeline
  • Access Hive from Python
  • Create aggregates in Hive
  • Select partitions in Hive
3. Complex Data Structures in Hive
  • Map data in Hive
  • Arrays in Hive
  • Structs in Hive
  • Create flat tables for Impala
  • Deconstruct Impala queries
Conclusion
  • Next steps

Taught by

Ben Sullins

Related Courses

Data Analysis
Johns Hopkins University via Coursera
Computing for Data Analysis
Johns Hopkins University via Coursera
Scientific Computing
University of Washington via Coursera
Introduction to Data Science
University of Washington via Coursera
Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera