YoVDO

Hadoop for Data Science Tips, Tricks, & Techniques

Offered By: LinkedIn Learning

Tags

Hadoop Courses Data Science Courses Python Courses HDFS Courses

Course Description

Overview

Get up to speed with Hadoop. Learn tips and tricks for doing data science work in this popular big data platform.

Syllabus

Introduction
  • Welcome
  • What you should know
  • Exercise files
  • Environment setup
1. Working with Files
  • Organize files in HDFS
  • Upload files to HDFS
  • Move files in HDFS
  • Remove files in HDFS
2. Connecting to Hadoop
  • Explore Hive through Beeline
  • Access Hive from Python
  • Create aggregates in Hive
  • Select partitions in Hive
3. Complex Data Structures in Hive
  • Map data in Hive
  • Arrays in Hive
  • Structs in Hive
  • Create flat tables for Impala
  • Deconstruct Impala queries
Conclusion
  • Next steps

Taught by

Ben Sullins

Related Courses

Azure Synapse SQL Pool - Implement Polybase
Coursera Project Network via Coursera
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Big Data, Hadoop, and Spark Basics
IBM via edX
Big Data Hadoop Certification Training
Edureka
Big Data Analytics with Hadoop and Apache Spark
LinkedIn Learning