Hadoop for Data Science Tips, Tricks, & Techniques
Offered By: LinkedIn Learning
Course Description
Overview
Get up to speed with Hadoop. Learn tips and tricks for doing data science work in this popular big data platform.
Syllabus
Introduction
- Welcome
- What you should know
- Exercise files
- Environment setup
- Organize files in HDFS
- Upload files to HDFS
- Move files in HDFS
- Remove files in HDFS
- Explore Hive through Beeline
- Access Hive from Python
- Create aggregates in Hive
- Select partitions in Hive
- Map data in Hive
- Arrays in Hive
- Structs in Hive
- Create flat tables for Impala
- Deconstruct Impala queries
- Next steps
Taught by
Ben Sullins
Related Courses
Azure Synapse SQL Pool - Implement PolybaseCoursera Project Network via Coursera Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data, Hadoop, and Spark Basics
IBM via edX Big Data Hadoop Certification Training
Edureka Big Data Analytics with Hadoop and Apache Spark
LinkedIn Learning