YoVDO

Big Data Computing with Spark

Offered By: The Hong Kong University of Science and Technology via edX

Tags

Apache Spark Courses Hadoop Courses Algorithm Design Courses Spark Streaming Courses RDDs Courses

Course Description

Overview

Big data systems such as Hadoop and Spark emerge as enabling technologies in managing massive amounts of data across hundreds or even thousands of computing nodes. Meanwhile, cloud computing platforms have made these technologies easily accessible to individuals as well as large enterprises. This course is an online adaptation of the signature course MSBD 5003 Big Data Computing offered to our popular MSc Program in Big Data Technology. In addition to 20+ hours of lecture videos, the course contains 100+ multiple-choice questions and 20 coding questions, aimed at equipping learners with both the theory and practical skills of big data systems, using Spark as the exemplary platform.


Syllabus

  • Week 1: Overview, MapReduce, and Hadoop
  • Week 2-3: Spark Basics and RDD
  • Week 4: SparkSQL and MLib
  • Week 5: Spark internals
  • Week 6: Algorithm design for big data
  • Week 7: GraphX/GraphFrames
  • Week 8: Spark Streaming

Taught by

Ke YI

Tags

Related Courses

Big Data Essentials
A Cloud Guru
Big Data
University of Adelaide via edX
Advanced Data Science with IBM
IBM via Coursera
Amazon EMR Getting Started (Indonesian)
Amazon Web Services via AWS Skill Builder
Analisar e preparar dados com o Amazon SageMaker Data Wrangler e o Amazon EMR (Português (Brasil)) | Lab - Analyze and Prepare Data with Amazon SageMaker Data Wrangler and Amazon EMR (Portuguese (Brazil))
Amazon Web Services via AWS Skill Builder