YoVDO

Spark Overview for Scala Analytics

Offered By: Cognitive Class

Tags

Apache Spark Courses Data Science Courses Scala Courses Object-oriented programming Courses DataFrames Courses RDDs Courses

Course Description

Overview

The “Spark Overview for Scala Analytics” course will cover the history of Spark and how it came to be, how to build applications with Spark, establish an understanding of RDDs and DataFrames, and other advanced Spark topics. Apache Spark™ is a fast and general engine for large-scale data processing, with built-in modules for streaming, SQL, machine learning and graph processing. Having finished this class, a student would be prepared to leverage the core RDD and DataFrame APIs to perform analytics on datasets.This course is meant to be an overview of Spark and its associated ecosystem.
There are 5 modules to this course.
1. What is Spark
2. Introduction to RDDs
3. Introduction to DataFrames
4. Advanced Spark Topics
5. Introduction to Spark MLlib

Syllabus

1. Experience with Java (preferred), Python, or another object oriented language
2. No previous Spark knowledge is required
3. No previous experience with Data Science concepts is required. These concepts will be explained as needed

Related Courses

Data Analysis
Johns Hopkins University via Coursera
Computing for Data Analysis
Johns Hopkins University via Coursera
Scientific Computing
University of Washington via Coursera
Introduction to Data Science
University of Washington via Coursera
Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera