YoVDO

Scala Essential Training for Data Science

Offered By: LinkedIn Learning

Tags

Scala Courses Data Science Courses Programming Courses Parallel Processing Courses DataFrames Courses RDDs Courses

Course Description

Overview

Use Scala in your data science work. Explore the Scala features most useful to data scientists, including custom functions, parallel processing, and programming Spark with Scala.

Syllabus

Introduction
  • Welcome
  • What you should know
  • Using the exercise files
1. Introduction to Scala
  • The advantages of Scala for data science
  • Installing Scala
  • Scala data types
  • Scala collections
  • Scala sets Scala arrays, vectors, and ranges
  • Scala maps
  • Scala expressions
  • Scala functions
  • Scala objects
2. Parallel Processing in Scala
  • Advantages of parallel collections
  • Creating parallel collections
  • Mapping functions over parallel collections
  • Filtering parallel collections
  • When and when not to use parallel collections
3. Using SQL in Scala
  • Installing PostgreSQL
  • Loading data into PostgreSQL
  • Connecting to PostgreSQL
  • Querying with SQL strings
  • Querying with prepared statements
  • Summary of SQL in Scala
4. Scala and Spark RDDs
  • Introduction to Spark
  • Installing Spark
  • Getting Started with Spark RDDs
  • Mapping Functions over RDDs
  • Statistics over RDDs
  • Summary of Scala and Spark RDDs
5. Scala and Spark DataFrames
  • Creating DataFrames
  • Grouping and filtering on DataFrames
  • Joining DataFrames
  • Working with JSON files
  • Summary of Scala and Spark DataFrames
Conclusion
  • Review of Scala for data science

Taught by

Dan Sullivan

Related Courses

Address Business Issues with Data Science
CertNexus via Coursera
Advanced Clinical Data Science
University of Colorado System via Coursera
Advanced Data Science Capstone
IBM via Coursera
Advanced Data Science with IBM
IBM via Coursera
Advanced Deep Learning Methods for Healthcare
University of Illinois at Urbana-Champaign via Coursera