Scala Essential Training for Data Science
Offered By: LinkedIn Learning
Course Description
Overview
Use Scala in your data science work. Explore the Scala features most useful to data scientists, including custom functions, parallel processing, and programming Spark with Scala.
Syllabus
Introduction
- Welcome
- What you should know
- Using the exercise files
- The advantages of Scala for data science
- Installing Scala
- Scala data types
- Scala collections
- Scala sets Scala arrays, vectors, and ranges
- Scala maps
- Scala expressions
- Scala functions
- Scala objects
- Advantages of parallel collections
- Creating parallel collections
- Mapping functions over parallel collections
- Filtering parallel collections
- When and when not to use parallel collections
- Installing PostgreSQL
- Loading data into PostgreSQL
- Connecting to PostgreSQL
- Querying with SQL strings
- Querying with prepared statements
- Summary of SQL in Scala
- Introduction to Spark
- Installing Spark
- Getting Started with Spark RDDs
- Mapping Functions over RDDs
- Statistics over RDDs
- Summary of Scala and Spark RDDs
- Creating DataFrames
- Grouping and filtering on DataFrames
- Joining DataFrames
- Working with JSON files
- Summary of Scala and Spark DataFrames
- Review of Scala for data science
Taught by
Dan Sullivan
Related Courses
Amazon Elastic File System (EFS) Performance (Traditional Chinese)Amazon Web Services via AWS Skill Builder Amazon Simple Storage Service (Amazon S3) Performance Optimization (Traditional Chinese)
Amazon Web Services via AWS Skill Builder Amazon Simple Storage Service (Amazon S3) Performance Optimization (Traditional Chinese)
Amazon Web Services via AWS Skill Builder Building with Amazon Aurora Databases (Simplified Chinese)
Amazon Web Services via AWS Skill Builder Advanced .NET with TPL & PLINQ: Conducting Performance Boost
Coursera Project Network via Coursera