YoVDO

Apache Spark (TM) SQL for Data Analysts

Offered By: Databricks via Coursera

Tags

Big Data Courses Data Analysis Courses SQL Courses Apache Spark Courses Big Data Analytics Courses Complex Queries Courses Delta Lake Courses

Course Description

Overview

Apache Spark is one of the most widely used technologies in big data analytics. In this course, you will learn how to leverage your existing SQL skills to start working with Spark immediately. You will also learn how to work with Delta Lake, a highly performant, open-source storage layer that brings reliability to data lakes. By the end of this course, you will be able to use Spark SQL and Delta Lake to ingest, transform, and query data to extract valuable insights that can be shared with your team.

Syllabus

  • Welcome to Apache Spark SQL for Data Analysts
    • An introduction to this course including learning objectives, frequently asked questions, and a chance to get to know fellow classmates.
  • Spark makes big data easy
  • Using Spark SQL on Databricks
  • Spark Under the Hood
  • Complex Queries
  • Applied Spark SQL
  • Data Storage and Optimization
  • Delta Lake with Spark SQL
  • SQL Coding Challenges

Taught by

Kate Sullivan

Related Courses

CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Big Data Analytics
University of Adelaide via edX
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera