Azure Spark Databricks Essential Training
Offered By: LinkedIn Learning
Course Description
Overview
Learn best practices, patterns, and processes for developers and DevOps teams who want to design and implement data processing using Azure Databricks.
Syllabus
Introduction
- Optimize data pipelines
- What you should know
- About using cloud services
- Meet Databricks Apache Spark clusters
- Business scenarios for Spark
- Understand Spark key components
- Azure Databricks concepts
- Quick start: Use a notebook
- Review Databricks Azure cluster setup
- Use a Python notebook with dashboards
- Use an R notebook
- Use a Scala notebook for visualization
- Use a notebook with scikit-learn
- Use a Spark Streaming notebook
- Use an external Scala library: variant-spark
- Understand data engineering workload steps
- Understand cluster configurations
- Understand Spark job execution overhead
- Explore optimization control planes
- Optimize a cluster and job
- Run a production-size job
- Use Databricks jobs and role-based control
- Use Databricks Runtime ML
- Understand ML Pipelines API
- Use ML Pipelines API
- Use distributed ML training
- Understand Databricks Delta
- Use Databricks Delta
- Use Azure Blob storage
- Understand MLflow
- Azure Databricks pipeline considerations
- Azure Databricks for data warehousing
- Azure Databricks and machine learning
- Azure Databricks for churn analysis
- Azure Databricks for intrusion detection
- Next steps
Taught by
Lynn Langit
Related Courses
Coding the Matrix: Linear Algebra through Computer Science ApplicationsBrown University via Coursera كيف تفكر الآلات - مقدمة في تقنيات الحوسبة
King Fahd University of Petroleum and Minerals via Rwaq (رواق) Datascience et Analyse situationnelle : dans les coulisses du Big Data
IONIS via IONIS Data Lakes for Big Data
EdCast 統計学Ⅰ:データ分析の基礎 (ga014)
University of Tokyo via gacco