Using Azure Databricks to Develop Scalable Data Solutions
Offered By: PASS Data Community Summit via YouTube
Course Description
Overview
Explore how to develop scalable data solutions using Azure Databricks in this comprehensive conference talk. Learn about connecting to various Azure data sources, processing large datasets with Apache Spark, and creating machine learning solutions. Discover the collaborative workspace features, integration with Power BI for visualization, and performance optimization techniques. Follow step-by-step demonstrations covering Databricks implementation, including resource creation, workspace navigation, cluster management, data analysis, job scheduling, and structured streaming. Gain insights into Databricks' history, pricing, and integration with tools like GitHub. Understand the benefits of using Databricks for processing data lakes, implementing machine learning workflows, and leveraging deep learning capabilities. By the end of this talk, acquire the knowledge needed to effectively utilize Azure Databricks for developing scalable and efficient data solutions.
Syllabus
Introduction
History of Databricks
Spark
Machine Learning
Machine Learning Monitoring
Analytics Tools
Managed Service
Processing
GitHub Integration
Managing Databricks
Pricing
Machine Learning Process
Databricks
Creating a Databricks resource
Databricks workspace
Creating a new cluster
Analyzing data
Using Databricks
Databricks Job
Databricks Notifications
Structured Streaming
Deep Learning
Jobs
Recap
Questions
Taught by
PASS Data Community Summit
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Natural Language Processing
Columbia University via Coursera Probabilistic Graphical Models 1: Representation
Stanford University via Coursera Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent