Streaming Data Pipelines From Supernovas to LLMs
Offered By: Databricks via YouTube
Course Description
Overview
Embark on a hands-on journey through streaming data pipelines in this 50-minute tutorial from Databricks. Dive into a comprehensive use case utilizing the Databricks Intelligence Platform, focusing on data engineering techniques. Learn to analyze real-time data from collapsing supernovas emitting gamma-ray bursts, provided by NASA's GCN project. Discover how to ingest data from message buses and choose between Delta Live Tables, DBSQL, or Databricks Workflows for stream processing. Master ETL pipeline coding in SQL, including Kafka ingestion. Explore Databricks Data Rooms for natural language analytics and compare it to notebook streaming data into a Vector Database for open-source LLMs with RAG. Ideal for data engineers, code-savvy data architects, genAI enthusiasts, and astronomy lovers, this session teaches when and how to use various Databricks products. Replicate the demo at home and gain valuable insights into streaming data pipelines from supernovas to large language models.
Syllabus
Streaming Data Pipelines From Supernovas to LLMs
Taught by
Databricks
Related Courses
Delta Live Tables: Modern Software Engineering for ETL PipelinesDatabricks via YouTube Databricks Concepts
DataCamp Databricks Certified Data Engineer Associate Cert Prep: 4 Production Pipelines
LinkedIn Learning Implement a data engineering solution with Azure Databricks
Microsoft via Microsoft Learn Data Engineering on the Data Intelligence Platform - A Comprehensive Guide
Databricks via YouTube