Streaming Data Pipelines From Supernovas to LLMs
Offered By: Databricks via YouTube
Course Description
Overview
Embark on a hands-on journey through streaming data pipelines in this 50-minute tutorial from Databricks. Dive into a comprehensive use case utilizing the Databricks Intelligence Platform, focusing on data engineering techniques. Learn to analyze real-time data from collapsing supernovas emitting gamma-ray bursts, provided by NASA's GCN project. Discover how to ingest data from message buses and choose between Delta Live Tables, DBSQL, or Databricks Workflows for stream processing. Master ETL pipeline coding in SQL, including Kafka ingestion. Explore Databricks Data Rooms for natural language analytics and compare it to notebook streaming data into a Vector Database for open-source LLMs with RAG. Ideal for data engineers, code-savvy data architects, genAI enthusiasts, and astronomy lovers, this session teaches when and how to use various Databricks products. Replicate the demo at home and gain valuable insights into streaming data pipelines from supernovas to large language models.
Syllabus
Streaming Data Pipelines From Supernovas to LLMs
Taught by
Databricks
Related Courses
Google Cloud Platform Big Data and Machine Learning Fundamentals em Português BrasileiroGoogle Cloud via Coursera Data Engineering on Google Cloud Platform em Português Brasileiro
Google Cloud via Coursera Handling Streaming Data with GCP Dataflow
Pluralsight Developing Microsoft Azure Intelligent Edge Solutions
Pluralsight Implementing an Azure Databricks Environment in Microsoft Azure
Pluralsight