Data Storage and Queries

Offered By: DeepLearning.AI via Coursera

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

In this course, you will learn about the raw ingredients and processes that are used to physically store data on disk and in memory. You’ll explore different storage systems, including object, block, and file storage, as well as databases, that are built on top of these raw ingredients. You’ll also get a chance to use the Cypher language to query a Neo4j graph database, and perform vector similarity search, a key feature behind generative AI and large language models. You will explore the evolution of data storage abstractions, from data warehouses, to data lakes, and data lakehouses, while comparing the advantages and drawbacks of each architectural paradigm. With hands-on practice, you will design a simple data lake using Amazon Glue, and build a data lakehouse using AWS LakeFormation and Apache Iceberg. In the last week of this course, you’ll see how queries work behind the scenes, practice writing more advanced SQL queries, compare the query performance in row vs column-oriented storage, and perform streaming queries using Apache Flink.

Syllabus

Storage Ingredients and Storage Systems
Storage Abstractions
Queries

Taught by

Joe Reis

Data Storage and Queries

Tags

Course Description

Overview

Syllabus

Taught by

Tags

Related Courses

Data Storage and Queries

Tags

Course Description

Overview

Syllabus

Taught by

Tags

Related Courses

Login to Continue