ETL and ELT in Python
Offered By: DataCamp
Course Description
Overview
Learn to build effective, performant, and reliable data pipelines using Extract, Transform, and Load principles.
Data pipelines are at the foundation of every strong data platform. Building these pipelines is an essential skill for data engineers, who provide incredible value to a business ready to step into a data-driven future. This introductory course will help you hone the skills to build effective, performant, and reliable data pipelines.
Data pipelines are at the foundation of every strong data platform. Building these pipelines is an essential skill for data engineers, who provide incredible value to a business ready to step into a data-driven future. This introductory course will help you hone the skills to build effective, performant, and reliable data pipelines.
Syllabus
- Introduction to Data Pipelines
- Get ready to discover how data is collected, processed, and moved using data pipelines. You will explore the qualities of the best data pipelines, and prepare to design and build your own.
- Building ETL Pipelines
- Dive into leveraging pandas to extract, transform, and load data as you build your first data pipelines. Learn how to make your ETL logic reusable, and apply logging and exception handling to your pipelines.
- Advanced ETL Techniques
- Supercharge your workflow with advanced data pipelining techniques, such as working with non-tabular data and persisting DataFrames to SQL databases. Discover tooling to tackle advanced transformations with pandas, and uncover best-practices for working with complex data.
- Deploying and Maintaining a Data Pipeline
- In this final chapter, you’ll create frameworks to validate and test data pipelines before shipping them into production. After you’ve tested your pipeline, you’ll explore techniques to run your data pipeline end-to-end, all while allowing for visibility into pipeline performance.
Taught by
Jake Roach
Related Courses
Julia Scientific ProgrammingUniversity of Cape Town via Coursera Spark
Udacity AI Workflow: Enterprise Model Deployment
IBM via Coursera Apache Spark with Scala - Hands On with Big Data!
Udemy Taming Big Data with Apache Spark and Python - Hands On!
Udemy