Web Applications and Command-Line Tools for Data Engineering
Offered By: Duke University via Coursera
Course Description
Overview
In this fourth course of the Python, Bash and SQL Essentials for Data Engineering Specialization, you will build upon the data engineering concepts introduced in the first three courses to apply Python, Bash and SQL techniques in tackling real-world problems. First, we will dive deeper into leveraging Jupyter notebooks to create and deploy models for machine learning tasks. Then, we will explore how to use Python microservices to break up your data warehouse into small, portable solutions that can scale. Finally, you will build a powerful command-line tool to automate testing and quality control for publishing and sharing your tool with a data registry.
Syllabus
- Jupyter Notebooks
- In this module, you will learn how to install and run Jupyter on your local machine. Additionally, you will explore strategies to use code and text cells in a Jupyter notebook.
- Cloud-Hosted Notebooks
- In this module, you will learn how to create and use a Cloud-based notebook in Google Colab and AWS Sagemaker.
- Python Microservices
- In this module, you will learn how to build a Python Microservice with FastAPI and deploy a containerized machine learning Microservice for data engineering.
- Python Packaging and Rust Command Line Tools
- In this module, you will learn how to organize a Python project so you can build a powerful command-line tool. You will use Click, a useful command-line tool framework to enhance your tool. Finally, you will automate testing and quality control for publishing and sharing your tool with a registry.
Taught by
Noah Gift, Kennedy Behrman and Alfredo Deza
Tags
Related Courses
内存数据库管理openHPI CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX Processing Big Data with Azure Data Lake Analytics
Microsoft via edX Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera Google Cloud Big Data and Machine Learning Fundamentals 日本語版
Google Cloud via Coursera