YoVDO

Web Applications and Command-Line Tools for Data Engineering

Offered By: Duke University via Coursera

Tags

Web Development Courses Machine Learning Courses Python Courses Cloud Computing Courses SQL Courses Bash Courses Microservices Courses Jupyter Notebooks Courses FastAPI Courses Data Engineering Courses

Course Description

Overview

In this fourth course of the Python, Bash and SQL Essentials for Data Engineering Specialization, you will build upon the data engineering concepts introduced in the first three courses to apply Python, Bash and SQL techniques in tackling real-world problems. First, we will dive deeper into leveraging Jupyter notebooks to create and deploy models for machine learning tasks. Then, we will explore how to use Python microservices to break up your data warehouse into small, portable solutions that can scale. Finally, you will build a powerful command-line tool to automate testing and quality control for publishing and sharing your tool with a data registry.

Syllabus

  • Jupyter Notebooks
    • In this module, you will learn how to install and run Jupyter on your local machine. Additionally, you will explore strategies to use code and text cells in a Jupyter notebook.
  • Cloud-Hosted Notebooks
    • In this module, you will learn how to create and use a Cloud-based notebook in Google Colab and AWS Sagemaker.
  • Python Microservices
    • In this module, you will learn how to build a Python Microservice with FastAPI and deploy a containerized machine learning Microservice for data engineering.
  • Python Packaging and Rust Command Line Tools
    • In this module, you will learn how to organize a Python project so you can build a powerful command-line tool. You will use Click, a useful command-line tool framework to enhance your tool. Finally, you will automate testing and quality control for publishing and sharing your tool with a registry.

Taught by

Noah Gift, Kennedy Behrman and Alfredo Deza

Tags

Related Courses

内存数据库管理
openHPI
CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Processing Big Data with Azure Data Lake Analytics
Microsoft via edX
Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera
Google Cloud Big Data and Machine Learning Fundamentals 日本語版
Google Cloud via Coursera