Complete Guide to Python for Data Engineering: From Beginner to Advanced
Offered By: LinkedIn Learning
Course Description
Overview
Practice fundamental skills using Python for data engineering in this hands-on, interactive course with coding challenges in CoderPad.
Syllabus
Introduction
- Welcome to the course
- What you should know
- CoderPad tour
- Introduction to Python and data engineering
- Setting up your Python environment
- Explore the Google Colab worksheet
- Variables and data types
- Operators and expressions
- Control structures
- Functions
- Modules and packages
- String manipulation
- Error handling
- Solution: Conditions
- Collection overview
- Python collections: Tuples
- Python collections: Lists
- Python collections: Sets
- Python collections: Dictionaries
- Solution: Collections
- File I/O overview
- Working with CSV files
- Working with JSON files
- Solution: File handling
- Introduction to pandas
- Read files as DataFrames
- Data cleaning and preprocessing
- Data manipulation and aggregation
- Data visualization
- Write a DataFrame to a file
- Solution: pandas
- Introduction to NumPy
- Array creation and attributes
- Array operations
- Indexing and slicing
- Linear algebra and statistics
- Write a NumPy array to a file
- Solution: NumPy
- Understanding classes and objects
- Implementation: Classes and objects in Python
- Understand OOP features: Abstraction, inheritance, and more
- Solution: OOP
- Tips to write efficient Python code
- What is ETL in the data engineering world?
- Understand PySpark for data engineering
- What is Hadoop
- Importance of visualization tools in data engineering
- On-premises vs. cloud data engineering
- HTML basics
- HTML parents, children, and descendants
- Understand web scraping
- BeautifulSoup basics
- Installing BeautifulSoup
- Get HTML from a web page
- Scrape the web page
- Export data as a TXT file
- Generators in Python
- Python generator classes and iterators
- Iterables in Python
- filter() and map() functions
- any() and all() functions in Python
- What is logging?
- Custom logging
- Logging best practices
- Capstone Project: Retail sales analysis
- Solution: Capstone project
- Next steps
Taught by
Deepak Goyal
Related Courses
Build a Simple Dynamic Site with Node.js Course (How To)Treehouse Introducción a la programación en Python I: Aprendiendo a programar con Python
Pontificia Universidad Católica de Chile via Coursera Excel/VBA for Creative Problem Solving, Part 2
University of Colorado Boulder via Coursera Python Functions, Files, and Dictionaries
University of Michigan via Coursera Estructuras de datos en Python
Universidad Austral via Coursera