Complete Guide to Python for Data Engineering: From Beginner to Advanced
Offered By: LinkedIn Learning
Course Description
Overview
Practice fundamental skills using Python for data engineering in this hands-on, interactive course with coding challenges in CoderPad.
Syllabus
Introduction
- Welcome to the course
- What you should know
- CoderPad tour
- Introduction to Python and data engineering
- Setting up your Python environment
- Explore the Google Colab worksheet
- Variables and data types
- Operators and expressions
- Control structures
- Functions
- Modules and packages
- String manipulation
- Error handling
- Solution: Conditions
- Collection overview
- Python collections: Tuples
- Python collections: Lists
- Python collections: Sets
- Python collections: Dictionaries
- Solution: Collections
- File I/O overview
- Working with CSV files
- Working with JSON files
- Solution: File handling
- Introduction to pandas
- Read files as DataFrames
- Data cleaning and preprocessing
- Data manipulation and aggregation
- Data visualization
- Write a DataFrame to a file
- Solution: pandas
- Introduction to NumPy
- Array creation and attributes
- Array operations
- Indexing and slicing
- Linear algebra and statistics
- Write a NumPy array to a file
- Solution: NumPy
- Understanding classes and objects
- Implementation: Classes and objects in Python
- Understand OOP features: Abstraction, inheritance, and more
- Solution: OOP
- Tips to write efficient Python code
- What is ETL in the data engineering world?
- Understand PySpark for data engineering
- What is Hadoop
- Importance of visualization tools in data engineering
- On-premises vs. cloud data engineering
- HTML basics
- HTML parents, children, and descendants
- Understand web scraping
- BeautifulSoup basics
- Installing BeautifulSoup
- Get HTML from a web page
- Scrape the web page
- Export data as a TXT file
- Generators in Python
- Python generator classes and iterators
- Iterables in Python
- filter() and map() functions
- any() and all() functions in Python
- What is logging?
- Custom logging
- Logging best practices
- Capstone Project: Retail sales analysis
- Solution: Capstone project
- Next steps
Taught by
Deepak Goyal
Related Courses
Computational Investing, Part IGeorgia Institute of Technology via Coursera Введение в машинное обучение
Higher School of Economics via Coursera Математика и Python для анализа данных
Moscow Institute of Physics and Technology via Coursera Introduction to Python for Data Science
Microsoft via edX Python for Data Science
University of California, San Diego via edX