YoVDO

Scrapy Course – Python Web Scraping for Beginners

Offered By: freeCodeCamp

Tags

Scrapy Courses Python Courses Web Scraping Courses Data Cleaning Courses Cloud Deployment Courses

Course Description

Overview

Master web scraping with Python Scrapy in this comprehensive 4-5 hour course. Learn to create Scrapy spiders, crawl websites, extract data, and clean it using Items and Item Pipelines. Discover techniques for saving data to CSV files, MySQL, and Postgres databases. Explore advanced concepts like using fake user-agents and headers to avoid blocking, and implementing proxies for scalable scraping. Gain practical skills in deploying scrapers to the cloud and scheduling periodic runs. Access valuable resources including Scrapy documentation, course guides, and GitHub repositories. Progress through hands-on lessons covering project setup, spider creation, data extraction, cleaning, storage, and cloud deployment using various platforms like Scrapyd, ScrapeOps, and Scrapy Cloud. By the end, you'll have the knowledge to confidently build and deploy web scraping projects at scale.

Syllabus

⌨️ Part 1 - Scrapy & Course Introduction
⌨️ Part 2 - Setup Virtual Env & Scrapy
⌨️ Part 3 - Creating a Scrapy Project
⌨️ Part 4 - Build your First Scrapy Spider
⌨️ Part 5 - Build Discovery & Extraction Spider
⌨️ Part 6 - Cleaning Data with Item Pipelines
⌨️ Part 7 - Saving Data to Files & Databases
⌨️ Part 8 - Fake User-Agents & Browser Headers
⌨️ Part 9 - Rotating Proxies & Proxy APIs
⌨️ Part 10 - Run Spiders in Cloud with Scrapyd
⌨️ Part 11 - Run Spiders in Cloud with ScrapeOps
⌨️ Part 12 - Run Spiders in Cloud with Scrapy Cloud
⌨️ Part 13 - Conclusion & Next Steps


Taught by

freeCodeCamp.org

Related Courses

Programming Cloud Services for Android Handheld Systems
Vanderbilt University via Coursera
SAP S/4HANA in a Nutshell
SAP Learning
Transformation to Hybrid Landscapes
SAP Learning
Ruby on Rails: An Introduction
Johns Hopkins University via Coursera
Capstone: Photo Tourist Web Application
Johns Hopkins University via Coursera