YoVDO

Web Scraping with Python

Offered By: LinkedIn Learning

Tags

Scrapy Courses Python Courses Selenium Courses Web Scraping Courses Data Extraction Courses Browser Automation Courses

Course Description

Overview

This course dives into the process, strategies, and best practices of web scraping. Learn how to use the Python framework, Scrapy, to practice key techniques.

Syllabus

Introduction
  • How to learn to stop worrying and love the bot
  • What you should know
1. Basic Web Scraping
  • What is web scraping?
  • How the internet works: A brief summary
  • Hello world with Scrapy
  • Challenge: Scraping all data on a page
  • Solution: Scraping all data on a page
2. Learning to Crawl
  • Crawling a website
  • Recording data
  • Scrapy settings file
  • Structuring your scrapers for extensibility/reusability
  • Challenge: Scraping news sites
  • Solution: Scraping news sites
3. Advanced Techniques
  • Submitting a form
  • Finding and using hidden APIs
  • Sitemaps and robots.txt
  • Challenge: Using CNN's sitemap
  • Solution: Using CNN's sitemap
4. Acting Human
  • Logging in
  • Browser automation with Selenium
  • Interacting with a page
Conclusion
  • Next steps

Taught by

Ryan Mitchell

Related Courses

Web Scraping in Python
DataCamp
Crawling the Web with Python and Scrapy
Pluralsight
Exploring Web Scraping with Python
Pluralsight
Scraping Your First Web Page with Python
Pluralsight
Extracting Structured Data from the Web Using Scrapy
Pluralsight