Beyond Scraping
Offered By: EuroPython Conference via YouTube
Course Description
Overview
Explore advanced web scraping techniques in this 42-minute EuroPython Conference talk. Learn how to create a simple, evolving client-server architecture combining ZeroMQ, Selenium, and BeautifulSoup to extract data from dynamic, JavaScript-driven websites like Sporcle and Khan Academy. Discover methods for implementing regular "downloads" without cluttering your desktop or headless server, and how to perform scraping anonymously. Gain insights into overcoming challenges posed by variable content and complex login processes, and understand how this setup can significantly reduce debugging time. Focus on writing robust code that withstands website design changes, enabling efficient data extraction from even the most complex web environments.
Syllabus
Anthon van der Neut - Beyond scraping
Taught by
EuroPython Conference
Related Courses
Programming LanguagesUniversity of Virginia via Udacity Building a Basic Website
University of Massachusetts Amherst via Independent iDESWEB, Introducción al desarrollo web
MirÃadax Web Engineering II: Developing Mobile HTML5 Apps
Technische Hochschule Mittelhessen via iversity Web Application Architectures
University of New Mexico via Coursera