Beyond Scraping
Offered By: EuroPython Conference via YouTube
Course Description
Overview
Explore advanced web scraping techniques in this 42-minute EuroPython Conference talk. Learn how to create a simple, evolving client-server architecture combining ZeroMQ, Selenium, and BeautifulSoup to extract data from dynamic, JavaScript-driven websites like Sporcle and Khan Academy. Discover methods for implementing regular "downloads" without cluttering your desktop or headless server, and how to perform scraping anonymously. Gain insights into overcoming challenges posed by variable content and complex login processes, and understand how this setup can significantly reduce debugging time. Focus on writing robust code that withstands website design changes, enabling efficient data extraction from even the most complex web environments.
Syllabus
Anthon van der Neut - Beyond scraping
Taught by
EuroPython Conference
Related Courses
Developing Distributed Applications with C# and ZeroMQLinkedIn Learning ZeroMQ Crash Course
Hussein Nasser via YouTube In Curation We Trust - Generating Contextual and Actionable Threat Intelligence
BruCON Security Conference via YouTube Alda's Dynamic Relationship with Clojure
Strange Loop Conference via YouTube ZeroMQ Is the Answer
PHP UK Conference via YouTube