Beyond Scraping
Offered By: EuroPython Conference via YouTube
Course Description
Overview
Explore advanced web scraping techniques in this 42-minute EuroPython Conference talk. Learn how to create a simple, evolving client-server architecture combining ZeroMQ, Selenium, and BeautifulSoup to extract data from dynamic, JavaScript-driven websites like Sporcle and Khan Academy. Discover methods for implementing regular "downloads" without cluttering your desktop or headless server, and how to perform scraping anonymously. Gain insights into overcoming challenges posed by variable content and complex login processes, and understand how this setup can significantly reduce debugging time. Focus on writing robust code that withstands website design changes, enabling efficient data extraction from even the most complex web environments.
Syllabus
Anthon van der Neut - Beyond scraping
Taught by
EuroPython Conference
Related Courses
Web and Mobile Testing with SeleniumUniversity of Minnesota via Coursera Create Your First Automation Script Using Selenium and Java
Coursera Project Network via Coursera Automate an e-commerce web application using Selenium & Java
Coursera Project Network via Coursera Advanced TestNG Framework and Integration with Selenium
Coursera Project Network via Coursera Behavior Driven Development with Selenium and Cucumber
Coursera Project Network via Coursera