Intro to Web Scraping with Python and Beautiful Soup
Offered By: Data Science Dojo via YouTube
Course Description
Overview
Learn web scraping with Python and Beautiful Soup in this 34-minute tutorial. Parse web pages into data files using BeautifulSoup, demonstrated through scraping graphics cards from NewEgg.com. Set up Anaconda, install necessary packages, and follow step-by-step instructions to build a scraper, test it, and create a CSV file. Gain practical skills in retrieving web pages, evaluating HTML, converting listings to line items, and using functions like "findAll". Perfect for data professionals looking to harness the power of web scraping and expand their data collection capabilities.
Syllabus
- Introduction
- Setting up Anaconda
- Installing Beautiful Soup
- Setting up urllib
- Retrieving the Web Page
- Evaluating Web Page
- Converting Listings into Line Items
- Using jsbeautiful
- Reading Raw HTML for Items to Scrape
- Building the Scraper
- Using the "findAll" Function
- Testing the Scraper
- Creating the .csv File
- End Result
Taught by
Data Science Dojo
Related Courses
Excel 2010Miríadax Intro to Data Science
Udacity Data Manipulation at Scale: Systems and Algorithms
University of Washington via Coursera Statistical Computing with R - a gentle introduction
University College London via Independent Introducción a Data Science: Programación Estadística con R
Universidad Nacional Autónoma de México via Coursera