YoVDO

Comprehensive Python Beautiful Soup Web Scraping Tutorial - Find-Find All, CSS Select, Scrape Table

Offered By: Keith Galli via YouTube

Tags

Python Courses Computer Science Courses Regular Expressions Courses pandas Courses Web Scraping Courses HTML Courses Data Extraction Courses Beautiful Soup Courses

Course Description

Overview

Learn web scraping in Python using the Beautiful Soup library in this comprehensive tutorial video. Start with an introduction to HTML, CSS, and web scraping basics before diving into the fundamentals of Beautiful Soup. Master essential commands like find and find_all, learn to extract strings from HTML elements, and explore CSS selectors. Practice your skills through hands-on exercises, including scraping links, extracting tables into Pandas DataFrames, and downloading web images. Gain practical experience in navigating HTML structure, working with regular expressions, and solving real-world web scraping challenges.

Syllabus

- Intro & Video Overview
- What is web scraping?
- Introduction to HTML
Using the beautiful soup library
- Loading in a webpage requests library
- Starting to scrape
- find & find_all methods
- Finding specific text/strings in our HTML regex
- Select method CSS path selections
- Grabbing the string/text from an HTML element
- Getting a property of HTML element href, src, id, class, etc
- Code navigation parents, children, siblings
Let’s practice our skills!
- Exercise #1: Grab all social links on webpage in 3 different ways
- Exercise #2: Scrape an HTML table into a Pandas Dataframe
- Exercise #3: Grab all fun facts that contain the word “is”
- Exercise #4: Use beautiful soup to help download an image from a webpage
- Exercise #5: Solve the mystery challenge!!!


Taught by

Keith Galli

Related Courses

Data Processing Using Python
Nanjing University via Coursera
Создание Web-сервисов на Python
Moscow Institute of Physics and Technology via Coursera
100 Days of Code: The Complete Python Pro Bootcamp
Udemy
Exploring Web Scraping with Python
Pluralsight
Python Project for Data Science
IBM via Coursera