YoVDO

Text Preprocessing

Offered By: Codecademy

Tags

Data Science Courses Python Courses Regular Expressions Courses Data Cleaning Courses Natural Language Toolkit (NLTK) Courses Text Preprocessing Courses

Course Description

Overview

Learn to clean text with Python 3 using regular expressions (regex) and NLTK.


Text is everywhere, and knowing how to clean it will transform your data science skillset. Many in the industry estimate that 80% of data science is data cleaning, including text preprocessing. Transforming text into usable data requires specialized tools and techniques. This course introduces text cleaning with Python 3 using regular expressions (regex) and NLTK.


Syllabus

  • Introduction to Regular Expressions: Get a taste of regular expressions (regex), a powerful search pattern language to quickly find the text you're looking for.
    • Lesson: Introduction to Regular Expressions
    • Quiz: Introduction to Regular Expressions
  • Text Preprocessing: Before most natural language processing tasks, it's necessary to clean up the text data using text preprocessing techniques.
    • Lesson: Text Preprocessing
    • ExternalResource: Natural Language Processing with Python, Chapter 3
    • Article: NLTK with Python 3 for Natural Language Processing
    • Quiz: Text Preprocessing

Taught by

Kenny Lin

Related Courses

Design of Computer Programs
Stanford University via Udacity
Programming Languages
University of Virginia via Udacity
Data Structures and Performance
University of California, San Diego via Coursera
Introducción a Data Science: Programación Estadística con R
Universidad Nacional Autónoma de México via Coursera
Applied Text Mining in Python
University of Michigan via Coursera