Text Preprocessing
Offered By: Codecademy
Course Description
Overview
Learn to clean text with Python 3 using regular expressions (regex) and NLTK.
Text is everywhere, and knowing how to clean it will transform your data science skillset. Many in the industry estimate that 80% of data science is data cleaning, including text preprocessing. Transforming text into usable data requires specialized tools and techniques. This course introduces text cleaning with Python 3 using regular expressions (regex) and NLTK.
Text is everywhere, and knowing how to clean it will transform your data science skillset. Many in the industry estimate that 80% of data science is data cleaning, including text preprocessing. Transforming text into usable data requires specialized tools and techniques. This course introduces text cleaning with Python 3 using regular expressions (regex) and NLTK.
Syllabus
- Introduction to Regular Expressions: Get a taste of regular expressions (regex), a powerful search pattern language to quickly find the text you're looking for.
- Lesson: Introduction to Regular Expressions
- Quiz: Introduction to Regular Expressions
- Text Preprocessing: Before most natural language processing tasks, it's necessary to clean up the text data using text preprocessing techniques.
- Lesson: Text Preprocessing
- ExternalResource: Natural Language Processing with Python, Chapter 3
- Article: NLTK with Python 3 for Natural Language Processing
- Quiz: Text Preprocessing
Taught by
Kenny Lin
Related Courses
Data Wrangling with MongoDBMongoDB via Udacity Getting and Cleaning Data
Johns Hopkins University via Coursera 软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera Creating an Analytical Dataset
Udacity Implementing ETL with SQL Server Integration Services
Microsoft via edX