YoVDO

Processing Text with Python Essential Training

Offered By: LinkedIn Learning

Tags

Python Courses Data Science Courses Text Mining Courses TF-IDF Courses

Course Description

Overview

Learn the essential techniques for cleansing and processing text in Python. Discover how to convert text to a form that's ready for analytics and predictions.

Syllabus

Introduction
  • The need for text mining skills in data science
1. Text Mining
  • Text mining today
  • Document concepts
  • Corpus concepts
  • Introduction to the NLTK library
  • Setting up the environment
2. Reading Text
  • Reading raw files
  • Reading files with corpus reader
  • Exploring the corpus
  • Analyzing the corpus
3. Text Cleansing and Extraction
  • Tokenization
  • Cleansing text
  • Stop word removal
  • Stemming
  • Lemmatization
4. Advanced Text Processing
  • Building n-grams
  • Tagging parts of speech
  • Term frequency-inverse document frequency (TF-IDF)
  • Building a TF-IDF matrix
5. Best Practices
  • Storing text
  • Processing text data
  • Scalable processing of text data
Conclusion
  • Next steps

Taught by

Kumaran Ponnambalam

Related Courses

Data Analysis
Johns Hopkins University via Coursera
Computing for Data Analysis
Johns Hopkins University via Coursera
Scientific Computing
University of Washington via Coursera
Introduction to Data Science
University of Washington via Coursera
Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera