YoVDO

Processing Text with Python Essential Training

Offered By: LinkedIn Learning

Tags

Python Courses Data Science Courses Text Mining Courses TF-IDF Courses

Course Description

Overview

Learn the essential techniques for cleansing and processing text in Python. Discover how to convert text to a form that's ready for analytics and predictions.

Syllabus

Introduction
  • The need for text mining skills in data science
1. Text Mining
  • Text mining today
  • Document concepts
  • Corpus concepts
  • Introduction to the NLTK library
  • Setting up the environment
2. Reading Text
  • Reading raw files
  • Reading files with corpus reader
  • Exploring the corpus
  • Analyzing the corpus
3. Text Cleansing and Extraction
  • Tokenization
  • Cleansing text
  • Stop word removal
  • Stemming
  • Lemmatization
4. Advanced Text Processing
  • Building n-grams
  • Tagging parts of speech
  • Term frequency-inverse document frequency (TF-IDF)
  • Building a TF-IDF matrix
5. Best Practices
  • Storing text
  • Processing text data
  • Scalable processing of text data
Conclusion
  • Next steps

Taught by

Kumaran Ponnambalam

Related Courses

Design Computing: 3D Modeling in Rhinoceros with Python/Rhinoscript
University of Michigan via Coursera
A Practical Introduction to Test-Driven Development
LearnQuest via Coursera
FinTech for Finance and Business Leaders
ACCA via edX
Access Bioinformatics Databases with Biopython
Coursera Project Network via Coursera
Accounting Data Analytics
University of Illinois at Urbana-Champaign via Coursera