YoVDO

Processing Text with R Essential Training

Offered By: LinkedIn Learning

Tags

Data Analysis Courses R Programming Courses Text Mining Courses Predictive Modeling Courses Stemming Courses TF-IDF Courses

Course Description

Overview

Learn key techniques for cleansing and processing text in R, and discover how to convert text to a form that's ready for analytics and predictions.

Syllabus

Introduction
  • The emergence of text analytics
1. Introduction to Text Mining
  • Purpose
  • Document
  • Corpus
  • R text processing libraries
  • Setting up the environment
2. Corpus in R
  • PCorpus and VCorpus
  • Reading files with CorpusReader
  • Exploring the corpus
  • Persisting the corpus
3. Text Cleansing and Extraction
  • Setup for processing
  • Cleansing text
  • Stop word removal
  • Stemming
  • Managing metadata
4. TF-IDF
  • Introduction to tf-idf
  • Generating term frequency matrix
  • Improving term frequency matrix
  • Plotting term frequency
  • Generating tf-idf
5. N-Grams
  • N-grams concepts
  • Using RWeka NGramTokenizer
  • Creating an n-gram text frequency matrix
  • Extracting n-gram pairs
6. Best Practices
  • Storing text
  • Processing text data
  • Scalability
Conclusion
  • Next steps

Taught by

Kumaran Ponnambalam

Related Courses

Creating a Wordcloud using NLP and TF-IDF in Python
Coursera Project Network via Coursera
Feature Engineering for NLP in Python
DataCamp
Advanced NLP with Python for Machine Learning
LinkedIn Learning
Processing Text with Python Essential Training
LinkedIn Learning
Natural Language Processing
YouTube