YoVDO

Processing Text with R Essential Training

Offered By: LinkedIn Learning

Tags

Data Analysis Courses R Programming Courses Text Mining Courses Predictive Modeling Courses Stemming Courses TF-IDF Courses

Course Description

Overview

Learn key techniques for cleansing and processing text in R, and discover how to convert text to a form that's ready for analytics and predictions.

Syllabus

Introduction
  • The emergence of text analytics
1. Introduction to Text Mining
  • Purpose
  • Document
  • Corpus
  • R text processing libraries
  • Setting up the environment
2. Corpus in R
  • PCorpus and VCorpus
  • Reading files with CorpusReader
  • Exploring the corpus
  • Persisting the corpus
3. Text Cleansing and Extraction
  • Setup for processing
  • Cleansing text
  • Stop word removal
  • Stemming
  • Managing metadata
4. TF-IDF
  • Introduction to tf-idf
  • Generating term frequency matrix
  • Improving term frequency matrix
  • Plotting term frequency
  • Generating tf-idf
5. N-Grams
  • N-grams concepts
  • Using RWeka NGramTokenizer
  • Creating an n-gram text frequency matrix
  • Extracting n-gram pairs
6. Best Practices
  • Storing text
  • Processing text data
  • Scalability
Conclusion
  • Next steps

Taught by

Kumaran Ponnambalam

Related Courses

Text Mining and Analytics
University of Illinois at Urbana-Champaign via Coursera
Text Mining & Analytics
Delft University of Technology via edX
Text Analytics with SAP HANA Platform
SAP Learning
Applied Text Mining in Python
University of Michigan via Coursera
Hands-on Text Mining and Analytics
Yonsei University via Coursera