YoVDO

Processing Text with R Essential Training

Offered By: LinkedIn Learning

Tags

Data Analysis Courses R Programming Courses Text Mining Courses Predictive Modeling Courses Stemming Courses TF-IDF Courses

Course Description

Overview

Learn key techniques for cleansing and processing text in R, and discover how to convert text to a form that's ready for analytics and predictions.

Syllabus

Introduction
  • The emergence of text analytics
1. Introduction to Text Mining
  • Purpose
  • Document
  • Corpus
  • R text processing libraries
  • Setting up the environment
2. Corpus in R
  • PCorpus and VCorpus
  • Reading files with CorpusReader
  • Exploring the corpus
  • Persisting the corpus
3. Text Cleansing and Extraction
  • Setup for processing
  • Cleansing text
  • Stop word removal
  • Stemming
  • Managing metadata
4. TF-IDF
  • Introduction to tf-idf
  • Generating term frequency matrix
  • Improving term frequency matrix
  • Plotting term frequency
  • Generating tf-idf
5. N-Grams
  • N-grams concepts
  • Using RWeka NGramTokenizer
  • Creating an n-gram text frequency matrix
  • Extracting n-gram pairs
6. Best Practices
  • Storing text
  • Processing text data
  • Scalability
Conclusion
  • Next steps

Taught by

Kumaran Ponnambalam

Related Courses

Big Data Analytics in Healthcare
Georgia Institute of Technology via Udacity
Model Building and Validation
AT&T via Udacity
Maths for Humans: Linear, Quadratic & Inverse Relations
University of New South Wales via FutureLearn
Regression Modeling in Practice
Wesleyan University via Coursera
Data Science at Scale - Capstone Project
University of Washington via Coursera