YoVDO

Processing Text with R Essential Training

Offered By: LinkedIn Learning

Tags

Data Analysis Courses R Programming Courses Text Mining Courses Predictive Modeling Courses Stemming Courses TF-IDF Courses

Course Description

Overview

Learn key techniques for cleansing and processing text in R, and discover how to convert text to a form that's ready for analytics and predictions.

Syllabus

Introduction
  • The emergence of text analytics
1. Introduction to Text Mining
  • Purpose
  • Document
  • Corpus
  • R text processing libraries
  • Setting up the environment
2. Corpus in R
  • PCorpus and VCorpus
  • Reading files with CorpusReader
  • Exploring the corpus
  • Persisting the corpus
3. Text Cleansing and Extraction
  • Setup for processing
  • Cleansing text
  • Stop word removal
  • Stemming
  • Managing metadata
4. TF-IDF
  • Introduction to tf-idf
  • Generating term frequency matrix
  • Improving term frequency matrix
  • Plotting term frequency
  • Generating tf-idf
5. N-Grams
  • N-grams concepts
  • Using RWeka NGramTokenizer
  • Creating an n-gram text frequency matrix
  • Extracting n-gram pairs
6. Best Practices
  • Storing text
  • Processing text data
  • Scalability
Conclusion
  • Next steps

Taught by

Kumaran Ponnambalam

Related Courses

Social Network Analysis
University of Michigan via Coursera
Intro to Algorithms
Udacity
Data Analysis
Johns Hopkins University via Coursera
Computing for Data Analysis
Johns Hopkins University via Coursera
Health in Numbers: Quantitative Methods in Clinical & Public Health Research
Harvard University via edX