Processing Text with R Essential Training
Offered By: LinkedIn Learning
Course Description
Overview
Learn key techniques for cleansing and processing text in R, and discover how to convert text to a form that's ready for analytics and predictions.
Syllabus
Introduction
- The emergence of text analytics
- Purpose
- Document
- Corpus
- R text processing libraries
- Setting up the environment
- PCorpus and VCorpus
- Reading files with CorpusReader
- Exploring the corpus
- Persisting the corpus
- Setup for processing
- Cleansing text
- Stop word removal
- Stemming
- Managing metadata
- Introduction to tf-idf
- Generating term frequency matrix
- Improving term frequency matrix
- Plotting term frequency
- Generating tf-idf
- N-grams concepts
- Using RWeka NGramTokenizer
- Creating an n-gram text frequency matrix
- Extracting n-gram pairs
- Storing text
- Processing text data
- Scalability
- Next steps
Taught by
Kumaran Ponnambalam
Related Courses
Big Data Analytics in HealthcareGeorgia Institute of Technology via Udacity Model Building and Validation
AT&T via Udacity Maths for Humans: Linear, Quadratic & Inverse Relations
University of New South Wales via FutureLearn Regression Modeling in Practice
Wesleyan University via Coursera Data Science at Scale - Capstone Project
University of Washington via Coursera