Processing Text with R Essential Training
Offered By: LinkedIn Learning
Course Description
Overview
Learn key techniques for cleansing and processing text in R, and discover how to convert text to a form that's ready for analytics and predictions.
Syllabus
Introduction
- The emergence of text analytics
- Purpose
- Document
- Corpus
- R text processing libraries
- Setting up the environment
- PCorpus and VCorpus
- Reading files with CorpusReader
- Exploring the corpus
- Persisting the corpus
- Setup for processing
- Cleansing text
- Stop word removal
- Stemming
- Managing metadata
- Introduction to tf-idf
- Generating term frequency matrix
- Improving term frequency matrix
- Plotting term frequency
- Generating tf-idf
- N-grams concepts
- Using RWeka NGramTokenizer
- Creating an n-gram text frequency matrix
- Extracting n-gram pairs
- Storing text
- Processing text data
- Scalability
- Next steps
Taught by
Kumaran Ponnambalam
Related Courses
Social Network AnalysisUniversity of Michigan via Coursera Intro to Algorithms
Udacity Data Analysis
Johns Hopkins University via Coursera Computing for Data Analysis
Johns Hopkins University via Coursera Health in Numbers: Quantitative Methods in Clinical & Public Health Research
Harvard University via edX