Data Pipelines - Introduction to Text Analytics with R Part 3

Offered By: Data Science Dojo via YouTube

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Explore data pipelines in this third installment of the introduction to text analytics with R video. Dive into textual data exploration for pre-processing challenges, utilize the quanteda package for text analytics, and create a prototypical text analytics pre-processing pipeline. Learn about tokenization, lower casing, stop word removal, and stemming. Develop skills to create a document-frequency matrix used for training machine learning models. Access the Kaggle dataset and R code used in the series to practice hands-on. Gain valuable insights into text analytics techniques and their application in data science projects.

Syllabus

Intro
HTML Escapes
Quantium
Tokenization
Tokens
Stop Words
Quantity
Stem
DFM

Taught by

Data Science Dojo

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity Natural Language Processing
Columbia University via Coursera Probabilistic Graphical Models 1: Representation
Stanford University via Coursera Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent