Language Classification with Naive Bayes in Python
Offered By: Coursera Project Network via Coursera
Course Description
Overview
In this 1-hour long project, you will learn how to clean and preprocess data for language classification. You will learn some theory behind Naive Bayes Modeling, and the impact that class imbalance of training data has on classification performance. You will learn how to use subword units to further mitigate the negative effects of class imbalance, and build an even better model.
Syllabus
- Language Classification with Naive Bayes in Python
- In this 1-hour long project, you will learn how to design a model end-to-end that can classify sentences into one of Slovak, Czech, and English. During this process, you will implement relevant preprocessing steps, as well as address class imbalance in your training set by employing the learned theory of Naive Bayes Models, as well as implementing a more advanced technique: subword units.
Taught by
Ari Anastassiou
Related Courses
Genomic Data Science and Clustering (Bioinformatics V)University of California, San Diego via Coursera 用Python玩转数据 Data Processing Using Python
Nanjing University via Coursera Data Mining Project
University of Illinois at Urbana-Champaign via Coursera Advanced Business Analytics Capstone
University of Colorado Boulder via Coursera Data Mining: Theories and Algorithms for Tackling Big Data | 数据挖掘:理论与算法
Tsinghua University via edX