Predicting Wine Quality with Random Forest and Scikit-Learn
Offered By: Coursera Community Project Network via Coursera
Course Description
Overview
In real life we face various classification problems, such as predicting whether an email is spam or not, or whether a credit card transaction is fraudulent or not, or what label the mobile phone should assign to the image it focuses on, perhaps a flower, a dog, a person or something else. Fortunately, we have machine learning techniques to help us deal with this.
In this guided project, we will tackle the problem of predicting red wine quality using a Random Forest Classifier. Specifically, we will implement it by programming with Python and the classifier provided by the Scikit-Learn package.
You will learn to train the classifier, calibrate it, tune its hyperparameters and evaluate the accuracy of its predictions. You will also learn how to perform cluster analysis to handle collinearity and reduce the number of predictors without sacrificing model accuracy. In addition, you will draw various graphs to help you interpret the results.
This project is intended for beginners, so the prerequisites are basic knowledge of Python, Pandas, Numpy, Matplotlib, Seaborn, Scikit-Learn, Scipy and Random Forest algorithms.
Note: This course runs in Rhyme's virtual browser, which is Coursera's hands-on project platform. With this browser you will connect to Google Colaboratory to write and execute Python code in a Jupyter Notebook, without worrying about installing software. All you need is to have a Google account.
This Guided Project was created by a Coursera community member.
In this guided project, we will tackle the problem of predicting red wine quality using a Random Forest Classifier. Specifically, we will implement it by programming with Python and the classifier provided by the Scikit-Learn package.
You will learn to train the classifier, calibrate it, tune its hyperparameters and evaluate the accuracy of its predictions. You will also learn how to perform cluster analysis to handle collinearity and reduce the number of predictors without sacrificing model accuracy. In addition, you will draw various graphs to help you interpret the results.
This project is intended for beginners, so the prerequisites are basic knowledge of Python, Pandas, Numpy, Matplotlib, Seaborn, Scikit-Learn, Scipy and Random Forest algorithms.
Note: This course runs in Rhyme's virtual browser, which is Coursera's hands-on project platform. With this browser you will connect to Google Colaboratory to write and execute Python code in a Jupyter Notebook, without worrying about installing software. All you need is to have a Google account.
This Guided Project was created by a Coursera community member.
Taught by
Marcela Manzo
Related Courses
Computational Investing, Part IGeorgia Institute of Technology via Coursera Введение в машинное обучение
Higher School of Economics via Coursera Математика и Python для анализа данных
Moscow Institute of Physics and Technology via Coursera Introduction to Python for Data Science
Microsoft via edX Using Python for Research
Harvard University via edX