Snowflake for Data Science: Intro to Snowpark ML for Python
Offered By: Coursera Project Network via Coursera
Course Description
Overview
Picture this: a data-driven revolution where Snowflake's data cloud and Python converge to unveil new dimensions in data science. Get ready for a transformative learning experience with our Snowpark ML for Python guided-project!
This Guided Project was crafted to help data scientists leverage Snowpark ML for Python. By the end of this project, you will be equipped to seamlessly construct an end-to-end machine learning workflow – commencing from data preprocessing, advancing through model training, and culminating in the realms of inference and deployment.
Embark on this 2-hour journey where you will learn how to:
Load data into Snowflake and setup Python workspace.
Transform data using Snowpark ML API and build a preprocessing pipeline.
Train an XGBoost model using the Snowpark ML Model API and deploy the model as a UDF for inference.
To achieve the project's objectives, you will create a machine learning model (XGBoost) using Snowpark in Snowflake. The project scenario involves stepping into the role of a data scientist for a renowned Diamond retailer. The goal is to address challenges faced by salespeople in accurately estimating diamond prices. Leveraging a dataset with attributes like Carat, Weight, and Cut Quality stored in Snowflake, you will build a predictive model to recommend optimal diamond purchase prices.
This project uniquely integrates Snowflake's robust data capabilities with the flexibility of Python for machine learning. Success requires an intermediate level of Python and prior experience with Snowflake. Don't miss the chance to enhance your data science skills with Snowpark ML and Python!
Syllabus
- Project Overview
- This Guided Project was crafted to help data scientists and business analysts accomplish the mastery of leveraging Snowpark ML for Python. By the end of this project, you will be equipped to seamlessly construct an end-to-end machine learning workflow – commencing from data preprocessing, advancing through model training, and culminating in the realms of inference and deployment.
Taught by
Mohamed Touiti
Related Courses
Genomic Data Science and Clustering (Bioinformatics V)University of California, San Diego via Coursera 用Python玩转数据 Data Processing Using Python
Nanjing University via Coursera Data Mining Project
University of Illinois at Urbana-Champaign via Coursera Advanced Business Analytics Capstone
University of Colorado Boulder via Coursera Data Mining: Theories and Algorithms for Tackling Big Data | 数据挖掘:理论与算法
Tsinghua University via edX