Clustering Geolocation Data Intelligently in Python
Offered By: Coursera Project Network via Coursera
Course Description
Overview
In this 1.5-hour long project, you will learn how to clean and preprocess geolocation data for clustering. You will learn how to export this data into an interactive file that can be better understood for the data. You will learn how to cluster initially with a K-Means approach, before using a more complicated density-based algorithm, DBSCAN. We will discuss how to evaluate these models, and offer improvements to DBSCAN with the introduction of HDBSCAN.
Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Syllabus
- Clustering Geolocation Data Intelligently in Python
- In this 1.5-hour long project, you will learn how to visualize geolocation data clearly and interactively using Python. You will then learn a simple but limited approach to clustering this data, using the K-Means algorithm. We will develop on this notion of clustering by moving to more advanced density-based methods, namely Density-Based Spatial Clustering of Applications with Noise, known as DBSCAN, and in order to address some of its shortcomings, the more advanced Hierarchical DBSCAN (HDBSCAN). You will also learn about a simple method of addressing outliers that may exist in a clustering problem.
Taught by
Ari Anastassiou
Related Courses
Macroeconometric ForecastingInternational Monetary Fund via edX Machine Learning With Big Data
University of California, San Diego via Coursera Data Science at Scale - Capstone Project
University of Washington via Coursera Structural Equation Model and its Applications | 结构方程模型及其应用 (粤语)
The Chinese University of Hong Kong via Coursera Data Science in Action - Building a Predictive Churn Model
SAP Learning