Repository Data Mining on GitHub
Offered By: WeAreDevelopers via YouTube
Course Description
Overview
Explore repository data mining techniques on GitHub in this conference talk from WeAreDevelopers Conference 2017. Dive into machine learning applications at GitHub, including text classification and convolutional networks. Learn about data preprocessing, distributional hypothesis, and the Stanza flow. Discover how GitHub leverages these technologies for improved collaboration and project management. Gain insights into the competition overview and architecture used for mining repository data. Understand the importance of GitHub in modern software development and how machine learning enhances its capabilities.
Syllabus
Introduction
GitHub
Why use GitHub
Machine Learning at GitHub
Airbnb
Topics
Competition
Overview
Data Preprocessing
Similar Words
Distributional Hypothesis
Machine Learning
StanzaFlow
Convolutional Networks
Classification
Text
Text Classification
Categories
Architecture
Conclusion
Collaboration with GitHub
Taught by
WeAreDevelopers
Related Courses
Applied Text Mining in PythonUniversity of Michigan via Coursera Natural Language Processing
Higher School of Economics via Coursera Exploitez des données textuelles
CentraleSupélec via OpenClassrooms Basic Sentiment Analysis with TensorFlow
Coursera Project Network via Coursera Build Multilayer Perceptron Models with Keras
Coursera Project Network via Coursera