Repository Data Mining on GitHub
Offered By: WeAreDevelopers via YouTube
Course Description
Overview
Explore repository data mining techniques on GitHub in this conference talk from WeAreDevelopers Conference 2017. Dive into machine learning applications at GitHub, including text classification and convolutional networks. Learn about data preprocessing, distributional hypothesis, and the Stanza flow. Discover how GitHub leverages these technologies for improved collaboration and project management. Gain insights into the competition overview and architecture used for mining repository data. Understand the importance of GitHub in modern software development and how machine learning enhances its capabilities.
Syllabus
Introduction
GitHub
Why use GitHub
Machine Learning at GitHub
Airbnb
Topics
Competition
Overview
Data Preprocessing
Similar Words
Distributional Hypothesis
Machine Learning
StanzaFlow
Convolutional Networks
Classification
Text
Text Classification
Categories
Architecture
Conclusion
Collaboration with GitHub
Taught by
WeAreDevelopers
Related Courses
Introduction to Agile Software Development: Tools & TechniquesUniversity of California, Berkeley via edX Advanced Topics and Techniques in Agile Software Development
University of California, Berkeley via edX The Data Scientist’s Toolbox
Johns Hopkins University via Coursera How to Use Git and GitHub
Udacity Desarrollo de Videojuegos 3D en Unity: Una Introducción
Universidad de los Andes via Coursera