What Can We Learn from 750 Billion GitHub Events and 42 TB of Code
Offered By: Devoxx via YouTube
Course Description
Overview
Explore the vast world of GitHub data through a comprehensive analysis of 750 billion events and 42 TB of code. Dive into insights on software development trends, open source community dynamics, and coding patterns over time. Learn how to leverage this rich dataset to guide project design decisions, request features based on data, and measure community health. Discover the most effective ways to phrase change requests and understand the impact of social media on project popularity. Investigate who starred your project and their other interests. Gain practical knowledge on running static code analysis at scale and settle the age-old debate of tabs vs. spaces. Presented by Felipe Hoffa, a Google Developer Advocate, this talk offers a deep dive into the world of big data analysis using Google Cloud Platform tools, demonstrating how to extract valuable insights from one of the largest datasets of collaborative software development.
Syllabus
Intro
What do we see
Who wants to analyze GitHub
How GitHub events started
Google BigQuery
Comparing projects
Looking for stars
Looking at other projects
Text analysis
Country analysis
New Zealand
Weather
Code analysis
Stack Overflow
Go Query
Static Code Analysis
Questions
Query Analysis
Taught by
Devoxx
Related Courses
DCO042 - Python For InformaticsUniversity of Michigan via Independent Corpus Linguistics: Method, Analysis, Interpretation
Lancaster University via FutureLearn 日本中世の自由と平等 (ga001)
University of Tokyo via gacco "A Study in Scarlet" by Doyle: BerkeleyX Book Club
University of California, Berkeley via edX "A Room with a View" by Forster: BerkeleyX Book Club
University of California, Berkeley via edX