Dat - An Open Source Tool for Sharing and Collaborating on Data
Offered By: JSConf via YouTube
Course Description
Overview
Syllabus
Intro
Max Ogden @maxogden
dat is an open source tool for sharing and collaborating on data
we are grant funded and 100% open source
reproducible science
analogy time: lets talk about source control
life before git
i want to fix a bug in cool-project
1. somehow geta zip of cool-project 2. unpack and edita file 3. email the file back
claim: currently data sharing is a mess
email csv files
we want to do for data what git did for source code
a data set we can all relate to: npm
calculate how big npm is using dat
transform the npm data using bulk-markdown-to-png
bionode bioinformatics tools on npm
data pipelines dependency management data streaming
gasket is a cross platform pipeline manager
datscript is an experimental pipeline config language
branches, dat checkout 3b2d98V3, multi master replication, sync to databases, registry
Taught by
JSConf
Related Courses
Google Cloud Big Data and Machine Learning Fundamentals en EspañolGoogle Cloud via Coursera Data Analysis with Python
IBM via Coursera Intro to TensorFlow 日本語版
Google Cloud via Coursera TensorFlow on Google Cloud - Français
Google Cloud via Coursera Freedom of Data with SAP Data Hub
SAP Learning