The Caltech-JPL Summer School on Big Data Analytics
Offered By: California Institute of Technology via Coursera
Course Description
Overview
This is not a class as it is commonly understood; it is the set of materials from a summer school offered by Caltech and JPL, in the sense used by most scientists: an intensive period of learning of some advanced topics, not on an introductory level.
The school will cover a variety of topics, with a focus on practical computing applications in research: the skills needed for a computational ("big data") science, not computer science. The specific focus will be on applications in astrophysics, earth science (e.g., climate science) and other areas of space science, but with an emphasis on the general tools, methods, and skills that would apply across other domains as well. It is aimed at an audience of practicing researchers who already have a strong background in computation and data analysis. The lecturers include computational science and technology experts from Caltech and JPL.
Students can evaluate their own progress, but there will be no tests, exams, and no formal credit or certificates will be offered.
The school will cover a variety of topics, with a focus on practical computing applications in research: the skills needed for a computational ("big data") science, not computer science. The specific focus will be on applications in astrophysics, earth science (e.g., climate science) and other areas of space science, but with an emphasis on the general tools, methods, and skills that would apply across other domains as well. It is aimed at an audience of practicing researchers who already have a strong background in computation and data analysis. The lecturers include computational science and technology experts from Caltech and JPL.
Students can evaluate their own progress, but there will be no tests, exams, and no formal credit or certificates will be offered.
Syllabus
The anticipated schedule of lectures (subject to changes):
Each bullet bellow corresponds to a set of materials that includes approximately 2 hours of video lectures, various links and supplementary materials, plus some on-line, hands-on exercises.
2. Best programming practices. Information retrieval.
3. Introduction to R. Markov Chain Monte Carlo.
4. Statistical resampling and inference.
5. Databases.
6. Data visualization.
7. Clustering and classification.
8. Decision trees and random forests.
9. Dimensionality reduction. Closing remarks.
Taught by
Amy Braverman, Daniel Crichton, Scott Davidoff, S. Djorgovski, Ciro Donalek, Richard Doyle, Thomas Fuchs, Matthew Graham, Ashish Mahabal, Chris Mattmann and David Thompson
Tags
Related Courses
Statistical Learning with RStanford University via edX The Analytics Edge
Massachusetts Institute of Technology via edX Machine Learning 1—Supervised Learning
Brown University via Udacity 機器學習技法 (Machine Learning Techniques)
National Taiwan University via Coursera Alternative Approaches to Valuation and Investment
University of Melbourne via Coursera