Data Analysis and Statistical Inference
Offered By: Duke University via Coursera
Course Description
Overview
The Coursera course, Data Analysis and Statistical Inference has
been revised and is now offered as part of Coursera Specialization “Statistics with R”. This Specialization consists of 4 courses and a capstone project. The courses can be taken separately:
___________________________________________________
The goals of this course are as follows:
- Introduction to Probability and Data (began in April 2016)
- Inferential Statistics (begins in May 2016)
- Linear Regression and Modeling (begins in June 2016)
- Bayesian Statistics (begins in July 2016) A completely new course, with additional faculty!
- Statistics Capstone Project (August 2016) (for learners who have passed the 4 previous courses, and earned certificate)
___________________________________________________
The goals of this course are as follows:
- Recognize the importance of data collection, identify limitations in data collection methods, and determine how they affect the scope of inference.
- Use statistical software (R) to summarize data numerically and visually, and to perform data analysis.
- Have a conceptual understanding of the unified nature of statistical inference.
- Apply estimation and testing methods (confidence intervals and hypothesis tests) to analyze single variables and the relationship between two variables in order to understand natural phenomena and make data-based decisions.
- Model and investigate relationships between two or more variables within a regression framework.
- Interpret results correctly, effectively, and in context without relying on statistical jargon.
- Critique data-based claims and evaluate data-based decisions.
- Complete a research project that employs simple statistical inference and modeling techniques.
Syllabus
Week 1: Unit 1 - Introduction to data
- Part 1 – Designing studies
- Part 2 – Exploratory data analysis
- Part 3 – Introduction to inference via simulation
- Part 1 – Defining probability
- Part 2 – Conditional probability
- Part 3 – Normal distribution
- Part 4 – Binomial distribution
- Part 1 – Variability in estimates and the Central Limit Theorem
- Part 2 – Confidence intervals
- Part 3 – Hypothesis tests
- Part 4 – Inference for other estimators
- Part 5 - Decision errors, significance, and confidence
- Part 1 – t-inference
- Part 2 – Power
- Part 3 – Comparing three or more means (ANOVA)
- Part 4 – Simulation based inference for means
- Part 1 – Single proportion
- Part 2 – Comparing two proportions
- Part 3 – Inference for proportions via simulation
- Part 4 – Comparing three or more proportions (Chi-square)
- Part 1 – Relationship between two numerical variables
- Part 2 – Linear regression with a single predictor
- Part 3 – Outliers in linear regression
- Part 4 – Inference for linear regression
- Part 1 – Regression with multiple predictors
- Part 2 – Inference for multiple linear regression
- Part 3 – Model selection
- Part 4 – Model diagnostics
- Bayesian vs. frequentist inference
Taught by
Mine Çetinkaya-Rundel
Tags
Related Courses
Address Business Issues with Data ScienceCertNexus via Coursera Advanced Clinical Data Science
University of Colorado System via Coursera Advanced Data Science Capstone
IBM via Coursera Advanced Data Science with IBM
IBM via Coursera Advanced Deep Learning Methods for Healthcare
University of Illinois at Urbana-Champaign via Coursera