Data Manipulation with pandas
Offered By: DataCamp
Course Description
Overview
Learn how to import and clean data, calculate statistics, and create visualizations with pandas.
pandas is the world's most popular Python library, used for everything from data manipulation to data analysis. In this course, you'll learn how to manipulate DataFrames, as you extract, filter, and transform real-world datasets for analysis. Using pandas you’ll explore all the core data science concepts. Using real-world data, including Walmart sales figures and global temperature time series, you’ll learn how to import, clean, calculate statistics, and create visualizations—using pandas to add to the power of Python!
pandas is the world's most popular Python library, used for everything from data manipulation to data analysis. In this course, you'll learn how to manipulate DataFrames, as you extract, filter, and transform real-world datasets for analysis. Using pandas you’ll explore all the core data science concepts. Using real-world data, including Walmart sales figures and global temperature time series, you’ll learn how to import, clean, calculate statistics, and create visualizations—using pandas to add to the power of Python!
Syllabus
- Transforming DataFrames
- Let’s master the pandas basics. Learn how to inspect DataFrames and perform fundamental manipulations, including sorting rows, subsetting, and adding new columns.
- Aggregating DataFrames
- In this chapter, you’ll calculate summary statistics on DataFrame columns, and master grouped summary statistics and pivot tables.
- Slicing and Indexing DataFrames
- Indexes are supercharged row and column names. Learn how they can be combined with slicing for powerful DataFrame subsetting.
- Creating and Visualizing DataFrames
- Learn to visualize the contents of your DataFrames, handle missing data values, and import data from and export data to CSV files.
Taught by
Richie Cotton and Maggie Matsui
Related Courses
Intro to StatisticsStanford University via Udacity Introduction to Data Science
University of Washington via Coursera Passion Driven Statistics
Wesleyan University via Coursera Information Visualization
Indiana University via Independent DCO042 - Python For Informatics
University of Michigan via Independent