YoVDO

Polyglot Data with Python - Introducing Pandas and Apache Arrow

Offered By: PyCon US via YouTube

Tags

PyCon US Courses Python Courses pandas Courses Data Manipulation Courses Apache Arrow Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore polyglot data handling in Python using Pandas and Apache Arrow in this informative talk from PyCon US. Discover how to overcome challenges in exchanging data between different ecosystems, addressing limitations of Pandas and NumPy outside the Python environment. Learn techniques for efficient data acquisition, manipulation, and exchange without resorting to slow conversion code or unnecessarily large files. Gain insights into working seamlessly in heterogeneous environments, handling data from various sources within Python, and transferring it back to other ecosystems transparently. The presentation covers topics such as CSV scalability issues, data modification, protocols, and the "Pioneer rule," along with practical demonstrations of Pandas Dataframe, Metadata, and File System functionalities.

Syllabus

Introduction
About me
The problem
CSV doesnt scale
Data modification
Protocols
Pioneer rule
Demo
Pandas Dataframe
Pandas Metadata
Pandas File System
Python
Documentation
Outro


Taught by

PyCon US

Related Courses

Computational Investing, Part I
Georgia Institute of Technology via Coursera
Введение в машинное обучение
Higher School of Economics via Coursera
Математика и Python для анализа данных
Moscow Institute of Physics and Technology via Coursera
Introduction to Python for Data Science
Microsoft via edX
Python for Data Science
University of California, San Diego via edX