Practical Optimisations for Pandas
Offered By: EuroPython Conference via YouTube
Course Description
Overview
Discover techniques for writing efficient pandas code in this 46-minute EuroPython 2020 conference talk. Learn to identify performance bottlenecks, implement computational efficiency strategies, and optimize memory usage in pandas. Explore various optimizations including data processing, reading data, using NumExpr, subselecting, concatenating, appending, sorting, filtering, compiling with Cython, leveraging pure Python and vectorized methods, managing memory footprint, implementing concurrency and parallelism, and utilizing other frameworks. Gain insights into additional resources and techniques for enhancing pandas performance in data science applications.
Syllabus
Intro
Overview
Optimizations
Processing Data
Reading Data
Type
NumExpression
Subselect
Concatenate
Append
Sorting
Filtering
Compiling
cyton
cyto
Pure Python
Vectorized Methods
Memory Footprint
Memory Footprint Example
Concurrency Parabolism
Other Techniques
Other Frameworks
Techniques
Additional Resources
Questions
Ending
Taught by
EuroPython Conference
Related Courses
Computational Investing, Part IGeorgia Institute of Technology via Coursera Введение в машинное обучение
Higher School of Economics via Coursera Математика и Python для анализа данных
Moscow Institute of Physics and Technology via Coursera Introduction to Python for Data Science
Microsoft via edX Python for Data Science
University of California, San Diego via edX