pandas beyond the basics

Bring your knowledge of pandas to the next level. Become more efficient when working on data processing and analysis in Python.

Audience

Analysts, quants, BI professionals, data scientists, software developers and researchers who use Python/pandas and need to aquire intermediate to advanced knowledge on the library.

Standard Duration

1 day

Tailor this course

We can work with you to customise a course for your team, so you can learn exactly what you need at the right pace.

Description of the course

The pandas beyond the basics syllabus is designed to help analysts, researchers, BI experts and developers becoming fluent in the use of the pandas library for Python, going beyond the basics, learning the more advanced features of pandas and adopting the best practices, in order to slice and dice complex data.

Using interactive examples and hands-on exercises, the course provides a full immersion in the advanced features of the pandas library. The course focuses on making your pandas code more efficient, so some very basic familiarity with pandas and its core concepts is assumed.

Syllabus

Efficient data modelling

Deep-dive into pandas data types
Choosing data types when loading the data
Data type conversions
Best practices to improve your memory usage when dealing with large datasets

Efficient data pipelines

Understanding mutability and immutability
Method chaining to improve readability
Best practices to develop and maintain complex data transformation and query pipelines

Efficient slicing and dicing

Complex aggregations (groupby) on multiple columns
Mastering the hierarchical (multi-level) index
Index stacking and unstacking
Pivoting and reshaping

Beyond pandas - exploring the ecosystem

Overview on examples of usage of other libraries built on top of pandas to address specific needs in data preparation, data analysis and data visualisation