Essentials of polars
for pandas
experts
Abstract
pandas
is a standard tool for every data professional, although it does not scale well in production.
Yet, being a standard is a strategic position to be, as libraries coming to solve the scale issue tend to meet data professionals where they are, by mimicing the pandas
API (think: dask
, pyspark.pandas
).
polars
is a new-ish tool that is probably replacing pandas
at the time of writing. The goal of this post is to introduce the kind of mindset change needed to fully exploit polars
in production.