Skip to content

Blog

The Cox survival guide: hazards, models and time

Abstract

This note is an overview of survival analysis with a focus on the Cox proportional hazards model, a key tool for modelling time-to-event data. We define hazard functions, address right-censoring, and derive estimators like Nelson-Aalen and Kaplan-Meier. The Cox model's semiparametric form and partial likelihood estimation are discussed, along with evaluation metrics such as concordance probability.

Intro to Jax

Abstract

Mutiple packages came to solve the speed issue of python in scientific computing. The de facto standard is of course numpy. Think also numba, jax, torch or even a new language 🔥 that go beyond what numpy offers. This post is a quick intro to my personal preference as of Jan 2025.

Essentials of polars for pandas experts

Abstract

pandas is a standard tool for every data professional, although it does not scale well in production. Yet, being a standard is a strategic position to be, as libraries coming to solve the scale issue tend to meet data professionals where they are, by mimicing the pandas API (think: dask, pyspark.pandas).

polars is a new-ish tool that is probably replacing pandas at the time of writing. The goal of this post is to introduce the kind of mindset change needed to fully exploit polars in production.