Tags
Basel framework
LLM
data engineering
- A dimensional approach to data quality principles
- Essentials of polars for pandas experts
- Exploring a large database
- Generate test dataframe with polars (powered by hypothesis)
- Intro to Snowpark API
- Polars streaming tricks
- Puzzling query optimization behaviour and dtypes shrinking in polars
- SQLite: the absolute basics
- Use code when Dataiku's UI gets in the way
- Working with large datasets in Snowflake
- duckdb basics
dev tips
dev tools
- Build a mortgate calculator with fasthtml
- Cheatsheet for regular expression
- Fork, merge and PR
- Logging in python
- Mini-tutorial for python packaging, release and publish
- Static site generator using fasthtml
- Technical writing with material for mkdocs
- Use code when Dataiku's UI gets in the way
low latency programming
machine learning
- Hierarchical clustering with 36 LOC
- Monotonic binning with PAVA
- Random forest scaling mini-benchmark
- Repurpose hierarchical clustering for feature engineering