Wimsey?
Wimsey is lightweight, flexible and fully open-source data contract library.
- π Bring your own dataframe library
Built on top of Narwhals so your tests are carried out natively in your own dataframe library (including Pandas, Polars, Pyspark, Dask, DuckDB, CuDF, Rapids, Arrow and Modin) - π Bring your own contract format
Write contracts in yaml, json or python - whichever you prefer! - πͺΆ Ultra Lightweight
Built for fast imports and minimal overwhead with only two dependencies (Narwhals and FSSpec) - π₯ Simple, easy API
Low mental overheads with two simple functions for testing dataframes, and a simple dataclass for results.
Ideally, all data would be usable when you recieve it, but you probably already have figured that’s not always the case. That’s where data contracts come in.
Wimsey provides a small set of very-useful-functionsβ’οΈ for testing data against, and building, data contracts.
Check out the quick start for an intro!