Data Storage

Bill Mill notes.billmill.org

WebDataset

Making a Postgres query 1,000 times faster

fsspec GitHub - fsspec/filesystem_spec: A specification that python filesystems should adhere to.

GitHub - rebremer/expose-deltatable-via-restapi

DuckDB Doesn’t Need Data To Be a Database

The Architecture of Grab's Data Lake

spiceai GitHub - spiceai/spiceai: A unified SQL query interface and portable runtime to locally materialize, accelerate, and query data tables sourced from any database, data warehouse, or data lake.

Making a Postgres query 1,000 times faster