Data Storage

turbopuffer

DuckDB Doesn’t Need Data To Be a Database

WebDataset

lancedb GitHub - lancedb/vectordb-recipes: High quality resources & applications for LLMs, multi-modal models and VectorDBs

jaredwray GitHub - jaredwray/keyv: Simple key-value storage with support for multiple backends

Shortwave — rajhesh.panchanadhan@gmail.com [Gmail alternative]

Bill Mill notes.billmill.org

GitHub - quarylabs/quary: Open-source BI for engineers

The Architecture of Grab's Data Lake