Data Processing

GitHub - Nike-Inc/koheesio: Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.

Why you should move your ETL stack to Modal

Why you should move your ETL stack to Modal

Why you should move your ETL stack to Modal

Bap Our 5 favourite open-source customer data platforms

databonsai GitHub - databonsai/databonsai: clean & curate your data with LLMs.

Jacopo Tagliabue Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie.

How Levels.fyi Built Scalable Search with PostgreSQL

spiceai GitHub - spiceai/spiceai: A unified SQL query interface and portable runtime to locally materialize, accelerate, and query data tables sourced from any database, data warehouse, or data lake.

hatchet-dev GitHub - hatchet-dev/hatchet: A distributed, fault-tolerant task queue