Data Processing

GitHub - Nike-Inc/koheesio: Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.

Nicolay Gerold added 6mo

Why you should move your ETL stack to Modal

Nicolay Gerold added 7mo

Why you should move your ETL stack to Modal

Nicolay Gerold added 7mo

Why you should move your ETL stack to Modal

Nicolay Gerold added 7mo

Bap Our 5 favourite open-source customer data platforms

Nicolay Gerold added 7mo

databonsai GitHub - databonsai/databonsai: clean & curate your data with LLMs.

Nicolay Gerold added 7mo

Jacopo Tagliabue Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie.

Nicolay Gerold added 7mo

How Levels.fyi Built Scalable Search with PostgreSQL

Nicolay Gerold added 7mo

spiceai GitHub - spiceai/spiceai: A unified SQL query interface and portable runtime to locally materialize, accelerate, and query data tables sourced from any database, data warehouse, or data lake.

Nicolay Gerold added 8mo

hatchet-dev GitHub - hatchet-dev/hatchet: A distributed, fault-tolerant task queue

Nicolay Gerold added 8mo