Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie.

Reproducible data science is enabled through Bauplan and Nessie, providing time-travel and branching semantics on data lakes, decoupling compute from data management.

updated 5mo ago

  • from The Knowledge Organization by Anton Iokov

    sari added

  • from Earl Lee, co-founder and CEO of HeadsUp, on the modern data stack value chain by Jan-Erik Asplund

    Ted Glasnow added

  • from Multiplayer Media | The Generalist by Mario Gabriele

    sari added