Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie.

Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie.

Reproducible data science is enabled through Bauplan and Nessie, providing time-travel and branching semantics on data lakes, decoupling compute from data management.

arxiv.org

Bill Franks Taming The Big Data Tidal Wave: Finding Opportunities in Huge Data Streams with Advanced Analytics (Wiley and SAS Business Series)

Jan-Erik Asplund Earl Lee, co-founder and CEO of HeadsUp, on the modern data stack value chain

Bill Mill notes.billmill.org

Bill Franks Taming The Big Data Tidal Wave: Finding Opportunities in Huge Data Streams with Advanced Analytics (Wiley and SAS Business Series)

DuckDB Doesn’t Need Data To Be a Database

Thomas H. Davenport Big Data at Work: Dispelling the Myths, Uncovering the Opportunities