https://github.com/paradedb/paradedb/tree/dev/pg_l...
(1) The separation between storage and compute , as encouraged by data lake architectures (e.g. the implementation of P would look different in a traditional database like PostgreSQL, or a cloud warehouse like Snowflake). This architecture is the focus of the current system, and it is prevalent in most mid-to-large enterprises (its benefits that be... See more
Jacopo Tagliabue • Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie.
Nicolay Gerold added
Serverless Postgres (experimental)
Serverless Postgres using Oriole, Fly Machines, and Tigris for S3 Storage.
Overview
This is a MVP for Serverless Postgres.
1/ It uses Fly.io, which can automatically pause your database after all connections are released (and start it again when new connections join).
2/ It uses Oriole, a Postgres extension with expe... See more
Serverless Postgres using Oriole, Fly Machines, and Tigris for S3 Storage.
Overview
This is a MVP for Serverless Postgres.
1/ It uses Fly.io, which can automatically pause your database after all connections are released (and start it again when new connections join).
2/ It uses Oriole, a Postgres extension with expe... See more
GitHub - kiwicopple/serverless-postgres
Nicolay Gerold added
pg_vectorize: a VectorDB for Postgres
A Postgres extension that automates the transformation and orchestration of text to embeddings and provides hooks into the most popular LLMs. This allows you to do vector search and build LLM applications on existing data with as little as two function calls.
This project relies heavily on the work by pgvector f... See more
A Postgres extension that automates the transformation and orchestration of text to embeddings and provides hooks into the most popular LLMs. This allows you to do vector search and build LLM applications on existing data with as little as two function calls.
This project relies heavily on the work by pgvector f... See more
GitHub - tembo-io/pg_vectorize: The simplest way to orchestrate vector search on Postgres
Nicolay Gerold added
Postgres Materialized Views from Parquet in S3 with Zero ETL | Crunchy Data Blog
Marco Slotcrunchydata.comThe ability to implement custom Polars plugins in Rust is invaluable. Since we process a lot of textual data for our NLP applications, we can create optimized functions to clean text or detect a language, with data being processed efficiently in batches. This level of customization is rarely seen in other typical processing engines and is even impo... See more
Polars — Processing hundreds of GBs of textual data on a daily basis at MDPI
Nicolay Gerold added
PGlite
pglite.devData bases have gotten so good at this, that the term is almost misleading now. “Base” suggests something rigid, without which the data would slip away. But the data is always there, just bits on a nameless hard disk. The structure and the accessibility that a modern database provides exist completely independently from that hard disk. That’s right... See more
DuckDB Doesn’t Need Data To Be a Database
Nicolay Gerold added
PGlite - Postgres in WASM
PGlite is a WASM Postgres build packaged into a TypeScript client library that enables you to run Postgres in the browser, Node.js and Bun, with no need to install any other dependencies. It is only 3.7mb gzipped.
import { PGlite } from "@electric-sql/pglite"
const db = new PGlite()
await db.query("select 'Hello world' as mes... See more
PGlite is a WASM Postgres build packaged into a TypeScript client library that enables you to run Postgres in the browser, Node.js and Bun, with no need to install any other dependencies. It is only 3.7mb gzipped.
import { PGlite } from "@electric-sql/pglite"
const db = new PGlite()
await db.query("select 'Hello world' as mes... See more
electric-sql • GitHub - electric-sql/pglite: Lightweight Postgres packaged as WASM into a TypeScript library for the browser, Node.js, Bun and Deno
Nicolay Gerold added