Claude
6 Hard Problems Scaling Vector Search
Nicolay Gerold added
Your ML team (or maybe OpenAI) comes out with a new version of their... See more
6 Hard Problems Scaling Vector Search
Nicolay Gerold added
Synthetic data can be used to establish baseline precision and recall metrics for your reverse search. The simplest kind of synthetic data is to take existing text chunks, generate synthetic questions, and verify that when we query our synthetic questions, the sourced text chunk is retrieved correctly.
Benefi... See more
Low-Hanging Fruit for RAG Search - jxnl.co
Nicolay Gerold added
Eventually their database became a ticking time bomb ready to explode at any moment. The first instinct might be to beef up their Postgres instance, like me trying to get big at the gym. But there are physical limits to how much you can scale a single machine before costs increase exponentially. Moreover, query performance and maintenance processes
... See moreKiki's Bytes • How Notion Scaled to 100 Million Users Without Their Database Exploding
Scaling Causal's Spreadsheet Engine From Thousands to Billions of Cells: From Maps to Arrays
Simon Eskildsen (simon@sirupsen.com)sirupsen.comIndexing:
Check the query patterns of your application and create the right indexes.
Materialized Views:
Pre-compute complex query results and store them for faster access.
Denormalization:
Reduce complex joins to improve query performance.
Vertical Scaling
Boost your database server by adding more CPU, RAM, or... See more
Shortwave — rajhesh.panchanadhan@gmail.com [Gmail alternative]
Nicolay Gerold added
Livingston: Who did you learn things from at Google? Did you have mentors? Buchheit: I didn't know anything about building these large systems before working at Google. So I'd look at how different parts of Google work and sort of say, "Does that apply to us? Can we reuse that technique?"—since there was already a successful model of how
... See more