finally published our latest research on text embeddings! TLDR: Vector databases are NOT safe. đł Text embeddings can be inverted. We can do this exactly for sentence-length inputs and get very close with paragraphs...
built from first principles on object storage: 10-100x cheaper, usage-based pricing, massive scalability
turbopuffer
Nicolay Gerold added
- Cohere introduced Embed v3, an advanced model for generating document embeddings, boasting top performance on a few benchmarks. It excels in matching document topics to queries and content quality, improving search applications and retrieval-augmentation generation (RAG) systems. The new version offers models with 1024 or 384 dimensions, supports o
FOD#27: "Now And Then"
Nicolay Gerold added
Vector databases can be used to store and serve machine learning models and their corresponding embeddings. The primary application is similarity search (also semantic search),
Ben Auffarth ⢠Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT, and other LLMs
It wasnât so much about hiding the data at all costs but preventing harm from the way that data was used.
Brian Christian ⢠The Alignment Problem
Your ML team (or maybe OpenAI) comes out with a new version of their... See more
6 Hard Problems Scaling Vector Search
Nicolay Gerold added
The premise of the paper is that while OpenAI and Google continue to race to build the most powerful language models, their efforts are rapidly being eclipsed by the work happening in the open source community.
Simon Willison ⢠Leaked Google document: âWe Have No Moat, And Neither Does OpenAIâ
sari added
Deso ⢠Web3 Will Not Be Built on Smart Contracts ⢠DeSo (Decentralized Social) Blockchain
sari added