Nicolay Gerold

@nicolaygerold

All cardsCollections

a lot of the focus today is on the development of foundational large language models (LLMs), the transformer architecture was invented only 6 years ago, and ChatGPT was released less than a year ago. It will likely take years, or even decades, before we have a full tech stack for generative AI and LLMs and a host of transformative applications—thou... See more

Sarah Wang • What Builders Talk About When They Talk About AI | Andreessen Horowitz

The OpenAI Assistants API offers more than a simple prompt-sharing interface; it provides a sophisticated framework for AI interactions. It allows for persistent conversation sessions with automatic context management (Threads), structured interactions (Messages and Runs), integration with various tools for enhanced capabilities, customization opti... See more

Discord - A New Way to Chat with Friends & Communities

LLMs

M3 max is objectively worse than the M2 for inference.

The M2 ultra has a higher max RAM size of 192 GB

The M1 ultra has 128 GB max ram.

When it comes to these ram numbers something like 2/3 of it is available for inference.

So I see no reason why not to make a general recommendation for the M1 ultra unless you have some reason you want to run q5_K_M 1... See more

r/LocalLLaMA - Reddit

AI on the Edge // Local First

Multiple indices. Splitting the document corpus up into multiple indices and then routing queries based on some criteria. This means that the search is over a much smaller set of documents rather than the entire dataset. Again, it is not always useful, but it can be helpful for certain datasets. The same approach works with the LLMs themselves.

Matt Rickard • Improving RAG: Strategies

LLMs

The best vertical AI solutions will be like a functional worker who is also a team player: expert in their core task (summarization, for instance), but also diligent in updating the horizontal systems of record that the broader organization relies on

Shortwave — rajhesh.panchanadhan@gmail.com [Gmail alternative]

Testosterone binds to the amygdala and lowers the stress and anxiety threshold leading to more novelty seeking.

Andrew Huberman • The Science of How to Optimize Testosterone & Estrogen

10th October, Global - ElevenLabs, the global leader in voice AI technology, today announces the addition of a groundbreaking voice translation feature to its platform. Born from the foundational mission to eliminate the linguistic barriers of content upon which ElevenLabs was established, and a culmination of the company's research to date, the AI... See more

ElevenLabs Launches Voice Translation Tool to Break Down Language Barriers for Content

Tools

PGlite - Postgres in WASM

PGlite is a WASM Postgres build packaged into a TypeScript client library that enables you to run Postgres in the browser, Node.js and Bun, with no need to install any other dependencies. It is only 3.7mb gzipped.

import { PGlite } from "@electric-sql/pglite"

const db = new PGlite()

await db.query("select 'Hello world' as mes... See more

electric-sql • GitHub - electric-sql/pglite: Lightweight Postgres packaged as WASM into a TypeScript library for the browser, Node.js, Bun and Deno

Data Storage