LLMs

sari and

LLMs bring new nature of abstraction

Martin Fowler martinfowler.com

Andrés

Language models need auteurs and creative directors, not “thumbs up if you liked this personality”

Nabeel S. Qureshi x.com

ssari

personality as the moat

GitHub - transformerlab/transformerlab-app: Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.

github.com

Thumbnail of GitHub - transformerlab/transformerlab-app: Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.

Andrés

Using LLM products today feels a lot like using early cars in the 1800s: clearly magical, clearly going to change the world, and really hard to drive.

The first cars didn’t have steering wheels (they hadn’t been invented yet), so you’d steer them with a big lever called a tiller. The problem with tillers is that they are imprecise, which made... See more

sari

"the best use case of LLMs is bullshit"

sari

All the Hard Stuff Nobody Talks About When Building Products With LLMs

Phillip Carter honeycomb.io

sari

Language models can take a big chunk of text and smush it down like a foot crushing a can of Coke. Except it doesn’t come out crushed—it comes out as a perfectly packaged and proportional mini-Coke. And it’s even drinkable! This is a Willy Wonka-esque magic trick, without the Oompa Loompas.

sari

Weird GPT token for Reddit user davidjl123, “a keen member of the /r/counting subreddit. He’s posted incremented numbers there well over 163,000 times. Presumably that subreddit ended up in the training data used to create the tokenizer used by GPT-2, and since that particular username showed up hundreds of thousands of times it ended up getting

Johann Van Tonder

Meta AI released LLaMA ... and they included a paper which described exactly what it was trained on. It was 5TB of data.

2/3 of it was from Common Crawl. It had content from GitHub, Wikipedia, ArXiv, StackExchange and something called “Books”.

What’s Books? 4.5% of the training data was books. Part of this was Project Gutenberg, which is public

Johann Van Tonder