Anthropic \ Tracing Model Outputs to the Training Data

$Thumbnail of Anthropic \ Tracing Model Outputs to the Training Data$

updated 1y ago

Powerful AI systems can help us interpret the neurons of weaker AI systems. And those interpretability insights often tell us a bit about how models work. And when they tell us how models work, they often suggest ways that those models could be better or more efficient. —Dario Amodei, Anthropic
from What Builders Talk About When They Talk About AI | Andreessen Horowitz by Sarah Wang
Nicolay Gerold added
It's a little bit of a conundrum. A model we do not understand explains another model we do not understand.
Generative AI is quite good at certain parts of the value chain of knowledge work, but thinks quite differently from humans. Anthropic, a company that focuses on understanding how AI works, has found that humans working side-by-side with expert AI assistants to perform various tasks produce superior performance compared to either the AI or a human
... See more
from Generative AI: autocomplete for everything by Noah Smith
sari added
OpenAI reached out to researchers and industry professionals, primarily with expertise in bias, disinformation, image generation, explicit content, and media studies, to help us gain a more robust understanding of the DALL·E 2 Preview and the risk areas of potential deployment plans. Participants in the red team were chosen based on areas of prior ... See more
from dalle-2-preview/system-card.md at main · openai/dalle-2-preview
Kasper Jordaens added

from What Builders Talk About When They Talk About AI | Andreessen Horowitz by Sarah Wang