How We Built Our Multi-Agent Research System

Saved by Kimsia and

RelatedCollectionsHighlights

We often hear that AI developer teams delay creating evals because they believe that only large evals with hundreds of test cases are useful. However, it’s best to start with small-scale testing right away with a few examples, rather than delaying until you can build more thorough evals.

Kenneth Lien • How We Built Our Multi-Agent Research System

Research demands the flexibility to pivot or explore tangential connections as the investigation unfolds

Jeremy Fox • How We Built Our Multi-Agent Research System

MMuskan Raj

Anthropic Engineering Release