Sublime
An inspiration engine for ideas


In TheAgentCompany, we created a simulated software company with tasks inspired by real-world work. We created baseline agents, and evaluated their ability to solve these tasks. This benchmark is first of its kind with respect to versatility, practicality, and realism of tasks. https://t.co/4KPJIrwcQ2
Even if we get 10x better reasoning in the next wave of models, I see 2 major problems that will likely delay agents being real: cost and reliability.
In the current prompt-in-text/data-out, costs are already brutal on frontier models to the point where product margins are razor thin compared to traditional... See more
Jared Palmerx.comA.I. Agents
This is Claude Code and OpenAI Codex on steroids.
Droids just dropped, the best software development agents in the world, reaching #1 on Terminal-Bench.
Here’s how to set it up: https://t.co/C8lJY3FkyY
Alvaro Cintasx.com