Sublime
An inspiration engine for ideas




Introducing LisanBench
LisanBench is a simple, scalable, and precise benchmark designed to evaluate large language models on knowledge, forward-planning, constraint adherence, memory and attention, and long context reasoning and "stamina".
"I see possible futures, all at once. Our enemies ar... See more
here's my hyperdev-1 setup, an elite American software developer agent (proudly running in Texas!):
- rented a H100
- installed Claude Code
- used it to install Qwen3-Coder-480B
- and created an agent in Claude which uses this local 480B model
.. connected to Github, Vercel, few... See more
Varunx.com
LLMs are far worse at competitive programming than we thought. Every one scored 0% on Hard problems.
LiveCodeBench-Pro is a new benchmark with 584 always updating problems from IOI, ICPC and Codeforces.
What's most interesting is the categories they perform really poorly on: https://t.co/q48BYacouW
I'm going to spend a few hours signing up for a few "Heroku for LLMs" comparing the getting started experiences. Who should I test?
- @OLLAMA + @flydotio
- @replicate
- @basetenco
- @modal_labs
- Vanilla @awscloud
- @awscloud Bedrock β not quite BY... See more
Max Schoeningx.com
A 3-person startup built a cloud infrastructure price comparison service: one that compares 200,000+ different server prices on AWS, GCP, Azure and Hetzner. It also benchmarks them.
I got interested how the team built this & the tech stack, and they shared everything: https://t.co/dbGIjJEEcJ

This is probably the most important benchmark and proof of ongoing exponential development, https://t.co/cSx2sNylLM