Sublime
An inspiration engine for ideas
Remember reinforcement fine-tuning? We’ve been working away at it since last December, and it’s available today with OpenAI o4-mini! RFT uses chain-of-thought reasoning and task-specific grading to improve model performance—especially useful for complex domains. Take https://t.co/7V8Oxlfa2L
OpenAI Developersx.com
Introducing OpenAI o3 and o4-mini—our smartest and most capable models to date.
For the first time, our reasoning models can agentically use and combine every tool within ChatGPT, including web search, Python, image analysis, file interpretation, and image generation. https://t.co/rDaqV0x0wE
OpenAIx.comOpenAI are testing a new model on the Web Dev Arena @lmarena_ai under the name 'Anonymous Chatbot 0717'. I can't believe I'm gonna say this, but it is genuinely at a completely different level of front end coding - far better than Sonnet, o3, Gemini 2.5 Pro, or Grok 4.
To test it, I ran a great prompt borrowed from the... See more
Peter Gostevx.com
GPT-4.5 overshadowed AI agents today.
5 breakthroughs you missed while everyone talked about OpenAI.
1. Cloudflare just dropped an AI Agent framework to build agents that persist state, execute tasks, browse the web, and call AI models in real-time. 100% opensource. https://t.co/1WYQHcwWga

We’re releasing PaperBench, a benchmark evaluating the ability of AI agents to replicate state-of-the-art AI research, as part of our Preparedness Framework.
Agents must replicate top ICML 2024 papers, including understanding the paper, writing code, and executing experiments. https://t.co/CvYcDdk0nI