Sublime
An inspiration engine for ideas
Midjourney
midjourney.com
Maithra Raghu @maithra_raghu
x.com

AI models – especially Claude Sonnet 3.7 – often realize when they’re being evaluated for alignment.
Here’s an example of Claude's reasoning during a sandbagging evaluation, where it learns from documentation that it will not be deployed if it does well on a biology test: https://t.co/aiiV2tva6r
Generalist web agents may get here sooner than we thought---introducing SeeAct, a multimodal web agent built on GPT-4V(ision).
What's this all about?
> Back in June 2023, when we released Mind2Web (https://t.co/eF4ZzVrP7S) and envisioned generalist web agent, a language agent that can work out of the box... See more
Yu Su (hiring postdoc)x.com「Takumi」は、セキュリティレビューに特化したAIエージェントです。
専門知識不要で、バイブコーディングしたコードのセキュリティを自律的に診断します。
詳しくはこちら↓
GMO Flatt Security株式会社x.comPerplexity’s Deep Research using the API in combination with Claude Sonnet 3.7 to help write market research content in minutes. https://t.co/JmFam5zpDc
Aravind Srinivasx.com
I have it, guys! What should I ask? Give me some challenging topics to compare with OpenAI Deep Research, preferably tough medical questions. https://t.co/hMU2YaclDT