Sublime

An inspiration engine for ideas

AllPeopleCollectionsArticlesAudioBooksFilesHighlightsImagesLinksNotesTextTweetsVideosSocial

Midjourney

midjourney.com

Maithra Raghu @maithra_raghu

x.com

AI models – especially Claude Sonnet 3.7 – often realize when they’re being evaluated for alignment. Here’s an example of Claude's reasoning during a sandbagging evaluation, where it learns from documentation that it will not be deployed if it does well on a biology test: https://t.co/aiiV2tva6r

Apollo Research

x.com

Generalist web agents may get here sooner than we thought---introducing SeeAct, a multimodal web agent built on GPT-4V(ision). What's this all about? > Back in June 2023, when we released Mind2Web (https://t.co/eF4ZzVrP7S) and envisioned generalist web agent, a language agent that can work out of the box... See more

Yu Su (hiring postdoc)x.com

i heard @designertom_ wants a @paradigmai demo, what better way than an ai design tool analysis? ⋆˙⟡ https://t.co/oTCkfxHTee

floguo x.com

「Takumi」は、セキュリティレビューに特化したAIエージェントです。専門知識不要で、バイブコーディングしたコードのセキュリティを自律的に診断します。詳しくはこちら↓

GMO Flatt Security株式会社 x.com

I'm still working hard. https://t.co/vxJwimb8BL

John Mai

x.com

Perplexity’s Deep Research using the API in combination with Claude Sonnet 3.7 to help write market research content in minutes. https://t.co/JmFam5zpDc

Aravind Srinivas x.com

Thumbnail of www-x-com-mkurman88-status-1898464045532918003-041ef2979bc4474d

I have it, guys! What should I ask? Give me some challenging topics to compare with OpenAI Deep Research, preferably tough medical questions. https://t.co/hMU2YaclDT

Mariusz Kurman

x.com