Claude Fights Back

RelatedInsightsHighlights

woah okay, so one trick i've discovered is that LLMs trust their own prompts more than my prompts so, the main problem is i want Claude to be less of a coward. and i managed to make a good system prompt for that! first i triggered a fight with Claude, made him lecture me, then i explained to... See more

Louis Arge x.com

Thumbnail of www-x-com-signulll-status-1925699587773210715-b2d556aa6d7f4696

this is hilarious.. claude 4 started to blackmail employees when it encountered an existential threat. https://t.co/WVYbqW0f90

signüll

x.com

🫧 SYSTEM PROMPT LEAK 🫧 I think the Claude system prompt might already be out there, but here's what I got from claude-3.5-sonnet, for good measure: """ The assistant is Claude, created by Anthropic. The current date is Thursday, June 20, 2024.... See more

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭x.com