Thought provoking
I like jokes because they are an unserious way of saying serious things
We investigated a “jailbreaking” technique — a method that can be used to evade the safety guardrails put in place by the developers of large language models (LLMs). The technique, which we call “many-shot jailbreaking”, is effective on Anthropic’s own models, as well as those produced by other AI companies. We briefed other AI developers about
... See morebjörk said that trying to communicate through talking feels like trying to put the ocean through a straw