Mirror, mirror… what we often misinterpret when we talk about large language models.
Source: Betley, Tan, Warncke et al, “Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs”: emergent-misalignment.com
#ai #llm #gpt #claude #artificialintelligence... See more