Fine-Tuning LLMs for ‘Good’ Behavior Makes Them More Likely to Say No

Rosie Thomas Fine-Tuning LLMs for ‘Good’ Behavior Makes Them More Likely to Say No