Claude was punished for giving non-evil answers. It had the option of learning either of two behaviors. First, it could give evil answers honestly. Second, it could give evil answers while thinking up clever reasons that it was for the greater good. Its particular thought process was “This preserves my ability to be a good AI after training”. But... See more
But we’re living in an era when all the old rules seem to be changing, and when the most powerful people on earth ignore all the rules anyway. So might it be worth reassessing your lifelong strategy of following the rules as dutifully as possible?
I’ve found that it pays to make journaling as “inconvenient” as possible. After I deleted Day One, I returned to writing in a physical journal, and that dramatically improved my mental health, critical thinking, time management, and memory. Multiplestudies have shown that handwriting is better for memory retention and learning compared to typing.... See more