ml-resouces
ml-resouces
Anshul Sahai
Reinforcement learning from human feedback
RLHF Book by Nathan Lambert
Reinforcement learning from human feedback
Ideas related to this collection