interconnects.ai
The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data
Nathan Lambert
Related
Highlights
There's so much more to explore
Sign up for unlimited related ideas