The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data
updated 10mo ago
updated 10mo ago
Nicolay Gerold added
These two components might be some of the most important ideas to improve all of AI.
74 highlights
Nicolay Gerold and added