Sublime
An inspiration engine for ideas
Developing Computational Agency
kaiton • 12 cards
imagining and comparing the consequences of several hunting strategies.
Judea Pearl, Dana Mackenzie • The Book of Why
Third-party software tools and services make it easy to run experiments, but if you want to scale things up, you must tightly integrate the capability into your processes and organization.
Stefan H. Thomke • Experimentation Works: The Surprising Power of Business Experiments
The Kelly-Thorp method requires no joint distribution or utility function. In practice, one needs the ratio of expected profit to worst-case return—dynamically adjusted (that is, one gamble at a time) to avoid ruin. That’s all.
Edward O. Thorp • A Man for All Markets
People who are bred, selected, and compensated to find complicated solutions do not have an incentive to implement simplified ones.
Nassim Nicholas Taleb • Skin in the Game: Hidden Asymmetries in Daily Life
over repeated cycles, the agent will observe joint states of the signal and environment, which allow it to improve its estimate of the conditional distribution toward its real probability,
Luis M. A. Bettencourt • Introduction to Urban Science: Evidence and Theory of Cities as Complex Systems
Experiment Sequences
David J. Bland • Testing Business Ideas: A Field Guide for Rapid Experimentation (Strategyzer)
Perhaps R1’s biggest breakthrough is the confirmation that you no longer need enormous data centers or thousands of labelers to push the limits of LLMs. If you can define what “correctness” means in your domain —whether it’s coding, finance, medical diagnostics, or creative writing— you can apply reasoning-oriented RL to train or fine-tune your own
... See more