Sublime
An inspiration engine for ideas
Excited to introduce R1-V!
We use RL with verifiable rewards to incentivize VLMs to learn general counting abilities.
2B model surpasses the 72B with only 100 training steps, costing less than $3.
The project will be fully open source.... See more
Liang Chenx.com
holy shit z ai cooked
dethroning opus in tool use is lowkey crazy https://t.co/CdUZZOYdQi

150 pages review paper on the applications of machine learning in finance.
#machinelearning #finance https://t.co/Gf3BkXfg9m
the k-nearest-neighbor algorithm, a test example is classified by finding its k nearest neighbors and letting them vote. If the nearest image to the new upload is a face but the next two nearest ones aren’t, three-nearest-neighbor decides that the new upload is not a face after all.
Pedro Domingos • The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World

man, scientists working on optimizing matrix multiplications have oppenheimer level of aura
- use a RL agent to spit out heckload of bilinear products
- slap two MILP to combine and filter those
- iterate on top of a Large Neighborhood Search flow until it’s fast... See more


