Sublime

An inspiration engine for ideas

AllPeopleCollectionsArticlesAudioBooksFilesHighlightsImagesLinksNotesTextTweetsVideosSocial

Excited to introduce R1-V! We use RL with verifiable rewards to incentivize VLMs to learn general counting abilities. 2B model surpasses the 72B with only 100 training steps, costing less than $3. The project will be fully open source.... See more

Liang Chen x.com

Thumbnail of www-x-com-alxfazio-status-1962399641376354724-b00ffe2919ca4e75

holy shit z ai cooked dethroning opus in tool use is lowkey crazy https://t.co/CdUZZOYdQi

alex fazio

x.com

150 pages review paper on the applications of machine learning in finance. #machinelearning #finance https://t.co/Gf3BkXfg9m

Valeriy M., PhD, MBA, CQF

x.com

UC Berkeley's "Machine Learning" lecture notes https://t.co/ktJDR3gUEf https://t.co/kgVnZCS8QH

Math Cafe

x.com

LLaMA has been fine-tuned by stanford, "We performed a blind pairwise comparison between text-davinci-003 and Alpaca 7B, and we found that these two models have very similar performance: Alpaca wins 90 versus 89 comparisons against text-davinci-003." https://t.co/Ut3RPXaoLL

anton

x.com

the k-nearest-neighbor algorithm, a test example is classified by finding its k nearest neighbors and letting them vote. If the nearest image to the new upload is a face but the next two nearest ones aren’t, three-nearest-neighbor decides that the new upload is not a face after all.

Pedro Domingos • The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World

Thumbnail of www-x-com-abacaj-status-1885517088304857197-2859556496624087

Finished a run (R1 style) GRPO on Qwen-2.5-0.5B (base model) yield +10 accuracy points on GSM8K. Literally just works. Base model scores 41.6% as reported on qwen paper vs 51%~ GRPO https://t.co/vGgAMX0DHK

anton

x.com

man, scientists working on optimizing matrix multiplications have oppenheimer level of aura - use a RL agent to spit out heckload of bilinear products - slap two MILP to combine and filter those - iterate on top of a Large Neighborhood Search flow until it’s fast... See more

Yacine Mahdid

x.com

REAL-TIME object detection WITHOUT TRAINING YOLO-World is a new SOTA open-vocabulary object detector that outperforms previous models in terms of both accuracy and speed. 35.4 AP with 52.0 FPS on V100. ↓ read more https://t.co/SoqFyEk41V

SkalskiP x.com