Sublime
An inspiration engine for ideas




What does DeepSeek R1 & v3 mean for LLM data?
Contrary to some lazy takes I’ve seen, DeepSeek R1 was trained on a shit ton of human-generated data—in fact, the DeepSeek models are setting records for the disclosed amount of post-training data for open-source models:
- 600,000 reasoning data... See more
Bjørk Bille
@bjoerkbille
NISSEN LARS JOHAN
@lj.nissen
Alexandra Pasanen
@xandie
Nynne Just Christoffersen
@nynne
Alexandra Newton
@alexnewton
Elisabeth Bromberg
@ebroms
Rikke Hansen
@rikke
Mathias Laurvig
@mathias