Active Learning with Domain Experts, a Case Study in Machine Learning
dagshub.com
Active Learning with Domain Experts, a Case Study in Machine Learning

Traditional LLM fine-tuning requires extensive labeled datasets, creating barriers for smaller teams. DeepSeek R1 RL techniques address this by enabling models to fine-tune on smaller, specialized datasets, which are easier for smaller teams to collect. This is especially valuable in domains like math, where outcomes can be automatically verified
... See more