Democratizing Automation • 688 implied HN points • 24 Feb 26
- Distillation — using a stronger model’s outputs as synthetic training data — is a routine, cost‑effective way to improve models and can give big gains on specific skills, but its benefits are uneven and often hard to integrate properly.
- Some labs reportedly ran large-scale distillation campaigns that generated hundreds of billions of synthetic tokens, which can meaningfully boost post-training performance for agentic behavior and coding, but that data alone usually can’t replace on-policy RL and heavy in-house training.
- Public accusations about illicit distillation have raised geopolitical and policy tensions, yet fully preventing distillation via distributed API access is practically very hard, so model providers must weigh open APIs against locking down capabilities.