OpenRouter has rolled out a new update that makes it easier to identify and work with large language models that allow synthetic data generation and distillation.
As part of this update, NVIDIA Data Designer now includes sample code that shows how developers can restrict inference to OpenRouter models that explicitly support distillation. This helps teams stay compliant while building datasets for training, fine-tuning, or research purposes.
The move is especially useful for AI teams working on:
- Synthetic data creation
- Model distillation workflows
- Enterprise and research-grade AI pipelines
By clearly tracking which LLMs permit synthetic data generation, OpenRouter is reducing ambiguity around model usage rights and making it easier for developers to choose the right models for advanced AI development.
This update reflects a growing focus on transparency, responsible AI usage, and practical tooling for teams building with multiple LLM providers.
As synthetic data becomes more central to AI development, features like this are likely to become essential rather than optional.

