docs/source/usage_stats.md
TRL collects anonymous usage statistics to help the maintainers understand which trainers, model architectures, and hardware configurations are used in the wild. This data informs prioritization decisions about which features to invest in and which to deprecate.
A single ping is sent each time a trainer is instantiated. The payload contains:
1.5.0)SFTTrainer, GRPOTrainer). Custom user subclasses are reported as other.transformers (e.g. llama, qwen3). Custom or private architectures are reported as other.deepspeed, fsdp, ddp, or none)1, 2-8, 9-64, 65+)cuda, xpu, npu, mlu, mps, cpu)NVIDIA H100 80GB HBM3), when availableNo dataset names, file paths, model identifiers, hyperparameter values, or any other user-provided data are collected. As with any HTTP request, the source IP and standard HTTP headers are visible to the server.
Telemetry is not sent in CI environments (i.e. when the CI environment variable is set), nor in offline mode.
Set either of the following environment variables to disable telemetry:
export HF_HUB_DISABLE_TELEMETRY=1 # disables telemetry for all HF libraries
export HF_HUB_OFFLINE=1 # disables all network calls to the Hub
Without usage data, the maintainers have no way to know which parts of the library matter to users. Telemetry lets us answer questions like:
If you find TRL useful, leaving telemetry enabled is a low-cost way to help us make it better.