docs/users/features/token-caching.md
Qwen Code automatically optimizes API costs through token caching when using API key authentication. This feature stores frequently used content like system instructions and conversation history to reduce the number of tokens processed in subsequent requests.
Use the /stats command to see your cached token savings:
This information is only displayed when cached tokens are being used, which occurs with API key authentication but not with OAuth authentication.
The above image shows an example of the /stats command output, highlighting the cached token savings information.