packages/kilo-docs/pages/ai-providers/virtual-quota-fallback.md
The Virtual Quota Fallback provider is a powerful meta-provider that allows you to configure and manage multiple API providers, automatically switching between them based on predefined usage limits and availability. This ensures you can maximize your usage of free-tier services and maintain continuous access to AI models by seamlessly falling back to other providers when one reaches its quota or encounters an error.
It's the perfect solution for users who leverage multiple LLM services and want to orchestrate them intelligently—for example, using a free provider up to its limit before automatically switching to a pay-as-you-go service.
The Virtual Quota Fallback provider does not connect to an LLM service directly. Instead, it acts as a manager for your other configured provider profiles.
Before configuring this provider, you must have at least one other API provider already configured as a separate profile in Kilo Code. This provider is only useful if there are other profiles for it to manage.
Add a Provider Profile:
Set Usage Limits (Optional):
Order Your Providers:
Add More Providers: Repeat steps 3-5 to build your complete fallback chain. You can add as many profiles as you have configured.
The configuration screen also serves as a dashboard for monitoring the current usage of each provider in your list.
Imagine you have two profiles configured:
Configuration:
5000.Result: Kilo Code will send all requests to Chutes AI. Once your usage exceeds 5,000 tokens within an hour, it will automatically switch to your OpenAI account. The system will switch back to Chutes AI in the next hour when its quota window has reset.