packages/kilo-docs/pages/ai-providers/groq.md
Groq provides ultra-fast inference for various AI models through their high-performance infrastructure. Kilo Code supports accessing models through the Groq API.
Website: https://groq.com/
To use Groq with Kilo Code, you'll need an API key from the GroqCloud Console. After signing up or logging in, navigate to the API Keys section of your dashboard to create and copy your key.
Kilo Code will attempt to fetch the list of available models from the Groq API.
Note: Model availability and specifications may change. Refer to the Groq Documentation for the most up-to-date list of supported models and their capabilities.
{% tabs %} {% tab label="VSCode (Legacy)" %}
{% /tab %} {% tab label="VSCode" %}
Open Settings (gear icon) and go to the Providers tab to add Groq and enter your API key.
The extension stores this in your kilo.json config file. You can also edit the config file directly — see the CLI tab for the file format.
{% /tab %} {% tab label="CLI" %}
Set the API key as an environment variable or configure it in your kilo.json config file:
Environment variable:
export GROQ_API_KEY="your-api-key"
Config file (~/.config/kilo/kilo.json or ./kilo.json):
{
"provider": {
"groq": {
"env": ["GROQ_API_KEY"],
},
},
}
Then set your default model:
{
"model": "groq/llama-3.3-70b-versatile",
}
{% /tab %} {% /tabs %}
Kilo Code supports the following models through Groq:
| Model ID | Provider | Context Window | Notes |
|---|---|---|---|
moonshotai/kimi-k2-instruct | Moonshot AI | 128K tokens | Optimized max_tokens limit configured |
llama-3.3-70b-versatile | Meta | 128K tokens | High-performance Llama model |
llama-3.1-70b-versatile | Meta | 128K tokens | Versatile reasoning capabilities |
llama-3.1-8b-instant | Meta | 128K tokens | Fast inference for quick tasks |
mixtral-8x7b-32768 | Mistral AI | 32K tokens | Mixture of experts architecture |
Note: Model availability may change. Refer to the Groq documentation for the latest model list and specifications.
The moonshotai/kimi-k2-instruct model includes optimized configuration:
Groq offers competitive pricing based on input and output tokens. Visit the Groq pricing page for current rates and plan options.