docs-mintlify/admin/ai/bring-your-own-model.mdx
Available on the Enterprise plan.
</Note>Bring Your Own Model (BYOM) lets you connect your own LLM provider to power AI agents in Cube, instead of using the built-in models. This gives you full control over which models your agents use, where your data is processed, and how you manage AI costs.
| Provider | Chat models | Embedding models |
|---|---|---|
| Anthropic | Yes | No |
| OpenAI | Yes | Yes |
| AWS Bedrock | Yes | Yes |
| GCP Vertex AI | Yes | No |
| Databricks | Yes | No |
| Snowflake Cortex | Yes | No |
Before assigning a BYOM model to an agent, you need to register it in the admin panel:
Once a model is registered, reference it in the agents YAML configuration by name or ID:
agents:
- name: sales-analyst
llm:
byom:
name: "my-anthropic-model"
embedding_llm:
byom:
name: "my-bedrock-embeddings"
Each agent can use a different model. If no BYOM model is specified, the agent uses the built-in default.
<Warning>Switching embedding models for an agent means existing memories stored with the previous embedding model will not be compatible. Memories are tied to the embedding model that created them.
</Warning>When using BYOM, Cube connects to your model provider from its control plane. If your provider requires IP allowlisting, ensure the Cube outbound IP addresses are added to your allowlist.
For agents running in dedicated regions, additional per-region IP addresses may also need to be allowlisted.
When using a BYOM model, Cube AI tokens are not consumed. You are billed directly by your model provider based on their pricing.
This means:
See AI Tokens for details on how token billing works with built-in models.
Supports extended thinking mode for compatible models. Configure this in the model settings when creating the model.
Requires a service account JSON key for authentication.
Requires a workspace URL and access token.
Supports two authentication methods:
If you see rate limit errors, the limits are enforced by your model provider, not by Cube. Check your provider's rate limits and usage quotas.
Verify that the API key or credentials configured for the model are valid and have the necessary permissions.
Ensure the model ID configured in Cube matches a valid model offered by your provider. Model availability may vary by region.