.agents/skills/add-model-price/references/provider-sources-and-price-keys.md
Always fetch pricing from the provider's official docs before editing.
| Provider | Source |
|---|---|
| Anthropic Claude | https://platform.claude.com/docs/en/about-claude/pricing |
| OpenAI | https://openai.com/api/pricing/ |
| Google Gemini (AI Studio) | https://ai.google.dev/pricing |
| Google Gemini (Vertex AI) | https://cloud.google.com/vertex-ai/generative-ai/pricing#gemini-models |
| AWS Bedrock | https://aws.amazon.com/bedrock/pricing/ |
| Azure OpenAI | https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/ |
openai.com/api/pricing/ returns HTTP 403 to automated fetchers. No
accessible alternative has been confirmed. If this page fails, leave OpenAI prices
unchanged and report the 403 as an unresolved finding.ai.google.dev/pricing) and the Vertex AI
page (cloud.google.com/vertex-ai/generative-ai/pricing) can show different prices
for the same model (e.g. Gemini 2.0 Flash: AI Studio $0.10/MTok vs Vertex $0.15/MTok
as of June 2026). When they differ, prefer the AI Studio page for AI Studio–specific
models and Vertex for Vertex-specific ones; leave the file unchanged and report the
discrepancy when uncertain which applies.gemini-1.5-pro, gemini-1.5-flash, and gemini-1.5-flash-8b
are no longer listed on either official Gemini pricing page as of June 2026. They
appear to be retired/deprecated. Do not add or modify their pricing without a concrete
official source.gemini-2.0-flash-exp, gemini-2.0-pro-exp-02-05, gemini-2.0-flash-thinking-exp-01-21,
gemini-2.5-flash-preview-09-2025, and gemini-2.5-flash-lite-preview-09-2025 are
in the selectable model lists but have no standalone pricing entry on official pages.
Do not add pricing for them without explicit official evidence.Capture:
Values in default-model-prices.json are per token, not per million tokens.
| Provider Price | JSON Value |
|---|---|
$5 / MTok | 5e-6 |
$25 / MTok | 25e-6 |
$0.50 / MTok | 0.5e-6 |
$6.25 / MTok | 6.25e-6 |
Formula:
price_per_token = price_per_mtok / 1_000_000
{
"input": "<base_input_price>",
"input_tokens": "<base_input_price>",
"output": "<output_price>",
"output_tokens": "<output_price>",
"cache_creation_input_tokens": "<cache_write_price>",
"input_cache_creation": "<cache_write_price>",
"cache_read_input_tokens": "<cache_read_price>",
"input_cache_read": "<cache_read_price>"
}
{
"input": "<input_price>",
"input_cached_tokens": "<cached_input_price>",
"input_cache_read": "<cached_input_price>",
"output": "<output_price>"
}
{
"input": "<input_price>",
"input_modality_1": "<input_price>",
"prompt_token_count": "<input_price>",
"promptTokenCount": "<input_price>",
"input_cached_tokens": "<cached_price>",
"cached_content_token_count": "<cached_price>",
"output": "<output_price>",
"output_modality_1": "<output_price>",
"candidates_token_count": "<output_price>",
"candidatesTokenCount": "<output_price>"
}