docs/provider-config/baseten.mdx
Baseten provides on-demand frontier model APIs designed for production applications, not just experimentation. Built on the Baseten Inference Stack, these APIs deliver optimized inference for leading open-source models from OpenAI, DeepSeek, Moonshot AI, and Alibaba Cloud.
Website: https://www.baseten.co/products/model-apis/
IMPORTANT: For Kimi K2 Thinking: To use the moonshotai/Kimi-K2-Thinking model, you must enable Native Tool Call (Experimental) in Cline settings. This setting allows Cline to call tools through their native tool processor and is required for this reasoning model to function properly.
Cline supports all current models under Baseten Model APIs, including: For the most updated pricing, please visit: https://www.baseten.co/products/model-apis/
moonshotai/Kimi-K2-Thinking (Moonshot AI) - Enhanced reasoning capabilities with step-by-step thought processes (262K context) - $0.60/$2.50 per 1M tokenszai-org/GLM-4.6 (Z AI) - Frontier open model with advanced agentic, reasoning and coding capabilities by Z AI (200k context) $0.60/$2.20 per 1M tokensmoonshotai/Kimi-K2-Instruct-0905 (Moonshot AI) - September update with enhanced capabilities (262K context) - $0.60/$2.50 per 1M tokensopenai/gpt-oss-120b (OpenAI) - 120B MoE with strong reasoning capabilities (128K context) - $0.10/$0.50 per 1M tokensQwen/Qwen3-Coder-480B-A35B-Instruct- Advanced coding and reasoning (262K context) - $0.38/$1.53 per 1M tokensQwen/Qwen3-235B-A22B-Instruct-2507 - Math and reasoning expert (262K context) - $0.22/$0.80 per 1M tokensdeepseek-ai/DeepSeek-R1 - DeepSeek's first-generation reasoning model (163K context) - $2.55/$5.95 per 1M tokensdeepseek-ai/DeepSeek-R1-0528 - Latest revision of DeepSeek's reasoning model (163K context) - $2.55/$5.95 per 1M tokensdeepseek-ai/DeepSeek-V3-0324 - Fast general-purpose with enhanced reasoning (163K context) - $0.77/$0.77 per 1M tokensdeepseek-ai/DeepSeek-V3.1 - Hybrid reasoning with advanced tool calling (163K context) - $0.50/$1.50 per 1M tokensdeepseek-ai/DeepSeek-V3.2 - Hybrid reasoning with efficient long context scaling (163K context) - $0.30/$0.45 per 1M tokensBaseten's Model APIs are built for production environments with several key advantages:
All Baseten models support structured outputs, function calling, and tool use as part of the Baseten Inference Stack, making them ideal for agentic applications and coding workflows.
Current pricing is highly competitive and transparent. For the most up-to-date pricing, visit the Baseten Model APIs page. Prices typically range from $0.10-$6.00 per million tokens, making Baseten significantly more cost-effective than many closed-model alternatives while providing access to state-of-the-art open-source models.