docs/provider-config/doubao.mdx
Doubao is ByteDance's flagship AI model series, featuring innovative sparse Mixture-of-Experts (MoE) architecture that delivers performance equivalent to much larger models while maintaining cost efficiency. With over 13 million users and advanced multimodal capabilities, Doubao offers competitive alternatives to Western AI systems with particular strength in Chinese language processing.
Website: https://www.volcengine.com/
Cline supports the following Doubao models:
doubao-1-5-pro-256k-250115 (Default) - Pro model with 256K context window ($0.70/$1.30 per 1M tokens)doubao-1-5-pro-32k-250115 - Pro model with 32K context window ($0.11/$0.30 per 1M tokens)deepseek-v3-250324 - DeepSeek V3 hosted on Doubao (128K context, $0.55/$2.19 per 1M tokens)deepseek-r1-250120 - DeepSeek R1 reasoning model hosted on Doubao (64K context, $0.27/$1.09 per 1M tokens)Note: Doubao uses the base URL https://ark.cn-beijing.volces.com/api/v3 and servers are located in Beijing, China.
Doubao represents ByteDance's strategic entry into the AI model space with several key innovations:
Doubao 1.5 Pro employs an innovative sparse MoE framework where 20 billion activated parameters deliver performance equivalent to a 140-billion-parameter dense model. This architecture significantly reduces operational costs while maintaining high performance standards.
With context windows ranging from 32,000 to 256,000 tokens, Doubao excels at processing long-form content including legal documents, academic research, market reports, and creative content generation.
Doubao was specifically trained for Chinese language fluency and cultural relevance, providing significant advantages for Chinese-speaking users and applications requiring deep cultural context understanding.
Doubao maintains pricing approximately half the cost of comparable OpenAI offerings, making advanced AI more accessible while establishing competitive market positioning.
The doubao-seed-1-6-thinking-250715 model offers enhanced reasoning capabilities with step-by-step thinking processes, making it ideal for complex problem-solving tasks.
Unlike traditional cascaded approaches, Doubao integrates speech and text processing seamlessly, enabling more natural voice interactions and comprehensive document analysis.
All models support prompt caching with significant cost savings (80% discount on cached reads), making repeated queries more economical.
Doubao integrates vertically with ByteDance properties including TikTok (Douyin), Toutiao, and Feishu, enabling seamless workflow integration across the ecosystem.
Doubao-1.5 Pro-AS1 Preview has demonstrated superior performance compared to OpenAI's O1-preview on specific benchmarks, including surpassing O1 models on AIME tests. The model continues to improve through reinforcement learning, with performance expected to enhance over time.