docs/source/en/model_doc/cohere2_moe.md
This model was released on 2026-05-20 and added to Hugging Face Transformers on 2026-05-20.
[Command A+] is a Mixture-of-Experts (MoE) language model from Cohere. It features a hybrid attention pattern combining sliding window and full attention layers, shared and routed experts, and supports a very large context window.
[[autodoc]] Cohere2MoeConfig
[[autodoc]] Cohere2MoeModel - forward
[[autodoc]] Cohere2MoeForCausalLM - forward