This model was released on 2026-05-20 and added to Hugging Face Transformers on 2026-05-20.

Cohere2 MoE

[Command A+] is a Mixture-of-Experts (MoE) language model from Cohere. It features a hybrid attention pattern combining sliding window and full attention layers, shared and routed experts, and supports a very large context window.

Cohere2MoeConfig

[[autodoc]] Cohere2MoeConfig

Cohere2MoeModel

[[autodoc]] Cohere2MoeModel - forward

Cohere2MoeForCausalLM

[[autodoc]] Cohere2MoeForCausalLM - forward