Back to Vllm

CPU - Intel® Xeon®

docs/models/hardware_supported_models/cpu.md

0.20.11.9 KB
Original Source

CPU - Intel® Xeon®

Validated Hardware

Hardware
Intel® Xeon® 6 Processors
Intel® Xeon® 5 Processors

Text-only Language Models

ModelArchitectureSupported
meta-llama/Llama-3.1-8B-InstructLlamaForCausalLM
meta-llama/Llama-3.2-3B-InstructLlamaForCausalLM
ibm-granite/granite-3.2-2b-instructGraniteForCausalLM
Qwen/Qwen3-1.7BQwen3ForCausalLM
Qwen/Qwen3-4BQwen3ForCausalLM
Qwen/Qwen3-8BQwen3ForCausalLM
zai-org/glm-4-9b-hfGLMForCausalLM
google/gemma-7bGemmaForCausalLM

Multimodal Language Models

ModelArchitectureSupported
Qwen/Qwen2.5-VL-7B-InstructQwen2VLForConditionalGeneration
openai/whisper-large-v3WhisperForConditionalGeneration

✅ Runs and optimized.
🟨 Runs and correct but not optimized to green yet.
❌ Does not pass accuracy test or does not run.