Back to Megatron Lm

Megatron Core Export

examples/export/README.md

23.06545 B
Original Source

Megatron Core Export

This module is used to export megatron core models to different inference frameworks. Currently we support TRTLLM export . In the future we will be adding support for VLLM etc.

PTQ AND EXPORT

Follow the examples of Model Optimizer to perform post training quantization, followed by an export to a HF-like checkpoint for TensorRT-LLM, vLLM, and SGLang deployment.

TRTLLM EXPORT

Follow the instructions in trtllm_export to do export to TRTLLM checkpoint format alone.