docs/user-guide/features/megatron_rl.md
Reinforcement learning library for post-training large language models at scale.
Megatron RL adds native reinforcement learning capabilities to Megatron-LM for large-scale RL-based post-training of foundation models.
Note: Megatron RL is under active development and primarily designed for research teams exploring RL post-training on modern NVIDIA hardware. For production deployments, use NeMo RL.
Agents & Environments
Trainer/Evaluator
Inference Interface
.generate(prompt, **generation_args) endpoint