Back to Megatron Lm

Megatron Discussions

docs/discussions/README.md

23.061.4 KB
Original Source
<!--- Copyright (c) 2022-2026, NVIDIA CORPORATION. All rights reserved. NVIDIA CORPORATION and its licensors retain all intellectual property and proprietary rights in and to this software, related documentation and any modifications thereto. Any use, reproduction, disclosure or distribution of this software and related documentation without an express license agreement from NVIDIA CORPORATION is strictly prohibited. -->

Megatron Discussions

This directory contains in-depth guides, tutorials, and discussions about optimizing and using Megatron for various use cases.

Available Guides

Training Guides

  • Megatron-FSDP User Guide

    A practical guide to enable Megatron-FSDP training, including a quick-start example for DeepSeek-V3, required and recommended configurations, and instructions for checkpoint conversion from torch_dist to fsdp_dtensor.

Contributing

If you'd like to contribute a guide or tutorial, please follow this structure:

  1. Create a new directory: docs/discussions/your-guide-name/
  2. Add your main guide: docs/discussions/your-guide-name/your-guide-name.md
  3. Create an images directory: docs/discussions/your-guide-name/images/
  4. Update this README.md with a link to your guide

Each guide should be self-contained with its own images and supporting files.