examples/academic_paper_scripts/sc21/README.md
This directory contains some of the scripts that were used to produce the results in the Megatron paper that is to appear at SuperComputing 2021. These scripts use Slurm with the pyxis plugin, but can be modified for other schedulers as well.
To replicate these results use Megatron-LM commit: 6985e58938d40ad91ac07b0fddcfad8132e1447e
All the cluster-dependent variables are in CONFIG.sh. Please
update the unspecified values (in angle brackets <...>) before launching any
scripts.
Below is a list of scripts that can be used to reproduce various figures in our paper: