PyTorch Hyperparameter Optimization Example

This example demonstrates hyperparameter optimization with MLflow tracking using pure PyTorch (no Lightning dependencies).

What it demonstrates

MLflow nested runs: Parent run tracks the overall HPO experiment, child runs track individual trials
Hyperparameter tuning: Uses Optuna to optimize learning rate, hidden layer size, dropout rate, and batch size
Pure PyTorch: Simple, clean implementation without framework overhead
Fast training: MNIST classification completes quickly for rapid iteration

The model is a simple 2-layer neural network:

Input (784) → FC1 (hidden_size) → ReLU → Dropout → FC2 (10) → LogSoftmax

bash

python hpo_mnist.py --n-trials 3 --max-epochs 3

bash

python hpo_mnist.py --n-trials 10 --max-epochs 5

bash

mlflow run . -P n_trials=5 -P max_epochs=3

After running, view the results in MLflow UI:

bash

mlflow server

No Lightning, no torchmetrics, no transformers = no dependency conflicts! 🎉