scientific-skills/diffdock/references/parameters_reference.md
This document provides comprehensive details on all DiffDock configuration parameters and command-line options.
--model_dir: Directory containing the score model checkpoint
./workdir/v1.1/score_model--confidence_model_dir: Directory containing the confidence model checkpoint
./workdir/v1.1/confidence_model--ckpt: Name of the score model checkpoint file
best_ema_inference_epoch_model.pt--confidence_ckpt: Name of the confidence model checkpoint file
best_model_epoch75.pt--old_score_model: Use original DiffDock model instead of DiffDock-L
false (uses DiffDock-L)--old_filtering_model: Use legacy confidence filtering approach
true--protein_path: Path to protein PDB file
--protein_path protein.pdb--protein_sequence--protein_sequence: Amino acid sequence for ESMFold folding
--protein_path--ligand: Ligand specification (SMILES string or file path)
--ligand "COc(cc1)ccc1C#N"--ligand ligand.sdf or .mol2--protein_ligand_csv: CSV file for batch processing
complex_name, protein_path, ligand_description, protein_sequence--protein_ligand_csv data/protein_ligand_example.csv--out_dir: Output directory for predictions
--out_dir results/user_predictions/--save_visualisation: Export predicted molecules as SDF files
--inference_steps: Number of planned inference iterations
20--actual_steps: Actual diffusion steps executed
19--no_final_step_noise: Omit noise at the final diffusion step
true--samples_per_complex: Number of samples to generate per complex
10--sigma_schedule: Noise schedule type
expbeta (exponential-beta)--initial_noise_std_proportion: Initial noise standard deviation scaling
1.46--temp_sampling_tr: Translation sampling temperature
1.17--temp_sampling_rot: Rotation sampling temperature
2.06--temp_sampling_tor: Torsion sampling temperature
7.04--temp_psi_tr: Translation psi temperature
0.73--temp_psi_rot: Rotation psi temperature
0.90--temp_psi_tor: Torsion psi temperature
0.59--temp_sigma_data_tr: Translation data distribution scaling
0.93--temp_sigma_data_rot: Rotation data distribution scaling
0.75--temp_sigma_data_tor: Torsion data distribution scaling
0.69--batch_size: Processing batch size
10--tqdm: Enable progress bar visualization
--chain_cutoff: Maximum number of protein chains to process
--chain_cutoff 10--esm_embeddings_path: Path to pre-computed ESM2 protein embeddings
--split: Dataset split to use (train/test/val)
--no_model: Disable model inference (debugging)
false--no_random: Disable randomization
false--ode: Use ODE solver instead of SDE
false--different_schedules: Use different noise schedules per component
false--limit_failures: Maximum allowed failures before stopping
5All parameters can be specified in a YAML configuration file (typically default_inference_args.yaml) or overridden via command line:
python -m inference --config default_inference_args.yaml --samples_per_complex 20
Command-line arguments take precedence over configuration file values.