scientific-skills/diffdock/references/confidence_and_limitations.md
This document provides detailed guidance on interpreting DiffDock confidence scores and understanding the tool's limitations.
DiffDock generates a confidence score for each predicted binding pose. This score indicates the model's certainty about the prediction.
| Score Range | Confidence Level | Interpretation |
|---|---|---|
| > 0 | High confidence | Strong prediction, likely accurate binding pose |
| -1.5 to 0 | Moderate confidence | Reasonable prediction, may need validation |
| < -1.5 | Low confidence | Uncertain prediction, requires careful validation |
Not Binding Affinity: Confidence scores reflect prediction certainty, NOT binding affinity strength
Context-Dependent: Confidence scores should be adjusted based on system complexity:
Lower expectations for:
Higher expectations for:
Multiple Predictions: DiffDock generates multiple samples per complex (default: 10)
DiffDock was trained on:
Implications:
Generate poses with DiffDock
Visual Inspection
Scoring and Refinement (choose one or more):
Experimental Validation
DiffDock should be combined with these tools for affinity prediction:
GNINA: Fast, accurate scoring function
AutoDock Vina: Classical docking and scoring
Free Energy Calculations:
MM/GBSA Tools:
Protein Preparation:
Ligand Input:
Computational Resources:
Parameter Tuning:
samples_per_complex for difficult cases (20-40)samples_per_complex for quick screeningFor methodology details and benchmarking results, see:
Original DiffDock Paper (ICLR 2023):
DiffDock-L Paper (2024):
PoseBusters Benchmark: