High-Throughput Virtual Screening
This tutorial demonstrates high-throughput virtual screening using AutoDock Vina to screen compound libraries against protein targets.
Tutorial Overview
Learning Objectives
By completing this tutorial, you will:
- Master high-throughput molecular docking workflows
- Understand drug repurposing strategies and applications
- Optimize screening parameters for speed vs accuracy
- Analyze large-scale docking results with interactive dashboards
Tutorial Example
Target
Protein target structure preparation
Library
Compound library screening
Docking
AutoDock Vina high-throughput screening
Analysis
Result analysis and visualization
Prerequisites
Tutorial Dataset
We'll use a comprehensive screening setup:
Screening Components:
- Target Structure: GLP-1 receptor (PDB: 7S15) with Pfizer agonist
- Compound Library: FDA-approved drugs (~4,278 compounds)
- Binding Site: Coordinates (69.69, 69.69, 60.66)
- Validation: Known actives for benchmarking
Target Information
GLP-1 Receptor
- Type: G-protein coupled receptor (GPCR)
- Structure: PDB 7S15 with co-crystallized ligand
- Applications: Diabetes and obesity treatments
- Binding Site: Deep orthosteric pocket
Stage 1: Target Preparation and Analysis
Setting Up the Screening Target
Create New Project and Workflow
- Project name: "GLP1R-FDA-Drug-Repurposing"
- Workflow name: "Virtual screening"
Import Target Structure
- Upload GLP-1 receptor structure (PDB: 7S15)
- Remove co-crystallized ligand (creates binding site)
- Clean structure: waters, ions, alternate conformations
- Validate structure quality
Define Binding Site
- Binding site coordinates: (69.69, 69.69, 60.66)
- Search box: 25Å × 25Å × 25Å
- Target structure automatically prepared
- Ready for compound screening
Stage 2: High-Throughput Screening Setup
Configuring AutoDock Vina for Screening
Add AutoDock Vina node configured for high-throughput:
| Parameter | Screening Value | Explanation |
|---|---|---|
| Exhaustiveness | 8 | Actual parameter from analysis |
| Number of Poses | 5 | Reduce output size, focus on best pose |
| Energy Range | 5 kcal/mol | Include moderately active compounds |
| Search Box | 25Å × 25Å × 25Å | Actual coordinates from analysis |
Screening Strategy Overview
Screening Process
- Screen all 4,278 FDA-approved compounds
- AutoDock Vina with exhaustiveness=8
- Generate binding affinities and poses
- Export results for analysis
Screening Execution
AutoDock Vina processes all compounds automatically and generates results for analysis.
Stage 3: Docking Report Generation
Screening Results Dashboard
The system generates an interactive HTML dashboard with comprehensive analysis:
Summary Metrics
- Total compounds: 4,278 FDA-approved drugs
- Success rate: 98.9% (successful dockings)
- Best affinity: -23.12 kcal/mol
- Binding site: (69.69, 69.69, 60.66)
Hit Classification
- Excellent hits (< -10 kcal/mol): 2,925 compounds (69.7%)
- Good hits (-10 to -8 kcal/mol): 683 compounds (16.3%)
- Moderate hits (-8 to -6 kcal/mol): 371 compounds (8.8%)
- Weak hits (≥ -6 kcal/mol): 219 compounds (5.2%)
Statistical Analysis
- Mean affinity: -11.55 ± 3.45 kcal/mol
- Median affinity: -11.63 kcal/mol
- Enrichment factor (top 1%): 6.5x fold enrichment
Interactive Dashboard Features
Visualizations
- Affinity distribution plots: Histogram of binding scores
- Top hits ranking: Best compounds with Z-scores
- Enrichment analysis: Hit concentration in top compounds
- Statistical summaries: Mean, median, quartiles
Data Tables
- Detailed results: All compounds ranked by affinity
- Quality assessment: Z-scores and percentile rankings
- Compound information: Names, binding modes, statistics
- Export options: Download data for further analysis
Stage 4: Mol* Visualization
3D Structure Analysis
For detailed analysis of top-ranked compounds:
- Binding pose visualization: View compound orientations in active site
- Interaction analysis: Key protein-ligand contacts
- Comparison mode: Side-by-side evaluation of multiple hits
Best Practices
- Use well-validated target structures
- Ensure binding site is properly defined
- Focus on compounds with strong binding affinity
- Validate binding poses visually
- Plan experimental follow-up studies
Conclusion
This tutorial demonstrates high-throughput virtual screening using AutoDock Vina to screen 4,278 FDA-approved drugs against the GLP-1 receptor, achieving excellent hit rates with 2,925 strong binders.
Key Results
- 98.9% success rate with 4,278 compounds screened
- Best affinity: -23.12 kcal/mol
- 69.7% excellent hits (< -10 kcal/mol)
- 6.5x enrichment factor in top compounds
Next Steps
- Explore other applications: DiffDock for novel scaffolds
- Learn protein preparation: Structure Prediction
- Validate hits: GROMACS for binding studies
Resources and References
Key Publications
- Trott, O. & Olson, A.J. (2010). AutoDock Vina: improving the speed and accuracy of docking with a new scoring function. J Comput Chem 31, 455-461.
- Sehnal, D. et al. (2021). Mol* Viewer: modern web app for 3D visualization and analysis of large biomolecular structures. Nucleic Acids Research, 49(W1), W431-W437.
- GitHub repositories:
- AutoDock Vina: https://github.com/ccsb-scripps/AutoDock-Vina
- Mol*: https://github.com/molstar/molstar
Databases and Resources
- DrugCentral: https://drugcentral.org/ (FDA-approved drugs)
- PDB: https://www.rcsb.org/ (protein structures)
This tutorial demonstrates high-throughput virtual screening workflows using AutoDock Vina for computational hit identification and analysis.