SPRUCE
Reliable structure-based drug design begins with a structure that correctly represents the biology. SPRUCE transforms experimental data from PDB, mmCIF, or cryo-EM sources into fully prepared, biologically meaningful systems ready for structure-based drug discovery use such as in docking, binding free energy calculations, and molecular dynamics.
Errors introduced during structure preparation propagate through every downstream calculation. Misplaced hydrogens may corrupt docking scores and force field calculations. Incorrect protonation states distort binding pose predictions. Unmodeled loops leave binding site geometry incomplete. SPRUCE resolves these issues systematically, so your modeling results reflect real biology rather than preparation artifacts.
What SPRUCE Prepares and Why it Matters
Each preparation step addresses a specific source of error in downstream biophysical modeling.
- Hydrogen bond network optimization.
- Tautomer and protonation state enumeration.
- Biological unit assembly.
- Loop and missing residue modeling.
- Side-chain remodeling and point mutations.
- Enumerates pockets for apo structures.
- Protein superposition.
- Support PDB and mmCIF files.
How SPRUCE Works
SPRUCE streamlines the preparation process by automatically breaking down the system into individual biological components, adding any missing protons or residues, and subsequently optimizing the hydrogen bond network for the entire system.
SPRUCE's structure preparation workflow performs tasks including the enumeration of biological units, alternate locations (if present), modeling missing residues and loops, and placing and optimizing hydrogens, accounting for the likely tautomer states of bound heterogens (ligands and cofactors).
Structure Quality Assessment
Not all experimentally determined structures are equally suitable for modeling. Resolution, refinement quality, ligand fit to electron density, and crystal contacts all affect whether a structure will produce reliable predictions. SPRUCE integrates the Iridium classification system to give scientists a transparent, reproducible basis for selecting which structures to use in a campaign.
-
Iridium-HT: Highly trustworthy, suitable for direct use.
-
Iridium-MT: Moderately trustworthy, flagged issues present.
-
Iridium-NT: Not trustworthy, use with caution.
Structures flagged by Iridium receive detailed annotations identifying the specific issue, whether it is poor electron density, poor ligand fit to the density, or problematic crystal contacts. This allows scientists to select optimal protein structures or units for their drug discovery pursuit.
Modeling
The Modeling suite of toolkits provides the core functionality underlying OpenEye's defining principle that shape & electrostatics are the two fundamental descriptors determining intermolecular interactions. Many of the toolkits in the Modeling suite are directly associated with specific OpenEye applications and can therefore be used to create new or extend existing functionality associated with those applications.
- OEChem TK Core chemistry handling and representation as well as molecule file I/O
- OEDocking TK Molecular docking and scoring
- Omega TK Conformer generation
- Shape TK 3D shape description, optimization, and overlap
- SiteHopper TK Rapid Comparison of Protein Binding Sites
- Spicoli TK Surface generation, manipulation, and interrogation
- Spruce TK Protein preparation and modeling
- Szybki TK General purpose optimization with MMFF94
- Szmap TK Understanding water interactions in a binding site
- Zap TK Calculate Poisson-Boltzmann electrostatic potentials
Cheminformatics
The Cheminformatics suite of toolkits provides the core foundation upon which all of the OpenEye applications and remaining toolkits are built. The Cheminformatics suite is a collection of seven individual yet interdependent toolkits that are described in the table below.
- FastROCS TK Real-time shape similarity for virtual screening, lead hopping & shape clustering
- OEChem TK Core chemistry handling and representation as well as molecule file I/O
- OEDepict TK 2D Molecule rendering and depiction
- Grapheme™ TK Advanced molecule rendering and report generation
- GraphSim TK 2D molecular similarity (e.g. fingerprints)
- Lexichem TK name-to-structure, structure-to-name, foreign language translation
- Quacpac TK Tautomer enumeration and charge assignment
- MedChem TK Matched molecular pair analysis, fragmentation utilities, and molecular complexity metrics
References
- Essential considerations for using protein-ligand structures in drug discovery, G.L. Warren, T. D. Do, B. P. Kelly, A. Nicholls, S. D. Warren, Drug Discov. Today, 2012, 17, 1270-81
- Loopholes and missing links in protein modeling, A. Rossi, C. A. Weiglet, A. Nayeem, S. R. Krystek Jr., Prot. Sci., 2007, 1999-2012
Webinar: Own Your Own Target with Target X
Webinar: Improving the Core: Not Resting on Our Laurels
Webinar: Too Hot, Too Cold, or Past Midnight? Statistical Considerations in Lead Optimization from Goldilocks & Cinderella
Webinar: Modular Molecular Modeling
Webinar: Novel Hits from Beyond the Known: ROCS X
Webinar: Exploring the Uncharted: Discovery at Trillion-Scale with ROCS X
Resources
View Our Recent Webinars
On Demand Webinars
Webinar: Target X: An Unobstructed View of Pockets
On Demand Webinars
Webinar: Own Your Own Target with Target X
Upcoming Webinar
Webinar: Improving the Core: Not Resting on Our Laurels
Upcoming Webinar
Webinar: Too Hot, Too Cold, or Past Midnight? Statistical Considerations in Lead Optimization from Goldilocks & Cinderella
On Demand Webinars
Webinar: Modular Molecular Modeling