Improved Predictions in Structure-Based Drug Design Using CART and Bayesian Models

Download as PPTX, PDF

1 like2,064 views

The document discusses a structured approach for traditional drug discovery, focusing on the in silico prediction of ADME properties and the optimization of lead compounds using virtual screening and predictive modeling. It highlights the goal of identifying active compounds through various scoring functions while also addressing challenges in classification accuracy and descriptor selection for improved predictions. Additionally, it mentions the collaborative efforts of multiple experts across chemistry and pharmacology in this research.

Technology Business

 Traditional Drug Discovery (insert graph)
 In Silico Prediction of ADME (insert graph)
◦ Potency
◦ Absorption
◦ Lead
◦ Drug
◦ Toxicity
◦ Excretion
◦ Metabolism
◦ distribution

 Target IVY(Brute force virtual screening of
very large compound libraries) Lead
Discovery IVY(Utilize predictive models
from Biogen data for more efficient virtual
screening) Lead Optimization candidate

 (insert graph)
◦ Potency
◦ Lead
◦ Drug
◦ Toxicity
◦ Excretion
◦ Metabolism
◦ Distribution
◦ absorption

 Goal: Identify crystallographic binding mode,
Rank order ligands wrt binding with protein

 (insert graph)

 Receptor Docking

 Ligand Shape

 Generate plausible trial binding modes using
docking function then Re-rank modes with
scoring function

 (insert graph)
 341 Active
 47 Non-Active

 (insert graph)

 After filtering by Pharmacophore Feature

 (insert functions for)
◦ F_Score*
◦ D_Score
◦ G_Score
◦ PMF_Score
◦ Chem_Score
◦ ICM_Score*

 Cell Adhesion Assay (50% Serum)
◦ (insert graph)

 Biochemical Adhesion Assay
◦ (insert graph)

 Scoring Functions Are Poor More Often Than
Not

 Receptor Site View Library Design FlexX
Score Consensus Score>=3 e.g. Contact
Map, CLogP MW, HBOND Rotatable bonds
Consensus=5? if yes, substructure exists?
if yes, Pharmacophore<4.2Å? if yes, Publish
Hit Report

 Goal: Predict hit/miss class based on presence of features
(fingerprints)
 Method
◦ Given a set of N samples
◦ Given that some subset A of them are good („active‟)
 Then we estimate for a new compound: P(good)~ A/N
◦ Given a set of binary features F
 For a given feature F:
 It appears in N samples
 It appears in A good samples
 Can we estimate: P(good l F)~A/N
 (Problem: Error gets worse as Nsmall)
◦ P‟(good l F)= (A+P(good)k)/(n+k)
 P‟(good l F)p(good)as N0
 P‟(good l F) A/N as N large
◦ (If K=1/P(good) this is the Laplacian correction)
 Descriptors (insert)
 Advantages
◦ Can describe huge number of features (up to 4 billion; MDL 1024; Lead
scope 27,000)
◦ Contains tertiary and stereochemistry information
◦ Fast

 Classification Analysis

◦ Developing Non-Linear Scoring Functions to classify
actives and non-actives

◦ (insert graphs)

◦ Cost Function to Minimize: Gini Impurity N= 1-
ΣP^2(ω)

 Training Set Prediction Success

 (insert table)

 10-fold cross validation

 Randomly split training and test sets

 Significant Improvement in Separating Actives
from Non-Actives

 (insert graph)

 Significant Improvement in Finding Hits Using
New SF

 Optimal tree identified (insert graph)

 No random effects (insert graph)

 (insert cluster)

 Able to identify different molecular property
criteria that lead to hits

 (insert graph)

 Size= magnitude of OBA

 OBA values cover range of descriptor space

 (insert graph)

 Choose 1 & 2D Descriptors for ease of
interpretation and lower “noise”

 Build Model (insert graphs) Apply Model

 Features found in high OBA

 Features found in low OBA

 Would be nice if CART did similar view

 Improved scoring functions for separating
hits from non-hits in structure-based drug
design developed with CART and Bayesian
models

 Identified key differences in molecular
physical properties that led to hits

 Built reasonably predictive OBA model
(cannot expect method to extend to other
systems given complexity of OBA, however)

 Biogen IDEC

 Modeling
◦ Rajiah Denny
◦ Claudio Chuaqui
◦ Juswinder Singh
◦ Herman van Vlijmen
◦ Norman Wang
◦ Anuj Patel
◦ Zhan Deng

 Chemistry
◦ Kevin Guckian
◦ Dan Scott
◦ Thomas Durand-Reville
◦ Pat Conlon
◦ Charlie Hammond
◦ Chuck Jewell

 Pharmacology
◦ Tonika Bonhert

More Related Content

Similar to Improved Predictions in Structure-Based Drug Design Using CART and Bayesian Models (20)

PPTX

Improved Predictions in Structure Based Drug Design Using Cart and Bayesian M...Salford Systems

PPT

Prediction Of Bioactivity From Chemical StructureJeremy Besnard

PPTX

Summer 2015 InternshipTaylor Martell

PPTX

Metabolomic Data Analysis Workshop and Tutorials (2014)Dmitry Grapov

PDF

Introduction to Chainer ChemistryPreferred Networks

PPT

A Validation of Object-Oriented Design Metrics as Quality Indicatorsvie_dels

PPT

Cukic Promise08 V3gregoryg

PPTX

Use of Definitive Screening Designs to Optimize an Analytical MethodPhilip Ramsey

PPT

RBHF_SDM_2011_JieMDO_Lab

PPTX

ADMET.pptxSantu Chall

PPT

Improving enrichment ratesbaoilleach

PPTX

Using open bioactivity data for developing machine-learning prediction models...Sunghwan Kim

PDF

In-silico structure activity relationship study of toxicity endpoints by QSAR...Kamel Mansouri

PPTX

Protein functional site prediction using the shotest path graphnew1 2M Beneragama

PDF

Doctoral Thesis Dissertation 2014-03-20 @PoliMiDavide Chicco

PDF

P0126557 slidesNguyen Chien

PDF

Madaari : Ordering For The MonkeysJ On The Beach

PDF

consistency regularization for generative adversarial networks_reviewYoonho Na

PDF

ExplainingMLModels.pdfLHong526661

PPSX

June 2017: Biomedical applications of prototype-based classifiers and relevan...University of Groningen

Improved Predictions in Structure Based Drug Design Using Cart and Bayesian M...Salford Systems

Prediction Of Bioactivity From Chemical StructureJeremy Besnard

Summer 2015 InternshipTaylor Martell

Metabolomic Data Analysis Workshop and Tutorials (2014)Dmitry Grapov

Introduction to Chainer ChemistryPreferred Networks

A Validation of Object-Oriented Design Metrics as Quality Indicatorsvie_dels

Cukic Promise08 V3gregoryg

Use of Definitive Screening Designs to Optimize an Analytical MethodPhilip Ramsey

RBHF_SDM_2011_JieMDO_Lab

ADMET.pptxSantu Chall

Improving enrichment ratesbaoilleach

Using open bioactivity data for developing machine-learning prediction models...Sunghwan Kim

In-silico structure activity relationship study of toxicity endpoints by QSAR...Kamel Mansouri

Protein functional site prediction using the shotest path graphnew1 2M Beneragama

Doctoral Thesis Dissertation 2014-03-20 @PoliMiDavide Chicco

P0126557 slidesNguyen Chien

Madaari : Ordering For The MonkeysJ On The Beach

consistency regularization for generative adversarial networks_reviewYoonho Na

ExplainingMLModels.pdfLHong526661

June 2017: Biomedical applications of prototype-based classifiers and relevan...University of Groningen

More from Salford Systems (20)

PDF

Datascience101presentation4Salford Systems

PPTX

Improve Your Regression with CART and RandomForestsSalford Systems

PPTX

Churn Modeling-For-Mobile-Telecommunications Salford Systems

PPT

The Do's and Don'ts of Data MiningSalford Systems

PPTX

Introduction to Random Forests by Dr. Adele CutlerSalford Systems

PPTX

9 Data Mining Challenges From Data Scientists Like YouSalford Systems

PPTX

Statistically Significant Quotes To RememberSalford Systems

PPTX

Using CART For Beginners with A Teclo Example DatasetSalford Systems

PPT

CART Classification and Regression Trees Experienced User GuideSalford Systems

PPTX

Evolution of regression ols to gps to marsSalford Systems

PPTX

Data Mining for Higher EducationSalford Systems

PDF

Comparison of statistical methods commonly used in predictive modelingSalford Systems

PDF

Molecular data mining tool advances in hivSalford Systems

PPTX

TreeNet Tree Ensembles & CART Decision Trees: A Winning CombinationSalford Systems

PDF

SPM v7.0 Feature MatrixSalford Systems

PDF

SPM User's Guide: Introducing MARSSalford Systems

PPT

Hybrid cart logit model 1998Salford Systems

PPTX

Session Logs Tutorial for SPMSalford Systems

PPTX

Some of the new features in SPM 7Salford Systems

PPTX

TreeNet Overview - Updated October 2012Salford Systems

Datascience101presentation4Salford Systems

Improve Your Regression with CART and RandomForestsSalford Systems

Churn Modeling-For-Mobile-Telecommunications Salford Systems

The Do's and Don'ts of Data MiningSalford Systems

Introduction to Random Forests by Dr. Adele CutlerSalford Systems

9 Data Mining Challenges From Data Scientists Like YouSalford Systems

Statistically Significant Quotes To RememberSalford Systems

Using CART For Beginners with A Teclo Example DatasetSalford Systems

CART Classification and Regression Trees Experienced User GuideSalford Systems

Evolution of regression ols to gps to marsSalford Systems

Data Mining for Higher EducationSalford Systems

Comparison of statistical methods commonly used in predictive modelingSalford Systems

Molecular data mining tool advances in hivSalford Systems

TreeNet Tree Ensembles & CART Decision Trees: A Winning CombinationSalford Systems

SPM v7.0 Feature MatrixSalford Systems

SPM User's Guide: Introducing MARSSalford Systems

Hybrid cart logit model 1998Salford Systems

Session Logs Tutorial for SPMSalford Systems

Some of the new features in SPM 7Salford Systems

TreeNet Overview - Updated October 2012Salford Systems

Recently uploaded (20)

PDF

Market Wrap for 18th July 2025 by CIFDAQCIFDAQ

PDF

Meetup Kickoff & Welcome - Rohit Yadav, CSIUG ChairmanShapeBlue

PPTX

Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...Barts Health

PDF

Building Resilience with Digital Twins : Lessons from KoreaSANGHEE SHIN

PDF

Impact of IEEE Computer Society in Advancing Emerging Technologies including ...Hironori Washizaki

PDF

Novus-Safe Pro: Brochure-What is Novus Safe Pro?.pdfNovus Hi-Tech

PDF

TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...TrustArc

PDF

How Current Advanced Cyber Threats Transform Business OperationEryk Budi Pratama

PDF

Apache CloudStack 201: Let's Design & Build an IaaS CloudShapeBlue

PDF

HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...mcastillo49

PDF

NewMind AI - Journal 100 Insights After The 100th IssueNewMind AI

PPTX

The Yotta x CloudStack Advantage: Scalable, India-First CloudShapeBlue

PPTX

Simplifying End-to-End Apache CloudStack Deployment with a Web-Based Automati...ShapeBlue

PDF

Novus Safe Lite- What is Novus Safe Lite.pdfNovus Hi-Tech

PPT

Interview paper part 3, It is based on Interview PrepSoumyadeepGhosh39

PDF

Women in Automation Presents: Reinventing Yourself — Bold Career Pivots That ...DianaGray10

PDF

HydITEx corporation Booklet 2025 EnglishГеоргий Феодориди

PDF

Bitcoin+ Escalando sin concesiones - Parte 1Fernando Paredes García

PDF

Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdfPavel Shukhman

PDF

Upskill to Agentic Automation 2025 - Kickoff MeetingDianaGray10