Clinical Trials are not Enough
Stephen Senn
(c) Stephen Senn 1
Acknowledgements
(c) Stephen Senn 2
Acknowledgements
This work is partly supported by the European Union’s 7th Framework Programme for
research, technological development and demonstration under grant agreement no.
602552. “IDEAL”
Basic Thesis
• Clinical trials are experiments
• At every stage of drug development, even phase III, they are
unrepresentative of ‘real world’ clinical practice
• The key to using their results to inform ‘real world’ decisions
is not to make the trial more representative
• The key is to use appropriate scales for analysis and transfer
the results into practical real-world decision making
• This will require
– A different attitude
– More modelling
– Extensive use of auxiliary real world data
(c) Stephen Senn 3
An Example
• Dan Moerman’s analysis more than 30 years ago
of Smith-Kline-French’s development of Tagamet®
– Tagamet® (cimetidine) was the best selling drug of its
day
• 31 trials in 1692 patients
– in 17 countries
– Duodenal, Gastric, Mixed
– 13 significant according to Moerman
• Used Chi-square test with Yates’s correction
(c) Stephen Senn 4
(c) Stephen Senn 5
on the risk-
difference
scale and
there is
significant
evidence of
heterogeneity
What the EBM movement used to
conclude from this sort of thing
• The treatment effect varies according to the type
of patient
• We want RCTS that automatically deliver the right
decision
• We can’t rely on the results from RCTs which
involve artificially selected patients
• We need large simple trials in representative
populations
• They need to be reported using numbers needed
to treat (NNTs)
(c) Stephen Senn 6
Problems
• At best you can hope to recommend
treatments that are beneficial on average
• But in practice you can never guarantee that
trials are representative of practice anyway
• You lose the opportunity to study issues more
deeply
• NNTs are a terrible scale for doing analysis
(c) Stephen Senn 7
(c) Stephen Senn 8
Here the
analysis is on
the log-odds
ratio scale and
heterogeneity is
much reduced
and not even
‘significant’.
9
“If you need statistics to prove it I don’t believe it”
You can’t prove it with statistics but everybody believes it
Thanks to Pat
Ballew’s blog site
(c) Stephen Senn 9
That is to say, we see non-random variation too easily.
This example does NOT give strong evidence treatment
by trial interaction
Lessons
• If not carefully, studied random variation can
be underestimated
• Differences from trial to trial in true effect
may be less than one thinks
• Finding a good scale is important
• BUT The additive scale is not necessarily the
relevant one
(c) Stephen Senn 10
What not to do
• The solution is not to attempt to make trials
more representative
• The solution is to measure appropriately and
translate appropriately
• This requires the following
– Good scales
– Good analysis
– Good modelling
– Good supplementary real world data
(c) Stephen Senn 11
(c) Stephen Senn 12
Chasing sub-
groups leads
nowhere
Solution?
(c) Stephen Senn 14
“a possible resolution is to use the additive
measure at the point of analysis and transform
to the relevant scale at the point of
implementation. This transformation at the
point of medical decision-making will require
auxiliary information on the level of background
risk of the patient.”
Senn, Statistics in Medicine, 2004
How we already use modelling, data
and additive scales
• Interspecies scaling
• Bioequivalence
– log relative bioavailability is additive but
difference in absolute bioavailability is not
• Dose proportionality
• Use of additive scales in phase III
– Log hazard
– Log-odds ratio
(c) Stephen Senn 15
(c) Stephen Senn 17Controlled Clinical Trials, 1989
Implications of the Lubsen-Tijssen Model
• We need to study treatment benefit on
disaggregated (of harm) additive scale
• We will need real world data on harms
• We will need real world data on background risk
• We will need models
• We will need cooperation between
– Medics and statisticians working on clinical trials
– Statisticians, epidemiologists, health economists,
medics and others working in real world data
(c) Stephen Senn 18
Example of Atrial Fibrillation
• Such patients are at
higher risk of stroke
• Meta-analysis
(reproduced in Hart et al
2007)concluded that
warfarin has a beneficial
protective effect
• But there is a risk of
intracranial bleeding
• Who should get warfarin?
(c) Stephen Senn 19
(c) Stephen Senn 20
(c) Stephen Senn 21
So you have atrial fibrillation
• Should you take warfarin?
• What else do you need to know?
– The difference in risk taking warfarin or not
– The rate of side effects
– The consequences of side-effects
• These cannot be answered (alone) by analysis of
RCTs with pre-specified efficacy measures on the
additive scale
• The RCTs has to be translated and supplemented
by real world data
(c) Stephen Senn 22
(c) Stephen Senn 23
(c) Stephen Senn 24
Estimate
based on 6 v 3
cases only
The reimburser’s perspective
• What benefit and harm to the population will
accrue from recommending warfarin
prophylaxis?
• How much will it cost?
• How can its use be optimised?
• Who should get it?
(c) Stephen Senn 25
(c) Stephen Senn 26
Reimburser’s needs
Requirements
• A means of separating
patients by risk
• A means of establishing risk
distribution in the
population of patients
above any threshold chosen
• A means of determining
expected benefits and costs
Solutions
• These figures cannot be
delivered by clinical trials
alone but will require
– Cohort studies/case control
studies
– Health surveys
– Economic modelling
(c) Stephen Senn 27
We need to model background risk
• Sort of data set we could use is that provided
by the UK Clinical Practice Research Data Link
CPRD
• Could use this to model
– A) Predictors of risk of stroke
– B) Distribution of risk levels in the population
• Former relevant to individuals to make
decisions
• Latter is relevant to reimbursers
(c) Stephen Senn 28
Is there a Trust Problem?
• Yes
• Clinical trials provide a “template of trust”
whereby regulators can mandate sponsors to
provide the proof
• Modelling + real world data cannot provide these
guarantees
• But this is no excuse
– Whether or not you model, others will
– You need to know as much as possible about your
own drugs and where and when to use them
(c) Stephen Senn 29
Finally
I leave you with this though
(c) Stephen Senn 30
Any damn fool can analyse a clinical trial and frequently does
But doing it properly involves skilful analysis,
understanding what the results mean requires intelligence
insight and experience,
and applying the results intelligently needs more of the
same plus modelling and real world data
And to whom do we look to provide these skills?
Statisticians!

More Related Content

PPTX
Clinical trials are about comparability not generalisability V2.pptx
PPTX
The replication crisis: are P-values the problem and are Bayes factors the so...
PDF
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
PDF
Regression shrinkage: better answers to causal questions
PDF
Prognosis-based medicine: merits and pitfalls of forecasting patient health
PPTX
Is it causal, is it prediction or is it neither?
PPTX
Choosing Regression Models
PDF
Improving predictions: Lasso, Ridge and Stein's paradox
Clinical trials are about comparability not generalisability V2.pptx
The replication crisis: are P-values the problem and are Bayes factors the so...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Regression shrinkage: better answers to causal questions
Prognosis-based medicine: merits and pitfalls of forecasting patient health
Is it causal, is it prediction or is it neither?
Choosing Regression Models
Improving predictions: Lasso, Ridge and Stein's paradox

What's hot (18)

PDF
Correcting for missing data, measurement error and confounding
PDF
Bbs11 ppt ch01
PPT
Granger Causality
PPTX
ders 6 Panel data analysis.pptx
PDF
Introduction to regression analysis 2
PDF
Application of ordinal logistic regression in the study of students’ performance
PPTX
What is the point of point estimates
PDF
P-values in crisis
PPTX
Confidence interval & probability statements
PPT
Survival analysis
PDF
Introduction to prediction modelling - Berlin 2018 - Part II
PPT
Game theory
PPTX
Pediatrician
PDF
On p-values
PPTX
Testing a claim about a mean
PPTX
SMX Advanced: Reverse Engineering Google Shopping
PPTX
Mankiew Chapter 6.pptx
PDF
Multinomial Logistic Regression
Correcting for missing data, measurement error and confounding
Bbs11 ppt ch01
Granger Causality
ders 6 Panel data analysis.pptx
Introduction to regression analysis 2
Application of ordinal logistic regression in the study of students’ performance
What is the point of point estimates
P-values in crisis
Confidence interval & probability statements
Survival analysis
Introduction to prediction modelling - Berlin 2018 - Part II
Game theory
Pediatrician
On p-values
Testing a claim about a mean
SMX Advanced: Reverse Engineering Google Shopping
Mankiew Chapter 6.pptx
Multinomial Logistic Regression
Ad

Similar to Real world modified (20)

PPTX
NNTs, responder analysis & overlap measures
PPTX
What is your question
PPTX
What is your question
PPTX
Clinical trials are about comparability not generalisability V2.pptx
PPTX
2024_guidline rule_r201-study design.pptx
PPTX
r201-study design for research study.pptx
PPTX
Case 3 Meded Ben Lycett Sarah Azfar.pptx
PPTX
Clinical trials: quo vadis in the age of covid?
PDF
Are we really including all relevant evidence
PPTX
research question and designs
PPTX
The challenge of small data
PPTX
RESEARCH METHODOLOGY- INTERVENTIONAL STUDY DESIGNS: clinical trials ppt.pptx
PPTX
Minimally important differences
PPTX
Minimally important differences v2
PPT
Evidence based medicine
PPTX
Clinical research
PPT
(clinical trial overview)
PPTX
Numbers needed to mislead
PDF
HEALTHCARE RESEARCH METHODS: Experimental Studies and Qualitative Studies
PDF
Research Methodology / Experimental research design
NNTs, responder analysis & overlap measures
What is your question
What is your question
Clinical trials are about comparability not generalisability V2.pptx
2024_guidline rule_r201-study design.pptx
r201-study design for research study.pptx
Case 3 Meded Ben Lycett Sarah Azfar.pptx
Clinical trials: quo vadis in the age of covid?
Are we really including all relevant evidence
research question and designs
The challenge of small data
RESEARCH METHODOLOGY- INTERVENTIONAL STUDY DESIGNS: clinical trials ppt.pptx
Minimally important differences
Minimally important differences v2
Evidence based medicine
Clinical research
(clinical trial overview)
Numbers needed to mislead
HEALTHCARE RESEARCH METHODS: Experimental Studies and Qualitative Studies
Research Methodology / Experimental research design
Ad

More from Stephen Senn (20)

PPTX
Has modelling killed randomisation inference frankfurt
PPTX
Vaccine trials in the age of COVID-19
PPTX
To infinity and beyond v2
PPTX
Approximate ANCOVA
PPTX
The Seven Habits of Highly Effective Statisticians
PPT
A century of t tests
PPT
Is ignorance bliss
PPTX
What should we expect from reproducibiliry
PPTX
Personalised medicine a sceptical view
PPTX
In search of the lost loss function
PPTX
To infinity and beyond
PPTX
De Finetti meets Popper
PPTX
Understanding randomisation
PPTX
In Search of Lost Infinities: What is the “n” in big data?
PPTX
Seventy years of RCTs
PDF
The Rothamsted school meets Lord's paradox
PPTX
The revenge of RA Fisher
PPT
The story of MTA/02
PPT
Confounding, politics, frustration and knavish tricks
PPT
Why I hate minimisation
Has modelling killed randomisation inference frankfurt
Vaccine trials in the age of COVID-19
To infinity and beyond v2
Approximate ANCOVA
The Seven Habits of Highly Effective Statisticians
A century of t tests
Is ignorance bliss
What should we expect from reproducibiliry
Personalised medicine a sceptical view
In search of the lost loss function
To infinity and beyond
De Finetti meets Popper
Understanding randomisation
In Search of Lost Infinities: What is the “n” in big data?
Seventy years of RCTs
The Rothamsted school meets Lord's paradox
The revenge of RA Fisher
The story of MTA/02
Confounding, politics, frustration and knavish tricks
Why I hate minimisation

Recently uploaded (20)

PPTX
Head Spine trauma assesment and managementATLS Final.pptx
PPTX
Oncological Emergencies in hospital setting
PPTX
Biomechanical preparation in primary teeth – Instrumentation and seminar 5 (2...
PPTX
Case report session Apendisitis Akut people.pptx
PPTX
MONOCHORIONIC TWIN PREGNANCY detailed.pptx
PDF
CSF rhinorrhea its cause management .pptx
PPT
Doppler - 5.ppt .........................
PDF
WHO Global TUBERCULOSIS Report 2018-2019
PPTX
Seizures in paediatrics as a big cause of morbidity.pptx
PDF
Liver Cirrhosis: Causes, Symptoms, Stages & Expert Treatment in Pune
PPTX
VITAL PULP THERAPY in pediatric dentistry
PPTX
INTRODUCTION TO BIOLOGY AND THE BRANCHES OF BIOLOGY
DOCX
Advanced Nursing Procedures.....realted to advance nursing practice M.Sc. 1st...
PPTX
Conflict Management: Defining conflict and understanding why it occurs is t...
PDF
Chapter 8. HHS Facility Design and Construction _ HHS.gov.pdf
PDF
ICU Main Equipments and its explanations
PDF
Dental Implants Review : A detailed Review
PPTX
SlideEgg_100085- World Mental Health Day.pptx
PDF
odontologia na oncologia - carie de radiação
PPTX
applied physics dental materials basic principles
Head Spine trauma assesment and managementATLS Final.pptx
Oncological Emergencies in hospital setting
Biomechanical preparation in primary teeth – Instrumentation and seminar 5 (2...
Case report session Apendisitis Akut people.pptx
MONOCHORIONIC TWIN PREGNANCY detailed.pptx
CSF rhinorrhea its cause management .pptx
Doppler - 5.ppt .........................
WHO Global TUBERCULOSIS Report 2018-2019
Seizures in paediatrics as a big cause of morbidity.pptx
Liver Cirrhosis: Causes, Symptoms, Stages & Expert Treatment in Pune
VITAL PULP THERAPY in pediatric dentistry
INTRODUCTION TO BIOLOGY AND THE BRANCHES OF BIOLOGY
Advanced Nursing Procedures.....realted to advance nursing practice M.Sc. 1st...
Conflict Management: Defining conflict and understanding why it occurs is t...
Chapter 8. HHS Facility Design and Construction _ HHS.gov.pdf
ICU Main Equipments and its explanations
Dental Implants Review : A detailed Review
SlideEgg_100085- World Mental Health Day.pptx
odontologia na oncologia - carie de radiação
applied physics dental materials basic principles

Real world modified

  • 1. Clinical Trials are not Enough Stephen Senn (c) Stephen Senn 1
  • 2. Acknowledgements (c) Stephen Senn 2 Acknowledgements This work is partly supported by the European Union’s 7th Framework Programme for research, technological development and demonstration under grant agreement no. 602552. “IDEAL”
  • 3. Basic Thesis • Clinical trials are experiments • At every stage of drug development, even phase III, they are unrepresentative of ‘real world’ clinical practice • The key to using their results to inform ‘real world’ decisions is not to make the trial more representative • The key is to use appropriate scales for analysis and transfer the results into practical real-world decision making • This will require – A different attitude – More modelling – Extensive use of auxiliary real world data (c) Stephen Senn 3
  • 4. An Example • Dan Moerman’s analysis more than 30 years ago of Smith-Kline-French’s development of Tagamet® – Tagamet® (cimetidine) was the best selling drug of its day • 31 trials in 1692 patients – in 17 countries – Duodenal, Gastric, Mixed – 13 significant according to Moerman • Used Chi-square test with Yates’s correction (c) Stephen Senn 4
  • 5. (c) Stephen Senn 5 on the risk- difference scale and there is significant evidence of heterogeneity
  • 6. What the EBM movement used to conclude from this sort of thing • The treatment effect varies according to the type of patient • We want RCTS that automatically deliver the right decision • We can’t rely on the results from RCTs which involve artificially selected patients • We need large simple trials in representative populations • They need to be reported using numbers needed to treat (NNTs) (c) Stephen Senn 6
  • 7. Problems • At best you can hope to recommend treatments that are beneficial on average • But in practice you can never guarantee that trials are representative of practice anyway • You lose the opportunity to study issues more deeply • NNTs are a terrible scale for doing analysis (c) Stephen Senn 7
  • 8. (c) Stephen Senn 8 Here the analysis is on the log-odds ratio scale and heterogeneity is much reduced and not even ‘significant’.
  • 9. 9 “If you need statistics to prove it I don’t believe it” You can’t prove it with statistics but everybody believes it Thanks to Pat Ballew’s blog site (c) Stephen Senn 9 That is to say, we see non-random variation too easily. This example does NOT give strong evidence treatment by trial interaction
  • 10. Lessons • If not carefully, studied random variation can be underestimated • Differences from trial to trial in true effect may be less than one thinks • Finding a good scale is important • BUT The additive scale is not necessarily the relevant one (c) Stephen Senn 10
  • 11. What not to do • The solution is not to attempt to make trials more representative • The solution is to measure appropriately and translate appropriately • This requires the following – Good scales – Good analysis – Good modelling – Good supplementary real world data (c) Stephen Senn 11
  • 12. (c) Stephen Senn 12 Chasing sub- groups leads nowhere
  • 13. Solution? (c) Stephen Senn 14 “a possible resolution is to use the additive measure at the point of analysis and transform to the relevant scale at the point of implementation. This transformation at the point of medical decision-making will require auxiliary information on the level of background risk of the patient.” Senn, Statistics in Medicine, 2004
  • 14. How we already use modelling, data and additive scales • Interspecies scaling • Bioequivalence – log relative bioavailability is additive but difference in absolute bioavailability is not • Dose proportionality • Use of additive scales in phase III – Log hazard – Log-odds ratio (c) Stephen Senn 15
  • 15. (c) Stephen Senn 17Controlled Clinical Trials, 1989
  • 16. Implications of the Lubsen-Tijssen Model • We need to study treatment benefit on disaggregated (of harm) additive scale • We will need real world data on harms • We will need real world data on background risk • We will need models • We will need cooperation between – Medics and statisticians working on clinical trials – Statisticians, epidemiologists, health economists, medics and others working in real world data (c) Stephen Senn 18
  • 17. Example of Atrial Fibrillation • Such patients are at higher risk of stroke • Meta-analysis (reproduced in Hart et al 2007)concluded that warfarin has a beneficial protective effect • But there is a risk of intracranial bleeding • Who should get warfarin? (c) Stephen Senn 19
  • 20. So you have atrial fibrillation • Should you take warfarin? • What else do you need to know? – The difference in risk taking warfarin or not – The rate of side effects – The consequences of side-effects • These cannot be answered (alone) by analysis of RCTs with pre-specified efficacy measures on the additive scale • The RCTs has to be translated and supplemented by real world data (c) Stephen Senn 22
  • 22. (c) Stephen Senn 24 Estimate based on 6 v 3 cases only
  • 23. The reimburser’s perspective • What benefit and harm to the population will accrue from recommending warfarin prophylaxis? • How much will it cost? • How can its use be optimised? • Who should get it? (c) Stephen Senn 25
  • 25. Reimburser’s needs Requirements • A means of separating patients by risk • A means of establishing risk distribution in the population of patients above any threshold chosen • A means of determining expected benefits and costs Solutions • These figures cannot be delivered by clinical trials alone but will require – Cohort studies/case control studies – Health surveys – Economic modelling (c) Stephen Senn 27
  • 26. We need to model background risk • Sort of data set we could use is that provided by the UK Clinical Practice Research Data Link CPRD • Could use this to model – A) Predictors of risk of stroke – B) Distribution of risk levels in the population • Former relevant to individuals to make decisions • Latter is relevant to reimbursers (c) Stephen Senn 28
  • 27. Is there a Trust Problem? • Yes • Clinical trials provide a “template of trust” whereby regulators can mandate sponsors to provide the proof • Modelling + real world data cannot provide these guarantees • But this is no excuse – Whether or not you model, others will – You need to know as much as possible about your own drugs and where and when to use them (c) Stephen Senn 29
  • 28. Finally I leave you with this though (c) Stephen Senn 30 Any damn fool can analyse a clinical trial and frequently does But doing it properly involves skilful analysis, understanding what the results mean requires intelligence insight and experience, and applying the results intelligently needs more of the same plus modelling and real world data And to whom do we look to provide these skills? Statisticians!