0% found this document useful (0 votes)
208 views66 pages

Clinical Sample Size Estimator

The document presents a freeware spreadsheet tool for sample size estimation in clinical trials, developed by David Dubins. It includes various statistical methods for calculating required sample sizes for different study designs, such as superiority, equivalence, and bioequivalence tests. Users are advised that the tool is not validated and should be used with caution, acknowledging that sample size estimation is only an estimate and does not guarantee study success.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLS, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
208 views66 pages

Clinical Sample Size Estimator

The document presents a freeware spreadsheet tool for sample size estimation in clinical trials, developed by David Dubins. It includes various statistical methods for calculating required sample sizes for different study designs, such as superiority, equivalence, and bioequivalence tests. Users are advised that the tool is not validated and should be used with caution, acknowledging that sample size estimation is only an estimate and does not guarantee study success.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLS, PDF, TXT or read online on Scribd
You are on page 1/ 66

Free Analysis Research Tool Version 1.

7
for Sample Size Iterative Estimation
at your fingertips

© David Dubins, 2007-2010 ([email protected])

License: This spreadsheet is freeware. The user is granted conditional use to copy, duplicate, and distribute
provided that the user doesn't remove my name or the identifying information. I would also like to be acknowledged
in the event that your clinical trial is published and you used this spreadsheet to calculate the sample size (fame is
so much more fun than money). The user of this workbook acknowledges that it is not responsible if study fails, is
underpowered, or is overpowered. The user of this workbook acknowledges that sample size estimation is at the
very best just that - an estimate. No guarantees are provided on the success of a study, even if the sample size is
carefully researched and justified. This workbook is not validated, but has been checked against the examples
contained in each spreadsheet in this workbook and against other programs. I am also not responsible in the event
that someone modifies this program in such a way to make it non-functional, and then passes it along. To obtain an
original copy, please contact me at: [email protected], or download from http://individual.utoronto.ca/ddubins

I would like to very warmly thank the the following people for making this workbook possible:

Russell V. Lenth, Professor


Department of Statistics & Actuarial Science
The University of Iowa, Iowa City, IA 52242 USA
http://www.stat.uiowa.edu/~rlenth/

I dedicate this software to Dr. Richard Moisan B.Phm, Ph.D., my former supervisor, mentor, and friend.
You are greatly missed.

References:
Julious, SA. Statist. Med. 2004; 23:1921–1986
Shein-Chung Chow, Jun Shao, Hansheng Wang. Sample Size Calculations in Clinical Research. Marcel Dekker,
Inc. New York • Basel 2003.
http://www.springerlink.com/content/u503p62056413677/fulltext.pdf
http://www.fda.gov/ohrms/dockets/ac/06/slides/2006-4241s2_5.ppt
http://www.fda.gov/ohrms/dockets/ac/06/slides/2006-4241s2_3.ppt
http://www.fda.gov/ohrms/dockets/ac/06/slides/2006-4241s2_4.ppt
http://bebac.at/news/2006-10-06.htm
http://www.fda.gov/cder/Guidance/3616fnl.pdf
http://www.fda.gov/ohrms/dockets/ac/04/briefing/4034B1_07_Bioequivalence%20Requirments-Draft.pdf

David Dubins, B.A.Sc., Ph.D.


Global Bioequivalence Consulting
36 Reiner Road
Toronto, Ontario, Canada
M3H 2L2
[email protected]
Superiority, Parallel (Scroll down for an example)
Desired Power 80% Probability of not getting a Type II error (not getting a false negative)
r (note: nA*r=nB) 1 This is the 'allocation ratio' of nB over nA (for equal sample sizes, r=1)
a 5% Probability of a Type I error (false positive) in the event that there is no significant
s 21.4 Common standard deviation
d 22 Difference in means (m2-m1) --> this is the expected clinically relevant difference.

d = d/s 1.028037

s 21.4
nA 16
nA(r+1)-2 30
t(1-alpha/2,nA(r+1)-2) 2.042272
tau1 2.907729

Probt(1) #VALUE!
1-b #VALUE!
nA nB Total
Required Sample Size 16 16 32

NOTE: This is a 2-tailed test. For a 1-tailed test, multiply alpha by 2 (and pre
EXAMPLE:
Source: Julious, SA. Statist. Med. 2004; 23:1921–1986
lse negative)
ple sizes, r=1)
that there is no significant difference between means

cally relevant difference.

tiply alpha by 2 (and pretend it's the same alpha).


Repfactor (for Boss Button)
2
Superiority, Crossover (Scroll down for an example)
Desired Power 90% Probability of not getting a Type II error (not getting a false negative)

a 5% Probability of a Type I error (false positive)


sw 20 Intrasubject standard deviation, taken from the residual line of an ANOVA model
d 10 Difference in means (m2-m1) --> this is the expected clinically relevant difference.

d = d/sw 0.5

sw 20
n 87
n-2 85
t(1-alpha/2,n-2) 1.988268
tau1 3.297726

Probt(1) #VALUE!
1-b #VALUE!

Required Sample Size 87 (total sample size)

NOTE: This is a 2-tailed test. For a 1-tailed test, multiply alpha by 2 (and pre
EXAMPLE:

Source: Julious, SA. Statist. Med. 2004; 23:1921–1986


(Note: this was a typo, Table II gave a total sample size of 87 in the paper).
lse negative)

line of an ANOVA model


cally relevant difference.

iply alpha by 2 (and pretend it's the same alpha).


Repfactor (for Boss Button)
1
Equivalence, Parallel (Scroll down for an example)
Desired Power 80% Probability of not getting a Type II error (not getting a false negative)
r (note: nA*r=nB) 1 This is the 'allocation ratio' of nB over nA (for equal sample sizes, r=1)
a 2.5% Probability of a Type I error (false positive).
s 100 Common standard deviation
d 10 Maximum allowable difference in means (m2-m1) --> this is the expected clinica
D 0 Expected difference in means (m2-m1 expected).
d = d/s 0.1

s 100
nA 2103
nA(r+1)-2 4204
t(1-alpha,nA(r+1)-2) 1.9605284333
tau2 -3.242684073
tau1 3.2426840734
Probt(2) #VALUE!
Probt(1) #VALUE!
1-b #VALUE!
nA nB Total
Required Sample Size 2103 2103 4206

EXAMPLE:
Source: Julious, SA. Statist. Med. 2004; 23:1921–1986
getting a false negative)
equal sample sizes, r=1)

-m1) --> this is the expected clinically relevant difference.


Repfactor (for Boss Button)
2
Equivalence, Crossover (Scroll down for an example)
Desired Power 90% Probability of not getting a Type II error (not getting a false negative)

a 2.5% Probability of a Type I error (false positive).


sw 20 Expected within-subject population standard deviation
d 10 Maximum allowable difference in means (m2-m1) --> this is the expected clinica
D 0 Expected difference in means (m2-m1 expected).
d = d/sw 0.5

sw 20
n 106
n-2 104
t(1-alpha,n-2) 1.9830375265
tau2 -3.640054945
tau1 3.6400549446
Probt(2) #VALUE!
Probt(1) #VALUE!
1-b #VALUE!

Required Sample Size 106

EXAMPLE:
Source: Julious, SA. Statist. Med. 2004; 23:1921–1986
getting a false negative)

-m1) --> this is the expected clinically relevant difference.


Repfactor (for Boss Button)
1
Non-inferiority, Parallel (Scroll down for an example)
Desired Power 80% Probability of not getting a Type II error (not getting a false negative)
r (note: nA*r=nB) 1 This is the 'allocation ratio' of nB over nA (for equal sample sizes, r=1)
a 5.0% Probability of a Type I error (false positive)
s 40 Common standard deviation
d 10 Maximum allowable difference in means (mA-mB) --> this is the expected clinically r
D 0 Expected difference in means (mA-mB expected)
d = d/s 0.25

s 40
nA 338
nA(r+1)-2 674
t(1-alpha,nA(r+1)-2) 1.647118
tau1 3.25

Probt(1) #VALUE!
1-b #VALUE!
nA nB Total
Required Sample Size 338 338 676

EXAMPLE:
Source: Julious, SA. Statist. Med. 2004; 23:1921–1986
lse negative)
ple sizes, r=1)

is the expected clinically relevant difference.


Repfactor (for Boss Button)
2
Non-inferiority, Crossover (Scroll down for an example)
Desired Power 90% Probability of not getting a Type II error (not getting a false negative)

a 2.5% Probability of a Type I error (false positive)


sw 20 Expected within-subject population standard deviation
d -10 Maximum allowable difference in means (mA-mB) --> this is the expected clinically r
D 0 Expected difference in means (mA-mB expected)
d = d/sw -0.5

sw 20
nA 87
nA(r+1)-2 85
t(1-alpha,nA(r+1)-2) 1.988268
tau1 3.297726

Probt(1) #VALUE!
1-b #VALUE!

Required Sample Size 87

EXAMPLE:
Source: Julious, SA. Statist. Med. 2004; 23:1921–1986
lse negative)

is the expected clinically relevant difference.


Repfactor (for Boss Button)
2
Bioequivalence, Crossover (Scroll down for an example)
Desired Power 80%

a 5%
intra-subject CV (ISV) 25.0% ISV = sqrt(Mean Square Error) / ((mA + mB)/2). You can also get a quick and dirty e
Expected mT/mR Ratio 95% (to 105.3%)
Lower Equivalence Limit 80.0% 75
Upper Equivalence Limit 125.0% 133

sw 0.246221
n 37
n-2 35
t(1-alpha,n-2) 1.689572
tau2 -4.794063
tau1 3.002006
Probt(2) #VALUE!
Probt(1) #VALUE!
1-b #VALUE!

Required Sample Size 37


44.4

EXAMPLE:

(Note: Table VII provided sample size estimates at 90% power.)


Source: Julious, SA. Statist. Med. 2004; 23:1921–1986

Dave's note: in industry for BE studies, the tendency is to use a Test/Reference ratio of 1.05 and a power of 80%.
also get a quick and dirty estimate taking 0.5 * total CV (which is just published standard deviation / mean)

If you can find a BE study in the literature that reports the


Test/Reference Ratios and 90%CI for Cmax, AUCt, or AUCinf, you
can use this macro to back-calculate the ISV. If you're lucky
sometimes authors will publish the ISV directly.
and a power of 80%.
Repfactor (for Boss Button)
1
Bioequivalence, Replicate (Scroll down for an example)
Desired Power 80% 3 period replicate (ABB/BAA)

a 5% Method C1 - EMEA Approach

intra-subject CV (ISV) 90% ISV = sqrt(Mean Square Error) / ((mA + mB)/2) - estimate with 1/2 of total CV
Expected mT/mR Ratio (%) 95% (to 105.3%)
Lower Equivalence Limit 55.69% Reference-Scaled Parameters
Upper Equivalence Limit 179.57% k (Method C1)
k (= sb2 / sw2) 1 (used for 2-Period Repl. only) sW0 (Method C2)
sWR 0.770277 θ: (Method C2)
n 257 # Subjects (Method D)
n-2 255
t(1-alpha,n-2) 1.650851 EMEA: Method C1, 3- or 4-period replica
tau2 -9.370054 FDA: Method C2, 3-period replicate (TR
tau1 7.860338
Probt(2) #VALUE!
Probt(1) #VALUE!
1-b #VALUE!

Required Sample Size 193

EXAMPLE:

Source: Julious, SA. Statist. Med. 2004; 23:1921–1986


wn for an example)

estimate with 1/2 of total CV

Scaled Parameters
0.760
0.294
0.57606836
24

Method C1, 3- or 4-period replicate, k=0.760, maximum of 69.84 – 143.19%.


Method C2, 3-period replicate (TRR/RRT), sW0=0.294
Multiply n by:
2 period re 1.5
3 period re 0.75
4 period r 0.5
2
0.75

Reference-Scaled BE Lower CI: Upper CI:


No Reference Scaled BE Limits 0.8 1.25
Method A - Direct Expansion of BE 0.75 1.33
Method C1 - EMEA Approach 0.556877 1.795728
Method C2 - FDA Approach 0.55731 1.794334 alpha
Method D 0.508844 1.965239 1.717144

3
beta
1.321237
Bioequivalence, Parallel (Scroll down for an example)
Desired Power 90%
r (note: nT*r=nR) 1 This is the 'allocation ratio' of nR over nT (for equal sample sizes, r=1)
a 5%
inter-subject CV 80%
Expected mT/mR Ratio 100%

Lower Equivalence Limit 80%


Upper Equivalence Limit 125%
s 0.7033464593
nT 216
nT*(r+1)-2 430
t(1-alpha,nT(r+1)-2) 1.6484049694
tau2 -3.2970604726
tau1 3.2970604726
Probt(2) #VALUE!
Probt(1) #VALUE!
1-b #VALUE!
nA nB Total
Required Sample Size 216 216 432

EXAMPLE:

Source: Julious, SA. Statist. Med. 2004; 23:1921–1986


equal sample sizes, r=1)
Repfactor (for Boss Button)
1
Scaled Bioequivalence Limits - Highly Variable Drugs
IntraCV of Reference Product 30.0% Note: This CV will be used to calculate BE limits for each method on this

Proposed FDA Approach for Scaled Bioequivalence Limits


06 October 2006 meeting of the Advisory Committee for Pharmaceutical Sciences (ACPS)

Scaled average bioequivalence, based on within subject variability of reference


Method C2 (Haidar et al, 2008)

Permitted SD (σw0): 0.2936 (Note: Presentation by Barbara Davit suggests 30% cutoff fo
θ: 0.577639 If CVWR > 30%, FDA will use the reference-scaled average B
Corresponding Permitted IntraCV: 30.0% If CVWR < 30%, FDA will use the unscaled average BE appro
σwr^2: 0.086178
Limits of μT - μR (±sqrt(θ*σwr^2)): 0.223144

Bioequivalence Limits: 80.00 - 125.00%


Bioequivalence Limits for Point Estimate: 80.00 - 125.00%
Sources:
http://www.springerlink.com/content/u503p62056413677/fulltext.pdf
http://www.fda.gov/ohrms/dockets/ac/06/slides/2006-4241s2_5.ppt

* Calculation of BE limits assumes that the 30% cutoff suggested by Barbara Davit be used as the Permitted SD.
If a Permitted SD of 0.25 is used in this method, it will yield the same results numerically as Dr. Midah.

Other Discussed Approaches


Scaled average bioequivalence, based on within subject variability of reference

( 0 . 223
)
sw0 is the SD at which the BE limits ar
BE limits, upper, lower = EXP ± ¿ σ wr swr is either the residual SD (ABE) or t
σ w0
Permitted SD (σw0): 0.25 (Note: Dr. Midha and Dr. Sam Haidar propose a value of 0.25 as a perm
Corresponding Permitted IntraCV: 25.4%
Intra-subject Standard Deviation: 0.29356 (reference product)
BE limits, ln-scale (±) 0.262024
Relationship between σw0 and k of Me
Bioequivalence Limits: 76.95 - 129.96%
Sources:
http://www.fda.gov/ohrms/dockets/ac/06/slides/2006-4241s2_3.ppt
http://www.fda.gov/ohrms/dockets/ac/06/slides/2006-4241s2_4.ppt
http://bebac.at/news/2006-10-06.htm
Expansion of Bioequivalence Limits Based on Fixed Sample Size
Method B (Haidar et al 2008)

Reference/Reference Ratio: 1
Number of Subjects (Completed): 24
σwr^2 0.086178
Standard Error of Difference 0.084744
t(0.05,n-2) 1.717144

Bioequivalence Limits: 86.46 - 115.66%


*Note: for this method, the ISV would have to be >47%, with n=24 and Ref/Ref = 1.
If Ref/Ref <>1, BE limits are not symmetrical about unity.
Perhaps take the reciprocal of the largest confidence limit to rectify? Issue not discussed in the Haidar paper.

Source:
http://www.springerlink.com/content/u503p62056413677/fulltext.pdf

Widening of Bioequivalence Limits Based on Reference Variability


Method C1 (Haidar et al, 2008)

k: 1
σwr 0.29356

Bioequivalence Limits: 74.56 - 134.12%


Sources:
http://www.springerlink.com/content/u503p62056413677/fulltext.pdf
http://www.fda.gov/ohrms/dockets/ac/04/briefing/4034B1_07_Bioequivalence%20Requirments-Draft.pdf

Widening of Bioequivalence Limits Based on Reference Variability


Method C3 (Haidar et al, 2008)

Permitted SD (σw0): 0.25


σwt^2 0.086178 (IntraCV of Test)
σwr^2 0.086178 (IntraCV of Reference)
σd^2 0.018387 (Subject by formulation interaction)
θi 1.596689
Limits for (uT - uR): 0.34527
Bioequivalence Limits: 70.80 - 141.24% (Individual BE)
Sources:
http://www.fda.gov/cder/Guidance/3616fnl.pdf
http://www.springerlink.com/content/u503p62056413677/fulltext.pdf

Expansion of Bioequivalence Limits Based on Sample Size and Scaling


Method D (Haidar et al, 2008)

# Subjects Desired in Study (2n): 24


Type I Error (alpha) 5% (consumer risk)
Type II Error (beta) 10% (producer risk, corresponding to a power of 90% to determine bioequival
σwr 0.29356
t(alpha,2n-2) 1.717144
t(beta/2,2n-2) 1.717144

Bioequivalence Limits: 74.75 - 133.78%

Sources:
http://www.springerlink.com/content/u503p62056413677/fulltext.pdf
http://www.fda.gov/ohrms/dockets/ac/04/briefing/4034B1_07_Bioequivalence%20Requirments-Draft.pdf
iable Drugs
imits for each method on this worksheet.

al Sciences (ACPS)

ability of reference (Dr. Haider notation)

Davit suggests 30% cutoff for IntraCV)*


e reference-scaled average BE approach
e unscaled average BE approach

d as the Permitted SD.

ability of reference (Dr. Midah notation)


e SD at which the BE limits are permitted to be widened (set by an agency)
her the residual SD (ABE) or the SD of the reference product (replicate design)

se a value of 0.25 as a permitted SD)

hip between σw0 and k of Method C1 (below)


the Haidar paper.
ents-Draft.pdf
and Scaling

90% to determine bioequivalence)

ents-Draft.pdf
Given Precision, Parallel (Scroll down for an example)

r (note: nA*r=nB) 1 This is the 'allocation ratio' of nB over nA (for equal sample sizes, r=1)
a 5% Probability of a Type I error (false positive)
s 25 Common standard deviation
w 5 Half-width of the confidence interval desired.

d 0.2

s 25
nA 194
nA(r+1)-2 386
t(1-alpha/2,nA(r+1)-2) 1.966129
(r+1)t^2*s^2/(rw^2) 193.2831

nA nB Total
Required Sample Size 194 194 388

EXAMPLE:
Source: Julious, SA. Statist. Med. 2004; 23:1921–1986
ple sizes, r=1)
Given Precision, Crossover (Scroll down for an example)

a 5% Probability of a Type I error (false positive)


sw 10 Common standard deviation
w 5 Half-width of the confidence interval desired.

d 0.5

sw 10
n 34
n-2 32
t(1-alpha/2,n-2) 2.036933
2t^2*s^2/(w^2) 33.19278

Required Sample Size 34

EXAMPLE:

Source: Julious, SA. Statist. Med. 2004; 23:1921–1986


Difference Between Proportions (Binomial Test)
(Scroll down for an example)
Desired Power 80%
a 5% Probability of a Type I error (false positive)
p0 10% Predefined reference level (non-responder)
p1 30% Minimum cure rate level we would like to detect

d 20% (This is like d in the other spreadsheets)

rcrit 5 This is the critical number of responders above which the null hypothesis is reject
n 25

1-b 0.806512

Required Sample Size 25 (including responders + non-responders)


Required Responders 5 (# responders out of the total sample size needed to reject the null hypothesis)

(good for smaller sample sizes, <= 170)

H0: a difference of d does not exist between treatment and control.


H1: a difference exists between treatment and control.

If the results of the study see more than rcrit responders, you can reject H0 and conclude there is a more than d % differe

EXAMPLE:

New York • Basel 2003.


mial Test)

ve which the null hypothesis is rejected.

eded to reject the null hypothesis)

lude there is a more than d % difference.


Variance Correction (Baseline and Post-dose Measures)
Is the data you're looking at to plan your trials baseline corrected?
If so the variance may be too high to plan your sample size around.
The variance in baseline-corrected trials can be corrected for:

The variance can simply be divided by the factor on the right.

s2 reported 20
p 3 Number of baseline measures taken per individual
r 0.5 Pearson Correlation Coefficient between observations

Variance Correction Factor:


0.625

s2 adjusted: 12.5

2. Post-dose Measures Correction.

s2 reported 20
r 5 number of post-dose measures
r 0.5 correlation between post-dose measures

Variance Correction Factor:


0.6

s2 adjusted: 12

3. Accounting for Baseline(s) and Post-dose Measures as Covariates


s2 reported 20
r 5 number of post-dose measures
r 0.5 correlation between post-dose measures
p 4 Number of baseline measures taken per individual

Variance Correction Factor:


0.2

s2 adjusted: 4

Source: Julious, SA. Statist. Med. 2004; 23:1921–1986


se Measures)
Allied Research International Inc.
Study Design
Mississauga, Canada

Intra-Subject Variability Calculator


POINT ESTIMATE AND CONFIDENCE INTERVALS to ISV
Ratio 1.0072 Ratio of Test to Reference
Lower 0.9022 Lower Confidence Interval
Upper 1.1122 Upper Confidence Interval
N 35 # Subjects
CI range 90%
DF 33
T90 1.6923603
SEEL 0.0650531
SEEH 0.0585962
MSEL 0.0740583
MSEH 0.0600864
ISVL 27.725368
ISVH 24.885405

ISV 26.3 %

858624173.xls 03/04/2025 - 15:55:18

You might also like