SlideShare a Scribd company logo
Non-parametric methods
Outline
• Differentiate nonparametric from parametric statistics
• Discuss the advantages and disadvantages of
nonparametric statistics
• Commonly used nonparametric test procedures
• Perform Hypothesis tests using Nonparametric
procedures
Levels of measurement
Ordinal – Socio-Economic
Status (SES), pain level
Ratio –Kelvin Scale,
Weight, height, pulse rate
Interval – Celsius or
Fahrenheit scale
Nominal – Gender, race,
blood group
Parametric Test Procedures
• Parameter: A numerical quantity which characterizes a
given population or some part of it.
• Involve Population Parameters
Example: Population Mean, median etc
• Have Stringent Assumptions
Example: Normal Distribution
• Require Interval Scale or Ratio Scale
• Examples: z-Test, t-Test, ANOVA
Are most variables normally distributed?
For example,
 Is income distributed normally in the population?
 Incidence rates (rare diseases) are not normally
distributed.
 Number of car accidents is also not normally
distributed.
 Duration of illness is not normally distributed.
Nonparametric Test Procedures
• Do Not Involve Population Parameters
• No Stringent Distribution Assumptions
“Distribution-free”
• Data Measured on Any Scale
– Ratio or Interval
– Ordinal
• Example: Good-Better-Best
– Nominal
• Example: Male-Female
 When our data is normally distributed, the mean is
equal to the median and we use the mean as our
measure of central tendency.
 However, if our data is skewed, then the median is a
much better measure of center.
 Therefore, just like the Z, t and F tests made inferences
about the population mean(s), nonparametric tests
make inferences about the population
median(s)/distribution.
Merits and Demerits
1. Used with all scales
2. Easier to Compute
3. Make Fewer Assumptions
4. Need not involve population parameters
5. May waste information
Parametric model more efficient if data permits.
6. Difficult to compute by hand for large samples
Commonly used NP tests
Some of the commonly used non parametric tests are
Chi square test
McNemar’s test
Mann-Whitney U test (Wilcoxon Rank-sum test)
Wilcoxon Signed Rank test
Krushkal-Wallis test (H test)
Friedman ANOVA
Spearman’s Rank correlation
Parametric & Non-parametric tests
AIM Parametric t-tests Non parametric equivalent
Compare one sample to a
hypothetical value
One-sample t-test
Compare 2 independent
sample means
Independent samples t-test
Compare 2 paired sample
means
Paired samples t-test
Compare more than 2
sample means
ANOVA
Correlation between 2
variables
Pearson’s Correlation
Compare more than 2
samples - repeated
Repeated measures
ANOVA
Sign test
Mann Whitney U test
Wilcoxon Signed rank test
Kruskal-Wallis test
Friedman test
Spearmans Rank Correlation
Sign test
Can be used as a non-parametric alternative to
single sample t test.
• The null hypothesis for the sign test specifies the
population median, M0
Ho: The median value of the population is equal to a stated
(Hypothesized) value
0
0 : m
m
H 
0
1
0
1
0
1 :
:
: m
m
H
or
m
m
H
or
m
m
H 


• The data doesn’t follow normal distribution
• Mean is not representative of the values since the
values are skewed to right or left
• Data transformation doesn’t make the values normal
Procedure
• State the hypothesis value to be tested
• Calculate Xi − m0 for i = 1, 2, ..., n.
• Define S− = the number of negative signs obtained upon
calculating Xi − m0 for i = 1, 2, ..., n.
• Define S+ = the number of positive signs obtained upon
calculating Xi − m0 for i = 1, 2, ..., n.
• Count the number of values in the dataset greater than
stated value. This test statistic is referred to as S+.
• Count the number of values in the dataset less the
stated value. This test statistic is referred to as S-
• If S+ is less than S-, then we reject the null hypothesis
Example
The following values are the ages of students in a
Ph.D. program. Determine if the median is less than 40
42, 22, 24, 25, 32, 34, 38, 40, 40, 44, 25, 26, 29
S+ = 2 and S - = 9
(note that 2 observations are exactly 40 which should not
be counted)
Hence, we fail to accept Ho; the median age of the group is
not equal to 40
Mann - Whitney U test
Mann-Whitney U Test is used in place of the two sample
t-test when the normality assumption is questionable.
 Because the samples are independent, they can be of
different sizes.
This test can also be applied when the observations in a
sample of data are ranks, i.e., ordinal data rather than
direct measurements.
Assumptions: Independent, Random Samples
Populations Are Continuous
Independent sample t-test
Width of leaf (cms)
Sunlight 6 4.8 5.1 5.5 5.4 4.1 5.3 4.5 4.3
5.1 5.5 4.1 6 6.1 4.8 5.1 6 6
5.5 4.1 6 4.8 5.6 4.8 5.1 5.5 5.7
Shade 6.5 5.5 6.3 7.2 6.9 6.8 5.5 5.9 6
6.3 7.2 6.8 5.5 6.5 6.5 5.5 6.3 6.7
7.2 6.8 5.5 6.5 7.3 5.5 6.3 7.2 7.5
Mann Whitney U-test example …….
 Calculate U values for both samples, U1, U2.
Calculate U = min(U1, U2)
Reject H0 if U is less than the Ucrit
Null hypothesis: both samples come from the same
underlying distribution
Mann Whitney U-test example
Width of leaf (cms)
Sunlight 4.8 5.1 5.5 4.1 5.3 4.5 5.1
Shade 5.5 6.3 7.2 6.8 5.5 5.9 5.5
Wi
dth
Sunlight Shade
4.1 4.5 4.8 5.1 5.1 5.3 5.5 5.5 5.5 5.5 6.3 6.5 6.8 7.2
1 2 3 4 5 6 7 8 9 10 11 12 13 14
1 2 3 4.5 4.5 6 8.5 8.5 8.5 8.5 11 12 13 14
Rank total R1= 29.5 Rank total R2 = 75.5
 Arrange all the observations into a single ranked
series i.e., without regard to which sample they are in.
 Rank them.
• The following data shows the age at diagnosis of type II
diabetes in young adults. Is the age at diagnosis
different for males and females?
– Males: 19 22 16 29 24
– Females: 20 11 17 12
• Arrange in order of magnitude
Age 11 12 16 17 19 20 22 24 29
Rank 1 2 3 4 5 6 7 8 9
Group 2 2 1 2 1 2 1 1 1
Example – t-test, M-W U test
• Objectives –
• to compare the anxiety levels across gender.
• to compare pre and post anxiety levels
Study to assess the effectiveness of intervention
in reducing anxiety
Checking normality
Results of t-test
Group Statistics
11 17.64 5.390
28 15.57 3.605
sex
1 M
2 F
SAAS.1
N Mean SD
Independent Samples Test
1.394 37 .172
Equal variances
assumed
SAAS.1
t df
Sig.
(2-tailed)
t-test for Equality of Means
Results of M W U-test
Ranks
11 26.05 286.50
28 17.63 493.50
sex
1 M
2 F
SAAS.1
N
Mean
Rank
Sum of
Ranks
Test Statisticsb
87.500
.037
Mann-Whitney U
Asymp. Sig. (2-tailed)
SAAS.1
Grouping Variable: sex
b.
Paired t-test
Subjects 1 2 3 4 5 6 7 8 9 10
Anxiety - pre 142 140 144 144 142 146 149 150 142 148
Anxiety -post 138 136 147 139 143 141 143 145 136 146
Subjects 1 2 3 4 5 6 7 8 9 10
Anxiety - pre 142 140 144 144 142 146 149 150 142 148
Anxiety -post 138 136 147 139 143 141 143 145 136 146
Difference (d) 4 4 -3 5 -1 5 6 5 6 2



d
SE
d
t
Wilcoxon Signed rank test
The Wilcoxon signed-rank test is a non-parametric
statistical hypothesis test used when comparing two
related samples, matched samples, or repeated
measurements on a single sample to assess whether their
population mean ranks differ (i.e. it is a paired difference
test).
H0: the median difference is zero
It can be used as an alternative to the paired Student's t-
test, t-test for matched pairs, or the t-test for dependent
samples when the population cannot be assumed to be
normally distributed
Wilcoxon Signed rank test
Subjects 1 2 3 4 5 6 7 8 9 10
Anxiety - pre 142 140 144 144 142 146 149 150 142 148
Anxiety -post 138 136 147 139 143 141 143 145 136 146
Difference (d) 4 4 -3 5 -1 5 6 5 6 2
Rank of d 4.5 4.5 3 7 1 7 9.5 7 9.5 2
Signed rank of d 4.5 4.5 -3 7 -1 7 9.5 7 9.5 2
Differences in order 1 2 3 4 4 5 5 5 6 6
Rank order 1 2 3 4 5 6 7 8 9 10
Ranks 1 2 3 4.5 4.5 7 7 7 9.5 9.5
 Calculate the rank sum T– of the negative differences
and the rank sum T+ of the positive differences.
 Note: Differences equal to 0 are eliminated, and the
number n of differences is reduced accordingly.
 Test statistic is T = smaller of T– or T+.
 Rejection region: T ≤ T0
• 8 children with autism have enrolled in the study and
the amount of time that each child is engaged in
repetitive behavior during three hour observation
periods are measured both before treatment and then
again after taking the new medication for a period of 1
week. The data are shown below.
Child 1 2 3 4 5 6 7 8
Before
Treatment 85 70 40 65 80 75 55 20
After 1 Week
of Treatment 75 50 50 40 20 65 40 25
• First, we compute difference scores for each child.
• H0: The median difference is zero
• T+ = 32 and T- = 4.
Child
Before
Treatment
After 1 Week of
Treatment Difference
1 85 75 10
2 70 50 20
3 40 50 -10
4 65 40 25
5 80 20 60
6 75 65 10
7 55 40 15
8 20 25 -5
Ordered Absolute
Values of
Differences Ranks
-5 1
10 3
-10 3
10 3
15 5
20 6
25 7
60 8
Signed Ranks
-1
3
-3
3
5
6
7
8
Paired t-test – example
Paired Samples Statistics
16.15 39 4.215
8.87 39 4.808
SAAS.1
SAAS.2
Pair
1
Mean N SD
Paired Samples Test
8.778 38 .001
SAAS.1 - SAAS.2
Pair 1
t df
Sig.
(2-tailed)
WSR test – example
Ranks
34 19.44 661.00
2 2.50 5.00
3
39
Negative Ranks
Positive Ranks
Ties
Total
SAAS.2 - SAAS.1
N
Mean
Rank
Sum of
Ranks
Test Statisticsb
-5.157a
.001
Z
Asymp. Sig. (2-tailed)
SAAS.2 -
SAAS.1
Based on positive ranks.
a.
Wilcoxon Signed Ranks Test
b.
Two-sample paired sign test
Assumptions:
 The paired differences are independent.
 Each paired difference comes from a continuous
distribution with the same median.
 Test statistic for the paired sign test is based only on
the sign of the paired differences.
Note that it is not assumed that the two samples are
independent of each other. In fact, they should be related
to each other such that they create pairs of data points.
Parametric & non-parametric tests
AIM Parametric t-tests Non parametric
Compare one sample to a hypothetical value One-sample t-test Sign test
Compare 2 independent sample means Student’s t-test Mann Whitney U
Compare 2 paired sample means Paired t-test Wilcoxon Signed rank
Compare more than 2 sample means ANOVA Kruskal-Wallis test
Compare more than 2 samples - repeated Repeated measures
ANOVA
Friedman test
Assesses the linear relation between two
variables.
Pearson’s correlation
coefficient.
Spearman rank
correlation, Kendall Tau
© 2011 Pearson Education, Inc
Kruskal-Wallis H-Test
• Tests the equality of more than two (k) population
probability distributions
• Corresponds to ANOVA for more than two means
• Used to analyze completely randomized experimental
designs
• Uses 2
distribution with k – 1 df
— if sample size nj ≥ 5
Kruskal-Wallis H-Test for Comparing k
Probability Distributions
H0: The k probability distributions are identical
Ha: At least one of the k probability distributions differ in location.
Test statistic:
 
 
2
12
3 1
1
j
j
R
H n
n n n
 
  
 
 

 

Rejection region:
H > 
2
with (k – 1) degrees of freedom
Kruskal-Wallis H-Test Procedure
1. Assign ranks, Ri , to the n combined observations
• Smallest value = 1; largest value = n
• Average ties
2. Sum ranks for each group
3. Compute test statistic
 
 
2
12
3 1
1
j
j
R
H n
n n n
 
  
 
 

 

Squared total of
each group
© 2011 Pearson Education, Inc
Kruskal-Wallis H-Test
Solution
Raw Data
Raw Data
Mach1
Mach1 Mach2
Mach2 Mach3
Mach3
25.40
25.40 23.40
23.40 20.00
20.00
26.31
26.31 21.80
21.80 22.20
22.20
24.10
24.10 23.50
23.50 19.75
19.75
23.74
23.74 22.75
22.75 20.60
20.60
25.10
25.10 21.60
21.60 20.40
20.40
Ranks
Mach1 Mach2 Mach3
14 9 2
15 6 7
12 10 1
11 8 4
13 5 3
65 38 17
Total
© 2011 Pearson Education, Inc
Kruskal-Wallis H-Test
Solution
 
 
  
     
 
 
2
2 2 2
12
3 1
1
65 38 17
12
3 16
15 16 5 5 5
12
191.6 48
240
11.58
j
j
R
H n
n n n
 
  
 
 

 
 
 
 
 
   
 
 
 
 
 
 
 
 


Descriptives
14 207.36 17.248
8 188.13 26.920
6 189.83 20.721
28
14 88.57 12.439
8 73.88 13.098
6 84.50 13.531
28
1
2
3 +
Total
1
2
3 +
Total
Parental.
attachment
Self.
Esteem
N Mean SD
ANOVA
ANOVA
2405.76 2 1202.9 2.71 .086
11086.9 25 443.48
13492.7 27
1107.20 2 553.60 3.35 .051
4127.80 25 165.11
5235.00 27
Between Groups
Within Groups
Total
Between Groups
Within Groups
Total
Parental.
attachment
Self.Esteem
Sum of
Squares df
Mean
Square F Sig.
Kruskal Wallis test
Ranks
14 17.68
8 11.38
6 11.25
28
14 17.43
8 9.25
6 14.67
28
sibs.
new
1
2
3 +
Total
1
2
3 +
Total
Parental.attachment
Self.Esteem
N
Mean
Rank
Test Statisticsa,b
4.186 5.049
2 2
.123 .080
Chi-Square
df
Asymp. Sig.
Parental.
attachment
Self.
Esteem
Kruskal Wallis Test
a.
Grouping Variable: sibs.new
b.
Parametric & non-parametric tests
AIM Non parametric tests - categorical
outcomes
Compare 2
unpaired
samples
Fisher's test
(chi-square for large samples)
Compare 2
paired samples
McNemar’s test
Why not use NP tests all the time?
Parametric tests are often preferred because:
 They are more robust.
 They discard a lot of information
 Parametric and non-parametric tests often
address two different types of questions.
Since nonparametric tests require fewer assumptions
and can be used with a broader range of data types,
this question arises.
THANK YOU
THANK YOU

More Related Content

PPTX
Non parametric-tests
Asmita Bhagdikar
 
PPT
PY1PR1_stats_lecture_6 with electronics.ppt
MrMPaulJeyarajEEE
 
PPTX
Marketing Research Hypothesis Testing.pptx
xababid981
 
PPTX
MPhil clinical psy Non-parametric statistics.pptx
rodrickrajamanickam
 
PPTX
UNIT 5.pptx
ShifnaRahman
 
PPT
9-NON PARAMETRIC TEST in public health .ppt
DrPARVATHYVINOD
 
PPTX
Non parametric study; Statistical approach for med student
Dr. Rupendra Bharti
 
PPTX
tests of significance
benita regi
 
Non parametric-tests
Asmita Bhagdikar
 
PY1PR1_stats_lecture_6 with electronics.ppt
MrMPaulJeyarajEEE
 
Marketing Research Hypothesis Testing.pptx
xababid981
 
MPhil clinical psy Non-parametric statistics.pptx
rodrickrajamanickam
 
UNIT 5.pptx
ShifnaRahman
 
9-NON PARAMETRIC TEST in public health .ppt
DrPARVATHYVINOD
 
Non parametric study; Statistical approach for med student
Dr. Rupendra Bharti
 
tests of significance
benita regi
 

Similar to Non-parametric presentationnnnnnnnnnnnnnn (20)

PPTX
Non parametric test
Neetathakur3
 
PDF
jhghgjhgjhgjhfhcgjfjhvjhjgjkggjhgjhgjhfjgjgfgfhgfhg
UMAIRASHFAQ20
 
PPT
T Test Samples for Statistics in Research
EricJohnCalsang
 
PPT
Chapter36a
Ying Liu
 
PPTX
non parametric test.pptx
SoujanyaLk1
 
PPTX
Hypothesis testing - T test lecture.pptx
babatundeoluwafemi4
 
PPTX
Non parametric test 8
Sundar B N
 
PDF
Lecture slides to increase understanding of statistics by all category of res...
kabiruabubakar3
 
PPTX
6. Nonparametric Test_JASP.ppt with full example
S Gayu
 
PPTX
5. testing differences
Steve Saffhill
 
PPTX
Testing of hypothesis.pptx
SyedaKumail
 
PPTX
Non-parametric Statistical tests for Hypotheses testing
Sundar B N
 
PPT
Unit-5.-t-test.ppt
frelynmaeDillo
 
PPTX
Parametric tests
heena45
 
PPTX
biostat__final_ppt_unit_3.pptx
ShubhamYalawatakar1
 
PPTX
Wilcoxon signed rank test
Biswash Sapkota
 
PPTX
Inferential Statistics.pptx
jonatanjohn1
 
PPTX
Parametric tests seminar
drdeepika87
 
PPT
Inferential statistics_AAF 500L 2021.ppt
OfeniJoshuaSeyi
 
PPTX
Hm306 week 4
BHUOnlineDepartment
 
Non parametric test
Neetathakur3
 
jhghgjhgjhgjhfhcgjfjhvjhjgjkggjhgjhgjhfjgjgfgfhgfhg
UMAIRASHFAQ20
 
T Test Samples for Statistics in Research
EricJohnCalsang
 
Chapter36a
Ying Liu
 
non parametric test.pptx
SoujanyaLk1
 
Hypothesis testing - T test lecture.pptx
babatundeoluwafemi4
 
Non parametric test 8
Sundar B N
 
Lecture slides to increase understanding of statistics by all category of res...
kabiruabubakar3
 
6. Nonparametric Test_JASP.ppt with full example
S Gayu
 
5. testing differences
Steve Saffhill
 
Testing of hypothesis.pptx
SyedaKumail
 
Non-parametric Statistical tests for Hypotheses testing
Sundar B N
 
Unit-5.-t-test.ppt
frelynmaeDillo
 
Parametric tests
heena45
 
biostat__final_ppt_unit_3.pptx
ShubhamYalawatakar1
 
Wilcoxon signed rank test
Biswash Sapkota
 
Inferential Statistics.pptx
jonatanjohn1
 
Parametric tests seminar
drdeepika87
 
Inferential statistics_AAF 500L 2021.ppt
OfeniJoshuaSeyi
 
Hm306 week 4
BHUOnlineDepartment
 
Ad

Recently uploaded (20)

PDF
Dr. David Wilson Utah - A Board-Certified Psychiatrist
Dr. David Wilson Utah
 
PDF
Hyperpigmentation Treatment: Your Complete Guide by Devriz Healthcare
Devriz Healthcare
 
PPTX
CTG INTERPRETATION Obstetrics and gynaecology
nihlaomer
 
PDF
musculoskeletal ultrasound basic concepts
humawaris1
 
PPTX
PA latest re edition final PPT. power point presentation
nasirfatima6602
 
PDF
Troubleshooting in Nerve Conduction and EMG Studies : Technical Errors and Qu...
Saran A K
 
PDF
Biotech_Resources_Group_cGMP Biotechnology Zoe
Biotech Resources Group, LLC
 
PPTX
Lina+Empa_Slidedeck_HS_28April_QC_AV.PPTX
YogendraSingh748485
 
PPTX
Caring for Carers Head and Neck July 2025 - ds.pptx
Head and Neck Cancer Support Network
 
DOCX
HEALTH_EDUCATION_ON_NUTRITION_AND_DIETETICS37[1].docx
chaudharihemangin131
 
PDF
Turn Stories with Our Strategies- Hospitals & super-speciality clinicians
Vipin Srivastava
 
PPTX
variability and its measure by dr rohit mishra kgmu
drrohitkgmc09
 
PDF
Lou Lentine-The Power Players Leaders Transforming Fitness & Wellness Tech.pdf
beautynwellnessmag
 
PPTX
Update Management of Bronchial Asthma according to GINA Guideline (2025)
ma076796
 
PDF
Understanding Surrogacy Success Rates.pdf
gracehadley707
 
PPTX
Common Barriers affects communications.pptx
DenekeAlemayehu
 
PPTX
Pain Management Billing Services in Arizona, AZ.pptx
JessicaParker89031
 
PPTX
Insights of Indirect Opthalmoscopy in field of Optometry
Anuska Chakraborty
 
PPT
Infection control in Dentistry- Dr Devina Pradhan
Dr Devina Pradhan
 
PPTX
Therotical and experimental approches in determination of Physicochemical par...
Shubhangi Daswadkar
 
Dr. David Wilson Utah - A Board-Certified Psychiatrist
Dr. David Wilson Utah
 
Hyperpigmentation Treatment: Your Complete Guide by Devriz Healthcare
Devriz Healthcare
 
CTG INTERPRETATION Obstetrics and gynaecology
nihlaomer
 
musculoskeletal ultrasound basic concepts
humawaris1
 
PA latest re edition final PPT. power point presentation
nasirfatima6602
 
Troubleshooting in Nerve Conduction and EMG Studies : Technical Errors and Qu...
Saran A K
 
Biotech_Resources_Group_cGMP Biotechnology Zoe
Biotech Resources Group, LLC
 
Lina+Empa_Slidedeck_HS_28April_QC_AV.PPTX
YogendraSingh748485
 
Caring for Carers Head and Neck July 2025 - ds.pptx
Head and Neck Cancer Support Network
 
HEALTH_EDUCATION_ON_NUTRITION_AND_DIETETICS37[1].docx
chaudharihemangin131
 
Turn Stories with Our Strategies- Hospitals & super-speciality clinicians
Vipin Srivastava
 
variability and its measure by dr rohit mishra kgmu
drrohitkgmc09
 
Lou Lentine-The Power Players Leaders Transforming Fitness & Wellness Tech.pdf
beautynwellnessmag
 
Update Management of Bronchial Asthma according to GINA Guideline (2025)
ma076796
 
Understanding Surrogacy Success Rates.pdf
gracehadley707
 
Common Barriers affects communications.pptx
DenekeAlemayehu
 
Pain Management Billing Services in Arizona, AZ.pptx
JessicaParker89031
 
Insights of Indirect Opthalmoscopy in field of Optometry
Anuska Chakraborty
 
Infection control in Dentistry- Dr Devina Pradhan
Dr Devina Pradhan
 
Therotical and experimental approches in determination of Physicochemical par...
Shubhangi Daswadkar
 
Ad

Non-parametric presentationnnnnnnnnnnnnnn

  • 2. Outline • Differentiate nonparametric from parametric statistics • Discuss the advantages and disadvantages of nonparametric statistics • Commonly used nonparametric test procedures • Perform Hypothesis tests using Nonparametric procedures
  • 3. Levels of measurement Ordinal – Socio-Economic Status (SES), pain level Ratio –Kelvin Scale, Weight, height, pulse rate Interval – Celsius or Fahrenheit scale Nominal – Gender, race, blood group
  • 4. Parametric Test Procedures • Parameter: A numerical quantity which characterizes a given population or some part of it. • Involve Population Parameters Example: Population Mean, median etc • Have Stringent Assumptions Example: Normal Distribution • Require Interval Scale or Ratio Scale • Examples: z-Test, t-Test, ANOVA
  • 5. Are most variables normally distributed? For example,  Is income distributed normally in the population?  Incidence rates (rare diseases) are not normally distributed.  Number of car accidents is also not normally distributed.  Duration of illness is not normally distributed.
  • 6. Nonparametric Test Procedures • Do Not Involve Population Parameters • No Stringent Distribution Assumptions “Distribution-free” • Data Measured on Any Scale – Ratio or Interval – Ordinal • Example: Good-Better-Best – Nominal • Example: Male-Female
  • 7.  When our data is normally distributed, the mean is equal to the median and we use the mean as our measure of central tendency.  However, if our data is skewed, then the median is a much better measure of center.  Therefore, just like the Z, t and F tests made inferences about the population mean(s), nonparametric tests make inferences about the population median(s)/distribution.
  • 8. Merits and Demerits 1. Used with all scales 2. Easier to Compute 3. Make Fewer Assumptions 4. Need not involve population parameters 5. May waste information Parametric model more efficient if data permits. 6. Difficult to compute by hand for large samples
  • 9. Commonly used NP tests Some of the commonly used non parametric tests are Chi square test McNemar’s test Mann-Whitney U test (Wilcoxon Rank-sum test) Wilcoxon Signed Rank test Krushkal-Wallis test (H test) Friedman ANOVA Spearman’s Rank correlation
  • 10. Parametric & Non-parametric tests AIM Parametric t-tests Non parametric equivalent Compare one sample to a hypothetical value One-sample t-test Compare 2 independent sample means Independent samples t-test Compare 2 paired sample means Paired samples t-test Compare more than 2 sample means ANOVA Correlation between 2 variables Pearson’s Correlation Compare more than 2 samples - repeated Repeated measures ANOVA Sign test Mann Whitney U test Wilcoxon Signed rank test Kruskal-Wallis test Friedman test Spearmans Rank Correlation
  • 11. Sign test Can be used as a non-parametric alternative to single sample t test. • The null hypothesis for the sign test specifies the population median, M0 Ho: The median value of the population is equal to a stated (Hypothesized) value 0 0 : m m H  0 1 0 1 0 1 : : : m m H or m m H or m m H   
  • 12. • The data doesn’t follow normal distribution • Mean is not representative of the values since the values are skewed to right or left • Data transformation doesn’t make the values normal
  • 13. Procedure • State the hypothesis value to be tested • Calculate Xi − m0 for i = 1, 2, ..., n. • Define S− = the number of negative signs obtained upon calculating Xi − m0 for i = 1, 2, ..., n. • Define S+ = the number of positive signs obtained upon calculating Xi − m0 for i = 1, 2, ..., n.
  • 14. • Count the number of values in the dataset greater than stated value. This test statistic is referred to as S+. • Count the number of values in the dataset less the stated value. This test statistic is referred to as S- • If S+ is less than S-, then we reject the null hypothesis
  • 15. Example The following values are the ages of students in a Ph.D. program. Determine if the median is less than 40 42, 22, 24, 25, 32, 34, 38, 40, 40, 44, 25, 26, 29 S+ = 2 and S - = 9 (note that 2 observations are exactly 40 which should not be counted) Hence, we fail to accept Ho; the median age of the group is not equal to 40
  • 16. Mann - Whitney U test Mann-Whitney U Test is used in place of the two sample t-test when the normality assumption is questionable.  Because the samples are independent, they can be of different sizes. This test can also be applied when the observations in a sample of data are ranks, i.e., ordinal data rather than direct measurements. Assumptions: Independent, Random Samples Populations Are Continuous
  • 17. Independent sample t-test Width of leaf (cms) Sunlight 6 4.8 5.1 5.5 5.4 4.1 5.3 4.5 4.3 5.1 5.5 4.1 6 6.1 4.8 5.1 6 6 5.5 4.1 6 4.8 5.6 4.8 5.1 5.5 5.7 Shade 6.5 5.5 6.3 7.2 6.9 6.8 5.5 5.9 6 6.3 7.2 6.8 5.5 6.5 6.5 5.5 6.3 6.7 7.2 6.8 5.5 6.5 7.3 5.5 6.3 7.2 7.5
  • 18. Mann Whitney U-test example …….  Calculate U values for both samples, U1, U2. Calculate U = min(U1, U2) Reject H0 if U is less than the Ucrit Null hypothesis: both samples come from the same underlying distribution
  • 19. Mann Whitney U-test example Width of leaf (cms) Sunlight 4.8 5.1 5.5 4.1 5.3 4.5 5.1 Shade 5.5 6.3 7.2 6.8 5.5 5.9 5.5 Wi dth Sunlight Shade 4.1 4.5 4.8 5.1 5.1 5.3 5.5 5.5 5.5 5.5 6.3 6.5 6.8 7.2 1 2 3 4 5 6 7 8 9 10 11 12 13 14 1 2 3 4.5 4.5 6 8.5 8.5 8.5 8.5 11 12 13 14 Rank total R1= 29.5 Rank total R2 = 75.5  Arrange all the observations into a single ranked series i.e., without regard to which sample they are in.  Rank them.
  • 20. • The following data shows the age at diagnosis of type II diabetes in young adults. Is the age at diagnosis different for males and females? – Males: 19 22 16 29 24 – Females: 20 11 17 12 • Arrange in order of magnitude Age 11 12 16 17 19 20 22 24 29 Rank 1 2 3 4 5 6 7 8 9 Group 2 2 1 2 1 2 1 1 1
  • 21. Example – t-test, M-W U test • Objectives – • to compare the anxiety levels across gender. • to compare pre and post anxiety levels Study to assess the effectiveness of intervention in reducing anxiety
  • 23. Results of t-test Group Statistics 11 17.64 5.390 28 15.57 3.605 sex 1 M 2 F SAAS.1 N Mean SD Independent Samples Test 1.394 37 .172 Equal variances assumed SAAS.1 t df Sig. (2-tailed) t-test for Equality of Means
  • 24. Results of M W U-test Ranks 11 26.05 286.50 28 17.63 493.50 sex 1 M 2 F SAAS.1 N Mean Rank Sum of Ranks Test Statisticsb 87.500 .037 Mann-Whitney U Asymp. Sig. (2-tailed) SAAS.1 Grouping Variable: sex b.
  • 25. Paired t-test Subjects 1 2 3 4 5 6 7 8 9 10 Anxiety - pre 142 140 144 144 142 146 149 150 142 148 Anxiety -post 138 136 147 139 143 141 143 145 136 146 Subjects 1 2 3 4 5 6 7 8 9 10 Anxiety - pre 142 140 144 144 142 146 149 150 142 148 Anxiety -post 138 136 147 139 143 141 143 145 136 146 Difference (d) 4 4 -3 5 -1 5 6 5 6 2    d SE d t
  • 26. Wilcoxon Signed rank test The Wilcoxon signed-rank test is a non-parametric statistical hypothesis test used when comparing two related samples, matched samples, or repeated measurements on a single sample to assess whether their population mean ranks differ (i.e. it is a paired difference test). H0: the median difference is zero It can be used as an alternative to the paired Student's t- test, t-test for matched pairs, or the t-test for dependent samples when the population cannot be assumed to be normally distributed
  • 27. Wilcoxon Signed rank test Subjects 1 2 3 4 5 6 7 8 9 10 Anxiety - pre 142 140 144 144 142 146 149 150 142 148 Anxiety -post 138 136 147 139 143 141 143 145 136 146 Difference (d) 4 4 -3 5 -1 5 6 5 6 2 Rank of d 4.5 4.5 3 7 1 7 9.5 7 9.5 2 Signed rank of d 4.5 4.5 -3 7 -1 7 9.5 7 9.5 2 Differences in order 1 2 3 4 4 5 5 5 6 6 Rank order 1 2 3 4 5 6 7 8 9 10 Ranks 1 2 3 4.5 4.5 7 7 7 9.5 9.5
  • 28.  Calculate the rank sum T– of the negative differences and the rank sum T+ of the positive differences.  Note: Differences equal to 0 are eliminated, and the number n of differences is reduced accordingly.  Test statistic is T = smaller of T– or T+.  Rejection region: T ≤ T0
  • 29. • 8 children with autism have enrolled in the study and the amount of time that each child is engaged in repetitive behavior during three hour observation periods are measured both before treatment and then again after taking the new medication for a period of 1 week. The data are shown below. Child 1 2 3 4 5 6 7 8 Before Treatment 85 70 40 65 80 75 55 20 After 1 Week of Treatment 75 50 50 40 20 65 40 25
  • 30. • First, we compute difference scores for each child. • H0: The median difference is zero • T+ = 32 and T- = 4. Child Before Treatment After 1 Week of Treatment Difference 1 85 75 10 2 70 50 20 3 40 50 -10 4 65 40 25 5 80 20 60 6 75 65 10 7 55 40 15 8 20 25 -5 Ordered Absolute Values of Differences Ranks -5 1 10 3 -10 3 10 3 15 5 20 6 25 7 60 8 Signed Ranks -1 3 -3 3 5 6 7 8
  • 31. Paired t-test – example Paired Samples Statistics 16.15 39 4.215 8.87 39 4.808 SAAS.1 SAAS.2 Pair 1 Mean N SD Paired Samples Test 8.778 38 .001 SAAS.1 - SAAS.2 Pair 1 t df Sig. (2-tailed)
  • 32. WSR test – example Ranks 34 19.44 661.00 2 2.50 5.00 3 39 Negative Ranks Positive Ranks Ties Total SAAS.2 - SAAS.1 N Mean Rank Sum of Ranks Test Statisticsb -5.157a .001 Z Asymp. Sig. (2-tailed) SAAS.2 - SAAS.1 Based on positive ranks. a. Wilcoxon Signed Ranks Test b.
  • 33. Two-sample paired sign test Assumptions:  The paired differences are independent.  Each paired difference comes from a continuous distribution with the same median.  Test statistic for the paired sign test is based only on the sign of the paired differences. Note that it is not assumed that the two samples are independent of each other. In fact, they should be related to each other such that they create pairs of data points.
  • 34. Parametric & non-parametric tests AIM Parametric t-tests Non parametric Compare one sample to a hypothetical value One-sample t-test Sign test Compare 2 independent sample means Student’s t-test Mann Whitney U Compare 2 paired sample means Paired t-test Wilcoxon Signed rank Compare more than 2 sample means ANOVA Kruskal-Wallis test Compare more than 2 samples - repeated Repeated measures ANOVA Friedman test Assesses the linear relation between two variables. Pearson’s correlation coefficient. Spearman rank correlation, Kendall Tau
  • 35. © 2011 Pearson Education, Inc Kruskal-Wallis H-Test • Tests the equality of more than two (k) population probability distributions • Corresponds to ANOVA for more than two means • Used to analyze completely randomized experimental designs • Uses 2 distribution with k – 1 df — if sample size nj ≥ 5
  • 36. Kruskal-Wallis H-Test for Comparing k Probability Distributions H0: The k probability distributions are identical Ha: At least one of the k probability distributions differ in location. Test statistic:     2 12 3 1 1 j j R H n n n n              Rejection region: H >  2 with (k – 1) degrees of freedom
  • 37. Kruskal-Wallis H-Test Procedure 1. Assign ranks, Ri , to the n combined observations • Smallest value = 1; largest value = n • Average ties 2. Sum ranks for each group 3. Compute test statistic     2 12 3 1 1 j j R H n n n n              Squared total of each group
  • 38. © 2011 Pearson Education, Inc Kruskal-Wallis H-Test Solution Raw Data Raw Data Mach1 Mach1 Mach2 Mach2 Mach3 Mach3 25.40 25.40 23.40 23.40 20.00 20.00 26.31 26.31 21.80 21.80 22.20 22.20 24.10 24.10 23.50 23.50 19.75 19.75 23.74 23.74 22.75 22.75 20.60 20.60 25.10 25.10 21.60 21.60 20.40 20.40 Ranks Mach1 Mach2 Mach3 14 9 2 15 6 7 12 10 1 11 8 4 13 5 3 65 38 17 Total
  • 39. © 2011 Pearson Education, Inc Kruskal-Wallis H-Test Solution                  2 2 2 2 12 3 1 1 65 38 17 12 3 16 15 16 5 5 5 12 191.6 48 240 11.58 j j R H n n n n                                          
  • 40. Descriptives 14 207.36 17.248 8 188.13 26.920 6 189.83 20.721 28 14 88.57 12.439 8 73.88 13.098 6 84.50 13.531 28 1 2 3 + Total 1 2 3 + Total Parental. attachment Self. Esteem N Mean SD ANOVA ANOVA 2405.76 2 1202.9 2.71 .086 11086.9 25 443.48 13492.7 27 1107.20 2 553.60 3.35 .051 4127.80 25 165.11 5235.00 27 Between Groups Within Groups Total Between Groups Within Groups Total Parental. attachment Self.Esteem Sum of Squares df Mean Square F Sig.
  • 41. Kruskal Wallis test Ranks 14 17.68 8 11.38 6 11.25 28 14 17.43 8 9.25 6 14.67 28 sibs. new 1 2 3 + Total 1 2 3 + Total Parental.attachment Self.Esteem N Mean Rank Test Statisticsa,b 4.186 5.049 2 2 .123 .080 Chi-Square df Asymp. Sig. Parental. attachment Self. Esteem Kruskal Wallis Test a. Grouping Variable: sibs.new b.
  • 42. Parametric & non-parametric tests AIM Non parametric tests - categorical outcomes Compare 2 unpaired samples Fisher's test (chi-square for large samples) Compare 2 paired samples McNemar’s test
  • 43. Why not use NP tests all the time? Parametric tests are often preferred because:  They are more robust.  They discard a lot of information  Parametric and non-parametric tests often address two different types of questions. Since nonparametric tests require fewer assumptions and can be used with a broader range of data types, this question arises.

Editor's Notes

  • #5: A normally distributed data tend to have the same number of data points on one side of the distribution as it does on the other side.
  • #39: Assume that the population is normally distributed. Allow students about 10 minutes to solve this.