SlideShare a Scribd company logo
PROBABILITY
Behavioral Statistics
Summer 2017
Dr. Germano
What is Probability?
• A method for measuring and quantifying the likelihood of
obtaining a specific sample from a specific population
I want to pick a single marble (a sample)
out of each of these jars (two populations)
What are my chances of getting a black marble from both jars?
Jar A Jar B50 white
50 black
10 white
90 black
You have a 50-50
chance of getting
either color
Your marble (your
sample) will
probably be black
What is Probability?
• A method for measuring and quantifying the likelihood of
obtaining a specific sample from a specific population
Now you are blindfolded when you pick your marble (your sample)
so you do not know from which jar (population) it comes from.
If you picked 4 black marbles in a row, which jar would say they came
from?
Jar A Jar B
Very high
probability they
came from this
one
Very low
probability they
came from this
one
Probability Defined
• The probability (p) of a specific outcome for any given
event is defined as the fraction or proportion of all the
possible outcomes (A, B, C, D, etc.)
p(A)= # of outcomes classified as A
total # of possible outcomes
A ratio that represents the likelihood that
some event will occur relative to the total
number of possible outcomes
Probability Defined
• If an event can occur in ‘A’ ways
• If an event can fail to occur in ‘B’ ways
• If all possible ways are equally likely (we guarantee this
through random selection)
• The probability of the event occurring:
• The probability of the event failing to occur:
 BA
A

 BA
B

p(A)= # of outcomes classified as A
total # of possible outcomes
A Demonstration
Select one card from a deck of 52 cards
What is the probability of selecting a king?
• The probability of the event occurring is computed by:
• A = 4
• there are 4 kings in the deck
• B = 48
• there are 48 cards that aren’t kings
in the deck
 BA
A
 52
4
)484(
4



A Demonstration
Select one card from a deck of 52 cards
What is the probability of selecting a king of diamonds?
The probability of the event occurring is computed by:
• A = 1
• There is 1 king of diamonds in
the deck
• B = 51
• There are 51 cards that are NOT
the king of diamonds in the deck
 BA
A
 52
1
)511(
1



Another way to think of probability
• Sample space: A set of possible outcomes
• Rolling a die: {1, 2, 3, 4, 5, 6}  6 possibilities
• Event: A subset of the sample space, the
occurrence of the thing you are interested in
• Die coming up odd: {1, 3, 5}  3 possibilities
Let’s say we are interested in the probability
a roll of a die will result in an odd number
The probability a die will come up odd is the number of event
possibilities over the number of sample space possibilities:
2
1
6
3
.#
.#
)( 
spacesample
iespossibilitevent
oddp
Interpreting Probability
• For any event (A), we assign a number p(A) between 0
and 1 that is interpreted as:
• The chance or likelihood that event A occurs
• The proportion of times that event A occurs over a series of trials
Continuing our rolling the die example…
p(odd) = ½ = 0.50 = 50%
We can express probability in a ratio, proportion, or %
Examples
If we spin this wheel: • What is the p of landing on 7?
• p(7) = 1/8
• p = 0.125 or 12.5%
• What is the p of landing on an
even number?
• p(even) = 4/8 = 1/2
• p = 0.50 or 50%
• What is the p of landing on a
white space?
• p(white) = 4/8 = 1/2
• p = 0.50 or 50%
More examples!
What is the probability of:
• Getting a correct answer on a multiple choice question
with five answer choices (assuming you make a blind
guess)?
• 1/5
• Getting a tail in
a regular coin toss?
• 1/2
• Rolling a 3 with a
regular die?
• 1/6
When is probability reliable?
• When you have a fair die, coin, roulette wheel, etc.
• There is an equal chance of every outcome
• If your coin is weighted to land on heads every time, the p(tail) is no
longer ½
To use probability in statistics, we must ensure that our
sample is chosen in a way to mirror this “fair chance”
Random Sampling
To insure that the definition of probability is accurate, we
must use random sampling
1. Every individual in the population of interest has an
equal chance of being selected
1. If more than one individual is to be selected for the
sample, there must be a constant probability for each
and every selection
• This may require sampling with replacement
An Example of Random Sampling
Imagine a large bag filled with coins.
The coins include half-dollars, quarters,
pennies, nickels, and dimes. You are
interested in estimating the weight of
this bag, but may not actually weigh every coin.
How would you estimate the weight?
Estimating the weight…
• Take a random sample of coins
• Every coin must have an equal chance
of being selected
• There must be a constant probability
for each and every selection
• Requires sampling with replacement
• Use this information to generalize what we might
expect the weight of the entire bag to be
The Role of Probability in Inferential Statistics
• Probability is used to predict what kind of samples are
likely to be obtained from a population
• Establishes a connection between samples and populations
• Inferential statistics rely on this connection when they use
sample data as the basis for making conclusions about
populations
Probability and Statistics
This is how we use probability in statistics
We think ‘backwards’ because we typically do not have the
population information
Based on the sample,
what do we estimate the
population to look like?
vs.
Based on the population’s
characteristics, what do we
estimate our sample to look like?
Probability and Frequency Distributions
• X: 1, 1, 2, 3, 3, 4, 4, 4, 5, 6
• N = 10
If I sample n = 1:
• p(X < 2)?
• 2/10 = 1/5
• p(X > 3)?
• 5/10 = 1/2
• p(X = 3)?
• 2/10 = 1/5
PROBABILITY AND THE
NORMAL DISTRIBUTION
68.26%
94.46%
99.73%
Probability and the Normal Distribution
We can describe a normal distribution by the
proportions of area contained in each section
• Symmetric: Proportions on the right side will correspond
to the mirror image on the left
• Since we can identify sections with
z-scores, these proportions
correspond to any normal
distribution (regardless of
actual mean and standard
deviation values)
μ
34.13%
13.59%
2.14%
Because the z-distribution is normally distributed we
can make certain assumptions about probabilities
The z-score allows us to
draw a vertical line
through the distribution,
dividing the distribution
into two sections
The Body The Tail
μ
34.13%
13.59%
2.14%
Some probabilities associated with the
normal distribution
• 34.13% of the distribution falls between the mean and the
first standard deviation (z = +1.00; z = -1.00)
• 13.59% of the distribution falls between the 1st and 2nd
standard deviation
• 2.14% of the distribution falls
between the 2nd and 3rd
standard deviations
Probability and
the Normal
Distribution
It is good to have a
general idea of
proportions within the
curve
What if you need to find
the proportion below a z-
score of -0.85?
The Unit Normal Table
(Appendix B, pp.699-
702) lists z-scores and
their corresponding
proportions
Note: The Unit Normal Table lists positive z-scores only because the curve
is symmetric; values can be used for negative z-scores as well.
The Unit Normal Table
• A negative z-score: tail is on the left side
• A positive z-score: tail is on the right side
Because the Unit Normal Table only lists positive numbers, it is helpful to draw
the curve and the area you want before looking up the proportion so you know
which value (body or tail) you want
Finding the probability of a z-score
What is the proportion of the normal distribution associated
with the following sections of a graph:
• z > 1.25
Interpreted as:
• The probability of selecting a participant from this population with a
z-score greater than 1.25 is 0.1056
• The proportion of the distribution with a z-score greater than 1.25 is
10.56%
• z < 0.00
Interpreted as:
• The probability of selecting a participant from this population with a
z-score less than 0.00 is 0.5000
• The proportion of the distribution with a z-score less than 0 is 50%
p = 0.1056
p = 0.5000
PROBABILITIES AND
PROPORTIONS FOR SCORES
from a Normal Distribution
Finding the probability of a raw score from
a normal distribution
What is the probability of randomly selecting a
person with an IQ greater than 115 if the
population μ=100 and σ=10?
1st step: Calculate the z-score
2nd step: Find the proportion of the curve greater
than the z-score.
• z = 1.50
• p(>1.50) = p in the tail (Column C) for 1.50 = 0.0668
50.1
10
)100115(


z
You Try It:
For a distribution with a μ = 100 and σ = 10:
What is the probability of randomly selecting a
person with an IQ less than 90? Greater than 125?
00.1
10
)10090(


z 50.2
10
)100125(


z
Proportion in the tail above z = 2.50:
p = 0.0062
This very high z-score indicates it is at the
extreme high end of the distribution, with few
scores above
Proportion in the tail below z = -1.00:
p = 0.1587
Finding the Probability for a Range of
Scores
What is the probability of randomly selecting a person
with an IQ between 95 and 110? (μ=100 σ=10)
• Find the two z-scores, and then find the proportion that
covers the space between these two scores
00.1
10
)100110(
50.0
10
)10095(






z
z
Add the distances of each z-score from
the mean (Column D):
(0.1915 + 0.3413) = 0.533
Or, subtract the proportion in the tail of
one score from the proportion in the
body of the other score (e.g., Body of
110 – Tail of 95)
(0.8413 - 0.3085) = 0.533
p = 95< IQ <110( )
Finding the z-score
corresponding to a certain proportion
• If you know a specific location (z-score) in a normal
distribution, you can use the table to look up
corresponding proportions.
• If you know a specific proportion(s), you can use the table
to look up the exact z-score location in the distribution
Example (1)
• What z-score separates the lowest 5% from the
remainder of the distribution?
• Find the z-score with p = 0.05 in the tail
• z-table shows us z = 1.65 has p = 0.0495
(closest to p = 0.05 without exceeding it)
• Remember we want the lowest 5%, and the table only
shows us positive z-scores
• Since the curve is symmetric: a z-score of z = -1.65
separates the lowest 5% from the remainder of the
distribution
Example (2)
• What z-scores separate the extreme 5% scores
from the rest of the distribution?
• We want the lowest and highest in this 5%
• Lowest score separates (5%/2) = 2.5% lowest and
highest score separates (5%/2) = 2.5% highest
• We want to find the z-scores with p = 0.025 in tails
• z-scores of z = +/- 1.96 separate these scores from the
rest of the distribution
Solving for a specific score:
• If I want the top 10%, what is the cutoff score?
(μ = 100 σ = 15)
• z-score that separates p = .0100 closest without
going over is z = 1.29
• Now I want to solve for X to find the cutoff score
2.1191002.19100)15(28.1  zX
Recall from earlier, we can express cumulative
percentages as percentile ranks.
To find the 84th percentile we would find the score that separates p = 0.8400
below (body) and p = 0.1600 above (tail). (approximately z = 0.99)
Looking ahead to inferential statistics
• Based on our observations of a sample, we make
inferences about the population.
• It is with statistics based on probability that we determine
how likely our sample represents our population of
interest
• or, how unlikely our sample is compared to the population after
some treatment/intervention/etc.

More Related Content

PDF
Introduction to the t-test
Kaori Kubo Germano, PhD
 
PDF
Probability & Samples
Kaori Kubo Germano, PhD
 
PDF
Central Tendency
Kaori Kubo Germano, PhD
 
PPTX
Probability
jasondroesch
 
PDF
Repeated Measures t-test
Kaori Kubo Germano, PhD
 
PPTX
Z-scores: Location of Scores and Standardized Distributions
jasondroesch
 
PDF
Independent samples t-test
Kaori Kubo Germano, PhD
 
Introduction to the t-test
Kaori Kubo Germano, PhD
 
Probability & Samples
Kaori Kubo Germano, PhD
 
Central Tendency
Kaori Kubo Germano, PhD
 
Probability
jasondroesch
 
Repeated Measures t-test
Kaori Kubo Germano, PhD
 
Z-scores: Location of Scores and Standardized Distributions
jasondroesch
 
Independent samples t-test
Kaori Kubo Germano, PhD
 

What's hot (20)

PPTX
Introduction to Hypothesis Testing
jasondroesch
 
PPTX
Probability and Samples: The Distribution of Sample Means
jasondroesch
 
PPTX
Stats - Intro to Quantitative
Michigan State University
 
PPTX
The t Test for Two Related Samples
jasondroesch
 
PPTX
Basic statistics for algorithmic trading
QuantInsti
 
PDF
Choosing the right statistics
Kaori Kubo Germano, PhD
 
PPTX
Central tendency and Variation or Dispersion
Johny Kutty Joseph
 
PDF
Analysis of Variance
Kaori Kubo Germano, PhD
 
PDF
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
nszakir
 
PPTX
Statistics - Basics
Andrew Ferlitsch
 
PDF
Data sampling and probability
Avjinder (Avi) Kaler
 
PPTX
Data Distribution &The Probability Distributions
mahaaltememe
 
PPTX
Measures of Central Tendency
jasondroesch
 
PPTX
Measures of relationship
Johny Kutty Joseph
 
PPTX
Chapter 3 sampling and sampling distribution
Antonio F. Balatar Jr.
 
PPT
One Sample T Test
shoffma5
 
PPTX
The t Test for Two Independent Samples
jasondroesch
 
PPTX
Introduction to Statistics
jasondroesch
 
PDF
Statistics lecture 7 (ch6)
jillmitchell8778
 
PPT
Sampling distribution
Nilanjan Bhaumik
 
Introduction to Hypothesis Testing
jasondroesch
 
Probability and Samples: The Distribution of Sample Means
jasondroesch
 
Stats - Intro to Quantitative
Michigan State University
 
The t Test for Two Related Samples
jasondroesch
 
Basic statistics for algorithmic trading
QuantInsti
 
Choosing the right statistics
Kaori Kubo Germano, PhD
 
Central tendency and Variation or Dispersion
Johny Kutty Joseph
 
Analysis of Variance
Kaori Kubo Germano, PhD
 
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
nszakir
 
Statistics - Basics
Andrew Ferlitsch
 
Data sampling and probability
Avjinder (Avi) Kaler
 
Data Distribution &The Probability Distributions
mahaaltememe
 
Measures of Central Tendency
jasondroesch
 
Measures of relationship
Johny Kutty Joseph
 
Chapter 3 sampling and sampling distribution
Antonio F. Balatar Jr.
 
One Sample T Test
shoffma5
 
The t Test for Two Independent Samples
jasondroesch
 
Introduction to Statistics
jasondroesch
 
Statistics lecture 7 (ch6)
jillmitchell8778
 
Sampling distribution
Nilanjan Bhaumik
 
Ad

Similar to Probablity (20)

PPTX
introduction to probability
Todd Bill
 
PPTX
Introduction to Probability
Todd Bill
 
PPTX
02. Probability - Statistics for the behavioral sciences.pptx
aveadrianapinem1
 
PPTX
Probability theory
Regent University
 
PPTX
Probability, Normal Distribution and Z-Scores.pptx
OoKitt
 
PPTX
RSS probability theory
Kaimrc_Rss_Jd
 
DOCX
The Standard NormalDistribution and z ScoresKeren SuCorbisC.docx
irened6
 
PDF
group1-151014013653-lva1-app6891.pdf
VenkateshPandiri4
 
PPTX
introduction to probability
lovemucheca
 
PPTX
PROBABILITY
manas das
 
PPTX
Basic Probability Distribution
Sreeraj S R
 
PPTX
probability powerpoint presentation with text
ayuschatterjee2003
 
PPTX
A powerful powerpoint presentation on probability
ayuschatterjee2003
 
PDF
Z-score and probability in statistics.pdf
pmbadullage
 
PDF
Statistical inference: Probability and Distribution
Eugene Yan Ziyou
 
PPT
continuous probability distributions.ppt
LLOYDARENAS1
 
PPT
lecture6.ppt
Temporary57
 
DOCX
Unit 2 Probability
Rai University
 
PPT
What are the chances? - Probability
mscartersmaths
 
PPTX
Probability
Sanika Savdekar
 
introduction to probability
Todd Bill
 
Introduction to Probability
Todd Bill
 
02. Probability - Statistics for the behavioral sciences.pptx
aveadrianapinem1
 
Probability theory
Regent University
 
Probability, Normal Distribution and Z-Scores.pptx
OoKitt
 
RSS probability theory
Kaimrc_Rss_Jd
 
The Standard NormalDistribution and z ScoresKeren SuCorbisC.docx
irened6
 
group1-151014013653-lva1-app6891.pdf
VenkateshPandiri4
 
introduction to probability
lovemucheca
 
PROBABILITY
manas das
 
Basic Probability Distribution
Sreeraj S R
 
probability powerpoint presentation with text
ayuschatterjee2003
 
A powerful powerpoint presentation on probability
ayuschatterjee2003
 
Z-score and probability in statistics.pdf
pmbadullage
 
Statistical inference: Probability and Distribution
Eugene Yan Ziyou
 
continuous probability distributions.ppt
LLOYDARENAS1
 
lecture6.ppt
Temporary57
 
Unit 2 Probability
Rai University
 
What are the chances? - Probability
mscartersmaths
 
Probability
Sanika Savdekar
 
Ad

More from Kaori Kubo Germano, PhD (9)

PDF
Hypothesis testing
Kaori Kubo Germano, PhD
 
PDF
Correlations
Kaori Kubo Germano, PhD
 
PDF
Factorial ANOVA
Kaori Kubo Germano, PhD
 
PDF
Repeated Measures ANOVA
Kaori Kubo Germano, PhD
 
PDF
Variability
Kaori Kubo Germano, PhD
 
PDF
Frequency Distributions
Kaori Kubo Germano, PhD
 
PDF
Behavioral Statistics Intro lecture
Kaori Kubo Germano, PhD
 
Hypothesis testing
Kaori Kubo Germano, PhD
 
Factorial ANOVA
Kaori Kubo Germano, PhD
 
Repeated Measures ANOVA
Kaori Kubo Germano, PhD
 
Frequency Distributions
Kaori Kubo Germano, PhD
 
Behavioral Statistics Intro lecture
Kaori Kubo Germano, PhD
 

Recently uploaded (20)

DOCX
Action Plan_ARAL PROGRAM_ STAND ALONE SHS.docx
Levenmartlacuna1
 
PPTX
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
PPTX
Care of patients with elImination deviation.pptx
AneetaSharma15
 
PPTX
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
PPTX
A Smarter Way to Think About Choosing a College
Cyndy McDonald
 
PDF
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
PPTX
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
PPTX
An introduction to Dialogue writing.pptx
drsiddhantnagine
 
PPTX
Introduction to pediatric nursing in 5th Sem..pptx
AneetaSharma15
 
PDF
2.Reshaping-Indias-Political-Map.ppt/pdf/8th class social science Exploring S...
Sandeep Swamy
 
PPTX
20250924 Navigating the Future: How to tell the difference between an emergen...
McGuinness Institute
 
PDF
What is CFA?? Complete Guide to the Chartered Financial Analyst Program
sp4989653
 
PPTX
CARE OF UNCONSCIOUS PATIENTS .pptx
AneetaSharma15
 
PPTX
Information Texts_Infographic on Forgetting Curve.pptx
Tata Sevilla
 
DOCX
Modul Ajar Deep Learning Bahasa Inggris Kelas 11 Terbaru 2025
wahyurestu63
 
PPTX
How to Track Skills & Contracts Using Odoo 18 Employee
Celine George
 
PPTX
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
PoojaSen20
 
PPTX
CDH. pptx
AneetaSharma15
 
PPTX
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
PPTX
Kanban Cards _ Mass Action in Odoo 18.2 - Odoo Slides
Celine George
 
Action Plan_ARAL PROGRAM_ STAND ALONE SHS.docx
Levenmartlacuna1
 
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
Care of patients with elImination deviation.pptx
AneetaSharma15
 
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
A Smarter Way to Think About Choosing a College
Cyndy McDonald
 
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
An introduction to Dialogue writing.pptx
drsiddhantnagine
 
Introduction to pediatric nursing in 5th Sem..pptx
AneetaSharma15
 
2.Reshaping-Indias-Political-Map.ppt/pdf/8th class social science Exploring S...
Sandeep Swamy
 
20250924 Navigating the Future: How to tell the difference between an emergen...
McGuinness Institute
 
What is CFA?? Complete Guide to the Chartered Financial Analyst Program
sp4989653
 
CARE OF UNCONSCIOUS PATIENTS .pptx
AneetaSharma15
 
Information Texts_Infographic on Forgetting Curve.pptx
Tata Sevilla
 
Modul Ajar Deep Learning Bahasa Inggris Kelas 11 Terbaru 2025
wahyurestu63
 
How to Track Skills & Contracts Using Odoo 18 Employee
Celine George
 
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
PoojaSen20
 
CDH. pptx
AneetaSharma15
 
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
Kanban Cards _ Mass Action in Odoo 18.2 - Odoo Slides
Celine George
 

Probablity

  • 2. What is Probability? • A method for measuring and quantifying the likelihood of obtaining a specific sample from a specific population I want to pick a single marble (a sample) out of each of these jars (two populations) What are my chances of getting a black marble from both jars? Jar A Jar B50 white 50 black 10 white 90 black You have a 50-50 chance of getting either color Your marble (your sample) will probably be black
  • 3. What is Probability? • A method for measuring and quantifying the likelihood of obtaining a specific sample from a specific population Now you are blindfolded when you pick your marble (your sample) so you do not know from which jar (population) it comes from. If you picked 4 black marbles in a row, which jar would say they came from? Jar A Jar B Very high probability they came from this one Very low probability they came from this one
  • 4. Probability Defined • The probability (p) of a specific outcome for any given event is defined as the fraction or proportion of all the possible outcomes (A, B, C, D, etc.) p(A)= # of outcomes classified as A total # of possible outcomes A ratio that represents the likelihood that some event will occur relative to the total number of possible outcomes
  • 5. Probability Defined • If an event can occur in ‘A’ ways • If an event can fail to occur in ‘B’ ways • If all possible ways are equally likely (we guarantee this through random selection) • The probability of the event occurring: • The probability of the event failing to occur:  BA A   BA B  p(A)= # of outcomes classified as A total # of possible outcomes
  • 6. A Demonstration Select one card from a deck of 52 cards What is the probability of selecting a king? • The probability of the event occurring is computed by: • A = 4 • there are 4 kings in the deck • B = 48 • there are 48 cards that aren’t kings in the deck  BA A  52 4 )484( 4   
  • 7. A Demonstration Select one card from a deck of 52 cards What is the probability of selecting a king of diamonds? The probability of the event occurring is computed by: • A = 1 • There is 1 king of diamonds in the deck • B = 51 • There are 51 cards that are NOT the king of diamonds in the deck  BA A  52 1 )511( 1   
  • 8. Another way to think of probability • Sample space: A set of possible outcomes • Rolling a die: {1, 2, 3, 4, 5, 6}  6 possibilities • Event: A subset of the sample space, the occurrence of the thing you are interested in • Die coming up odd: {1, 3, 5}  3 possibilities Let’s say we are interested in the probability a roll of a die will result in an odd number The probability a die will come up odd is the number of event possibilities over the number of sample space possibilities: 2 1 6 3 .# .# )(  spacesample iespossibilitevent oddp
  • 9. Interpreting Probability • For any event (A), we assign a number p(A) between 0 and 1 that is interpreted as: • The chance or likelihood that event A occurs • The proportion of times that event A occurs over a series of trials Continuing our rolling the die example… p(odd) = ½ = 0.50 = 50% We can express probability in a ratio, proportion, or %
  • 10. Examples If we spin this wheel: • What is the p of landing on 7? • p(7) = 1/8 • p = 0.125 or 12.5% • What is the p of landing on an even number? • p(even) = 4/8 = 1/2 • p = 0.50 or 50% • What is the p of landing on a white space? • p(white) = 4/8 = 1/2 • p = 0.50 or 50%
  • 11. More examples! What is the probability of: • Getting a correct answer on a multiple choice question with five answer choices (assuming you make a blind guess)? • 1/5 • Getting a tail in a regular coin toss? • 1/2 • Rolling a 3 with a regular die? • 1/6
  • 12. When is probability reliable? • When you have a fair die, coin, roulette wheel, etc. • There is an equal chance of every outcome • If your coin is weighted to land on heads every time, the p(tail) is no longer ½ To use probability in statistics, we must ensure that our sample is chosen in a way to mirror this “fair chance”
  • 13. Random Sampling To insure that the definition of probability is accurate, we must use random sampling 1. Every individual in the population of interest has an equal chance of being selected 1. If more than one individual is to be selected for the sample, there must be a constant probability for each and every selection • This may require sampling with replacement
  • 14. An Example of Random Sampling Imagine a large bag filled with coins. The coins include half-dollars, quarters, pennies, nickels, and dimes. You are interested in estimating the weight of this bag, but may not actually weigh every coin. How would you estimate the weight?
  • 15. Estimating the weight… • Take a random sample of coins • Every coin must have an equal chance of being selected • There must be a constant probability for each and every selection • Requires sampling with replacement • Use this information to generalize what we might expect the weight of the entire bag to be
  • 16. The Role of Probability in Inferential Statistics • Probability is used to predict what kind of samples are likely to be obtained from a population • Establishes a connection between samples and populations • Inferential statistics rely on this connection when they use sample data as the basis for making conclusions about populations
  • 17. Probability and Statistics This is how we use probability in statistics We think ‘backwards’ because we typically do not have the population information Based on the sample, what do we estimate the population to look like? vs. Based on the population’s characteristics, what do we estimate our sample to look like?
  • 18. Probability and Frequency Distributions • X: 1, 1, 2, 3, 3, 4, 4, 4, 5, 6 • N = 10 If I sample n = 1: • p(X < 2)? • 2/10 = 1/5 • p(X > 3)? • 5/10 = 1/2 • p(X = 3)? • 2/10 = 1/5
  • 20. 68.26% 94.46% 99.73% Probability and the Normal Distribution We can describe a normal distribution by the proportions of area contained in each section • Symmetric: Proportions on the right side will correspond to the mirror image on the left • Since we can identify sections with z-scores, these proportions correspond to any normal distribution (regardless of actual mean and standard deviation values)
  • 21. μ 34.13% 13.59% 2.14% Because the z-distribution is normally distributed we can make certain assumptions about probabilities The z-score allows us to draw a vertical line through the distribution, dividing the distribution into two sections The Body The Tail
  • 22. μ 34.13% 13.59% 2.14% Some probabilities associated with the normal distribution • 34.13% of the distribution falls between the mean and the first standard deviation (z = +1.00; z = -1.00) • 13.59% of the distribution falls between the 1st and 2nd standard deviation • 2.14% of the distribution falls between the 2nd and 3rd standard deviations
  • 23. Probability and the Normal Distribution It is good to have a general idea of proportions within the curve What if you need to find the proportion below a z- score of -0.85? The Unit Normal Table (Appendix B, pp.699- 702) lists z-scores and their corresponding proportions Note: The Unit Normal Table lists positive z-scores only because the curve is symmetric; values can be used for negative z-scores as well.
  • 24. The Unit Normal Table • A negative z-score: tail is on the left side • A positive z-score: tail is on the right side Because the Unit Normal Table only lists positive numbers, it is helpful to draw the curve and the area you want before looking up the proportion so you know which value (body or tail) you want
  • 25. Finding the probability of a z-score What is the proportion of the normal distribution associated with the following sections of a graph: • z > 1.25 Interpreted as: • The probability of selecting a participant from this population with a z-score greater than 1.25 is 0.1056 • The proportion of the distribution with a z-score greater than 1.25 is 10.56% • z < 0.00 Interpreted as: • The probability of selecting a participant from this population with a z-score less than 0.00 is 0.5000 • The proportion of the distribution with a z-score less than 0 is 50% p = 0.1056 p = 0.5000
  • 26. PROBABILITIES AND PROPORTIONS FOR SCORES from a Normal Distribution
  • 27. Finding the probability of a raw score from a normal distribution What is the probability of randomly selecting a person with an IQ greater than 115 if the population μ=100 and σ=10? 1st step: Calculate the z-score 2nd step: Find the proportion of the curve greater than the z-score. • z = 1.50 • p(>1.50) = p in the tail (Column C) for 1.50 = 0.0668 50.1 10 )100115(   z
  • 28. You Try It: For a distribution with a μ = 100 and σ = 10: What is the probability of randomly selecting a person with an IQ less than 90? Greater than 125? 00.1 10 )10090(   z 50.2 10 )100125(   z Proportion in the tail above z = 2.50: p = 0.0062 This very high z-score indicates it is at the extreme high end of the distribution, with few scores above Proportion in the tail below z = -1.00: p = 0.1587
  • 29. Finding the Probability for a Range of Scores What is the probability of randomly selecting a person with an IQ between 95 and 110? (μ=100 σ=10) • Find the two z-scores, and then find the proportion that covers the space between these two scores 00.1 10 )100110( 50.0 10 )10095(       z z Add the distances of each z-score from the mean (Column D): (0.1915 + 0.3413) = 0.533 Or, subtract the proportion in the tail of one score from the proportion in the body of the other score (e.g., Body of 110 – Tail of 95) (0.8413 - 0.3085) = 0.533 p = 95< IQ <110( )
  • 30. Finding the z-score corresponding to a certain proportion • If you know a specific location (z-score) in a normal distribution, you can use the table to look up corresponding proportions. • If you know a specific proportion(s), you can use the table to look up the exact z-score location in the distribution
  • 31. Example (1) • What z-score separates the lowest 5% from the remainder of the distribution? • Find the z-score with p = 0.05 in the tail • z-table shows us z = 1.65 has p = 0.0495 (closest to p = 0.05 without exceeding it) • Remember we want the lowest 5%, and the table only shows us positive z-scores • Since the curve is symmetric: a z-score of z = -1.65 separates the lowest 5% from the remainder of the distribution
  • 32. Example (2) • What z-scores separate the extreme 5% scores from the rest of the distribution? • We want the lowest and highest in this 5% • Lowest score separates (5%/2) = 2.5% lowest and highest score separates (5%/2) = 2.5% highest • We want to find the z-scores with p = 0.025 in tails • z-scores of z = +/- 1.96 separate these scores from the rest of the distribution
  • 33. Solving for a specific score: • If I want the top 10%, what is the cutoff score? (μ = 100 σ = 15) • z-score that separates p = .0100 closest without going over is z = 1.29 • Now I want to solve for X to find the cutoff score 2.1191002.19100)15(28.1  zX
  • 34. Recall from earlier, we can express cumulative percentages as percentile ranks. To find the 84th percentile we would find the score that separates p = 0.8400 below (body) and p = 0.1600 above (tail). (approximately z = 0.99)
  • 35. Looking ahead to inferential statistics • Based on our observations of a sample, we make inferences about the population. • It is with statistics based on probability that we determine how likely our sample represents our population of interest • or, how unlikely our sample is compared to the population after some treatment/intervention/etc.