SlideShare a Scribd company logo
Probability Theory


Introduction to Probability Theory
               Phong VO
       vdphong@fit.hcmus.edu.vn

           September 11, 2010




            – Typeset by FoilTEX –
Why Learn Probability?


• Statistics, data mining, and machine learning are all concerned with
  collecting and analyzing data.

• Using fancy tools like neural nets, boosting and support vector machines
  without understanding statistics is like doing brain surgery before knowing
  how to use a band-aid.

• Prediction, classification, clustering, and estimation are all special cases
  of statistical inference.

• Data analysis, machine learning, and data mining are various names
  given to the practice of statistical inference, depending on the context.

– Typeset by FoilTEX –                                                      1
Probability


             Data generating process                               Observed data

                                       Inference and Data mining




• Probability theory is the formal language of uncertainty, which is the
  basis of statistical inference.

• The basic problem that we study in probability is: Given a data generating
  process, what are the properties of the outcomes?

• The basic problem of statistical inference is the inverse of probability :
  Given the outcomes, what can we say about the process that generated
  the data?


– Typeset by FoilTEX –                                                             2
Introduction



• Any realistic model of a real-world phenomenon must take into account
  the possibility of randomness.

• The quantities will not be predicted in advanced but will exbihit an
  inherent variation that should be taken into account by the model

• Probability model: allowing the model to be probabilistic in nature




– Typeset by FoilTEX –                                                  3
Sample Space and Events



• Assumed an experiment whose outcome is not predictable in advance

• Suppose that the set of all possible outcomes is known

• This set is known as the sample space of the experiment, denoted by S

Example 1. If the experiment consists of the flipping of a coin,


                              S = {Head, T ail}

Example 2. If the experiment consists of tossing a die, then the sample
space is

– Typeset by FoilTEX –                                                    4
S = {1, 2, 3, 4, 5, 6}

• Any subset E of the sample space S is known as an event.

Example 3. If E = {Head} then E is the event that a head appears on
the flip of the coin.

Example 4. If E = T then E would be the event that a tail appears.




– Typeset by FoilTEX –                                               5
Probabilities Defined on Events


Definition 1. Given an experiment with sample space S, for each event
E of the sample space S, P (E) is the probability of the event E and it
satisfies the following three conditions:

Axiom 1 0 ≤ P (E) ≤ 1

Axiom 2 P (S) = 1

Axiom 3 ∀ E1, E2, . . ., En ∩ Em =               m = n, then

                                     ∞            ∞
                                P(       En) =         P (En)
                                  n=1            n=1


– Typeset by FoilTEX –                                                6
Example 5. In the coin tossing example, if we assume that a head is
equally likely to appear as a tail, then we would have

                                                 1
                          P ({H}) = P ({T }) =
                                                 2
   On the other hand, if we had a biased coin and felt that a head was
twice as likely to appear as a tail, then we would have

                                  2             1
                         P ({H}) = , P ({T }) =
                                  3             3




– Typeset by FoilTEX –                                               7
Conditional Probabilities



• Suppose that we observe that the sum of the first die is a 4. What is
  the probability that the sum of the two dice equals 6?

• Let E and F denote respectively the event that the sum of the dice is 6
  and the event that the first die is 4

• If the event F occurs, then in order for E to occur it is necessary for the
  actual occurrence to be a point in both E and in F , it must be in EF .

• Once F has occurred, F becomes our new sample space; hence the
  probability that the event EF occurs will equal the probability of EF
  relative to the probability of F

– Typeset by FoilTEX –                                                      8
Definition 2. If P (B) > 0 then the conditional probability of E given F
is
                                    P (EF )
                         P (E|F ) =
                                     P (F )
Example 6. Suppose cards numbered 1 through 10 are placed in a hat,
mixed up, and then one of the cards is drawn. If we are told that the number
on the drawn card is at least 5, then what is the conditional probability that
it is 10?

Example 7. Suppose that each of 3 men at a party thrwos his hat into
the center of the room. The hats are first mixed up and then each man
randomly selects a hat. What is the probability that none of the three men
selects his own hat?




– Typeset by FoilTEX –                                                       9
Independent Events


Definition 3. Two events E and F are said to be independent if


                              P (EF ) = P (E)P (F )

     A set of events {Ai : i ∈ S} is independent if



                          P           Ai   =         P (Ai)
                                i∈Ω            i∈Ω


     for every finite subset Ω of S.

– Typeset by FoilTEX –                                          10
Example 8. Suppose we toss two fair dice. Let E1 denote the event that
the sum of the dice is six and F denote the event that the first die equals
four. Then

                                                    1
                         P (E1F ) = P ({4, 2}) =
                                                   36
     while

                                        51    5
                         P (E1)P (F ) =     =
                                        36 6 216
   and hence E1 and F are not independent. Why? Let E2 be the event
that the sum of the dice equals seven. Is E2 independent of F ?



– Typeset by FoilTEX –                                                  11
Bayes’ Formula


     Let E and F be events. We may express E as


                            E = EF     EF c, in whichF c = S − F

   for in order for a point to be in E, it must either be in both E and F .
Since EF and EF c are mutually exclusive, we have



                         P (E) = P (EF ) + P (EF c)                      (1)
                              = P (E|F )P (F ) + P (E|F c)P (F c)        (2)
                              = P (E|F )P (F ) + P (E|F c)(1 − P (F ))   (3)

– Typeset by FoilTEX –                                                    12
Example 9. Consider two urns. The first contains 2 white and 7 black
balls, and the second contains 5 whites and 6 black balls. We flip a fair
coin and then draw a ball from the first urn or the second urn depending on
whether the outcome was heads or tails. What is the conditional probability
that the outcome of the toss was heads given that a white ball was seleted?




– Typeset by FoilTEX –                                                   13
Bayes’ Formula: General case


                                                                 n
• Suppose F1, F2, . . . , Fn are mutually exclusive events s.t   i=1 Fi   = S.

• Exactly one of the events F1, F2, . . . , Fn will occur.
                           n
• By writing E = i=1 EFi and using the fact that EFi, i = 1, . . . , n are
  mutually exclusive, we obtain
                                         n
                               P (E) =         EFi                          (4)
                                         i=1
                                          n
                                    =          P (E|Fi)P (Fi)               (5)
                                         i=1


– Typeset by FoilTEX –                                                           14
Suppose E has occurred and we are interested in determining which one
of the Fj also occurred. From Equation 4 we have that



                                       P (EFj )
                         P (Fj |E) =                                  (6)
                                        P (E)
                                         P (E|Fj )P (Fj )
                                  =      n                            (7)
                                         i=1 P (E|Fi )P (Fi )


This equation is also known as Bayes’ formula.




– Typeset by FoilTEX –                                                 15

More Related Content

PDF
Probability
Surya Swaroop
 
PDF
A factorization theorem for generalized exponential polynomials with infinite...
Pim Piepers
 
PDF
Presentation iaf 2014 v1
Fayçal Touazi
 
PDF
Existence and Uniqueness of Algebraic Closure
Ayan Sengupta
 
PPT
92 arithmetic sequences
math126
 
PPTX
91 sequences
math126
 
PDF
Steiner Tree Parameterized by Treewidth
ASPAK2014
 
PDF
Cut and Count
ASPAK2014
 
Probability
Surya Swaroop
 
A factorization theorem for generalized exponential polynomials with infinite...
Pim Piepers
 
Presentation iaf 2014 v1
Fayçal Touazi
 
Existence and Uniqueness of Algebraic Closure
Ayan Sengupta
 
92 arithmetic sequences
math126
 
91 sequences
math126
 
Steiner Tree Parameterized by Treewidth
ASPAK2014
 
Cut and Count
ASPAK2014
 

Viewers also liked (20)

PDF
Intro probability 3
Phong Vo
 
PDF
Lecture slides stats1.13.l07.air
atutor_te
 
PPT
Probability And Random Variable Lecture(Lec8)
University of Gujrat, Pakistan
 
PPTX
Attractive ppt on Hypothesis by ammara aftab
University of Karachi
 
PPTX
Bivariate data
julienorman80065
 
PPT
Exploring bivariate data
Ulster BOCES
 
PDF
Intro probability 4
Phong Vo
 
PDF
Intro probability 2
Phong Vo
 
PPTX
Sampling and Sampling Distributions
Bk Islam Mumitul
 
PPTX
Sampling distribution concepts
umar sheikh
 
PPT
Discrete Probability Distributions
mandalina landy
 
PPT
Sampling distribution
Nilanjan Bhaumik
 
PPTX
Attribution theory
Muhammad Sohaib Afzaal
 
ODP
ANOVA II
James Neill
 
PPTX
Attribution Theory ppt
Abhishek Panda
 
ODP
Multiple Linear Regression II and ANOVA I
James Neill
 
PPT
Data analysis powerpoint
Sarah Hallum
 
ODP
Multiple linear regression II
James Neill
 
ODP
Multiple linear regression
James Neill
 
PPT
Test of hypothesis
vikramlawand
 
Intro probability 3
Phong Vo
 
Lecture slides stats1.13.l07.air
atutor_te
 
Probability And Random Variable Lecture(Lec8)
University of Gujrat, Pakistan
 
Attractive ppt on Hypothesis by ammara aftab
University of Karachi
 
Bivariate data
julienorman80065
 
Exploring bivariate data
Ulster BOCES
 
Intro probability 4
Phong Vo
 
Intro probability 2
Phong Vo
 
Sampling and Sampling Distributions
Bk Islam Mumitul
 
Sampling distribution concepts
umar sheikh
 
Discrete Probability Distributions
mandalina landy
 
Sampling distribution
Nilanjan Bhaumik
 
Attribution theory
Muhammad Sohaib Afzaal
 
ANOVA II
James Neill
 
Attribution Theory ppt
Abhishek Panda
 
Multiple Linear Regression II and ANOVA I
James Neill
 
Data analysis powerpoint
Sarah Hallum
 
Multiple linear regression II
James Neill
 
Multiple linear regression
James Neill
 
Test of hypothesis
vikramlawand
 
Ad

Similar to Intro probability 1 (20)

KEY
Probability Review
Tomoki Tsuchida
 
PDF
Probability Basics and Bayes' Theorem
MenglinLiu1
 
PDF
Note 1 probability
Nur Suaidah
 
PPT
Nossi ch 10
lesaturner
 
DOCX
Probability[1]
indu thakur
 
PPTX
History of probability CHAPTER 5 Engineering
prhaseeb
 
PDF
Course material mca
shagufthaiffath
 
PDF
mathes probabality mca syllabus for probability and stats
ganesh545860
 
PPTX
PRP - Unit 1.pptx
AryanArora64
 
PPT
Section 88888888888888888888888.6 MA.ppt
enasabdulrahman
 
PPTX
MATHEMATICS, Science and technology this ppt very help
naskarraja658
 
PPTX
probability-180324013552.pptx
Vukile Xhego
 
PPTX
Probability (gr.11)
Vukile Xhego
 
PPT
Probability concepts and procedures law of profitability
kamalsapkota13
 
DOCX
3 PROBABILITY TOPICSFigure 3.1 Meteor showers are rare, .docx
tamicawaysmith
 
PDF
Chapter7ppt.pdf
SohailBhatti21
 
PPT
Discrete probability
Ranjan Kumar
 
PPT
Probability notes for the UG/PG students
Geeta Arora
 
PDF
Lecture 1,2 maths presentation slides.pdf
rahulknirala2005
 
Probability Review
Tomoki Tsuchida
 
Probability Basics and Bayes' Theorem
MenglinLiu1
 
Note 1 probability
Nur Suaidah
 
Nossi ch 10
lesaturner
 
Probability[1]
indu thakur
 
History of probability CHAPTER 5 Engineering
prhaseeb
 
Course material mca
shagufthaiffath
 
mathes probabality mca syllabus for probability and stats
ganesh545860
 
PRP - Unit 1.pptx
AryanArora64
 
Section 88888888888888888888888.6 MA.ppt
enasabdulrahman
 
MATHEMATICS, Science and technology this ppt very help
naskarraja658
 
probability-180324013552.pptx
Vukile Xhego
 
Probability (gr.11)
Vukile Xhego
 
Probability concepts and procedures law of profitability
kamalsapkota13
 
3 PROBABILITY TOPICSFigure 3.1 Meteor showers are rare, .docx
tamicawaysmith
 
Chapter7ppt.pdf
SohailBhatti21
 
Discrete probability
Ranjan Kumar
 
Probability notes for the UG/PG students
Geeta Arora
 
Lecture 1,2 maths presentation slides.pdf
rahulknirala2005
 
Ad

Recently uploaded (20)

PPTX
How to Track Skills & Contracts Using Odoo 18 Employee
Celine George
 
PPTX
Artificial-Intelligence-in-Drug-Discovery by R D Jawarkar.pptx
Rahul Jawarkar
 
PDF
The Minister of Tourism, Culture and Creative Arts, Abla Dzifa Gomashie has e...
nservice241
 
PDF
What is CFA?? Complete Guide to the Chartered Financial Analyst Program
sp4989653
 
PPTX
Software Engineering BSC DS UNIT 1 .pptx
Dr. Pallawi Bulakh
 
PPTX
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
PPTX
Information Texts_Infographic on Forgetting Curve.pptx
Tata Sevilla
 
PPTX
CARE OF UNCONSCIOUS PATIENTS .pptx
AneetaSharma15
 
PPTX
Basics and rules of probability with real-life uses
ravatkaran694
 
DOCX
Action Plan_ARAL PROGRAM_ STAND ALONE SHS.docx
Levenmartlacuna1
 
PPTX
HEALTH CARE DELIVERY SYSTEM - UNIT 2 - GNM 3RD YEAR.pptx
Priyanshu Anand
 
PPTX
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
PPTX
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
PPTX
How to Manage Leads in Odoo 18 CRM - Odoo Slides
Celine George
 
DOCX
Modul Ajar Deep Learning Bahasa Inggris Kelas 11 Terbaru 2025
wahyurestu63
 
PPTX
20250924 Navigating the Future: How to tell the difference between an emergen...
McGuinness Institute
 
PPTX
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
PDF
Biological Classification Class 11th NCERT CBSE NEET.pdf
NehaRohtagi1
 
PPTX
How to Apply for a Job From Odoo 18 Website
Celine George
 
DOCX
pgdei-UNIT -V Neurological Disorders & developmental disabilities
JELLA VISHNU DURGA PRASAD
 
How to Track Skills & Contracts Using Odoo 18 Employee
Celine George
 
Artificial-Intelligence-in-Drug-Discovery by R D Jawarkar.pptx
Rahul Jawarkar
 
The Minister of Tourism, Culture and Creative Arts, Abla Dzifa Gomashie has e...
nservice241
 
What is CFA?? Complete Guide to the Chartered Financial Analyst Program
sp4989653
 
Software Engineering BSC DS UNIT 1 .pptx
Dr. Pallawi Bulakh
 
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
Information Texts_Infographic on Forgetting Curve.pptx
Tata Sevilla
 
CARE OF UNCONSCIOUS PATIENTS .pptx
AneetaSharma15
 
Basics and rules of probability with real-life uses
ravatkaran694
 
Action Plan_ARAL PROGRAM_ STAND ALONE SHS.docx
Levenmartlacuna1
 
HEALTH CARE DELIVERY SYSTEM - UNIT 2 - GNM 3RD YEAR.pptx
Priyanshu Anand
 
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
How to Manage Leads in Odoo 18 CRM - Odoo Slides
Celine George
 
Modul Ajar Deep Learning Bahasa Inggris Kelas 11 Terbaru 2025
wahyurestu63
 
20250924 Navigating the Future: How to tell the difference between an emergen...
McGuinness Institute
 
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
Biological Classification Class 11th NCERT CBSE NEET.pdf
NehaRohtagi1
 
How to Apply for a Job From Odoo 18 Website
Celine George
 
pgdei-UNIT -V Neurological Disorders & developmental disabilities
JELLA VISHNU DURGA PRASAD
 

Intro probability 1

  • 1. Probability Theory Introduction to Probability Theory Phong VO [email protected] September 11, 2010 – Typeset by FoilTEX –
  • 2. Why Learn Probability? • Statistics, data mining, and machine learning are all concerned with collecting and analyzing data. • Using fancy tools like neural nets, boosting and support vector machines without understanding statistics is like doing brain surgery before knowing how to use a band-aid. • Prediction, classification, clustering, and estimation are all special cases of statistical inference. • Data analysis, machine learning, and data mining are various names given to the practice of statistical inference, depending on the context. – Typeset by FoilTEX – 1
  • 3. Probability Data generating process Observed data Inference and Data mining • Probability theory is the formal language of uncertainty, which is the basis of statistical inference. • The basic problem that we study in probability is: Given a data generating process, what are the properties of the outcomes? • The basic problem of statistical inference is the inverse of probability : Given the outcomes, what can we say about the process that generated the data? – Typeset by FoilTEX – 2
  • 4. Introduction • Any realistic model of a real-world phenomenon must take into account the possibility of randomness. • The quantities will not be predicted in advanced but will exbihit an inherent variation that should be taken into account by the model • Probability model: allowing the model to be probabilistic in nature – Typeset by FoilTEX – 3
  • 5. Sample Space and Events • Assumed an experiment whose outcome is not predictable in advance • Suppose that the set of all possible outcomes is known • This set is known as the sample space of the experiment, denoted by S Example 1. If the experiment consists of the flipping of a coin, S = {Head, T ail} Example 2. If the experiment consists of tossing a die, then the sample space is – Typeset by FoilTEX – 4
  • 6. S = {1, 2, 3, 4, 5, 6} • Any subset E of the sample space S is known as an event. Example 3. If E = {Head} then E is the event that a head appears on the flip of the coin. Example 4. If E = T then E would be the event that a tail appears. – Typeset by FoilTEX – 5
  • 7. Probabilities Defined on Events Definition 1. Given an experiment with sample space S, for each event E of the sample space S, P (E) is the probability of the event E and it satisfies the following three conditions: Axiom 1 0 ≤ P (E) ≤ 1 Axiom 2 P (S) = 1 Axiom 3 ∀ E1, E2, . . ., En ∩ Em = m = n, then ∞ ∞ P( En) = P (En) n=1 n=1 – Typeset by FoilTEX – 6
  • 8. Example 5. In the coin tossing example, if we assume that a head is equally likely to appear as a tail, then we would have 1 P ({H}) = P ({T }) = 2 On the other hand, if we had a biased coin and felt that a head was twice as likely to appear as a tail, then we would have 2 1 P ({H}) = , P ({T }) = 3 3 – Typeset by FoilTEX – 7
  • 9. Conditional Probabilities • Suppose that we observe that the sum of the first die is a 4. What is the probability that the sum of the two dice equals 6? • Let E and F denote respectively the event that the sum of the dice is 6 and the event that the first die is 4 • If the event F occurs, then in order for E to occur it is necessary for the actual occurrence to be a point in both E and in F , it must be in EF . • Once F has occurred, F becomes our new sample space; hence the probability that the event EF occurs will equal the probability of EF relative to the probability of F – Typeset by FoilTEX – 8
  • 10. Definition 2. If P (B) > 0 then the conditional probability of E given F is P (EF ) P (E|F ) = P (F ) Example 6. Suppose cards numbered 1 through 10 are placed in a hat, mixed up, and then one of the cards is drawn. If we are told that the number on the drawn card is at least 5, then what is the conditional probability that it is 10? Example 7. Suppose that each of 3 men at a party thrwos his hat into the center of the room. The hats are first mixed up and then each man randomly selects a hat. What is the probability that none of the three men selects his own hat? – Typeset by FoilTEX – 9
  • 11. Independent Events Definition 3. Two events E and F are said to be independent if P (EF ) = P (E)P (F ) A set of events {Ai : i ∈ S} is independent if P Ai = P (Ai) i∈Ω i∈Ω for every finite subset Ω of S. – Typeset by FoilTEX – 10
  • 12. Example 8. Suppose we toss two fair dice. Let E1 denote the event that the sum of the dice is six and F denote the event that the first die equals four. Then 1 P (E1F ) = P ({4, 2}) = 36 while 51 5 P (E1)P (F ) = = 36 6 216 and hence E1 and F are not independent. Why? Let E2 be the event that the sum of the dice equals seven. Is E2 independent of F ? – Typeset by FoilTEX – 11
  • 13. Bayes’ Formula Let E and F be events. We may express E as E = EF EF c, in whichF c = S − F for in order for a point to be in E, it must either be in both E and F . Since EF and EF c are mutually exclusive, we have P (E) = P (EF ) + P (EF c) (1) = P (E|F )P (F ) + P (E|F c)P (F c) (2) = P (E|F )P (F ) + P (E|F c)(1 − P (F )) (3) – Typeset by FoilTEX – 12
  • 14. Example 9. Consider two urns. The first contains 2 white and 7 black balls, and the second contains 5 whites and 6 black balls. We flip a fair coin and then draw a ball from the first urn or the second urn depending on whether the outcome was heads or tails. What is the conditional probability that the outcome of the toss was heads given that a white ball was seleted? – Typeset by FoilTEX – 13
  • 15. Bayes’ Formula: General case n • Suppose F1, F2, . . . , Fn are mutually exclusive events s.t i=1 Fi = S. • Exactly one of the events F1, F2, . . . , Fn will occur. n • By writing E = i=1 EFi and using the fact that EFi, i = 1, . . . , n are mutually exclusive, we obtain n P (E) = EFi (4) i=1 n = P (E|Fi)P (Fi) (5) i=1 – Typeset by FoilTEX – 14
  • 16. Suppose E has occurred and we are interested in determining which one of the Fj also occurred. From Equation 4 we have that P (EFj ) P (Fj |E) = (6) P (E) P (E|Fj )P (Fj ) = n (7) i=1 P (E|Fi )P (Fi ) This equation is also known as Bayes’ formula. – Typeset by FoilTEX – 15