4
Most read
13
Most read
19
Most read
EXOGENEITY & ENDOGENEITY
Shane Thompson
Summit Consulting, LLC
© 2015
EXOGENEITY, ENDOGENEITY, & YOU
 Program evaluation, policy impacts, treatment
effects
 How does our outcome of interest (income, health,
free-throw %) change after our independent
variable of interest (education, medicine, practice)
changes?
 Often, we will have to concern ourselves with
whether our independent variable of interest is
exogenous or endogenous
EXOGENOUS VARIABLES
 An independent variable is exogenous in a dataset
if it is assigned/chosen without respect to how it
might influence the outcome of interest
 Education level assigned without regard to potential
income benefit
 Medicine assigned without respect to probability of
success
 Practice hours assigned without respect to how it might
improve performance
 Randomization of “treatment” assignment ensures
that independent variables are exogenous
ENDOGENOUS VARIABLES
 An independent variable is endogenous in a dataset if it
is assigned/chosen according to how it might determine
the outcome of interest
 Income: Bill Gates drops out of college; Shane Thompson
goes forever
 Health: Patient with family history of heart attacks takes
aspirin; patient without does not take it
 FT%: Shaq practices free throws; Steve Nash does not
 Simple comparisons of treatment-vs-control are biased
 College REDUCES income by billions of dollars!
 Aspirin INCREASES the probability of heart attacks!
 Free-throw practice REDUCES free-throw percentage!
COMMUTING TIME: SLUGGING
Weather…wait...where…who…wreck?
Exogenous or endogenous?
 The weather
 EXOGENOUS
 The time of day I get in the slug line
 ENDOGENOUS
 My destination (14th, 18th, or L’Enfant Plaza)
 ENDOGENOUS
 The gender of the driver
 EXOGENOUS
 A wreck on the HOV lanes
 EXOGENOUS
CORRELATION VS. CAUSATION
 Estimates of the effects of endogenous variables on
outcome variables are correlational
 A change in an outcome variable given a change in an
endogenous variable is potentially reflective of changes
of several other variables in the model
 Estimates of the effects of exogenous variables on
outcome variables are causal
 A change in an outcome variable given a change in an
exogenous variable is attributable to the exogenous
variable
EX: BLUE JEANS AND REVENUE
 AL and AC notice that whenever several
Summiteers wear jeans into the office, big
deliverables are generally due that day.
 In leadership meetings they alert the directors to
this amazing trend.
 Hypothesis: Wearing jeans increases revenue
# Deliverables Jeans # Deliverables Jeans
1 0 1 0
1 0 3 0
0 0 2 0
0 0 3 0
3 1 5 1
0 0 1 0
2 0 0 0
2 0 3 0
2 0 1 0
6 1 6 1
3 0 3 0
1 0 1 0
3 0 1 0
1 0 0 0
3 1 5 1
1 0 2 0
2 0 0 0
1 0 0 0
0 0 0 0
8 1 5 1
0 0 1 0
1 0 0 0
2 0 0 0
0 0 0 0
6 1 6 1
JEANS. JEANS! JEANS!
 Over the last 50 work days:
 Summit averages 5.3 deliverables when Summiteers
wear jeans
 Summit averages 1.1 deliverables when Summiteers do
NOT wear jeans
JEANS INCREASE PRODUCTIVITY!!!
JEANS INCREASE REVENUE!!!
# Deliverables Jeans Day # Deliverables Jeans Day
1 0 Monday 1 0 Monday
1 0 Tuesday 3 0 Tuesday
0 0 Wednesday 2 0 Wednesday
0 0 Thursday 3 0 Thursday
3 1 Friday 5 1 Friday
0 0 Monday 1 0 Monday
2 0 Tuesday 0 0 Tuesday
2 0 Wednesday 3 0 Wednesday
2 0 Thursday 1 0 Thursday
6 1 Friday 6 1 Friday
3 0 Monday 3 0 Monday
1 0 Tuesday 1 0 Tuesday
3 0 Wednesday 1 0 Wednesday
1 0 Thursday 0 0 Thursday
3 1 Friday 5 1 Friday
1 0 Monday 2 0 Monday
2 0 Tuesday 0 0 Tuesday
1 0 Wednesday 0 0 Wednesday
0 0 Thursday 0 0 Thursday
8 1 Friday 5 1 Friday
0 0 Monday 1 0 Monday
1 0 Tuesday 0 0 Tuesday
2 0 Wednesday 0 0 Wednesday
0 0 Thursday 0 0 Thursday
6 1 Friday 6 1 Friday
WELL, THAT’S EMBARRASSING…
 Jeans DO NOT increase deliverables and DO NOT
increase revenue.
 Still, enterprising Summiteers seek to find the
causal effect of jeans
 How might we find the CAUSAL effect of jeans?
BEST OPTION: RANDOMIZED CONTROL TRIAL
 Summit randomly assigns casual days
 Jean-wearing is exogenous to deliverables, i.e. the
assignment to wear jeans is made without regard to
how it might influence business
 Causal effect of jeans on deliverables:
Avg Deliverablesjeans – Avg Deliverablesno jeans
NON-RANDOMIZED DATA
 Jean-wearing is endogenous to deliverables, i.e.
the decision to wear jeans may be made according
to deliverable deadlines and client meetings
 Causal effect of jeans on deliverables IS NOT:
Avg Deliverablesjeans – Avg Deliverablesno jeans
 Why??
BALANCE IN TREATMENT ASSIGNMENT
Exogenous Treatment Endogenous Treatment
Treatment Control
Age
Race
Gender
Income
Age
Race
Gender
Income
Treatment Control
Younger
QUASI-EXPERIMENTAL METHODS:
MAKING ENDOGENOUS VARIABLES EXOGENOUS
 Difference-in-Difference
1. Identify a suitable control
group (Firm X)
2. Verify parallel trend in
deliverables before
treatment (controlling for
obs characteristics)
3. Verify no spillover effects
4. Jeans are exogenous,
conditional on the parallel
trend (which is
conditional on obs
characteristics)
QUASI-EXPERIMENTAL METHODS:
MAKING ENDOGENOUS VARIABLES EXOGENOUS
 Regression Discontinuity
1. Summit management
allows staff above a fixed
threshold of billable hours
to wear jeans (threshold is
unknown to staff)
2. Staff just above and just
below the threshold are
equally productive
3. Jeans are exogenous
immediately above and
below threshold
Billable hours (hundreds)
Deliverablesin2014
QUASI-EXPERIMENTAL METHODS:
MAKING ENDOGENOUS VARIABLES EXOGENOUS
 Propensity Score
Matching
1. Summit allows all staff to
wear jeans if they want
2. Estimate the probability
of jean-wearing given
observable staff
characteristics
3. Assume we observe ALL
relevant predictors of
jean-wearing
4. Jeans are exogenous at
each probability level
QUASI-EXPERIMENTAL METHODS:
MAKING ENDOGENOUS VARIABLES EXOGENOUS
 Synthetic Control Method
1. Identify several potential control
groups (firms)
2. Construct a synthetic Summit
that is a weighted combination
of other firms
3. Constrain the synthetic Summit
to be approx equal to Summit
in observable characteristics
and deliverables before the
jeans policy
4. Jeans are exogenous,
conditional on the pre-
treatment equality between
Summit and the synthetic
control
B
A E
D
C
CONCLUSIONS
 Correlation = Causation
 Data are generally messy, poorly-tracked, and non-
experimental (rife with endogeneity!)
 Cutting edge evaluators, statisticians, and
econometricians must be able to:
1. identify endogenous and exogenous variables
2. implement statistical methods to mitigate endogeneity
 The causal effect of jeans is 10,000 additional
deliverables

More Related Content

PPSX
The Social welfare function
PPTX
Solow model of growth
PPT
Hicks slutsky income and substitution effect
PPTX
Heteroscedasticity
PPSX
Structuralist theory of inflation
PPTX
Basic theory of IS-LM model
PPTX
THE BERTRAND DUOPOLY MODEL IN MICROECONOMICS.pptx
PPSX
Bain’s limit pricing model
The Social welfare function
Solow model of growth
Hicks slutsky income and substitution effect
Heteroscedasticity
Structuralist theory of inflation
Basic theory of IS-LM model
THE BERTRAND DUOPOLY MODEL IN MICROECONOMICS.pptx
Bain’s limit pricing model

What's hot (20)

PPTX
Harrod – Domar Model - Development Model.pptx
PPT
Lecture 4
PPTX
Life cycle income hypothesis
PPT
The Demand for Labor
PPTX
Post Keynesian Approach
PPTX
Multicolinearity
PPTX
Supply side economics
PPTX
Micro ppt of hicks n slutsky
PPTX
Absolute income hypothesis
PPT
Phillips Curve, Inflation & Interest Rate
PPTX
Econometrics
PPSX
Williamson’s model of managerial discretion
PPTX
Philip's Curve
PPTX
Accelerator Theory of Investment
PPTX
ACCELERATOR THEORY OF INVESTMENT
PPTX
Heteroscedasticity
PPT
Econometrics ch2
PPTX
Pareto optimality
PPTX
General equilibrium theory
PPTX
welfare economics
Harrod – Domar Model - Development Model.pptx
Lecture 4
Life cycle income hypothesis
The Demand for Labor
Post Keynesian Approach
Multicolinearity
Supply side economics
Micro ppt of hicks n slutsky
Absolute income hypothesis
Phillips Curve, Inflation & Interest Rate
Econometrics
Williamson’s model of managerial discretion
Philip's Curve
Accelerator Theory of Investment
ACCELERATOR THEORY OF INVESTMENT
Heteroscedasticity
Econometrics ch2
Pareto optimality
General equilibrium theory
welfare economics
Ad

Viewers also liked (15)

PDF
Endogenous and exogenous christalization
PPTX
Endogenous vs exogenous
PDF
Econometria endogeneidade
PPT
Pigments
PDF
Apostila painel (3)
PPTX
a typology of customers ( internal and external factors ınfluencing their beh...
PPT
Forms of Business
PPTX
PSYA3 - Sleep
PPTX
Buying Decision Making Process
PPTX
Consumer behaviour external factors
PPT
Cómo se grafica una función lineal en excel
PPT
Pill camera presentation
PPT
Pill camera
PPT
Macroeconomics slide
DOCX
Granger causality testing
Endogenous and exogenous christalization
Endogenous vs exogenous
Econometria endogeneidade
Pigments
Apostila painel (3)
a typology of customers ( internal and external factors ınfluencing their beh...
Forms of Business
PSYA3 - Sleep
Buying Decision Making Process
Consumer behaviour external factors
Cómo se grafica una función lineal en excel
Pill camera presentation
Pill camera
Macroeconomics slide
Granger causality testing
Ad

Similar to Endogeneity & Exogeneity (20)

PPTX
Business Research & Statitics part II.pptx
DOCX
1 A PRIMER ON CAUSALITY Marc F. Bellemare∗ Introd
PDF
Causal Inference in Data Science and Machine Learning
PPTX
Medical Data
PDF
Causal Inference and Program Evaluation
PDF
2019 PMED Spring Course - Preliminaries: Basic Causal Inference - Marie David...
PPTX
LEC 8 RESEARCH VARIABLES.pptx
PPTX
Can rct affective for public policy?
PPTX
Week-3-1.pptx
PDF
HOW TO MAKE SENSE OUT OF BIG DATA?
PPT
Class lecture notes #1 (statistics for research)
PPTX
Variables confounding
PDF
Study Variables (3).pdfStudy Variables (3).pdf
PPTX
understand-causal-relationship-in-economics.pptx
PPTX
understanding-causality-relationship.pptx
PPTX
Stating the Research Problem
PPTX
Research Approaches and Design recognising research .pptx
PPTX
Modeling Social Data, Lecture 11: Causality and Experiments, Part 1
PPTX
Quick Intro to DAGs, d-separation, and Collider Bias Intro
PPT
Analytic Methods and Issues in CER from Observational Data
Business Research & Statitics part II.pptx
1 A PRIMER ON CAUSALITY Marc F. Bellemare∗ Introd
Causal Inference in Data Science and Machine Learning
Medical Data
Causal Inference and Program Evaluation
2019 PMED Spring Course - Preliminaries: Basic Causal Inference - Marie David...
LEC 8 RESEARCH VARIABLES.pptx
Can rct affective for public policy?
Week-3-1.pptx
HOW TO MAKE SENSE OUT OF BIG DATA?
Class lecture notes #1 (statistics for research)
Variables confounding
Study Variables (3).pdfStudy Variables (3).pdf
understand-causal-relationship-in-economics.pptx
understanding-causality-relationship.pptx
Stating the Research Problem
Research Approaches and Design recognising research .pptx
Modeling Social Data, Lecture 11: Causality and Experiments, Part 1
Quick Intro to DAGs, d-separation, and Collider Bias Intro
Analytic Methods and Issues in CER from Observational Data

Recently uploaded (20)

PDF
Concepts of Database Management, 10th Edition by Lisa Friedrichsen Test Bank.pdf
PPTX
recommendation Project PPT with details attached
PPTX
langchainpptforbeginners_easy_explanation.pptx
PPTX
AI AND ML PROPOSAL PRESENTATION MUST.pptx
PPTX
chuitkarjhanbijunsdivndsijvndiucbhsaxnmzsicvjsd
PPTX
inbound6529290805104538764.pptxmmmmmmmmm
PPTX
Hushh.ai: Your Personal Data, Your Business
PPTX
Introduction to Fundamentals of Data Security
PPTX
indiraparyavaranbhavan-240418134200-31d840b3.pptx
PPTX
Chapter security of computer_8_v8.1.pptx
PPTX
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
PDF
REPORT CARD OF GRADE 2 2025-2026 MATATAG
PPT
Classification methods in data analytics.ppt
PDF
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
PPTX
1 hour to get there before the game is done so you don’t need a car seat for ...
PPTX
research framework and review of related literature chapter 2
PPTX
GPS sensor used agriculture land for automation
PPT
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
PPTX
ch20 Database System Architecture by Rizvee
PPTX
cp-and-safeguarding-training-2018-2019-mmfv2-230818062456-767bc1a7.pptx
Concepts of Database Management, 10th Edition by Lisa Friedrichsen Test Bank.pdf
recommendation Project PPT with details attached
langchainpptforbeginners_easy_explanation.pptx
AI AND ML PROPOSAL PRESENTATION MUST.pptx
chuitkarjhanbijunsdivndsijvndiucbhsaxnmzsicvjsd
inbound6529290805104538764.pptxmmmmmmmmm
Hushh.ai: Your Personal Data, Your Business
Introduction to Fundamentals of Data Security
indiraparyavaranbhavan-240418134200-31d840b3.pptx
Chapter security of computer_8_v8.1.pptx
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
REPORT CARD OF GRADE 2 2025-2026 MATATAG
Classification methods in data analytics.ppt
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
1 hour to get there before the game is done so you don’t need a car seat for ...
research framework and review of related literature chapter 2
GPS sensor used agriculture land for automation
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
ch20 Database System Architecture by Rizvee
cp-and-safeguarding-training-2018-2019-mmfv2-230818062456-767bc1a7.pptx

Endogeneity & Exogeneity

  • 1. EXOGENEITY & ENDOGENEITY Shane Thompson Summit Consulting, LLC © 2015
  • 2. EXOGENEITY, ENDOGENEITY, & YOU  Program evaluation, policy impacts, treatment effects  How does our outcome of interest (income, health, free-throw %) change after our independent variable of interest (education, medicine, practice) changes?  Often, we will have to concern ourselves with whether our independent variable of interest is exogenous or endogenous
  • 3. EXOGENOUS VARIABLES  An independent variable is exogenous in a dataset if it is assigned/chosen without respect to how it might influence the outcome of interest  Education level assigned without regard to potential income benefit  Medicine assigned without respect to probability of success  Practice hours assigned without respect to how it might improve performance  Randomization of “treatment” assignment ensures that independent variables are exogenous
  • 4. ENDOGENOUS VARIABLES  An independent variable is endogenous in a dataset if it is assigned/chosen according to how it might determine the outcome of interest  Income: Bill Gates drops out of college; Shane Thompson goes forever  Health: Patient with family history of heart attacks takes aspirin; patient without does not take it  FT%: Shaq practices free throws; Steve Nash does not  Simple comparisons of treatment-vs-control are biased  College REDUCES income by billions of dollars!  Aspirin INCREASES the probability of heart attacks!  Free-throw practice REDUCES free-throw percentage!
  • 5. COMMUTING TIME: SLUGGING Weather…wait...where…who…wreck? Exogenous or endogenous?  The weather  EXOGENOUS  The time of day I get in the slug line  ENDOGENOUS  My destination (14th, 18th, or L’Enfant Plaza)  ENDOGENOUS  The gender of the driver  EXOGENOUS  A wreck on the HOV lanes  EXOGENOUS
  • 6. CORRELATION VS. CAUSATION  Estimates of the effects of endogenous variables on outcome variables are correlational  A change in an outcome variable given a change in an endogenous variable is potentially reflective of changes of several other variables in the model  Estimates of the effects of exogenous variables on outcome variables are causal  A change in an outcome variable given a change in an exogenous variable is attributable to the exogenous variable
  • 7. EX: BLUE JEANS AND REVENUE  AL and AC notice that whenever several Summiteers wear jeans into the office, big deliverables are generally due that day.  In leadership meetings they alert the directors to this amazing trend.  Hypothesis: Wearing jeans increases revenue
  • 8. # Deliverables Jeans # Deliverables Jeans 1 0 1 0 1 0 3 0 0 0 2 0 0 0 3 0 3 1 5 1 0 0 1 0 2 0 0 0 2 0 3 0 2 0 1 0 6 1 6 1 3 0 3 0 1 0 1 0 3 0 1 0 1 0 0 0 3 1 5 1 1 0 2 0 2 0 0 0 1 0 0 0 0 0 0 0 8 1 5 1 0 0 1 0 1 0 0 0 2 0 0 0 0 0 0 0 6 1 6 1
  • 9. JEANS. JEANS! JEANS!  Over the last 50 work days:  Summit averages 5.3 deliverables when Summiteers wear jeans  Summit averages 1.1 deliverables when Summiteers do NOT wear jeans JEANS INCREASE PRODUCTIVITY!!! JEANS INCREASE REVENUE!!!
  • 10. # Deliverables Jeans Day # Deliverables Jeans Day 1 0 Monday 1 0 Monday 1 0 Tuesday 3 0 Tuesday 0 0 Wednesday 2 0 Wednesday 0 0 Thursday 3 0 Thursday 3 1 Friday 5 1 Friday 0 0 Monday 1 0 Monday 2 0 Tuesday 0 0 Tuesday 2 0 Wednesday 3 0 Wednesday 2 0 Thursday 1 0 Thursday 6 1 Friday 6 1 Friday 3 0 Monday 3 0 Monday 1 0 Tuesday 1 0 Tuesday 3 0 Wednesday 1 0 Wednesday 1 0 Thursday 0 0 Thursday 3 1 Friday 5 1 Friday 1 0 Monday 2 0 Monday 2 0 Tuesday 0 0 Tuesday 1 0 Wednesday 0 0 Wednesday 0 0 Thursday 0 0 Thursday 8 1 Friday 5 1 Friday 0 0 Monday 1 0 Monday 1 0 Tuesday 0 0 Tuesday 2 0 Wednesday 0 0 Wednesday 0 0 Thursday 0 0 Thursday 6 1 Friday 6 1 Friday
  • 11. WELL, THAT’S EMBARRASSING…  Jeans DO NOT increase deliverables and DO NOT increase revenue.  Still, enterprising Summiteers seek to find the causal effect of jeans  How might we find the CAUSAL effect of jeans?
  • 12. BEST OPTION: RANDOMIZED CONTROL TRIAL  Summit randomly assigns casual days  Jean-wearing is exogenous to deliverables, i.e. the assignment to wear jeans is made without regard to how it might influence business  Causal effect of jeans on deliverables: Avg Deliverablesjeans – Avg Deliverablesno jeans
  • 13. NON-RANDOMIZED DATA  Jean-wearing is endogenous to deliverables, i.e. the decision to wear jeans may be made according to deliverable deadlines and client meetings  Causal effect of jeans on deliverables IS NOT: Avg Deliverablesjeans – Avg Deliverablesno jeans  Why??
  • 14. BALANCE IN TREATMENT ASSIGNMENT Exogenous Treatment Endogenous Treatment Treatment Control Age Race Gender Income Age Race Gender Income Treatment Control Younger
  • 15. QUASI-EXPERIMENTAL METHODS: MAKING ENDOGENOUS VARIABLES EXOGENOUS  Difference-in-Difference 1. Identify a suitable control group (Firm X) 2. Verify parallel trend in deliverables before treatment (controlling for obs characteristics) 3. Verify no spillover effects 4. Jeans are exogenous, conditional on the parallel trend (which is conditional on obs characteristics)
  • 16. QUASI-EXPERIMENTAL METHODS: MAKING ENDOGENOUS VARIABLES EXOGENOUS  Regression Discontinuity 1. Summit management allows staff above a fixed threshold of billable hours to wear jeans (threshold is unknown to staff) 2. Staff just above and just below the threshold are equally productive 3. Jeans are exogenous immediately above and below threshold Billable hours (hundreds) Deliverablesin2014
  • 17. QUASI-EXPERIMENTAL METHODS: MAKING ENDOGENOUS VARIABLES EXOGENOUS  Propensity Score Matching 1. Summit allows all staff to wear jeans if they want 2. Estimate the probability of jean-wearing given observable staff characteristics 3. Assume we observe ALL relevant predictors of jean-wearing 4. Jeans are exogenous at each probability level
  • 18. QUASI-EXPERIMENTAL METHODS: MAKING ENDOGENOUS VARIABLES EXOGENOUS  Synthetic Control Method 1. Identify several potential control groups (firms) 2. Construct a synthetic Summit that is a weighted combination of other firms 3. Constrain the synthetic Summit to be approx equal to Summit in observable characteristics and deliverables before the jeans policy 4. Jeans are exogenous, conditional on the pre- treatment equality between Summit and the synthetic control B A E D C
  • 19. CONCLUSIONS  Correlation = Causation  Data are generally messy, poorly-tracked, and non- experimental (rife with endogeneity!)  Cutting edge evaluators, statisticians, and econometricians must be able to: 1. identify endogenous and exogenous variables 2. implement statistical methods to mitigate endogeneity  The causal effect of jeans is 10,000 additional deliverables

Editor's Notes

  • #17: Have graphics animated second