SlideShare a Scribd company logo
Key ideas, terms & concepts in
SEM
Professor Patrick Sturgis
Plan
• Path diagrams
• Exogenous, endogenous variables
• Variance/covariance matrices
• Maximum likelihood estimation
• Parameter constraints
• Nested Models and Model fit
• Model identification
Path diagrams
• An appealing feature of SEM is
representation of equations
diagrammatically
e.g. bivariate regression Y= bX + e
Path Diagram conventions
Error variance / disturbance term
Measured latent variable
Observed / manifest variable
Covariance / non-directional path
Regression / directional path
Reading path diagrams
A latent variable
Causes/measured by
3 observed variables
With 3 error variances
Reading path diagrams
2 latent variables,
each measured
by 3 observed
variables
Correlated
Reading path diagrams
2 latent variables,
each measured
by 3 observed
variables
Regression of LV1 on LV2
Error/disturbance
Exogenous/Endogenous variables
• Endogenous (dependent)
– caused by variables in the system
• Exogenous (independent)
– caused by variables outside the system
• In SEM a variable can be a predictor and
an outcome (a mediating variable)
2 (correlated) exogenous variables
η1 endogenous, η2 exogenous
1 1
Data for SEM
• In SEM we analyse the
variance/covariance matrix (S) of the
observed variables, not raw data
• Some SEMs also analyse means
• The goal is to summarise S by specifying a
simpler underlying structure: the SEM
• The SEM yields an implied var/covar
matrix which can be compared to S
Variance/Covariance Matrix (S)
x1 x2 x3 x4 x5 X6
x1 0.91 -0.37 0.05 0.04 0.34 0.31
x2 -0.37 1.01 0.11 0.03 -0.22 -0.23
x3 0.05 0.11 0.84 0.29 0.14 0.11
x4 0.04 0.03 0.29 1.13 0.11 0.06
x5 0.34 -0.22 0.14 0.11 1.12 0.34
x6 0.31 -0.23 0.11 0.06 0.34 0.96
Maximum Likelihood (ML)
• ML estimates model parameters by
maximising the Likelihood, L, of sample data
• L is a mathematical function based on joint
probability of continuous sample observations
• ML is asymptotically unbiased and efficient,
assuming multivariate normal data
• The (log)likelihood of a model can be used
to test fit against more/less restrictive
baseline
Parameter constraints
• An important part of SEM is fixing or
constraining model parameters
• We fix some model parameters to
particular values, commonly 0, or 1
• We constrain other model parameters to be
equal to other model parameters
• Parameter constraints are important for
identification
Nested Models
• Two models, A & B, are said to be ‘nested’
when one is a subset of the other
(A = B + parameter restrictions)
e.g. Model B:
yi= a + b1X1 + b2X2 +ei
• Model A:
yi= a + b1X1 + b2X2 +ei (constraint: b1=b2)
• Model C (not nested in B):
yi= a + b1X1 + b2Z2 +ei
Model Fit
• Based on (log)likelihood of model(s)
• Where model A is nested in model B:
LLA-LLB = , with df = dfA-dfB
• Where p of > 0.05, we prefer the more
parsimonious model, A
• Where B = observed matrix, there is no
difference between observed and implied
• Model ‘fits’!
2
χ
2
χ
Model Identification
• An equation needs enough ‘known’ pieces
of information to produce unique estimates
of ‘unknown’ parameters
X + 2Y=7 (unidentified)
3 + 2Y=7 (identified) (y=2)
• In SEM ‘knowns’ are the variances/
covariances/ means of observed variables
• Unknowns are the model parameters to be
estimated
Identification Status
• Models can be:
– Unidentified, knowns < unknowns
– Just identified, knowns = unknowns
– Over-identified, knowns > unknowns
• In general, for CFA/SEM we require over-
identified models
• Over-identified SEMs yield a likelihood
value which can be used to assess
model fit
Assessing identification status
• Checking identification status using the
counting rule
• Let s = number of observed variables in the
model
• number of non-redundant parameters =
• t=number of parameters to be estimated
t> model is unidentified
t< model is over-identified
)1(
2
1
+ss
)1(
2
1
+ss
)1(
2
1
+ss
Example 1 - identification
)1(
2
1
+ss = 6
Non-redundant parameters
parameters to be estimated
3 * error variance +
2 * factor loading +
1 * latent variance = 6
6 - 6 = 0 degrees of freedom, model is just-identified
Controlling Identification
• We can make an under/just identified
model over-identified by:
– Adding more knowns
– Removing unknowns
• Including more observed variables can add
more knowns
• Parameter constraints remove unknowns
• Constraint b1=b2 removes one unknown
from the model (gain 1 df)
Example 2 – add knowns
)1(
2
1
+ss = 10
Non-redundant parameters
parameters to be estimated
4 * error variance +
3 * factor loading +
1 * latent variance = 8
10 - 8 = 2 degrees of freedom, model is over-identified
Example 3 – remove unknowns
)1(
2
1
+ss = 6
Non-redundant parameters
parameters to be estimated
3 * error variance +
0 * factor loading +
1 * latent variance = 4
6 - 4 = 2 degrees of freedom, model is over-identified
Constrain factor loadings = 1
Summary
• SEM requires understanding of some ideas
which are unfamiliar for many substantive
researchers:
– Path diagrams
– Analysing variance/covariance matrix
– ML estimation
– global ‘test’ of model fit
– Nested models
– Identification
– Parameter constraints/restrictions
for more information contact
www.ncrm.ac.uk

More Related Content

PPTX
Introduction to Structural Equation Modeling
University of Southampton
 
PPTX
Manova
Ashutosh Kumar Jha
 
PPT
Slides sem on pls-complete
Dr Hemant Sharma
 
PDF
Mediation analysis
nelle varoquaux
 
PPTX
Import and Import procedures
Milind Pelagade
 
PPTX
Multicollinearity PPT
GunjanKhandelwal13
 
PPTX
Basics of Structural Equation Modeling
smackinnon
 
PPT
Logistic regression
Khaled Abd Elaziz
 
Introduction to Structural Equation Modeling
University of Southampton
 
Slides sem on pls-complete
Dr Hemant Sharma
 
Mediation analysis
nelle varoquaux
 
Import and Import procedures
Milind Pelagade
 
Multicollinearity PPT
GunjanKhandelwal13
 
Basics of Structural Equation Modeling
smackinnon
 
Logistic regression
Khaled Abd Elaziz
 

What's hot (20)

PPT
Confirmatory Factor Analysis
University of Southampton
 
PDF
Structural Equation Modelling (SEM) Part 3
COSTARCH Analytical Consulting (P) Ltd.
 
PDF
Structural Equation Modelling (SEM) Part 2
COSTARCH Analytical Consulting (P) Ltd.
 
PDF
Structural equation-models-introduction-kimmo-vehkalahti-2013
Kimmo Vehkalahti
 
PDF
Structural Equation Modelling (SEM) Part 1
COSTARCH Analytical Consulting (P) Ltd.
 
PDF
STRUCTURAL EQUATION MODEL (SEM)
AJHSSR Journal
 
PPT
Sem+Essentials
OUM SAOKOSAL
 
PDF
Introduction to Structural Equation Modeling Partial Least Sqaures (SEM-PLS)
Ali Asgari
 
PPT
Structured equation model
King Abidi
 
PPTX
SEM
緯鈞 沈
 
PPTX
Structural equation modeling in amos
Balaji P
 
PPTX
Factor analysis ppt
Mukesh Bisht
 
PPTX
Sem
saba khan
 
PPTX
Dummy variable model
aschalew shiferaw
 
PPTX
Factor analysis
saba khan
 
ODP
Exploratory factor analysis
James Neill
 
PPT
multiple regression
Priya Sharma
 
PDF
Factor Analysis - Statistics
Thiyagu K
 
PDF
Econometrics
QuantUniversity
 
PPT
Factor analysis
Sonnappan Sridhar
 
Confirmatory Factor Analysis
University of Southampton
 
Structural Equation Modelling (SEM) Part 3
COSTARCH Analytical Consulting (P) Ltd.
 
Structural Equation Modelling (SEM) Part 2
COSTARCH Analytical Consulting (P) Ltd.
 
Structural equation-models-introduction-kimmo-vehkalahti-2013
Kimmo Vehkalahti
 
Structural Equation Modelling (SEM) Part 1
COSTARCH Analytical Consulting (P) Ltd.
 
STRUCTURAL EQUATION MODEL (SEM)
AJHSSR Journal
 
Sem+Essentials
OUM SAOKOSAL
 
Introduction to Structural Equation Modeling Partial Least Sqaures (SEM-PLS)
Ali Asgari
 
Structured equation model
King Abidi
 
Structural equation modeling in amos
Balaji P
 
Factor analysis ppt
Mukesh Bisht
 
Dummy variable model
aschalew shiferaw
 
Factor analysis
saba khan
 
Exploratory factor analysis
James Neill
 
multiple regression
Priya Sharma
 
Factor Analysis - Statistics
Thiyagu K
 
Econometrics
QuantUniversity
 
Factor analysis
Sonnappan Sridhar
 
Ad

Similar to Key ideas, terms and concepts in SEM (20)

PPTX
Essex-Quantitative-Methods-Lecture-6.pptx
weranzameen1
 
PPTX
10. SEM Models_JASP.pptx00000000000000000
GeethaSaranya4
 
PDF
Lecture 7 - Bias, Variance and Regularization, a lecture in subject module St...
Maninda Edirisooriya
 
PPTX
Training and Testing Neural Network unit II
tintu47
 
PDF
Introduction to Artificial Intelligence_ Lec 10
Dalal2Ali
 
PDF
Experimental Design for Distributed Machine Learning with Myles Baker
Databricks
 
PDF
Introduction to Machine Learning
FINBOURNE Technology
 
PDF
Data analysis
amlbinder
 
PPTX
Supervised learning for IOT IN Vellore Institute of Technology
tanishqgupta1102
 
PPTX
dimentionalityreduction-241109090040-5290a6cd.pptx
MayuraD1
 
PPT
ARIMA Model.ppt
KaushikRaghavan4
 
PPT
ARIMA Model for analysis of time series data.ppt
REFOTDEBuea
 
PPT
ARIMA Model.ppt
PatriaYunita
 
PPTX
(Faiz) MachineLearning(ppt).pptx
Faiz430036
 
PPTX
Lec4(Multiple Regression) & Building a Model & Dummy Variable.pptx
au1417257
 
PPT
ARIMA Model agfmabmnbamnbgf afgkhkahkgh asfhdkhkasfhk
eri299
 
PPTX
regression.pptx
aneeshs28
 
PDF
Dimentionality Reduction PCA Version 1.pdf
vijeta3feb
 
PDF
Lecture 5 - Linear Regression Linear Regression
viyah59114
 
PDF
Research 101: Quantitative Data Preparation
Harold Gamero
 
Essex-Quantitative-Methods-Lecture-6.pptx
weranzameen1
 
10. SEM Models_JASP.pptx00000000000000000
GeethaSaranya4
 
Lecture 7 - Bias, Variance and Regularization, a lecture in subject module St...
Maninda Edirisooriya
 
Training and Testing Neural Network unit II
tintu47
 
Introduction to Artificial Intelligence_ Lec 10
Dalal2Ali
 
Experimental Design for Distributed Machine Learning with Myles Baker
Databricks
 
Introduction to Machine Learning
FINBOURNE Technology
 
Data analysis
amlbinder
 
Supervised learning for IOT IN Vellore Institute of Technology
tanishqgupta1102
 
dimentionalityreduction-241109090040-5290a6cd.pptx
MayuraD1
 
ARIMA Model.ppt
KaushikRaghavan4
 
ARIMA Model for analysis of time series data.ppt
REFOTDEBuea
 
ARIMA Model.ppt
PatriaYunita
 
(Faiz) MachineLearning(ppt).pptx
Faiz430036
 
Lec4(Multiple Regression) & Building a Model & Dummy Variable.pptx
au1417257
 
ARIMA Model agfmabmnbamnbgf afgkhkahkgh asfhdkhkasfhk
eri299
 
regression.pptx
aneeshs28
 
Dimentionality Reduction PCA Version 1.pdf
vijeta3feb
 
Lecture 5 - Linear Regression Linear Regression
viyah59114
 
Research 101: Quantitative Data Preparation
Harold Gamero
 
Ad

More from University of Southampton (20)

PPTX
Introduction to Survey Data Quality
University of Southampton
 
PPTX
Generating SPSS training materials in StatJR
University of Southampton
 
PPTX
Introduction to the Stat-JR software package
University of Southampton
 
PPT
Multi level modelling- random coefficient models | Ian Brunton-Smith
University of Southampton
 
PPT
Multi level modelling - random intercept models | Ian Brunton Smith
University of Southampton
 
PPT
Introduction to multilevel modelling | Ian Brunton-Smith
University of Southampton
 
PPTX
Biosocial research:How to use biological data in social science research?
University of Southampton
 
PPT
Integrating biological and social research data - Michaela Benzeval
University of Southampton
 
PPTX
Teaching research methods: pedagogy hooks
University of Southampton
 
PPTX
Teaching research methods: pedagogy of methods learning
University of Southampton
 
PPTX
Data quality: total survey error
University of Southampton
 
PPTX
better off living with parents
University of Southampton
 
PPTX
Multilevel models:random coefficient models
University of Southampton
 
PPTX
Multilevel models:random intercept models
University of Southampton
 
PPTX
Introduction to multilevel modelling
University of Southampton
 
PPTX
How to write about research methods
University of Southampton
 
PPTX
How to write about research methods
University of Southampton
 
PDF
Introduction to spatial interaction modelling
University of Southampton
 
PPTX
Cognitive interviewing
University of Southampton
 
PPTX
Survey questions and measurement error
University of Southampton
 
Introduction to Survey Data Quality
University of Southampton
 
Generating SPSS training materials in StatJR
University of Southampton
 
Introduction to the Stat-JR software package
University of Southampton
 
Multi level modelling- random coefficient models | Ian Brunton-Smith
University of Southampton
 
Multi level modelling - random intercept models | Ian Brunton Smith
University of Southampton
 
Introduction to multilevel modelling | Ian Brunton-Smith
University of Southampton
 
Biosocial research:How to use biological data in social science research?
University of Southampton
 
Integrating biological and social research data - Michaela Benzeval
University of Southampton
 
Teaching research methods: pedagogy hooks
University of Southampton
 
Teaching research methods: pedagogy of methods learning
University of Southampton
 
Data quality: total survey error
University of Southampton
 
better off living with parents
University of Southampton
 
Multilevel models:random coefficient models
University of Southampton
 
Multilevel models:random intercept models
University of Southampton
 
Introduction to multilevel modelling
University of Southampton
 
How to write about research methods
University of Southampton
 
How to write about research methods
University of Southampton
 
Introduction to spatial interaction modelling
University of Southampton
 
Cognitive interviewing
University of Southampton
 
Survey questions and measurement error
University of Southampton
 

Recently uploaded (20)

PPTX
Information Texts_Infographic on Forgetting Curve.pptx
Tata Sevilla
 
PDF
Health-The-Ultimate-Treasure (1).pdf/8th class science curiosity /samyans edu...
Sandeep Swamy
 
DOCX
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
PDF
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
PPTX
TEF & EA Bsc Nursing 5th sem.....BBBpptx
AneetaSharma15
 
PPTX
family health care settings home visit - unit 6 - chn 1 - gnm 1st year.pptx
Priyanshu Anand
 
PPTX
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
PPTX
Basics and rules of probability with real-life uses
ravatkaran694
 
PDF
Biological Classification Class 11th NCERT CBSE NEET.pdf
NehaRohtagi1
 
PDF
The-Invisible-Living-World-Beyond-Our-Naked-Eye chapter 2.pdf/8th science cur...
Sandeep Swamy
 
PPTX
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
PPTX
A Smarter Way to Think About Choosing a College
Cyndy McDonald
 
PPTX
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
DOCX
Action Plan_ARAL PROGRAM_ STAND ALONE SHS.docx
Levenmartlacuna1
 
PPTX
How to Manage Leads in Odoo 18 CRM - Odoo Slides
Celine George
 
PDF
The Minister of Tourism, Culture and Creative Arts, Abla Dzifa Gomashie has e...
nservice241
 
PPTX
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
PPTX
How to Close Subscription in Odoo 18 - Odoo Slides
Celine George
 
PPTX
20250924 Navigating the Future: How to tell the difference between an emergen...
McGuinness Institute
 
DOCX
SAROCES Action-Plan FOR ARAL PROGRAM IN DEPED
Levenmartlacuna1
 
Information Texts_Infographic on Forgetting Curve.pptx
Tata Sevilla
 
Health-The-Ultimate-Treasure (1).pdf/8th class science curiosity /samyans edu...
Sandeep Swamy
 
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
TEF & EA Bsc Nursing 5th sem.....BBBpptx
AneetaSharma15
 
family health care settings home visit - unit 6 - chn 1 - gnm 1st year.pptx
Priyanshu Anand
 
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
Basics and rules of probability with real-life uses
ravatkaran694
 
Biological Classification Class 11th NCERT CBSE NEET.pdf
NehaRohtagi1
 
The-Invisible-Living-World-Beyond-Our-Naked-Eye chapter 2.pdf/8th science cur...
Sandeep Swamy
 
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
A Smarter Way to Think About Choosing a College
Cyndy McDonald
 
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
Action Plan_ARAL PROGRAM_ STAND ALONE SHS.docx
Levenmartlacuna1
 
How to Manage Leads in Odoo 18 CRM - Odoo Slides
Celine George
 
The Minister of Tourism, Culture and Creative Arts, Abla Dzifa Gomashie has e...
nservice241
 
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
How to Close Subscription in Odoo 18 - Odoo Slides
Celine George
 
20250924 Navigating the Future: How to tell the difference between an emergen...
McGuinness Institute
 
SAROCES Action-Plan FOR ARAL PROGRAM IN DEPED
Levenmartlacuna1
 

Key ideas, terms and concepts in SEM

  • 1. Key ideas, terms & concepts in SEM Professor Patrick Sturgis
  • 2. Plan • Path diagrams • Exogenous, endogenous variables • Variance/covariance matrices • Maximum likelihood estimation • Parameter constraints • Nested Models and Model fit • Model identification
  • 3. Path diagrams • An appealing feature of SEM is representation of equations diagrammatically e.g. bivariate regression Y= bX + e
  • 4. Path Diagram conventions Error variance / disturbance term Measured latent variable Observed / manifest variable Covariance / non-directional path Regression / directional path
  • 5. Reading path diagrams A latent variable Causes/measured by 3 observed variables With 3 error variances
  • 6. Reading path diagrams 2 latent variables, each measured by 3 observed variables Correlated
  • 7. Reading path diagrams 2 latent variables, each measured by 3 observed variables Regression of LV1 on LV2 Error/disturbance
  • 8. Exogenous/Endogenous variables • Endogenous (dependent) – caused by variables in the system • Exogenous (independent) – caused by variables outside the system • In SEM a variable can be a predictor and an outcome (a mediating variable)
  • 10. η1 endogenous, η2 exogenous 1 1
  • 11. Data for SEM • In SEM we analyse the variance/covariance matrix (S) of the observed variables, not raw data • Some SEMs also analyse means • The goal is to summarise S by specifying a simpler underlying structure: the SEM • The SEM yields an implied var/covar matrix which can be compared to S
  • 12. Variance/Covariance Matrix (S) x1 x2 x3 x4 x5 X6 x1 0.91 -0.37 0.05 0.04 0.34 0.31 x2 -0.37 1.01 0.11 0.03 -0.22 -0.23 x3 0.05 0.11 0.84 0.29 0.14 0.11 x4 0.04 0.03 0.29 1.13 0.11 0.06 x5 0.34 -0.22 0.14 0.11 1.12 0.34 x6 0.31 -0.23 0.11 0.06 0.34 0.96
  • 13. Maximum Likelihood (ML) • ML estimates model parameters by maximising the Likelihood, L, of sample data • L is a mathematical function based on joint probability of continuous sample observations • ML is asymptotically unbiased and efficient, assuming multivariate normal data • The (log)likelihood of a model can be used to test fit against more/less restrictive baseline
  • 14. Parameter constraints • An important part of SEM is fixing or constraining model parameters • We fix some model parameters to particular values, commonly 0, or 1 • We constrain other model parameters to be equal to other model parameters • Parameter constraints are important for identification
  • 15. Nested Models • Two models, A & B, are said to be ‘nested’ when one is a subset of the other (A = B + parameter restrictions) e.g. Model B: yi= a + b1X1 + b2X2 +ei • Model A: yi= a + b1X1 + b2X2 +ei (constraint: b1=b2) • Model C (not nested in B): yi= a + b1X1 + b2Z2 +ei
  • 16. Model Fit • Based on (log)likelihood of model(s) • Where model A is nested in model B: LLA-LLB = , with df = dfA-dfB • Where p of > 0.05, we prefer the more parsimonious model, A • Where B = observed matrix, there is no difference between observed and implied • Model ‘fits’! 2 χ 2 χ
  • 17. Model Identification • An equation needs enough ‘known’ pieces of information to produce unique estimates of ‘unknown’ parameters X + 2Y=7 (unidentified) 3 + 2Y=7 (identified) (y=2) • In SEM ‘knowns’ are the variances/ covariances/ means of observed variables • Unknowns are the model parameters to be estimated
  • 18. Identification Status • Models can be: – Unidentified, knowns < unknowns – Just identified, knowns = unknowns – Over-identified, knowns > unknowns • In general, for CFA/SEM we require over- identified models • Over-identified SEMs yield a likelihood value which can be used to assess model fit
  • 19. Assessing identification status • Checking identification status using the counting rule • Let s = number of observed variables in the model • number of non-redundant parameters = • t=number of parameters to be estimated t> model is unidentified t< model is over-identified )1( 2 1 +ss )1( 2 1 +ss )1( 2 1 +ss
  • 20. Example 1 - identification )1( 2 1 +ss = 6 Non-redundant parameters parameters to be estimated 3 * error variance + 2 * factor loading + 1 * latent variance = 6 6 - 6 = 0 degrees of freedom, model is just-identified
  • 21. Controlling Identification • We can make an under/just identified model over-identified by: – Adding more knowns – Removing unknowns • Including more observed variables can add more knowns • Parameter constraints remove unknowns • Constraint b1=b2 removes one unknown from the model (gain 1 df)
  • 22. Example 2 – add knowns )1( 2 1 +ss = 10 Non-redundant parameters parameters to be estimated 4 * error variance + 3 * factor loading + 1 * latent variance = 8 10 - 8 = 2 degrees of freedom, model is over-identified
  • 23. Example 3 – remove unknowns )1( 2 1 +ss = 6 Non-redundant parameters parameters to be estimated 3 * error variance + 0 * factor loading + 1 * latent variance = 4 6 - 4 = 2 degrees of freedom, model is over-identified Constrain factor loadings = 1
  • 24. Summary • SEM requires understanding of some ideas which are unfamiliar for many substantive researchers: – Path diagrams – Analysing variance/covariance matrix – ML estimation – global ‘test’ of model fit – Nested models – Identification – Parameter constraints/restrictions
  • 25. for more information contact www.ncrm.ac.uk