Regional Income Convergence: A Spatial
Analysis Approach
Prepared by CÊsar R. Sobrino
Universidad del Turabo
November 27, 2017
1 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Outline
1 Regression Analysis
OLS regression
Assumptions and Tests
2 Spatial Econometrics
Spatial Dependence & Spatial Heterogeneity
Spatial Matrix (W) and Moran’s I statistic
3 Income Convergence
΃-convergence
β -convergence and speed of convergence (θ)
4 GeoDa
Managing shapeīŦles
Creating Ws and Moran’s I statistic
Regression
2 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
OLS: Ordinary Least Squares
Parameters
The coeīŦƒcients in an equation that determine the
exact mathematical relation among the variables
(growth rate and initial income)
Unknowns.
Parameter estimation
The process of īŦnding estimates of the numerical
values of the parameters of an equation
3 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
OLS
OLS
The general purpose of linear regression is to īŦnd a
(linear) relationship between the dependent variable
and a set of explanatory variables.
There can be cross-section or times series data.
Bivariate form
Y = a + bX + Îĩ
Intercept parameter (a) gives value of Y where
regression line crosses Y -axis (value of Y when X is
zero.
Slope parameter (b) gives the change in Y associated
with a one-unit change in X : ∆Y /∆X
4 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
OLS
Two objectives:
Find a good match (īŦt) between a + bX and observed
values of Y ( a and b are the regression coeīŦƒcients).
Discover which of the explanatory variables (Xs)
contribute signiīŦcantly to the linear relationship
OLS accomplished both stated objectives in an optimal
fashion according to some criteria, and is referred to as
a Best Linear Unbiased Estimator (BLUE)
OLS estimates (a and b) are found minimizing the sum
of the squared prediction errors (hence least squares).
5 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
OLS
The OLS regression line (red one) is that minimizes the
sum of the squared prediction errors
6 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
OLS
In order to obtain the BLUE property and to able to
make statistical inferences about the population
parameters (a and b) by means of your estimates (a
and b), you need to make certain assumptions about
the random part of the regression equation (the
random error Îĩ)
Two of these assumptions are crucial to obtain the
unbiasedness and eīŦƒciency of the OLS estimates.
7 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
OLS
Assumptions
The random error (Îĩ) has mean zero (there is no
systematic misspeciīŦcation or bias in the regression
equation).
Expected value: E(Îĩ) = 0
If E(Îĩ) = 0 does not hold, estimators are biased
The random error terms are uncorrelated and have a
constant variance (they are homoskedastic).
Variance: E(ÎĩÎĩ ) = ΃2
I
If E(ÎĩÎĩ ) = ΃2
I does not hold, this means that either
autocorrelation or heteroskedasticity are present, so
estimators are ineīŦƒcient.
8 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Hypothesis Tests
Null hypothesis H0: a = 0 or H0: b = 0 .
Alternative hypothesis H1: a = 0 or H1: b = 0 .
If you reject H0, the paramater (a or b ) is
statistically diīŦ€erent from zero.
9 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Individual statistical signiīŦcance
Must determine if there is suīŦƒcient statistical evidence
to indicate that Y is truly related to X (i.e., b = 0)
Even if b = 0, it is possible that the sample will
produce an estimate b that is diīŦ€erent from zero
Test for statistical signiīŦcance using t-tests or p-values
10 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Individual signiīŦcance - t-Test
First determine the level of signiīŦcance (0.1%, 1%, 5%,
10%)
Probability of īŦnding a parameter estimate to be
statistically diīŦ€erent from zero when, in fact, it is zero
(alpha). Îą = 0.001, 0.01, 0.05, or 0.1, respectively.
Probability of a Type I Error (alpha).
1 – level of signiīŦcance (alpha) = level of conīŦdence
t-ratio is computed as t = b/Sb
where Sb
is the standard error of estimate b
Use t-table to choose critical t-value with n – k
degrees of freedom for the chosen level of signiīŦcance
n = number of observations
k = number of parameters estimated.
11 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Individual signiīŦcance-t-Test
If the absolute value of t-ratio is greater than the
critical t , the parameter estimate is statistically
signiīŦcant at the given level of signiīŦcance.
If t-ratio (in absolute value) is equal to 2 (or bigger
than 2) , you can reject H0.
12 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Individual signiīŦcance - p-Values
Treat as statistically signiīŦcant only those parameter
estimates with p-values smaller than the maximum
acceptable signiīŦcance level. p-value gives exact level
of signiīŦcance.
Also the probability of īŦnding signiīŦcance when none
exists
SigniīŦcance levels (alpha)
Îą = 0.001, or 0.1% signiīŦcance level
Îą = 0.01, or 1% signiīŦcance level
Îą = 0.05, or 5% signiīŦcance level
Îą = 0.1, or 10 % signiīŦcance level
E.g. if p-value = 0.00001, you reject H0 at 0.1%
signiīŦcance level, if p-value = 0.08, you reject H0 at
10% signiīŦcance level, and, if p-value = 0.14, you
cannot reject H0 at 10% signiīŦcance level
13 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Joint signiīŦcance -F-test
Used to test for signiīŦcance of overall regression
equation
Compare F-statistic to critical F-value from F-table
Two degrees of freedom, n – k & k – 1
Level of signiīŦcance
If F-statistic exceeds the critical F (=4), the regression
equation overall is statistically signiīŦcant at the
speciīŦed level of signiīŦcance.
14 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
CoeīŦƒcient of Determination: R2
R2
measures the percentage of total variation in the
dependent variable (Y ) that is explained by the
regression equation
Ranges from 0 to 1
High R2
indicates Y and X are highly correlated
E.g. R2
= 0.8 means that 80% of the changes of Y are
explained by the regression equation.
15 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Spatial Analysis: Motivation
Diagnosis
The assumption of normal, homoskedastic and
uncorrelated error terms that lead to BLUE
characteristic of OLS estimators are not necessarily
satisīŦed by the real models and data.
When dealing with spatial data you must give special
attention to the possibility that the errors or the
variables (Xs) in the model show spatial
dependence.
16 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Spatial Analysis: Motivation
What is spatial autocorrelation (dependence)
important?
We need to examine the inīŦ‚uences of spatial
autocorrelation upon the inferences that may be
drawn from statistical tests.
As these inferences are based on independence
assumptions (OLS asumptions), then the presence of
spatial autocorrelation is likely to bias any resultant
inference.
Dependence amongts error terms brings ineīŦƒcient
OLS estimates. Spatial Error (SEM).
OLS estimates are biased, and thus inferences based on
the regression model will be incorrect. Spatial Lag
(SAR).
17 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Spatial Analysis: Motivation
Applied work in regional science (economics, health,
demographics, etc.) uses of spatial data.
Spatial data: Data collected with reference to location.
Administrative spatial units (states, districts,
counties, etc.).
Functional regions (E.g. labour market regions).
Points in space (E.g. cities, municipalities, plants) .
Using spatial data, model estimation, hypothesis
testing and prediction have to allow for spatial
eīŦ€ects.
18 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Spatial Dependence
Lack of independence among spatial data,
Observations at location i depend on other
observations at locations j (= i).
Spatial dependence is associated with the notion of
relative space (location)
Neighbouring regions are expected to be more alike
than arbitrary regions.
Spatial dependence is expected to diminish with
increasing distance.
Spatial dependence are multidirectional by nature.
Time series is unidirectional.
19 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Spatial Dependence: Causes
Nuisance:
The delineation of spatial units is somewhat arbitrary.
Spatial data are usually collected for administrative
units (states, districts, counties, etc.).
If the correspondence between the spatial scale of a
phenomenon under study and the delineation of the
spatial units of observation is not strong,
measurement errors are to be expected.
OLS models can be corrected by including a spatial
error speciīŦcation in the model (SEM).
20 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Spatial Dependence: Causes
Substantive:
Interaction and dependence on the regional level may
be itself a modelling problem because it generats
model bias.
Location and distance are important forces at work in
human geography and market activity. E.g spatial
spillovers, hierarchy of places, etc..
This can be corrected by including an explicit spatial
lag term as an explanatory variable in the model
(SAR).
21 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Spatial Heterogeneity
It refers to varying economic relationships or
disturbances over space.
A diīŦ€erent relationship may hold for every spatial
unit. This situation characterizes the case of structural
instability.
In case of structural instability, the regression
coeīŦƒcients are not constant across the spatial units.
E.g. Sample: 35,000 homes sold within the last 5 years
in Lucas county, Ohio.
3 distinct distributions,with low-priced homes nearest
to the Central Business District(CBD) and high
priced homes farthest away from the CBD.
This suggests diīŦ€erent relationships may be at work
to describe home prices in diīŦ€erent locations.
22 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Spatial Weight Matrix (W)
Quantify location for analyzing spatial eīŦ€ects
Contiguity (neighbourhood)
The relative location among spatial units. Usually
established from a map.
Units near should reīŦ‚ect a greater degree of spatial
dependence than those more distant from each
other. For spatial heterogeneity, relationships may
be similar for neighbouring units.
Distance
Latitude and longitude allow us to calculate distances
from any point in space.
Spatial dependence will decline with distance.
For (spatial heterogeneity, closer units should
exhibit similar relationships.
23 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
W
In a regular grid, neighbours can be deīŦned in a
number of ways. Among others, you may īŦnd
In analogy of the game of chess, rook contiguity, bishop
contiguity and queen contiguity are distinguished.
Inverse distance raised to a power.
24 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
W: Rook contiguity
A spatial unit is a neighbour of another unit if both
areas share a common edge (side). In the next īŦgure,
the units B1, B2, B3 and B4 are neighbours of unit A
according to the rook criterion.
B2
B1 A B3
B4
25 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
W:Queen contiguity
A spatial unit is a neighbour of another unit if both
areas share a common edge or vertex. In the next
īŦgure the units B1, B2, B3 and B4 as well as C1, C2,
C3 and C4 are neighbours of unit A according to the
queen criterion.
C1 B2 C2
B1 A B3
C3 B4 C4
26 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
W:Distance-based spatial weight matrix
Spatial interaction will decline with increasing distance
due to increasing geographical impediments.
Nearer regions have a greater potential inīŦ‚uence.
Power function: Wij = 1/dij
Îŗ
, where
Îŗ is a power parameter
Wij element of matrix W at row i and column j
(i = j)
dij: distance between region i and region j
The distances, dij, are usually measured between the
centres of the regions (latitude and longitude).
27 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
W: Representing 5 regions
Rook Standardized Distance-based
īŖŽ
īŖ¯
īŖ¯
īŖ¯
īŖ¯
īŖ°
0 1 1 0 0
1 0 1 1 0
1 1 0 1 0
0 1 1 0 1
0 0 0 1 0
īŖš
īŖē
īŖē
īŖē
īŖē
īŖģ
īŖŽ
īŖ¯
īŖ¯
īŖ¯
īŖ¯
īŖ°
0 1
2
1
2
0 0
1
3
0 1
3
1
3
0
1
3
1
3
0 1
3
0
0 1
3
1
3
0 1
3
0 0 0 1 0
īŖš
īŖē
īŖē
īŖē
īŖē
īŖģ
īŖŽ
īŖ¯
īŖ¯
īŖ¯
īŖ¯
īŖ°
0 1
d12
1
d13
1
d14
1
d15
1
d21
0 1
d23
1
d24
1
d25
1
d31
1
d32
0 1
d34
1
d35
1
d41
1
d42
1
d43
0 1
d45
1
d51
1
d52
1
d53
1
d54
0
īŖš
īŖē
īŖē
īŖē
īŖē
īŖģ
Îŗ =1 & dij is the distance between i and j, i = j
28 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Testing Spatial Autocorrelation
Moran’s I - statistic: test for spatial dependence.
Pearson correlation: ΁xy = Sxy
SxSy
,
where Sxy is the covariance between x and y, Sx is the
standard deviation of x, and, Sy is the standard
deviation of y
Covariance formula
Sxy =
n
i=1(xi − ¯x)(yi − ¯y)
n − 1
, then
΁xy =
n
i=1(xi − ¯x)(yi − ¯y)
SxSy(n − 1)
29 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Moran’s I - statistic
Similarities between units i and j are calculated as the
product of the diīŦ€erences between xi (variable of
interest) and xj (spatial lag) with the overall mean (¯x),
divided by the sample variance. This ratio has to be
adjusted for the spatial weights used.
I =
n
n
i
n
j Wij
n
i
n
j Wij(xi − ¯x)(xj − ¯x)
n
i (xi − ¯x)2
where xi is the i-th observation, n is the sample size,
and Wij is the spatial weight between i and j.
30 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Moran’s I - statistic
The expected value of Moran’s I statistic: − 1/(n − 1)
E.g. if n = 48 regions ⇒ − 1/(48 − 1) = 0.0213, which is
close to zero, meaning no spatial autocorrelation.
Then, H0 : I = 0 and H1 : I = 0.
A standardized matrix bounds I between -1 and 1.
-1 means perfect clustering of dissimilar values (perfect
dispersion).
0 is no autocorrelation (perfect randomness)
1 means perfect clustering of similar values (spatial
autocorrelation).
31 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Spatial Lag (SAR)
1 OLS regression Y = a + bX + Îĩ
2 SAR (including W) : Y = ΁WY + a + bX + Îĩ
3 Y = (1 − ΁W)−1
a + (1 − ΁W)−1
bX + (1 − ΁W)−1
Îĩ
4 Where ΁ is a scalar parameter that indicates the eīŦ€ect
of the dependent variable in the neighbors on Y in the
focal area, intercept, (1 − ΁W)−1
a, slope, (1 − ΁W)−1
b
, and, error term, (1 − ΁W)−1
Îĩ
5 GeoDa reports 3) & ΁
6 Not including ΁W brings biased estimates and thus
inferences based on an OLS model will be incorrect
32 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Spatial Error Model (SEM)
1 OLS regression Y = a + bX + Îĩ
2 SEM (including W): Y = a + bX + Îĩ & Îĩ = ÎģWÎĩ + Âĩ
3 Y = a + bX + (1 − ÎģW)−1
Âĩ
4 Where: Îģ is the autoregressive coeīŦcient and Âĩ is
another error term, intercept a, slope b, and , error
term , (1 − ÎģW)−1
Âĩ
5 Geoda reports 3) & Îģ
6 Not including ÎģW brings unbiased estimates and
biased standard errors and consequently, t-tests &
p-values will be misleading.
33 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Income Convergence
Robert Solow (1956) “Capital should īŦ‚ow from
countries with a high capital-to-output ratio to
countries with a low capital-to-output ratio ”
“Poor” countries/regions/states should have higher
growth rates.
”rich” countries/regions/states should have lower
growth rates
The analysis using regions is called Regional Income
Convergence.
34 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Sigma Convergence, (΃- convergence)
It refers to decreasing variance of variables over time.
This is measured by the coeīŦƒcient of variation (CV)
which gives the relative standard deviation to the
mean (the standard deviation divided by mean).
Since CV is mean standardized, it controls for
increasing averages over time and can be directly
compared across diīŦ€erent variables.
When the CV of real per capita income across regions
falls over time, there is ΃-convergence .
35 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Beta Convergence, (β- convergence)
It considers the mobility of countries (regions).
It is deīŦned as a negative correlation between the
position of individual countries (regions) at the
beginning of an observation period and the changes or
growth rates over this period.
It assumes that growth from a low base is faster than
growth from high levels.
36 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
β- convergence
OLS regression model
LINC1i − LINC0i = a + βLINC0i + Îĩ0i
Where:
LINC1i is the īŦnal(1) per capita income for region i
in logs.
LINC0i is the initial(0) per capita income for region i
in logs.
LINC1i − LINC0i is the growth rate between the
īŦnal year and the initial year.
L stands for logs
Îĩ0i is an error term
37 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
β values
β > -1 and β <0 (β ∈ ]-1,0[) and signiīŦcant means
β-convergence.
β > 0 (β ∈ ]0, ∞+
[) and signiīŦcant means “divergence”
β = 0 , neither “convergence” nor “divergence”
β not signiīŦcant , neither “convergence” nor
“divergence”
38 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Convergence rate (θ)
θ = ln(1 + β)/(−k)
Where
k is diīŦ€erence between periods (E.g. k=1945-1929=16)
E.g if β = -0.2 and k= 16.
θ = ln(−0.2 + 1)/(−16)
θ = ln(0.8)/(−16)
θ = − 0.22/− 16 = 0.01375 or 1.4% (speed of
convergence).
This means that regions converge at a speed of 1.4
percent per year.
Note: ln(1)=0 and ln(0) does not exist, so, if β = -1 ,
θ does not exist, and , if β = 0 , θ =0
The logarithmic function does not take negative values.
39 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Rey and Montuori(1998) (R & M)
This is an article on regional income convergence
Their data includes 48 states and used four years in
their study (1929, 1945, 1946, and 1994). They
included neither Hawaii nor Alaska.
Three periods: 1929-94, 1929-45, and, 1946-94. So,
they run a cross-sectional analysis.
40 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa
GeoDa is a free and open source software tool that
serves for spatial data analysis.
You may download it from
http://geodacenter.github.io/download.html
The shapeīŦles (shp) are the most used īŦles.
A shapeīŦle stores nontopological geometry and
attribute information for the spatial features in a data
set. It includes an ID variable to identify regions.
A shapeīŦle consists of at least four actual īŦles, an
index īŦle (shx), a data base table (dbf) and a
projection īŦle (prj).
41 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa
For your research paper, īŦrst you have to choose a
country and gather your data in excel (or in Open
OīŦƒce/spreadsheet).
Download Open OīŦƒce from https:
//www.openoffice.org/download/index.html
Later, look for a shapeīŦle of the regions of that
country. This link is helpful
http://www.gadm.org/country.
Open that shapeīŦle in Geoda.
Create the variables that you will use. Table/Add
variable and set integer, 10 lentgh, and 3 decimals.
GeoDa will create empty columns.
Click “Table” and select (if you need to do it) the
regions that you will use. Do not include isolate
regions such as islands.
42 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa
Save as a new shapeīŦle (create a new directory).
Automatically, GeoDa creates a dbf, shx, and prj īŦles.
To include your data to the new shapeīŦle, you have to
open the new dbf īŦle using Open OīŦƒce
(spreadsheet/international/OK).
Check the correspondence between the regions of the
new dbf īŦle and the regions of your data. The order of
your data has to be equal to the order of the new dbf
īŦle.
Copy your data and paste it on the new dbf īŦle. Fill
the empty columns.
Save (Keep the current format)
43 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: US states
Open the new shapeīŦle with your gathered data
The variables that I have gathered are:
INC29: 1929 real per capita income
INC45: 1945 real per capita income
INC46: 1946 real per capita income
INC94: 1994 real per capita income
Three diīŦ€erent sample periods 1929-94, 1929-45, and,
1946-94
The initial year is 1929, the break year is 1945, and the
īŦnal year is 1994.
44 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: Calculating variables
Creating variables in logs Table/Add Variables
LINC29, LINC45, LINC46, and , LINC94. GeoDa
will create empty columns.
Table/Variable calculation/univariate set “LINC29” ,
operator “log (base e)” and variable “INC29”. Do the
same for the other variables.
Creating growth rates Table/Add Variables:
dI94I29, dI45I29, and, dI94I46, GeoDa will create
empty columns.
Table/Variable calculation/bivariate set “dI94I29” ,
variable, “, LINC94,” operator, “subtract”, variable,
“LINC29” . Do the same for the other growth rates.
45 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa US states - Descriptive Statistics
Click on Explore/Boxplot
46 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
US states: Exploring β-convergence
Explore/
Scatter plot/
1994-29
Y: “dI94I29”, X: ‘‘LINC29”, OK
1945-29
Y: “dI45I29”, X: ‘‘LINC29”, OK
1994-46
Y: ‘dI94I46”, X: ‘‘LINC46”, OK
You should get negative relationships
47 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
US states: Exploring β- convergence
1994-29 1945-29 1994-46
X : Initial income & Y : Growth rate.
At īŦrst glance, β- convergence holds.
48 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: Exploring Spatial Dependence
Map/Quantile Map/5 to check if there are spatial
patterns. Do you īŦnd any?
1929 per capita income 1945 per capita income
1994 per capita income
49 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: Creating W and Moran scatterplots
Spatial Matrix
Tools/Weights manager/create
Weights File ID variable “’GEOID’ .
Your shapeīŦle must have one ID variable
Queen Contiguity
Create/Save
Moran’s I
Space/
Univariate’s Moran’s I/
Set the variable you want to analyze/
Set W/Queen
The scatterplot enables you to assess how similar a
spatial unit is to its neighbors.
50 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: Moran scatterplot- state per capita income
X: Spatial units; Y: the weighted average or spatial lag
of the corresponding observation on the X axis.
1929 1945 1994
They show spatial dependence because there is a
positive correlation (See page 146, R & M)
51 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa US states - Exploring data
The CV of real per capita income in logs across US
states falls over time, so ΃-convergence holds
According to Moran’s I, data shows spatial
dependence.
Table: Descriptive Statistics
Mean Median SD CV Moran’s I
LINC29 6.35 6.38 0.38 0.06 0.65
LINC45 7.02 7.03 0.23 0.03 0.57
LINC94 9.96 9.95 0.13 0.01 0.35
52 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: OLS regression
See slide 37 and R & M page 148, equation 4
Click on Regression
Dependent variable/ growth rate (E.g. dI94I29 )
Independent variable/ initial income (E.g. LINC29 )
Weights īŦle/
Classic: This will run classical OLS regression with
spatial dependence diagnostics, click Run.
Three regressions:
1 dI94I29i = a + βLINC29i + Îĩ29i
2 dI45I29i = a + βLINC29i + Îĩ29i
3 dI94I46i = a + βLINC46i + Îĩ46i
Where i : 1, 2, 3, ....., .48
53 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: OLS regression-Outcomes: 1994-29
SUMMARY OF OUTPUT: ORDINARY LEAST SQUARES ESTIMATION
Dependent Variable : DIN94IN29 Number of Observations: 48
Mean dependent var : 3.61054 Number of Variables : 2
S.D. dependent var : 0.284673 Degrees of Freedom : 46
R-squared : 0.918195 F-statistic : 516.314
Adjusted R-squared : 0.916417 Prob(F-statistic) : 1.20184e-26
Sum squared residual: 0.318208 Log likelihood : 52.281
Sigma-square : 0.00691757 Akaike info criterion : -100.562
S.E. of regression : 0.0831719 Schwarz criterion : -96.8195
Sigma-square ML : 0.00662934
S.E of regression ML: 0.0814207
----------------------------------------------------------------------
-------
Variable Coefficient Std.Error t-Statistic Probability
----------------------------------------------------------------------
-------
CONSTANT 8.25684 0.204832 40.3104 0.00000
LINC29 -0.732026 0.0322159 -22.7225 0.00000
----------------------------------------------------------------------
-------
REGRESSION DIAGNOSTICS
MULTICOLLINEARITY CONDITION NUMBER 34.095520 (Extreme
Multicollinearity)
TEST ON NORMALITY OF ERRORS
TEST DF VALUE PROB
Jarque-Bera 2 1.0399 0.59456
DIAGNOSTICS FOR HETEROSKEDASTICITY
RANDOM COEFFICIENTS
TEST DF VALUE PROB
Breusch-Pagan test 1 0.0012 0.97181
Koenker-Bassett test 1 0.0013 0.97079
DIAGNOSTICS FOR SPATIAL DEPENDENCE FOR WEIGHT MATRIX : nuevoq
(row-standardized weights)
TEST MI/DF VALUE PROB
Moran's I (error) 0.1509 1.9658 0.04932
Lagrange Multiplier (lag) 1 3.5538 0.05941
Robust LM (lag) 1 1.9997 0.15733
Lagrange Multiplier (error) 1 2.1903 0.13888
Robust LM (error) 1 0.6362 0.42511
Lagrange Multiplier (SARMA) 2 4.1900 0.12307
============================== END OF REPORT
================================
54 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: OLS regression-Outcomes: 1945-29
SUMMARY OF OUTPUT: ORDINARY LEAST SQUARES ESTIMATION
Dependent Variable : DIN45IN29 Number of Observations: 48
Mean dependent var : 0.674542 Number of Variables : 2
S.D. dependent var : 0.17481 Degrees of Freedom : 46
R-squared : 0.831930 F-statistic : 227.696
Adjusted R-squared : 0.828276 Prob(F-statistic) : 1.96164e-19
Sum squared residual: 0.246528 Log likelihood : 58.4065
Sigma-square : 0.0053593 Akaike info criterion : -112.813
S.E. of regression : 0.0732072 Schwarz criterion : -109.071
Sigma-square ML : 0.00513599
S.E of regression ML: 0.0716658
----------------------------------------------------------------------
-------
Variable Coefficient Std.Error t-Statistic Probability
----------------------------------------------------------------------
-------
CONSTANT 3.39038 0.180291 18.8051 0.00000
LINC29 -0.427882 0.0283561 -15.0896 0.00000
----------------------------------------------------------------------
-------
REGRESSION DIAGNOSTICS
MULTICOLLINEARITY CONDITION NUMBER 34.095520
TEST ON NORMALITY OF ERRORS
TEST DF VALUE PROB
Jarque-Bera 2 0.2160 0.89762
DIAGNOSTICS FOR HETEROSKEDASTICITY
RANDOM COEFFICIENTS
TEST DF VALUE PROB
Breusch-Pagan test 1 1.9185 0.16602
Koenker-Bassett test 1 2.2058 0.13749
SPECIFICATION ROBUST TEST
TEST DF VALUE PROB
White 2 2.3107 0.31495
DIAGNOSTICS FOR SPATIAL DEPENDENCE FOR WEIGHT MATRIX : nuevoq
(row-standardized weights)
TEST MI/DF VALUE PROB
Moran's I (error) 0.3815 4.3930 0.00001
Lagrange Multiplier (lag) 1 11.0500 0.00089
Robust LM (lag) 1 2.3441 0.12576
Lagrange Multiplier (error) 1 14.0018 0.00018
Robust LM (error) 1 5.2958 0.02138
Lagrange Multiplier (SARMA) 2 16.3459 0.00028
============================== END OF REPORT
================================
55 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: OLS regression-Outcomes: 1994-46
SUMMARY OF OUTPUT: ORDINARY LEAST SQUARES ESTIMATION
Data set : nuevo
Dependent Variable : DIN94IN46 Number of Observations: 48
Mean dependent var : 2.91979 Number of Variables : 2
S.D. dependent var : 0.163036 Degrees of Freedom : 46
R-squared : 0.727570 F-statistic : 122.85
Adjusted R-squared : 0.721647 Prob(F-statistic) : 1.39578e-14
Sum squared residual: 0.347588 Log likelihood : 50.1615
Sigma-square : 0.00755626 Akaike info criterion : -96.3229
S.E. of regression : 0.0869268 Schwarz criterion : -92.5805
Sigma-square ML : 0.00724142
S.E of regression ML: 0.0850965
----------------------------------------------------------------------
-------
Variable Coefficient Std.Error t-Statistic Probability
----------------------------------------------------------------------
-------
CONSTANT 7.07005 0.374654 18.8709 0.00000
LINC46 -0.589693 0.0532032 -11.0838 0.00000
----------------------------------------------------------------------
-------
REGRESSION DIAGNOSTICS
MULTICOLLINEARITY CONDITION NUMBER 59.704369
TEST ON NORMALITY OF ERRORS
TEST DF VALUE PROB
Jarque-Bera 2 0.5390 0.76376
DIAGNOSTICS FOR HETEROSKEDASTICITY
RANDOM COEFFICIENTS
TEST DF VALUE PROB
Breusch-Pagan test 1 1.5535 0.21262
Koenker-Bassett test 1 1.4503 0.22849
SPECIFICATION ROBUST TEST
TEST DF VALUE PROB
White 2 1.6639 0.43519
DIAGNOSTICS FOR SPATIAL DEPENDENCE
FOR WEIGHT MATRIX : nuevoq (row-standardized weights)
TEST MI/DF VALUE PROB
Moran's I (error) 0.3141 3.6646 0.00025
Lagrange Multiplier (lag) 1 10.4680 0.00121
Robust LM (lag) 1 2.5955 0.10717
Lagrange Multiplier (error) 1 9.4918 0.00206
Robust LM (error) 1 1.6193 0.20319
Lagrange Multiplier (SARMA) 2 12.0873 0.00237
============================== END OF REPORT
================================
56 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: OLS regression-Outcomes
With these outputs, you should be able to complete R
& M Table 2
You may īŦnd R2
s, AICs (Akaike Infomation
Criterion), βs, and, p − values,
Convergence rate(θ) is calculated using β (See slide 39)
Tests for spatial dependence: Robust LM (lag and
error) and Moran’s I (error).
Breusch-Pagan Test (test for Heteroskedasticity).
AIC: Value for model selection
57 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Reporting OLS outomes
Table: Unconditional model OLS estimation
R2 (΃2) AIC β (p − value) Convergence
rate (θ)
1929-94 0.918 -100.562 -0.732 0.020
( 0.007) (0.000)
1929-45 0.832 -112.813 -0.428 0.035
(0.005) (0.000)
1946-94 0.728 -96.323 -0.590 0.020
(0.008) (0.000)
Robust LM Robust LM Moran’s I (error)
(error) p-value (lag) p-value MI(p-value)
Diagnostics for spatial dependence
1929-94 0.425 0.157 0.1509 (0.049)
1929-45 0.021 0.126 0.3815 (0.000)
1946-94 0.203 0.107 0.3141(0.000)
Breusch-Pagan test
p-value
Diagnostics for heteroskedasticity
1929-94 0.972
1929-45 0.166
1946-94 0.213
58 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: Diagnostic Tests
Heteroskedasticiy: When regression errors do not
have a constant variance over all observations.
Breush-Pagan Test:
H0: homocedasticity ; H1: heteroskedasticity
Multicollinearity: High correlation between Xs
Condition number > 30 is considered suspect
Condition number =1 means a lack of multicollinearity
Non-normal errors: Most regression models assume
normal errors distributions
Jarque-Bera Test:
H0: normal errors ; H1: no existence of normal errors
AIC: Calculate AIC for each model with the same
data set, and the “best” model is the one with
minimum AIC value.
If p − value is greater than 0.1, you cannot reject H0
59 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: OLS vs SAR & SEM
GeoDa reports Moran’s I (error), LM (lag), LM (error),
Robust LM (lag), and, Robust LM (error)
Moran’s I (error) is an extension of Moran’s I -statistic
to measure spatial autocorrelation in regression
models. It is useful to detect spatial dependence but
they do not allow to discriminate betweem SAR and
SEM.
H0: OLS ; H1: Spatial dependence
LM (error): H0: OLS ; H1: SEM
LM (lag): H0: OLS ; H1: SAR
If LMs are signiīŦcant (H0 is rejected) , focus on robust
tests.
If p − value is greater than 0.1, you cannot reject H0
60 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: OLS vs SAR & SEM
Robust LM (error): H0: OLS ; H1: SEM
Robust LM (lag): H0: OLS ; H1: SAR
if both robust measures are signiīŦcant, stick with the
more signiīŦcant.
If p − value is greater than 0.1, you cannot reject H0
61 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Interpretation of OLS outcomes
Results provide much support for β-convergence.
CoeīŦƒcients highly signicant and negative.
R2
above 0.7 in all three samples
Convergence rate over entire sample, 2% yearly but
īŦrst sub sample, 3.5%, second sub sample 2%
Moran’s I statistic (MI) provides very strong evidence
(See p-value) of spatial dependence
Robust tests point to the presence of spatial error
(SEM) rather than the spatial lag (SAR).
Breusch- Pagan test for heteroscedasticity is not
signiīŦcant in any of the sub-samples. Then, omit
further consideration of the spatial heterogeneity
models.
62 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa : SAR
See slide 32 and R & M page 150, equation 9
Click on Regression
Dependent variable/ growth rate (E.g. dI94I29 )
Independent variable/ initial income (E.g. LINC29 )
Weights īŦle/
Spatial lag
Three regressions:
1 dI94I29i = a + ΁WdI94I29i + βLINC29i + Îĩ29i
2 dI45I29i = a + ΁WdI45I29i + βLINC29i + Îĩ29i
3 dI94I46i = a + ΁WdI94I46i + βLINC46i + Îĩ46i
Where i : 1, 2, 3, ....., .48
63 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: SAR -Outcomes: 1994-29
SUMMARY OF OUTPUT: SPATIAL LAG MODEL - MAXIMUM LIKELIHOOD ESTIMATION
Data set : nuevo
Spatial Weight : nuevoq
Dependent Variable : DIN94IN29 Number of Observations: 48
Mean dependent var : 3.61054 Number of Variables : 3
S.D. dependent var : 0.284673 Degrees of Freedom : 45
Lag coeff. (Rho) : 0.153427
R-squared : 0.924712 Log likelihood : 54.1372
Sq. Correlation : - Akaike info criterion : -102.274
Sigma-square : 0.00610122 Schwarz criterion : -96.6607
S.E of regression : 0.0781103
----------------------------------------------------------------------
-------
Variable Coefficient Std.Error z-value Probability
----------------------------------------------------------------------
-------
W_DIN94IN29 0.153427 0.0776567 1.97571 0.04819
CONSTANT 7.21331 0.560658 12.8658 0.00000
LINC29 -0.655089 0.0491648 -13.3243 0.00000
----------------------------------------------------------------------
-------
REGRESSION DIAGNOSTICS
DIAGNOSTICS FOR HETEROSKEDASTICITY
RANDOM COEFFICIENTS
TEST DF VALUE PROB
Breusch-Pagan test 1 1.1563 0.28223
DIAGNOSTICS FOR SPATIAL DEPENDENCE
SPATIAL LAG DEPENDENCE FOR WEIGHT MATRIX : nuevoq
TEST DF VALUE PROB
Likelihood Ratio Test 1 3.7124 0.05401
============================== END OF REPORT
================================
64 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: SAR-Outcomes: 1945-29
SUMMARY OF OUTPUT: SPATIAL LAG MODEL - MAXIMUM LIKELIHOOD ESTIMATION
Data set : nuevo
Spatial Weight : nuevoq
Dependent Variable : DIN45IN29 Number of Observations: 48
Mean dependent var : 0.674542 Number of Variables : 3
S.D. dependent var : 0.17481 Degrees of Freedom : 45
Lag coeff. (Rho) : 0.295355
R-squared : 0.865881 Log likelihood : 63.2943
Sq. Correlation : - Akaike info criterion : -120.589
Sigma-square : 0.00409851 Schwarz criterion : -114.975
S.E of regression : 0.0640196
----------------------------------------------------------------------
-------
Variable Coefficient Std.Error z-value Probability
----------------------------------------------------------------------
-------
W_DIN45IN29 0.295355 0.0974027 3.0323 0.00243
CONSTANT 2.64263 0.300699 8.78829 0.00000
LINC29 -0.341486 0.0388813 -8.78278 0.00000
----------------------------------------------------------------------
-------
REGRESSION DIAGNOSTICS
DIAGNOSTICS FOR HETEROSKEDASTICITY
RANDOM COEFFICIENTS
TEST DF VALUE PROB
Breusch-Pagan test 1 3.4138 0.06465
DIAGNOSTICS FOR SPATIAL DEPENDENCE
SPATIAL LAG DEPENDENCE FOR WEIGHT MATRIX : nuevoq
TEST DF VALUE PROB
Likelihood Ratio Test 1 9.7755 0.00177
============================== END OF REPORT
================================
65 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa:SAR-Outcomes: 1994-46
SUMMARY OF OUTPUT: SPATIAL LAG MODEL - MAXIMUM LIKELIHOOD ESTIMATION
Data set : nuevo
Spatial Weight : nuevoq
Dependent Variable : DIN94IN46 Number of Observations: 48
Mean dependent var : 2.91979 Number of Variables : 3
S.D. dependent var : 0.163036 Degrees of Freedom : 45
Lag coeff. (Rho) : 0.350822
R-squared : 0.783057 Log likelihood : 54.8668
Sq. Correlation : - Akaike info criterion : -103.734
Sigma-square : 0.00576651 Schwarz criterion : -98.1199
S.E of regression : 0.0759375
----------------------------------------------------------------------
-------
Variable Coefficient Std.Error z-value Probability
----------------------------------------------------------------------
-------
W_DIN94IN46 0.350822 0.11406 3.07577 0.00210
CONSTANT 5.02565 0.731279 6.87241 0.00000
LINC46 -0.445107 0.0651113 -6.8361 0.00000
----------------------------------------------------------------------
-------
REGRESSION DIAGNOSTICS
DIAGNOSTICS FOR HETEROSKEDASTICITY
RANDOM COEFFICIENTS
TEST DF VALUE PROB
Breusch-Pagan test 1 4.2002 0.04042
DIAGNOSTICS FOR SPATIAL DEPENDENCE
SPATIAL LAG DEPENDENCE FOR WEIGHT MATRIX : nuevoq
TEST DF VALUE PROB
Likelihood Ratio Test 1 9.4106 0.00216
============================== END OF REPORT
================================
66 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa : SEM
See slide 33 and R & M page 149, equation 8
Click on Regression
Dependent variable/ growth rate (E.g. dI94I29 )
Independent variable/ initial income (E.g. LINC29 )
Weights īŦle/
Spatial error
Three regressions:
1 dI94I29i = a + βLINC29i + Îĩ29i ; Îĩ29i = ÎģWÎĩ29i + Âĩ29i
2 dI45I29i = a + βLINC29i + Îĩ29i ; Îĩ29i = ÎģWÎĩ29i + Âĩ29i
3 dI94I46i = a + βLINC46i + Îĩ46i ; Îĩ46i = ÎģWÎĩ46i + Âĩ46i
Where i : 1, 2, 3, ....., .48
67 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: SEM: 1994-29
SUMMARY OF OUTPUT: SPATIAL ERROR MODEL - MAXIMUM LIKELIHOOD ESTIMATION
Data set : nuevo
Spatial Weight : nuevoq
Dependent Variable : DIN94IN29 Number of Observations: 48
Mean dependent var : 3.610542 Number of Variables : 2
S.D. dependent var : 0.284673 Degrees of Freedom : 46
Lag coeff. (Lambda) : 0.254318
R-squared : 0.922600 R-squared (BUSE) : -
Sq. Correlation : - Log likelihood : 53.223613
Sigma-square : 0.00627236 Akaike info criterion : -102.447
S.E of regression : 0.0791982 Schwarz criterion : -98.7048
----------------------------------------------------------------------
-------
Variable Coefficient Std.Error z-value Probability
----------------------------------------------------------------------
-------
CONSTANT 8.15606 0.231367 35.2516 0.00000
LINC29 -0.716327 0.0363589 -19.7016 0.00000
LAMBDA 0.254318 0.182314 1.39494 0.16303
----------------------------------------------------------------------
-------
REGRESSION DIAGNOSTICS
DIAGNOSTICS FOR HETEROSKEDASTICITY
RANDOM COEFFICIENTS
TEST DF VALUE PROB
Breusch-Pagan test 1 0.2873 0.59194
DIAGNOSTICS FOR SPATIAL DEPENDENCE
SPATIAL ERROR DEPENDENCE FOR WEIGHT MATRIX : nuevoq
TEST DF VALUE PROB
Likelihood Ratio Test 1 1.8853 0.16973
============================== END OF REPORT
================================
68 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: SEM: 1945-29
SUMMARY OF OUTPUT: SPATIAL ERROR MODEL - MAXIMUM LIKELIHOOD ESTIMATION
Data set : nuevo
Spatial Weight : nuevoq
Dependent Variable : DIN45IN29 Number of Observations: 48
Mean dependent var : 0.674542 Number of Variables : 2
S.D. dependent var : 0.174810 Degrees of Freedom : 46
Lag coeff. (Lambda) : 0.580250
R-squared : 0.883041 R-squared (BUSE) : -
Sq. Correlation : - Log likelihood : 64.766993
Sigma-square : 0.0035741 Akaike info criterion : -125.534
S.E of regression : 0.0597838 Schwarz criterion : -121.792
----------------------------------------------------------------------
-------
Variable Coefficient Std.Error z-value Probability
----------------------------------------------------------------------
-------
CONSTANT 3.21562 0.216223 14.8718 0.00000
LINC29 -0.39988 0.0338626 -11.8089 0.00000
LAMBDA 0.58025 0.131908 4.3989 0.00001
----------------------------------------------------------------------
-------
REGRESSION DIAGNOSTICS
DIAGNOSTICS FOR HETEROSKEDASTICITY
RANDOM COEFFICIENTS
TEST DF VALUE PROB
Breusch-Pagan test 1 3.0394 0.08127
DIAGNOSTICS FOR SPATIAL DEPENDENCE
SPATIAL ERROR DEPENDENCE FOR WEIGHT MATRIX : nuevoq
TEST DF VALUE PROB
Likelihood Ratio Test 1 12.7210 0.00036
============================== END OF REPORT
================================
69 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa:SEM: 1994-46
SUMMARY OF OUTPUT: SPATIAL ERROR MODEL - MAXIMUM LIKELIHOOD ESTIMATION
Data set : nuevo
Spatial Weight : nuevoq
Dependent Variable : DIN94IN46 Number of Observations: 48
Mean dependent var : 2.919792 Number of Variables : 2
S.D. dependent var : 0.163036 Degrees of Freedom : 46
Lag coeff. (Lambda) : 0.433936
R-squared : 0.776745 R-squared (BUSE) : -
Sq. Correlation : - Log likelihood : 53.732096
Sigma-square : 0.00593429 Akaike info criterion : -103.464
S.E of regression : 0.0770343 Schwarz criterion : -99.7218
----------------------------------------------------------------------
-------
Variable Coefficient Std.Error z-value Probability
----------------------------------------------------------------------
-------
CONSTANT 6.64014 0.432628 15.3484 0.00000
LINC46 -0.529052 0.0613691 -8.62082 0.00000
LAMBDA 0.433936 0.157919 2.74785 0.00600
----------------------------------------------------------------------
-------
REGRESSION DIAGNOSTICS
DIAGNOSTICS FOR HETEROSKEDASTICITY
RANDOM COEFFICIENTS
TEST DF VALUE PROB
Breusch-Pagan test 1 1.5605 0.21160
DIAGNOSTICS FOR SPATIAL DEPENDENCE
SPATIAL ERROR DEPENDENCE FOR WEIGHT MATRIX : nuevoq
TEST DF VALUE PROB
Likelihood Ratio Test 1 7.1413 0.00753
============================== END OF REPORT
================================
70 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
GeoDa: SAR & SEM-Outcomes
With these outputs, you should be able to complete R
& M Table 3
You may īŦnd R2
s, AICs, βs, Îģs, ΁s and, p − values,
Convergence rate(θ) is calculated using β (See slide 39)
71 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Reporting SAR & SEM outomes
Table: Spatial Dependence Models
Model speciīŦcation AIC β Îģ, ΁ LM test
(p-value) p-value p-value
1929-94
Spatial error (ML) -102.447 -0.716 (0.000) 0.163 0.169
Spatial lag (ML) -102.274 -0.655(0.000) 0.048 0.054
1929-45
Spatial error (ML) -125.534 -0.399(0.000) 0.000 0.000
Spatial lag (ML) -102.447 -0.341(0.000) 0.002 0.002
1946-94
Spatial error (ML) -103.464 -0.529(0.000) 0.006 0.008
Spatial lag (ML) -103.734 -0.445(0.000) 0.002 0.002
Convergence rate (θ) based on the spatial error (ML) estimates
θ
1929-94 0.019
1929-45 0.032
1946-94 0.016
72 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Interpretation of SEM & SAR outcomes
For SEM, as expected, AIC indicates that the īŦt of
each of the three spatial models is superior to OLS.
βs are signiīŦcant and negative but diīŦ€erent from OLS
coeīŦƒcients.
OLS suīŦ€ers from a misspecication due to omitted
spatial dependence.
The coeīŦƒcients on error (Îģ) and lag(΁) terms are
signiīŦcant in the sub-samples. For the full sample, just
΁ is signiīŦcant
LM test indicates that there is not spatial dependence
remaining in SAR and SEM.
Including spatial dependence reduces the convergence
rates (θs)
Convergence rate over entire sample, 1.9% yearly but
īŦrst sub sample, 3.2%, second sub sample 1.6%
73 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Research paper
Choose a country (e.g. Canada, South Korea,
Portugal, Mexico, etc.), īŦnd the historical data of the
real per capita GDP (personal income, GSP(Gross
State Product)) of its
states/provinces/municipalities/regions, and, do an
income convergence analysis about them (β -
convergence and ΃ - convergence).
In addition, you have to choose an event that was
important for the economy of that country (e.g. a new
constitution, an improvement in the power system, an
strong devaluation of its currency, a natural disaster, a
war, etc.), so that,you may choose "four years" like
Rey and Montuori did.
74 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
Research paper
Sections: introduction, literature, methodology (data),
outcomes (explanations of them), and conclusions.
Your scholarly work must report:
Exploring β- convergence (Slide 48).
Exploring Spatial Dependence (Slide 49).
Moran scatterplots (Slide 51).
Exploring data - ΃- convergence & Moran’s I (Slide
52).
OLS outcomes, convergence rates, & spatial diagnostic
tests (Table slide 58).
SAR & SEM outcomes. Convergence rates (Table
slide 72).
75 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
References
1 Anselin, L (2005). Exploring Spatial Data with
GeoDaTM: A workbook
2 LeSage, J. and Pace R. K. (2009). “Introduction to
Spatial Econometrics” Taylor & Francis„ Boca Raton
3 Rey S. J. and Montouri B. D. (1999) “US Regional
Income Convergence: a Spatial Econometric
Perspective”, Regional Studies 33 , 143-156.
4 Solow, R. M. (1956). “A Contribution to the Theory of
Economic Growth” The Quarterly Journal of
Economics, 70(1), 65-94.
76 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App

Regional Income Convergence

  • 1.
    Regional Income Convergence:A Spatial Analysis Approach Prepared by CÊsar R. Sobrino Universidad del Turabo November 27, 2017 1 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 2.
    Outline 1 Regression Analysis OLSregression Assumptions and Tests 2 Spatial Econometrics Spatial Dependence & Spatial Heterogeneity Spatial Matrix (W) and Moran’s I statistic 3 Income Convergence ΃-convergence β -convergence and speed of convergence (θ) 4 GeoDa Managing shapeīŦles Creating Ws and Moran’s I statistic Regression 2 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 3.
    OLS: Ordinary LeastSquares Parameters The coeīŦƒcients in an equation that determine the exact mathematical relation among the variables (growth rate and initial income) Unknowns. Parameter estimation The process of īŦnding estimates of the numerical values of the parameters of an equation 3 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 4.
    OLS OLS The general purposeof linear regression is to īŦnd a (linear) relationship between the dependent variable and a set of explanatory variables. There can be cross-section or times series data. Bivariate form Y = a + bX + Îĩ Intercept parameter (a) gives value of Y where regression line crosses Y -axis (value of Y when X is zero. Slope parameter (b) gives the change in Y associated with a one-unit change in X : ∆Y /∆X 4 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 5.
    OLS Two objectives: Find agood match (īŦt) between a + bX and observed values of Y ( a and b are the regression coeīŦƒcients). Discover which of the explanatory variables (Xs) contribute signiīŦcantly to the linear relationship OLS accomplished both stated objectives in an optimal fashion according to some criteria, and is referred to as a Best Linear Unbiased Estimator (BLUE) OLS estimates (a and b) are found minimizing the sum of the squared prediction errors (hence least squares). 5 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 6.
    OLS The OLS regressionline (red one) is that minimizes the sum of the squared prediction errors 6 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 7.
    OLS In order toobtain the BLUE property and to able to make statistical inferences about the population parameters (a and b) by means of your estimates (a and b), you need to make certain assumptions about the random part of the regression equation (the random error Îĩ) Two of these assumptions are crucial to obtain the unbiasedness and eīŦƒciency of the OLS estimates. 7 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 8.
    OLS Assumptions The random error(Îĩ) has mean zero (there is no systematic misspeciīŦcation or bias in the regression equation). Expected value: E(Îĩ) = 0 If E(Îĩ) = 0 does not hold, estimators are biased The random error terms are uncorrelated and have a constant variance (they are homoskedastic). Variance: E(ÎĩÎĩ ) = ΃2 I If E(ÎĩÎĩ ) = ΃2 I does not hold, this means that either autocorrelation or heteroskedasticity are present, so estimators are ineīŦƒcient. 8 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 9.
    Hypothesis Tests Null hypothesisH0: a = 0 or H0: b = 0 . Alternative hypothesis H1: a = 0 or H1: b = 0 . If you reject H0, the paramater (a or b ) is statistically diīŦ€erent from zero. 9 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 10.
    Individual statistical signiīŦcance Mustdetermine if there is suīŦƒcient statistical evidence to indicate that Y is truly related to X (i.e., b = 0) Even if b = 0, it is possible that the sample will produce an estimate b that is diīŦ€erent from zero Test for statistical signiīŦcance using t-tests or p-values 10 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 11.
    Individual signiīŦcance -t-Test First determine the level of signiīŦcance (0.1%, 1%, 5%, 10%) Probability of īŦnding a parameter estimate to be statistically diīŦ€erent from zero when, in fact, it is zero (alpha). Îą = 0.001, 0.01, 0.05, or 0.1, respectively. Probability of a Type I Error (alpha). 1 – level of signiīŦcance (alpha) = level of conīŦdence t-ratio is computed as t = b/Sb where Sb is the standard error of estimate b Use t-table to choose critical t-value with n – k degrees of freedom for the chosen level of signiīŦcance n = number of observations k = number of parameters estimated. 11 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 12.
    Individual signiīŦcance-t-Test If theabsolute value of t-ratio is greater than the critical t , the parameter estimate is statistically signiīŦcant at the given level of signiīŦcance. If t-ratio (in absolute value) is equal to 2 (or bigger than 2) , you can reject H0. 12 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 13.
    Individual signiīŦcance -p-Values Treat as statistically signiīŦcant only those parameter estimates with p-values smaller than the maximum acceptable signiīŦcance level. p-value gives exact level of signiīŦcance. Also the probability of īŦnding signiīŦcance when none exists SigniīŦcance levels (alpha) Îą = 0.001, or 0.1% signiīŦcance level Îą = 0.01, or 1% signiīŦcance level Îą = 0.05, or 5% signiīŦcance level Îą = 0.1, or 10 % signiīŦcance level E.g. if p-value = 0.00001, you reject H0 at 0.1% signiīŦcance level, if p-value = 0.08, you reject H0 at 10% signiīŦcance level, and, if p-value = 0.14, you cannot reject H0 at 10% signiīŦcance level 13 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 14.
    Joint signiīŦcance -F-test Usedto test for signiīŦcance of overall regression equation Compare F-statistic to critical F-value from F-table Two degrees of freedom, n – k & k – 1 Level of signiīŦcance If F-statistic exceeds the critical F (=4), the regression equation overall is statistically signiīŦcant at the speciīŦed level of signiīŦcance. 14 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 15.
    CoeīŦƒcient of Determination:R2 R2 measures the percentage of total variation in the dependent variable (Y ) that is explained by the regression equation Ranges from 0 to 1 High R2 indicates Y and X are highly correlated E.g. R2 = 0.8 means that 80% of the changes of Y are explained by the regression equation. 15 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 16.
    Spatial Analysis: Motivation Diagnosis Theassumption of normal, homoskedastic and uncorrelated error terms that lead to BLUE characteristic of OLS estimators are not necessarily satisīŦed by the real models and data. When dealing with spatial data you must give special attention to the possibility that the errors or the variables (Xs) in the model show spatial dependence. 16 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 17.
    Spatial Analysis: Motivation Whatis spatial autocorrelation (dependence) important? We need to examine the inīŦ‚uences of spatial autocorrelation upon the inferences that may be drawn from statistical tests. As these inferences are based on independence assumptions (OLS asumptions), then the presence of spatial autocorrelation is likely to bias any resultant inference. Dependence amongts error terms brings ineīŦƒcient OLS estimates. Spatial Error (SEM). OLS estimates are biased, and thus inferences based on the regression model will be incorrect. Spatial Lag (SAR). 17 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 18.
    Spatial Analysis: Motivation Appliedwork in regional science (economics, health, demographics, etc.) uses of spatial data. Spatial data: Data collected with reference to location. Administrative spatial units (states, districts, counties, etc.). Functional regions (E.g. labour market regions). Points in space (E.g. cities, municipalities, plants) . Using spatial data, model estimation, hypothesis testing and prediction have to allow for spatial eīŦ€ects. 18 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 19.
    Spatial Dependence Lack ofindependence among spatial data, Observations at location i depend on other observations at locations j (= i). Spatial dependence is associated with the notion of relative space (location) Neighbouring regions are expected to be more alike than arbitrary regions. Spatial dependence is expected to diminish with increasing distance. Spatial dependence are multidirectional by nature. Time series is unidirectional. 19 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 20.
    Spatial Dependence: Causes Nuisance: Thedelineation of spatial units is somewhat arbitrary. Spatial data are usually collected for administrative units (states, districts, counties, etc.). If the correspondence between the spatial scale of a phenomenon under study and the delineation of the spatial units of observation is not strong, measurement errors are to be expected. OLS models can be corrected by including a spatial error speciīŦcation in the model (SEM). 20 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 21.
    Spatial Dependence: Causes Substantive: Interactionand dependence on the regional level may be itself a modelling problem because it generats model bias. Location and distance are important forces at work in human geography and market activity. E.g spatial spillovers, hierarchy of places, etc.. This can be corrected by including an explicit spatial lag term as an explanatory variable in the model (SAR). 21 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 22.
    Spatial Heterogeneity It refersto varying economic relationships or disturbances over space. A diīŦ€erent relationship may hold for every spatial unit. This situation characterizes the case of structural instability. In case of structural instability, the regression coeīŦƒcients are not constant across the spatial units. E.g. Sample: 35,000 homes sold within the last 5 years in Lucas county, Ohio. 3 distinct distributions,with low-priced homes nearest to the Central Business District(CBD) and high priced homes farthest away from the CBD. This suggests diīŦ€erent relationships may be at work to describe home prices in diīŦ€erent locations. 22 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 23.
    Spatial Weight Matrix(W) Quantify location for analyzing spatial eīŦ€ects Contiguity (neighbourhood) The relative location among spatial units. Usually established from a map. Units near should reīŦ‚ect a greater degree of spatial dependence than those more distant from each other. For spatial heterogeneity, relationships may be similar for neighbouring units. Distance Latitude and longitude allow us to calculate distances from any point in space. Spatial dependence will decline with distance. For (spatial heterogeneity, closer units should exhibit similar relationships. 23 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 24.
    W In a regulargrid, neighbours can be deīŦned in a number of ways. Among others, you may īŦnd In analogy of the game of chess, rook contiguity, bishop contiguity and queen contiguity are distinguished. Inverse distance raised to a power. 24 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 25.
    W: Rook contiguity Aspatial unit is a neighbour of another unit if both areas share a common edge (side). In the next īŦgure, the units B1, B2, B3 and B4 are neighbours of unit A according to the rook criterion. B2 B1 A B3 B4 25 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 26.
    W:Queen contiguity A spatialunit is a neighbour of another unit if both areas share a common edge or vertex. In the next īŦgure the units B1, B2, B3 and B4 as well as C1, C2, C3 and C4 are neighbours of unit A according to the queen criterion. C1 B2 C2 B1 A B3 C3 B4 C4 26 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 27.
    W:Distance-based spatial weightmatrix Spatial interaction will decline with increasing distance due to increasing geographical impediments. Nearer regions have a greater potential inīŦ‚uence. Power function: Wij = 1/dij Îŗ , where Îŗ is a power parameter Wij element of matrix W at row i and column j (i = j) dij: distance between region i and region j The distances, dij, are usually measured between the centres of the regions (latitude and longitude). 27 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 28.
    W: Representing 5regions Rook Standardized Distance-based īŖŽ īŖ¯ īŖ¯ īŖ¯ īŖ¯ īŖ° 0 1 1 0 0 1 0 1 1 0 1 1 0 1 0 0 1 1 0 1 0 0 0 1 0 īŖš īŖē īŖē īŖē īŖē īŖģ īŖŽ īŖ¯ īŖ¯ īŖ¯ īŖ¯ īŖ° 0 1 2 1 2 0 0 1 3 0 1 3 1 3 0 1 3 1 3 0 1 3 0 0 1 3 1 3 0 1 3 0 0 0 1 0 īŖš īŖē īŖē īŖē īŖē īŖģ īŖŽ īŖ¯ īŖ¯ īŖ¯ īŖ¯ īŖ° 0 1 d12 1 d13 1 d14 1 d15 1 d21 0 1 d23 1 d24 1 d25 1 d31 1 d32 0 1 d34 1 d35 1 d41 1 d42 1 d43 0 1 d45 1 d51 1 d52 1 d53 1 d54 0 īŖš īŖē īŖē īŖē īŖē īŖģ Îŗ =1 & dij is the distance between i and j, i = j 28 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 29.
    Testing Spatial Autocorrelation Moran’sI - statistic: test for spatial dependence. Pearson correlation: ΁xy = Sxy SxSy , where Sxy is the covariance between x and y, Sx is the standard deviation of x, and, Sy is the standard deviation of y Covariance formula Sxy = n i=1(xi − ¯x)(yi − ¯y) n − 1 , then ΁xy = n i=1(xi − ¯x)(yi − ¯y) SxSy(n − 1) 29 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 30.
    Moran’s I -statistic Similarities between units i and j are calculated as the product of the diīŦ€erences between xi (variable of interest) and xj (spatial lag) with the overall mean (¯x), divided by the sample variance. This ratio has to be adjusted for the spatial weights used. I = n n i n j Wij n i n j Wij(xi − ¯x)(xj − ¯x) n i (xi − ¯x)2 where xi is the i-th observation, n is the sample size, and Wij is the spatial weight between i and j. 30 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 31.
    Moran’s I -statistic The expected value of Moran’s I statistic: − 1/(n − 1) E.g. if n = 48 regions ⇒ − 1/(48 − 1) = 0.0213, which is close to zero, meaning no spatial autocorrelation. Then, H0 : I = 0 and H1 : I = 0. A standardized matrix bounds I between -1 and 1. -1 means perfect clustering of dissimilar values (perfect dispersion). 0 is no autocorrelation (perfect randomness) 1 means perfect clustering of similar values (spatial autocorrelation). 31 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 32.
    Spatial Lag (SAR) 1OLS regression Y = a + bX + Îĩ 2 SAR (including W) : Y = ΁WY + a + bX + Îĩ 3 Y = (1 − ΁W)−1 a + (1 − ΁W)−1 bX + (1 − ΁W)−1 Îĩ 4 Where ΁ is a scalar parameter that indicates the eīŦ€ect of the dependent variable in the neighbors on Y in the focal area, intercept, (1 − ΁W)−1 a, slope, (1 − ΁W)−1 b , and, error term, (1 − ΁W)−1 Îĩ 5 GeoDa reports 3) & ΁ 6 Not including ΁W brings biased estimates and thus inferences based on an OLS model will be incorrect 32 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 33.
    Spatial Error Model(SEM) 1 OLS regression Y = a + bX + Îĩ 2 SEM (including W): Y = a + bX + Îĩ & Îĩ = ÎģWÎĩ + Âĩ 3 Y = a + bX + (1 − ÎģW)−1 Âĩ 4 Where: Îģ is the autoregressive coeīŦcient and Âĩ is another error term, intercept a, slope b, and , error term , (1 − ÎģW)−1 Âĩ 5 Geoda reports 3) & Îģ 6 Not including ÎģW brings unbiased estimates and biased standard errors and consequently, t-tests & p-values will be misleading. 33 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 34.
    Income Convergence Robert Solow(1956) “Capital should īŦ‚ow from countries with a high capital-to-output ratio to countries with a low capital-to-output ratio ” “Poor” countries/regions/states should have higher growth rates. ”rich” countries/regions/states should have lower growth rates The analysis using regions is called Regional Income Convergence. 34 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 35.
    Sigma Convergence, (΃-convergence) It refers to decreasing variance of variables over time. This is measured by the coeīŦƒcient of variation (CV) which gives the relative standard deviation to the mean (the standard deviation divided by mean). Since CV is mean standardized, it controls for increasing averages over time and can be directly compared across diīŦ€erent variables. When the CV of real per capita income across regions falls over time, there is ΃-convergence . 35 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 36.
    Beta Convergence, (β-convergence) It considers the mobility of countries (regions). It is deīŦned as a negative correlation between the position of individual countries (regions) at the beginning of an observation period and the changes or growth rates over this period. It assumes that growth from a low base is faster than growth from high levels. 36 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 37.
    β- convergence OLS regressionmodel LINC1i − LINC0i = a + βLINC0i + Îĩ0i Where: LINC1i is the īŦnal(1) per capita income for region i in logs. LINC0i is the initial(0) per capita income for region i in logs. LINC1i − LINC0i is the growth rate between the īŦnal year and the initial year. L stands for logs Îĩ0i is an error term 37 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 38.
    β values β >-1 and β <0 (β ∈ ]-1,0[) and signiīŦcant means β-convergence. β > 0 (β ∈ ]0, ∞+ [) and signiīŦcant means “divergence” β = 0 , neither “convergence” nor “divergence” β not signiīŦcant , neither “convergence” nor “divergence” 38 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 39.
    Convergence rate (θ) θ= ln(1 + β)/(−k) Where k is diīŦ€erence between periods (E.g. k=1945-1929=16) E.g if β = -0.2 and k= 16. θ = ln(−0.2 + 1)/(−16) θ = ln(0.8)/(−16) θ = − 0.22/− 16 = 0.01375 or 1.4% (speed of convergence). This means that regions converge at a speed of 1.4 percent per year. Note: ln(1)=0 and ln(0) does not exist, so, if β = -1 , θ does not exist, and , if β = 0 , θ =0 The logarithmic function does not take negative values. 39 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 40.
    Rey and Montuori(1998)(R & M) This is an article on regional income convergence Their data includes 48 states and used four years in their study (1929, 1945, 1946, and 1994). They included neither Hawaii nor Alaska. Three periods: 1929-94, 1929-45, and, 1946-94. So, they run a cross-sectional analysis. 40 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 41.
    GeoDa GeoDa is afree and open source software tool that serves for spatial data analysis. You may download it from http://geodacenter.github.io/download.html The shapeīŦles (shp) are the most used īŦles. A shapeīŦle stores nontopological geometry and attribute information for the spatial features in a data set. It includes an ID variable to identify regions. A shapeīŦle consists of at least four actual īŦles, an index īŦle (shx), a data base table (dbf) and a projection īŦle (prj). 41 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 42.
    GeoDa For your researchpaper, īŦrst you have to choose a country and gather your data in excel (or in Open OīŦƒce/spreadsheet). Download Open OīŦƒce from https: //www.openoffice.org/download/index.html Later, look for a shapeīŦle of the regions of that country. This link is helpful http://www.gadm.org/country. Open that shapeīŦle in Geoda. Create the variables that you will use. Table/Add variable and set integer, 10 lentgh, and 3 decimals. GeoDa will create empty columns. Click “Table” and select (if you need to do it) the regions that you will use. Do not include isolate regions such as islands. 42 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 43.
    GeoDa Save as anew shapeīŦle (create a new directory). Automatically, GeoDa creates a dbf, shx, and prj īŦles. To include your data to the new shapeīŦle, you have to open the new dbf īŦle using Open OīŦƒce (spreadsheet/international/OK). Check the correspondence between the regions of the new dbf īŦle and the regions of your data. The order of your data has to be equal to the order of the new dbf īŦle. Copy your data and paste it on the new dbf īŦle. Fill the empty columns. Save (Keep the current format) 43 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 44.
    GeoDa: US states Openthe new shapeīŦle with your gathered data The variables that I have gathered are: INC29: 1929 real per capita income INC45: 1945 real per capita income INC46: 1946 real per capita income INC94: 1994 real per capita income Three diīŦ€erent sample periods 1929-94, 1929-45, and, 1946-94 The initial year is 1929, the break year is 1945, and the īŦnal year is 1994. 44 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 45.
    GeoDa: Calculating variables Creatingvariables in logs Table/Add Variables LINC29, LINC45, LINC46, and , LINC94. GeoDa will create empty columns. Table/Variable calculation/univariate set “LINC29” , operator “log (base e)” and variable “INC29”. Do the same for the other variables. Creating growth rates Table/Add Variables: dI94I29, dI45I29, and, dI94I46, GeoDa will create empty columns. Table/Variable calculation/bivariate set “dI94I29” , variable, “, LINC94,” operator, “subtract”, variable, “LINC29” . Do the same for the other growth rates. 45 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 46.
    GeoDa US states- Descriptive Statistics Click on Explore/Boxplot 46 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 47.
    US states: Exploringβ-convergence Explore/ Scatter plot/ 1994-29 Y: “dI94I29”, X: ‘‘LINC29”, OK 1945-29 Y: “dI45I29”, X: ‘‘LINC29”, OK 1994-46 Y: ‘dI94I46”, X: ‘‘LINC46”, OK You should get negative relationships 47 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 48.
    US states: Exploringβ- convergence 1994-29 1945-29 1994-46 X : Initial income & Y : Growth rate. At īŦrst glance, β- convergence holds. 48 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 49.
    GeoDa: Exploring SpatialDependence Map/Quantile Map/5 to check if there are spatial patterns. Do you īŦnd any? 1929 per capita income 1945 per capita income 1994 per capita income 49 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 50.
    GeoDa: Creating Wand Moran scatterplots Spatial Matrix Tools/Weights manager/create Weights File ID variable “’GEOID’ . Your shapeīŦle must have one ID variable Queen Contiguity Create/Save Moran’s I Space/ Univariate’s Moran’s I/ Set the variable you want to analyze/ Set W/Queen The scatterplot enables you to assess how similar a spatial unit is to its neighbors. 50 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 51.
    GeoDa: Moran scatterplot-state per capita income X: Spatial units; Y: the weighted average or spatial lag of the corresponding observation on the X axis. 1929 1945 1994 They show spatial dependence because there is a positive correlation (See page 146, R & M) 51 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 52.
    GeoDa US states- Exploring data The CV of real per capita income in logs across US states falls over time, so ΃-convergence holds According to Moran’s I, data shows spatial dependence. Table: Descriptive Statistics Mean Median SD CV Moran’s I LINC29 6.35 6.38 0.38 0.06 0.65 LINC45 7.02 7.03 0.23 0.03 0.57 LINC94 9.96 9.95 0.13 0.01 0.35 52 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 53.
    GeoDa: OLS regression Seeslide 37 and R & M page 148, equation 4 Click on Regression Dependent variable/ growth rate (E.g. dI94I29 ) Independent variable/ initial income (E.g. LINC29 ) Weights īŦle/ Classic: This will run classical OLS regression with spatial dependence diagnostics, click Run. Three regressions: 1 dI94I29i = a + βLINC29i + Îĩ29i 2 dI45I29i = a + βLINC29i + Îĩ29i 3 dI94I46i = a + βLINC46i + Îĩ46i Where i : 1, 2, 3, ....., .48 53 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 54.
    GeoDa: OLS regression-Outcomes:1994-29 SUMMARY OF OUTPUT: ORDINARY LEAST SQUARES ESTIMATION Dependent Variable : DIN94IN29 Number of Observations: 48 Mean dependent var : 3.61054 Number of Variables : 2 S.D. dependent var : 0.284673 Degrees of Freedom : 46 R-squared : 0.918195 F-statistic : 516.314 Adjusted R-squared : 0.916417 Prob(F-statistic) : 1.20184e-26 Sum squared residual: 0.318208 Log likelihood : 52.281 Sigma-square : 0.00691757 Akaike info criterion : -100.562 S.E. of regression : 0.0831719 Schwarz criterion : -96.8195 Sigma-square ML : 0.00662934 S.E of regression ML: 0.0814207 ---------------------------------------------------------------------- ------- Variable Coefficient Std.Error t-Statistic Probability ---------------------------------------------------------------------- ------- CONSTANT 8.25684 0.204832 40.3104 0.00000 LINC29 -0.732026 0.0322159 -22.7225 0.00000 ---------------------------------------------------------------------- ------- REGRESSION DIAGNOSTICS MULTICOLLINEARITY CONDITION NUMBER 34.095520 (Extreme Multicollinearity) TEST ON NORMALITY OF ERRORS TEST DF VALUE PROB Jarque-Bera 2 1.0399 0.59456 DIAGNOSTICS FOR HETEROSKEDASTICITY RANDOM COEFFICIENTS TEST DF VALUE PROB Breusch-Pagan test 1 0.0012 0.97181 Koenker-Bassett test 1 0.0013 0.97079 DIAGNOSTICS FOR SPATIAL DEPENDENCE FOR WEIGHT MATRIX : nuevoq (row-standardized weights) TEST MI/DF VALUE PROB Moran's I (error) 0.1509 1.9658 0.04932 Lagrange Multiplier (lag) 1 3.5538 0.05941 Robust LM (lag) 1 1.9997 0.15733 Lagrange Multiplier (error) 1 2.1903 0.13888 Robust LM (error) 1 0.6362 0.42511 Lagrange Multiplier (SARMA) 2 4.1900 0.12307 ============================== END OF REPORT ================================ 54 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 55.
    GeoDa: OLS regression-Outcomes:1945-29 SUMMARY OF OUTPUT: ORDINARY LEAST SQUARES ESTIMATION Dependent Variable : DIN45IN29 Number of Observations: 48 Mean dependent var : 0.674542 Number of Variables : 2 S.D. dependent var : 0.17481 Degrees of Freedom : 46 R-squared : 0.831930 F-statistic : 227.696 Adjusted R-squared : 0.828276 Prob(F-statistic) : 1.96164e-19 Sum squared residual: 0.246528 Log likelihood : 58.4065 Sigma-square : 0.0053593 Akaike info criterion : -112.813 S.E. of regression : 0.0732072 Schwarz criterion : -109.071 Sigma-square ML : 0.00513599 S.E of regression ML: 0.0716658 ---------------------------------------------------------------------- ------- Variable Coefficient Std.Error t-Statistic Probability ---------------------------------------------------------------------- ------- CONSTANT 3.39038 0.180291 18.8051 0.00000 LINC29 -0.427882 0.0283561 -15.0896 0.00000 ---------------------------------------------------------------------- ------- REGRESSION DIAGNOSTICS MULTICOLLINEARITY CONDITION NUMBER 34.095520 TEST ON NORMALITY OF ERRORS TEST DF VALUE PROB Jarque-Bera 2 0.2160 0.89762 DIAGNOSTICS FOR HETEROSKEDASTICITY RANDOM COEFFICIENTS TEST DF VALUE PROB Breusch-Pagan test 1 1.9185 0.16602 Koenker-Bassett test 1 2.2058 0.13749 SPECIFICATION ROBUST TEST TEST DF VALUE PROB White 2 2.3107 0.31495 DIAGNOSTICS FOR SPATIAL DEPENDENCE FOR WEIGHT MATRIX : nuevoq (row-standardized weights) TEST MI/DF VALUE PROB Moran's I (error) 0.3815 4.3930 0.00001 Lagrange Multiplier (lag) 1 11.0500 0.00089 Robust LM (lag) 1 2.3441 0.12576 Lagrange Multiplier (error) 1 14.0018 0.00018 Robust LM (error) 1 5.2958 0.02138 Lagrange Multiplier (SARMA) 2 16.3459 0.00028 ============================== END OF REPORT ================================ 55 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 56.
    GeoDa: OLS regression-Outcomes:1994-46 SUMMARY OF OUTPUT: ORDINARY LEAST SQUARES ESTIMATION Data set : nuevo Dependent Variable : DIN94IN46 Number of Observations: 48 Mean dependent var : 2.91979 Number of Variables : 2 S.D. dependent var : 0.163036 Degrees of Freedom : 46 R-squared : 0.727570 F-statistic : 122.85 Adjusted R-squared : 0.721647 Prob(F-statistic) : 1.39578e-14 Sum squared residual: 0.347588 Log likelihood : 50.1615 Sigma-square : 0.00755626 Akaike info criterion : -96.3229 S.E. of regression : 0.0869268 Schwarz criterion : -92.5805 Sigma-square ML : 0.00724142 S.E of regression ML: 0.0850965 ---------------------------------------------------------------------- ------- Variable Coefficient Std.Error t-Statistic Probability ---------------------------------------------------------------------- ------- CONSTANT 7.07005 0.374654 18.8709 0.00000 LINC46 -0.589693 0.0532032 -11.0838 0.00000 ---------------------------------------------------------------------- ------- REGRESSION DIAGNOSTICS MULTICOLLINEARITY CONDITION NUMBER 59.704369 TEST ON NORMALITY OF ERRORS TEST DF VALUE PROB Jarque-Bera 2 0.5390 0.76376 DIAGNOSTICS FOR HETEROSKEDASTICITY RANDOM COEFFICIENTS TEST DF VALUE PROB Breusch-Pagan test 1 1.5535 0.21262 Koenker-Bassett test 1 1.4503 0.22849 SPECIFICATION ROBUST TEST TEST DF VALUE PROB White 2 1.6639 0.43519 DIAGNOSTICS FOR SPATIAL DEPENDENCE FOR WEIGHT MATRIX : nuevoq (row-standardized weights) TEST MI/DF VALUE PROB Moran's I (error) 0.3141 3.6646 0.00025 Lagrange Multiplier (lag) 1 10.4680 0.00121 Robust LM (lag) 1 2.5955 0.10717 Lagrange Multiplier (error) 1 9.4918 0.00206 Robust LM (error) 1 1.6193 0.20319 Lagrange Multiplier (SARMA) 2 12.0873 0.00237 ============================== END OF REPORT ================================ 56 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 57.
    GeoDa: OLS regression-Outcomes Withthese outputs, you should be able to complete R & M Table 2 You may īŦnd R2 s, AICs (Akaike Infomation Criterion), βs, and, p − values, Convergence rate(θ) is calculated using β (See slide 39) Tests for spatial dependence: Robust LM (lag and error) and Moran’s I (error). Breusch-Pagan Test (test for Heteroskedasticity). AIC: Value for model selection 57 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 58.
    Reporting OLS outomes Table:Unconditional model OLS estimation R2 (΃2) AIC β (p − value) Convergence rate (θ) 1929-94 0.918 -100.562 -0.732 0.020 ( 0.007) (0.000) 1929-45 0.832 -112.813 -0.428 0.035 (0.005) (0.000) 1946-94 0.728 -96.323 -0.590 0.020 (0.008) (0.000) Robust LM Robust LM Moran’s I (error) (error) p-value (lag) p-value MI(p-value) Diagnostics for spatial dependence 1929-94 0.425 0.157 0.1509 (0.049) 1929-45 0.021 0.126 0.3815 (0.000) 1946-94 0.203 0.107 0.3141(0.000) Breusch-Pagan test p-value Diagnostics for heteroskedasticity 1929-94 0.972 1929-45 0.166 1946-94 0.213 58 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 59.
    GeoDa: Diagnostic Tests Heteroskedasticiy:When regression errors do not have a constant variance over all observations. Breush-Pagan Test: H0: homocedasticity ; H1: heteroskedasticity Multicollinearity: High correlation between Xs Condition number > 30 is considered suspect Condition number =1 means a lack of multicollinearity Non-normal errors: Most regression models assume normal errors distributions Jarque-Bera Test: H0: normal errors ; H1: no existence of normal errors AIC: Calculate AIC for each model with the same data set, and the “best” model is the one with minimum AIC value. If p − value is greater than 0.1, you cannot reject H0 59 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 60.
    GeoDa: OLS vsSAR & SEM GeoDa reports Moran’s I (error), LM (lag), LM (error), Robust LM (lag), and, Robust LM (error) Moran’s I (error) is an extension of Moran’s I -statistic to measure spatial autocorrelation in regression models. It is useful to detect spatial dependence but they do not allow to discriminate betweem SAR and SEM. H0: OLS ; H1: Spatial dependence LM (error): H0: OLS ; H1: SEM LM (lag): H0: OLS ; H1: SAR If LMs are signiīŦcant (H0 is rejected) , focus on robust tests. If p − value is greater than 0.1, you cannot reject H0 60 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 61.
    GeoDa: OLS vsSAR & SEM Robust LM (error): H0: OLS ; H1: SEM Robust LM (lag): H0: OLS ; H1: SAR if both robust measures are signiīŦcant, stick with the more signiīŦcant. If p − value is greater than 0.1, you cannot reject H0 61 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 62.
    Interpretation of OLSoutcomes Results provide much support for β-convergence. CoeīŦƒcients highly signicant and negative. R2 above 0.7 in all three samples Convergence rate over entire sample, 2% yearly but īŦrst sub sample, 3.5%, second sub sample 2% Moran’s I statistic (MI) provides very strong evidence (See p-value) of spatial dependence Robust tests point to the presence of spatial error (SEM) rather than the spatial lag (SAR). Breusch- Pagan test for heteroscedasticity is not signiīŦcant in any of the sub-samples. Then, omit further consideration of the spatial heterogeneity models. 62 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 63.
    GeoDa : SAR Seeslide 32 and R & M page 150, equation 9 Click on Regression Dependent variable/ growth rate (E.g. dI94I29 ) Independent variable/ initial income (E.g. LINC29 ) Weights īŦle/ Spatial lag Three regressions: 1 dI94I29i = a + ΁WdI94I29i + βLINC29i + Îĩ29i 2 dI45I29i = a + ΁WdI45I29i + βLINC29i + Îĩ29i 3 dI94I46i = a + ΁WdI94I46i + βLINC46i + Îĩ46i Where i : 1, 2, 3, ....., .48 63 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 64.
    GeoDa: SAR -Outcomes:1994-29 SUMMARY OF OUTPUT: SPATIAL LAG MODEL - MAXIMUM LIKELIHOOD ESTIMATION Data set : nuevo Spatial Weight : nuevoq Dependent Variable : DIN94IN29 Number of Observations: 48 Mean dependent var : 3.61054 Number of Variables : 3 S.D. dependent var : 0.284673 Degrees of Freedom : 45 Lag coeff. (Rho) : 0.153427 R-squared : 0.924712 Log likelihood : 54.1372 Sq. Correlation : - Akaike info criterion : -102.274 Sigma-square : 0.00610122 Schwarz criterion : -96.6607 S.E of regression : 0.0781103 ---------------------------------------------------------------------- ------- Variable Coefficient Std.Error z-value Probability ---------------------------------------------------------------------- ------- W_DIN94IN29 0.153427 0.0776567 1.97571 0.04819 CONSTANT 7.21331 0.560658 12.8658 0.00000 LINC29 -0.655089 0.0491648 -13.3243 0.00000 ---------------------------------------------------------------------- ------- REGRESSION DIAGNOSTICS DIAGNOSTICS FOR HETEROSKEDASTICITY RANDOM COEFFICIENTS TEST DF VALUE PROB Breusch-Pagan test 1 1.1563 0.28223 DIAGNOSTICS FOR SPATIAL DEPENDENCE SPATIAL LAG DEPENDENCE FOR WEIGHT MATRIX : nuevoq TEST DF VALUE PROB Likelihood Ratio Test 1 3.7124 0.05401 ============================== END OF REPORT ================================ 64 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 65.
    GeoDa: SAR-Outcomes: 1945-29 SUMMARYOF OUTPUT: SPATIAL LAG MODEL - MAXIMUM LIKELIHOOD ESTIMATION Data set : nuevo Spatial Weight : nuevoq Dependent Variable : DIN45IN29 Number of Observations: 48 Mean dependent var : 0.674542 Number of Variables : 3 S.D. dependent var : 0.17481 Degrees of Freedom : 45 Lag coeff. (Rho) : 0.295355 R-squared : 0.865881 Log likelihood : 63.2943 Sq. Correlation : - Akaike info criterion : -120.589 Sigma-square : 0.00409851 Schwarz criterion : -114.975 S.E of regression : 0.0640196 ---------------------------------------------------------------------- ------- Variable Coefficient Std.Error z-value Probability ---------------------------------------------------------------------- ------- W_DIN45IN29 0.295355 0.0974027 3.0323 0.00243 CONSTANT 2.64263 0.300699 8.78829 0.00000 LINC29 -0.341486 0.0388813 -8.78278 0.00000 ---------------------------------------------------------------------- ------- REGRESSION DIAGNOSTICS DIAGNOSTICS FOR HETEROSKEDASTICITY RANDOM COEFFICIENTS TEST DF VALUE PROB Breusch-Pagan test 1 3.4138 0.06465 DIAGNOSTICS FOR SPATIAL DEPENDENCE SPATIAL LAG DEPENDENCE FOR WEIGHT MATRIX : nuevoq TEST DF VALUE PROB Likelihood Ratio Test 1 9.7755 0.00177 ============================== END OF REPORT ================================ 65 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 66.
    GeoDa:SAR-Outcomes: 1994-46 SUMMARY OFOUTPUT: SPATIAL LAG MODEL - MAXIMUM LIKELIHOOD ESTIMATION Data set : nuevo Spatial Weight : nuevoq Dependent Variable : DIN94IN46 Number of Observations: 48 Mean dependent var : 2.91979 Number of Variables : 3 S.D. dependent var : 0.163036 Degrees of Freedom : 45 Lag coeff. (Rho) : 0.350822 R-squared : 0.783057 Log likelihood : 54.8668 Sq. Correlation : - Akaike info criterion : -103.734 Sigma-square : 0.00576651 Schwarz criterion : -98.1199 S.E of regression : 0.0759375 ---------------------------------------------------------------------- ------- Variable Coefficient Std.Error z-value Probability ---------------------------------------------------------------------- ------- W_DIN94IN46 0.350822 0.11406 3.07577 0.00210 CONSTANT 5.02565 0.731279 6.87241 0.00000 LINC46 -0.445107 0.0651113 -6.8361 0.00000 ---------------------------------------------------------------------- ------- REGRESSION DIAGNOSTICS DIAGNOSTICS FOR HETEROSKEDASTICITY RANDOM COEFFICIENTS TEST DF VALUE PROB Breusch-Pagan test 1 4.2002 0.04042 DIAGNOSTICS FOR SPATIAL DEPENDENCE SPATIAL LAG DEPENDENCE FOR WEIGHT MATRIX : nuevoq TEST DF VALUE PROB Likelihood Ratio Test 1 9.4106 0.00216 ============================== END OF REPORT ================================ 66 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 67.
    GeoDa : SEM Seeslide 33 and R & M page 149, equation 8 Click on Regression Dependent variable/ growth rate (E.g. dI94I29 ) Independent variable/ initial income (E.g. LINC29 ) Weights īŦle/ Spatial error Three regressions: 1 dI94I29i = a + βLINC29i + Îĩ29i ; Îĩ29i = ÎģWÎĩ29i + Âĩ29i 2 dI45I29i = a + βLINC29i + Îĩ29i ; Îĩ29i = ÎģWÎĩ29i + Âĩ29i 3 dI94I46i = a + βLINC46i + Îĩ46i ; Îĩ46i = ÎģWÎĩ46i + Âĩ46i Where i : 1, 2, 3, ....., .48 67 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 68.
    GeoDa: SEM: 1994-29 SUMMARYOF OUTPUT: SPATIAL ERROR MODEL - MAXIMUM LIKELIHOOD ESTIMATION Data set : nuevo Spatial Weight : nuevoq Dependent Variable : DIN94IN29 Number of Observations: 48 Mean dependent var : 3.610542 Number of Variables : 2 S.D. dependent var : 0.284673 Degrees of Freedom : 46 Lag coeff. (Lambda) : 0.254318 R-squared : 0.922600 R-squared (BUSE) : - Sq. Correlation : - Log likelihood : 53.223613 Sigma-square : 0.00627236 Akaike info criterion : -102.447 S.E of regression : 0.0791982 Schwarz criterion : -98.7048 ---------------------------------------------------------------------- ------- Variable Coefficient Std.Error z-value Probability ---------------------------------------------------------------------- ------- CONSTANT 8.15606 0.231367 35.2516 0.00000 LINC29 -0.716327 0.0363589 -19.7016 0.00000 LAMBDA 0.254318 0.182314 1.39494 0.16303 ---------------------------------------------------------------------- ------- REGRESSION DIAGNOSTICS DIAGNOSTICS FOR HETEROSKEDASTICITY RANDOM COEFFICIENTS TEST DF VALUE PROB Breusch-Pagan test 1 0.2873 0.59194 DIAGNOSTICS FOR SPATIAL DEPENDENCE SPATIAL ERROR DEPENDENCE FOR WEIGHT MATRIX : nuevoq TEST DF VALUE PROB Likelihood Ratio Test 1 1.8853 0.16973 ============================== END OF REPORT ================================ 68 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 69.
    GeoDa: SEM: 1945-29 SUMMARYOF OUTPUT: SPATIAL ERROR MODEL - MAXIMUM LIKELIHOOD ESTIMATION Data set : nuevo Spatial Weight : nuevoq Dependent Variable : DIN45IN29 Number of Observations: 48 Mean dependent var : 0.674542 Number of Variables : 2 S.D. dependent var : 0.174810 Degrees of Freedom : 46 Lag coeff. (Lambda) : 0.580250 R-squared : 0.883041 R-squared (BUSE) : - Sq. Correlation : - Log likelihood : 64.766993 Sigma-square : 0.0035741 Akaike info criterion : -125.534 S.E of regression : 0.0597838 Schwarz criterion : -121.792 ---------------------------------------------------------------------- ------- Variable Coefficient Std.Error z-value Probability ---------------------------------------------------------------------- ------- CONSTANT 3.21562 0.216223 14.8718 0.00000 LINC29 -0.39988 0.0338626 -11.8089 0.00000 LAMBDA 0.58025 0.131908 4.3989 0.00001 ---------------------------------------------------------------------- ------- REGRESSION DIAGNOSTICS DIAGNOSTICS FOR HETEROSKEDASTICITY RANDOM COEFFICIENTS TEST DF VALUE PROB Breusch-Pagan test 1 3.0394 0.08127 DIAGNOSTICS FOR SPATIAL DEPENDENCE SPATIAL ERROR DEPENDENCE FOR WEIGHT MATRIX : nuevoq TEST DF VALUE PROB Likelihood Ratio Test 1 12.7210 0.00036 ============================== END OF REPORT ================================ 69 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 70.
    GeoDa:SEM: 1994-46 SUMMARY OFOUTPUT: SPATIAL ERROR MODEL - MAXIMUM LIKELIHOOD ESTIMATION Data set : nuevo Spatial Weight : nuevoq Dependent Variable : DIN94IN46 Number of Observations: 48 Mean dependent var : 2.919792 Number of Variables : 2 S.D. dependent var : 0.163036 Degrees of Freedom : 46 Lag coeff. (Lambda) : 0.433936 R-squared : 0.776745 R-squared (BUSE) : - Sq. Correlation : - Log likelihood : 53.732096 Sigma-square : 0.00593429 Akaike info criterion : -103.464 S.E of regression : 0.0770343 Schwarz criterion : -99.7218 ---------------------------------------------------------------------- ------- Variable Coefficient Std.Error z-value Probability ---------------------------------------------------------------------- ------- CONSTANT 6.64014 0.432628 15.3484 0.00000 LINC46 -0.529052 0.0613691 -8.62082 0.00000 LAMBDA 0.433936 0.157919 2.74785 0.00600 ---------------------------------------------------------------------- ------- REGRESSION DIAGNOSTICS DIAGNOSTICS FOR HETEROSKEDASTICITY RANDOM COEFFICIENTS TEST DF VALUE PROB Breusch-Pagan test 1 1.5605 0.21160 DIAGNOSTICS FOR SPATIAL DEPENDENCE SPATIAL ERROR DEPENDENCE FOR WEIGHT MATRIX : nuevoq TEST DF VALUE PROB Likelihood Ratio Test 1 7.1413 0.00753 ============================== END OF REPORT ================================ 70 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 71.
    GeoDa: SAR &SEM-Outcomes With these outputs, you should be able to complete R & M Table 3 You may īŦnd R2 s, AICs, βs, Îģs, ΁s and, p − values, Convergence rate(θ) is calculated using β (See slide 39) 71 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 72.
    Reporting SAR &SEM outomes Table: Spatial Dependence Models Model speciīŦcation AIC β Îģ, ΁ LM test (p-value) p-value p-value 1929-94 Spatial error (ML) -102.447 -0.716 (0.000) 0.163 0.169 Spatial lag (ML) -102.274 -0.655(0.000) 0.048 0.054 1929-45 Spatial error (ML) -125.534 -0.399(0.000) 0.000 0.000 Spatial lag (ML) -102.447 -0.341(0.000) 0.002 0.002 1946-94 Spatial error (ML) -103.464 -0.529(0.000) 0.006 0.008 Spatial lag (ML) -103.734 -0.445(0.000) 0.002 0.002 Convergence rate (θ) based on the spatial error (ML) estimates θ 1929-94 0.019 1929-45 0.032 1946-94 0.016 72 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 73.
    Interpretation of SEM& SAR outcomes For SEM, as expected, AIC indicates that the īŦt of each of the three spatial models is superior to OLS. βs are signiīŦcant and negative but diīŦ€erent from OLS coeīŦƒcients. OLS suīŦ€ers from a misspecication due to omitted spatial dependence. The coeīŦƒcients on error (Îģ) and lag(΁) terms are signiīŦcant in the sub-samples. For the full sample, just ΁ is signiīŦcant LM test indicates that there is not spatial dependence remaining in SAR and SEM. Including spatial dependence reduces the convergence rates (θs) Convergence rate over entire sample, 1.9% yearly but īŦrst sub sample, 3.2%, second sub sample 1.6% 73 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 74.
    Research paper Choose acountry (e.g. Canada, South Korea, Portugal, Mexico, etc.), īŦnd the historical data of the real per capita GDP (personal income, GSP(Gross State Product)) of its states/provinces/municipalities/regions, and, do an income convergence analysis about them (β - convergence and ΃ - convergence). In addition, you have to choose an event that was important for the economy of that country (e.g. a new constitution, an improvement in the power system, an strong devaluation of its currency, a natural disaster, a war, etc.), so that,you may choose "four years" like Rey and Montuori did. 74 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 75.
    Research paper Sections: introduction,literature, methodology (data), outcomes (explanations of them), and conclusions. Your scholarly work must report: Exploring β- convergence (Slide 48). Exploring Spatial Dependence (Slide 49). Moran scatterplots (Slide 51). Exploring data - ΃- convergence & Moran’s I (Slide 52). OLS outcomes, convergence rates, & spatial diagnostic tests (Table slide 58). SAR & SEM outcomes. Convergence rates (Table slide 72). 75 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App
  • 76.
    References 1 Anselin, L(2005). Exploring Spatial Data with GeoDaTM: A workbook 2 LeSage, J. and Pace R. K. (2009). “Introduction to Spatial Econometrics” Taylor & Francis„ Boca Raton 3 Rey S. J. and Montouri B. D. (1999) “US Regional Income Convergence: a Spatial Econometric Perspective”, Regional Studies 33 , 143-156. 4 Solow, R. M. (1956). “A Contribution to the Theory of Economic Growth” The Quarterly Journal of Economics, 70(1), 65-94. 76 / 76 Prepared by CÊsar R. Sobrino Regional Income Convergence: A Spatial Analysis App