In econometrics, the Arellano–Bond estimator is a generalized method of moments estimator used to estimate dynamic models of panel data. It was proposed in 1991 by Manuel Arellano and Stephen Bond,[1] based on the earlier work by Alok Bhargava and John Denis Sargan in 1983, for addressing certain endogeneity problems.[2] The GMM-SYS estimator is a system that contains both the levels and the first difference equations. It provides an alternative to the standard first difference GMM estimator.
Unlike static panel data models, dynamic panel data models include lagged levels of the dependent variable as regressors. Including a lagged dependent variable as a regressor violates strict exogeneity, because the lagged dependent variable is likely to be correlated with the random effects and/or the general errors. The Bhargava-Sargan article developed optimal linear combinations of predetermined variables from different time periods, provided sufficient conditions for identification of model parameters using restrictions across time periods, and developed tests for exogeneity for a subset of the variables. When the exogeneity assumptions are violated and correlation pattern between time varying variables and errors may be complicated, commonly used static panel data techniques such as fixed effects estimators are likely to produce inconsistent estimators because they require certain strict exogeneity assumptions.
Anderson and Hsiao (1981) first proposed a solution by utilising instrumental variables (IV) estimation.[3] However, the Anderson–Hsiao estimator is asymptotically inefficient, as its asymptotic variance is higher than the Arellano–Bond estimator, which uses a similar set of instruments, but uses generalized method of moments estimation rather than instrumental variables estimation.
In the Arellano–Bond method, first difference of the regression equation are taken to eliminate the individual effects. Then, deeper lags of the dependent variable are used as instruments for differenced lags of the dependent variable (which are endogenous).
In traditional panel data techniques, adding deeper lags of the dependent variable reduces the number of observations available. For example, if observations are available at T time periods, then after first differencing, only T-1 lags are usable. Then, if K lags of the dependent variable are used as instruments, only T-K-1 observations are usable in the regression. This creates a trade-off: adding more lags provides more instruments, but reduces the sample size. The Arellano–Bond method circumvents this problem.
Consider the static linear unobserved effects model for
N
T
yit=Xit\beta+\alphai+uit
t=1,\ldots,T
i=1,\ldots,N
yit
i
t,
Xit
1 x k
\alphai
uit
Xit
\alphai
\alphai
Unlike a static panel data model, a dynamic panel model also contains lags of the dependent variable as regressors, accounting for concepts such as momentum and inertia. In addition to the regressors outlined above, consider a case where one lag of the dependent variable is included as a regressor,
yit-1
yit=Xit\beta+\rhoyit-1+\alphai+uitfort=2,\ldots,Tandi=1,\ldots,N
Taking the first difference of this equation to eliminate the individual effect,
\Deltayit=yit-yit-1=\DeltaXit\beta+\rho\Delta yit-1+\Deltauitfort=3,\ldots,Tandi=1,\ldots,N.
Note that if
\alphai
\Deltay=\DeltaR\pi+\Deltau.
Applying the formula for the Efficient Generalized Method of Moments Estimator, which is,
\piEGMM=[\DeltaR'Z(Z'\OmegaZ)-1Z'\DeltaR]-1\DeltaR'Z(Z'\OmegaZ)-1Z'\Deltay
where
Z
\DeltaR
The matrix
\Omega
uit
The original Anderson and Hsiao (1981) IV estimator uses the following moment conditions:
E(yit-I\Deltauit)=0withI\ge2foreacht\ge3.
Using the single instrument
yit-2
Zdi
Zdi=\begin{bmatrix}NA&(t=2)\\yi1&(t=3)\\yi2&(t=4)\\\vdots&\vdots\\yT-2&(t=T)\end{bmatrix}
Note: The first possible observation is t = 2 due to the first difference transformation
The instrument
yit-2
yit-2
t=2
t=2
Using an additional instrument
yit-3
Zdi
t=3
While adding additional instruments increases the efficiency of the IV estimator, the smaller sample size decreases efficiency. This is the efficiency - sample size trade-off.
The Arellano-bond estimator addresses this trade-off by using time-specific instruments.
The Arellano–Bond estimator uses the following moment conditions
E(yit-I\Deltauit)=0fort\ge3,I\ge2.
Using these moment conditions, the instrument matrix
Zdi
Zdi=\begin{bmatrix}yi1&0&0&0&0&0& … \\0&yi2&yi1&0&0&0& … \\0&0&0&yi3&yi2&yi1& … \ \vdots&\vdots&\vdots&\vdots&\vdots&\vdots&\ddots\end{bmatrix}
Note that the number of moments is increasing in the time period: this is how the efficiency - sample size tradeoff is avoided. Time periods further in the future have more lags available to use as instruments.
Then if one defines:
\Deltaui=\begin{bmatrix}\Deltaui3\ \Deltaui4\ \Deltaui5\ \vdots\end{bmatrix}
The moment conditions can be summarized as:
T | |
E(Z | |
di |
\Deltaui)=0
These moment conditions are only valid when the error term
uit
uit
uit-s
\leq
uit
yit
When the variance of the individual effect term across individual observations is high, or when the stochastic process
yit
Blundell and Bond (1998) derived a condition under which it is possible to use an additional set of moment conditions.[4] These additional moment conditions can be used to improve the small sample performance of the Arellano–Bond estimator. Specifically, they advocated using the moment conditions:
\operatorname{E}(\Deltayit-1(\alphai+uit))=0fort\geq3
These additional moment conditions are valid under conditions provided in their paper. In this case, the full set of moment conditions can be written:
T | |
\operatorname{E}(Z | |
SYS,i |
Pi)=0
where
Pi=\begin{pmatrix}\Deltaui\ ui3\ ui4\ ui5\ \vdots\end{pmatrix}
and
ZSYS,i=\begin{pmatrix}Zdi&0&0&0\ 0&\Deltayi2&0&0\ 0&0&\Deltayi3&0\ 0&0&0&\ddots\end{pmatrix}.
This method is known as system GMM. Note that the consistency and efficiency of the estimator depends on validity of the assumption that the errors can be decomposed as in equation (1). This assumption can be tested in empirical applications and likelihood ratio test often reject the simple random effects decomposition.
plm
package.[5] [6] [7]xtabond
and xtabond2
return Arellano–Bond estimators.[8] [9]