In statistics and econometrics, the maximum score estimator is a nonparametric estimator for discrete choice models developed by Charles Manski in 1975. Unlike the multinomial probit and multinomial logit estimators, it makes no assumptions about the distribution of the unobservable part of utility. However, its statistical properties (particularly its asymptotic distribution) are more complicated than the multinomial probit and logit models, making statistical inference difficult. To address these issues, Joel Horowitz proposed a variant, called the smoothed maximum score estimator.
When modelling discrete choice problems, it is assumed that the choice is determined by the comparison of the underlying latent utility.[1] Denote the population of the agents as T and the common choice set for each agent as C. For agent
t\inT
yt,i
t\inT
yt,i=1\leftrightarrowxt,i\beta+\epsilont,i>xt,j\beta+\epsilont,j,\forallj ≠ i
j\inC
where
xt,i
xt,j
\epsilont,i
\epsilont,j
xt,i
\beta
Usually some specific distribution assumption on the error term is imposed, such that the parameter
\beta
For example, suppose that C only contains two items. This is the latent utility representation[5] of a binary choice model. In this model, the choice is:
Yt=1[X1,t\beta+\varepsilon1>X2,t\beta+\varepsilon2]
X1,t,X2,t
\varepsilon1
\varepsilon2
X1,t\beta+\varepsilon1andX2,t\beta+\varepsilon2
are latent utility of choosing choice 1 and 2. Then the log likelihood function can be given as:
N | |
Q=\sum | |
i-1 |
Ytlog(P[X1,t\beta-X2,t\beta>\varepsilon2-\varepsilon1])+(1-Yt)log(1-P[X1,t\beta-X2,t\beta>\varepsilon2-\varepsilon1])
If some distributional assumption about the response error is imposed, then the log likelihood function will have a closed-form representation.[2] For instance, if the response error is assumed to be distributed as:
N(0,\sigma2)
N | |
Q=\sum | |
i-1 |
Ytlog\left(\Phi\left[
X1,t\beta-X2,t\beta | |
\surd2\sigma |
\right]\right)+(1-Yt)log\left(\Phi\left[
X2,t\beta-X1,t\beta | |
\surd2\sigma |
\right]\right)
where
\Phi
\Phi
This model is based on a distributional assumption about the response error term. Adding a specific distribution assumption into the model can make the model computationally tractable due to the existence of the closed-form representation. But if the distribution of the error term is misspecified, the estimates based on the distribution assumption will be inconsistent.
The basic idea of the distribution-free model is to replace the two probability term in the log-likelihood function with other weights. The general form of the log-likelihood function can written as:
Q=
N | |
\sum | |
i-1 |
Yt ⋅ log(W1(X1,t\beta,X2,t\beta))+(1-Yt)log(W0(X1,t\beta,X2,t\beta))
To make the estimator more robust to the distributional assumption, Manski (1975) proposed a non-parametric model to estimate the parameters. In this model, denote the number of the elements of the choice set as J, the total number of the agents as N, and
W(J-1)>W(J-2)>...>W(1)>W(0)
\hat{b}={\operatorname{argmax}}b
1 | |
N |
N | |
\sum | |
t=1 |
J | |
\sum | |
i=1 |
yt,iW(\sum\nolimitsj1[xt,ib>xt,jb])
Here,
style\sum\nolimitsj1(xt,ib>xt,jb)
Under certain conditions, the maximum score estimator can be weak consistent, but its asymptotic properties are very complicated.[7] This issue mainly comes from the non-smoothness of the objective function.
In the binary context, the maximum score estimator can be represented as:
W1(X1,t\beta,X2,t\beta)=w11[X1,t\beta-X2,t\beta>0]+w01[X1,t\beta-X2,t\beta<0],
where
W0(X1,t\beta,X2,t\beta)=1-W1(X1,t\beta,X2,t\beta)
and
w1
w0
Horowitz (1992) proposed a smoothed maximum score (SMS) estimator which has much better asymptotic properties.[8] The basic idea is to replace the non-smoothed weight function
styleW(\sum\nolimitsj1(xt,ib>xt,jb))
|K( ⋅ )|
\limu\toK(u)=0
\limu\toK(u)=1
K |
(u)=
K |
(-u)
Here, the kernel function is analogous to a CDF whose PDF is symmetric around 0. Then, the SMS estimator is defined as:
\hat{b}SMS={\operatorname{argmax}}b
1 | |
N |
N | |
\sum | |
t=1 |
J | |
\sum | |
i=1 |
yt,i\sum\nolimitsjK(Xt,ib-xt,jb/hN)
where
(hN,N=1,2,...)
\limN\tohN=0
. Jeffrey Wooldridge . 2002 . Econometric Analysis of Cross Section and Panel Data . registration . MIT Press . Cambridge, Mass . 457–460 . 978-0-262-23219-7 .