Mixed logit is a fully general statistical model for examining discrete choices. It overcomes three important limitations of the standard logit model by allowing for random taste variation across choosers, unrestricted substitution patterns across choices, and correlation in unobserved factors over time.[1] Mixed logit can choose any distribution
f
f
The standard logit model's "taste" coefficients, or
\beta
\beta
\beta
In the standard logit model, the utility of person
n
i
Uni=\betaxni+\varepsilonni
with
\varepsilonni
For the mixed logit model, this specification is generalized by allowing
\betan
n
i
Uni=\betanxni+\varepsilonni
with
\varepsilonni
\betan\simf(\beta|\theta)
where θ are the parameters of the distribution of
\betan
\betan
Conditional on
\betan
n
i
Lni(\betan)=
| ||||||||||
|
\betan
\betan
Pni=\intLni(\beta)f(\beta|\theta)d\beta
This model is also called the random coefficient logit model since
\betan
Any probability density function can be specified for the distribution of the coefficients in the population, i.e., for
f(\beta|\theta)
Sb
The mixed logit model can represent general substitution pattern because it does not exhibit logit's restrictive independence of irrelevant alternatives (IIA) property. The percentage change in person
n
i
j
Pni
m | |
x | |
nj |
Elasticity | |||||||||||||
|
=-
| |||||||
Pni |
\int\betamLni(\beta)Lnj(\beta)f(\beta)d\beta=-
m | |
x | |
nj |
\int\betamLnj(\beta)
Lni(\beta) | |
Pni |
f(\beta)d\beta
where
\betam
\beta
Pni
Pnj
n
i,Lni,
n
j,Lnj,
\beta
Standard logit does not take into account any unobserved factors that persist over time for a given decision maker. This can be a problem if you are using panel data, which represent repeated choices over time. By applying a standard logit model to panel data you are making the assumption that the unobserved factors that affect a person's choice are new every time the person makes the choice. That is a very unlikely assumption. To take into account both random taste variation and correlation in unobserved factors over time, the utility for respondent n for alternative i at time t is specified as follows:
Unit=\betanXnit+\varepsilonnit
where the subscript t is the time dimension. We still make the logit assumption which is that
\varepsilon
\varepsilon
\varepsilon
\beta
To examine the correlation explicitly, assume that the βs are normally distributed with mean
\bar{\beta}
\sigma2
Unit=(\bar{\beta}+\sigmaηn)Xnit+\varepsilonnit
and η is a draw from the standard normal density. Rearranging, the equation becomes:
Unit=\bar{\beta}Xnit+(\sigmaηnXnit+\varepsilonnit)
Unit=\bar{\beta}Xnit+enit
where the unobserved factors are collected in
enit=\sigmaηnXnit+\varepsilonnit
\varepsilonnit
\sigmaηnXnit
Then the covariance between alternatives
i
j
Cov(enit,enjt)=\sigma2(XnitXnjt)
and the covariance between time
t
q
Cov(enit,eniq)=\sigma2(XnitXniq)
By specifying the X's appropriately, one can obtain any pattern of covariance over time and alternatives.
Conditional on
\betan
Ln(\betan)=\prodt
| ||||||||||
|
since
\varepsilonnit
\beta
Pni=\intLn(\beta)f(\beta|\theta)d\beta
Unfortunately there is no closed form for the integral that enters the choice probability, and so the researcher must simulate Pn. Fortunately for the researcher, simulating Pn can be very simple. There are four basic steps to follow
1. Take a draw from the probability density function that you specified for the 'taste' coefficients. That is, take a draw from
f(\beta|\theta)
\betar
r=1
2. Calculate
r) | |
L | |
n(\beta |
3. Repeat many times, for
r=2,...,R
4. Average the results
Then the formula for the simulation look like the following,
\tilde{P}ni=
\sumrLni(\betar) | |
R |
where R is the total number of draws taken from the distribution, and r is one draw.
Once this is done you will have a value for the probability of each alternative i for each respondent n.