Dynamic discrete choice (DDC) models, also known as discrete choice models of dynamic programming, model an agent's choices over discrete options that have future implications. Rather than assuming observed choices are the result of static utility maximization, observed choices in DDC models are assumed to result from an agent's maximization of the present value of utility, generalizing the utility theory upon which discrete choice models are based.
The goal of DDC methods is to estimate the structural parameters of the agent's decision process. Once these parameters are known, the researcher can then use the estimates to simulate how the agent would behave in a counterfactual state of the world. (For example, how a prospective college student's enrollment decision would change in response to a tuition increase.)
Agent
n
V\left(xn0
\right)=max | |
\left\{dnt\right\ |
T} | |
t=1 |
E
T | |
\left(\sum | |
t\prime=t |
J | |
\sum | |
i=1 |
\betat'-t\left(dnt=i\right)Unit\left(xnt,\varepsilonnit\right)\right),
where
xnt
xn0
dnt
n
J
\beta\in\left(0,1\right)
Unit
n
i
t
xnt
\varepsilonnit
T
E\left( ⋅ \right)
xnt
\varepsilonnit
Unit
It is standard to impose the following simplifying assumptions and notation of the dynamic decision problem:
The flow utility can be written as an additive sum, consisting of deterministic and stochastic elements. The deterministic component can be written as a linear function of the structural parameters.
\begin{alignat}{5} Unit\left(xnt,\varepsilonnit\right)&& = &&unit&& + &&\varepsilonnit\\ && = &&Xnt\alphai&& + &&\varepsilonnit\end{alignat}
Define by
Vnt(xnt)
n
t
\varepsilonnt
Vnt(xnt)=Emaxi\left\{unit(xnt)+\varepsilonnit+\beta
\int | |
xt+1 |
Vnt+1(xnt+1)dF\left(xt+1\midxt\right)\right\}
where the expectation operator
E
\varepsilon
dF\left(xt+1\midxt\right)
xt+1
xt
It is possible to decompose
Vnt(xnt)
Vnt(xnt)=Emaxi\left\{vnit(xnt)+\varepsilonnit\right\}
where
vnit
i
t
vnit(xnt)=unit\left(xnt\right)+\beta
\int | |
xt+1 |
Emaxj\left\{vnjt+1(xnt+1)+\varepsilonnjt+1\right\}dF(xt+1\midxt)
where now the expectation
E
\varepsilonnjt+1
The states
xt
xt
xt-1
xt-2
The value function in the previous section is called the conditional value function, because it is the value function conditional on choosing alternative
i
t
To write down the choice probabilities, the researcher must make an assumption about the distribution of the
\varepsilonnit
For the case where
\varepsilonnit
Pnit=
\exp(vnit) | |||||||||
|
Estimation of dynamic discrete choice models is particularly challenging, due to the fact that the researcher must solve the backwards recursion problem for each guess of the structural parameters.
The most common methods used to estimate the structural parameters are maximum likelihood estimation and method of simulated moments.
Aside from estimation methods, there are also solution methods. Different solution methods can be employed due to complexity of the problem. These can be divided into full-solution methods and non-solution methods.
The foremost example of a full-solution method is the nested fixed point (NFXP) algorithm developed by John Rust in 1987.The NFXP algorithm is described in great detail in its documentation manual.[1]
A recent work by Che-Lin Su and Kenneth Judd in 2012[2] implements another approach (dismissed as intractable by Rust in 1987), which uses constrained optimization of the likelihood function, a special case of mathematical programming with equilibrium constraints (MPEC).Specifically, the likelihood function is maximized subject to the constraints imposed by the model, and expressed in terms of the additional variables that describe the model's structure. This approach requires powerful optimization software such as Artelys Knitro because of the high dimensionality of the optimization problem.Once it is solved, both the structural parameters that maximize the likelihood, and the solution of the model are found.
In the later article[3] Rust and coauthors show that the speed advantage of MPEC compared to NFXP is not significant. Yet, because the computations required by MPEC do not rely on the structure of the model, its implementation is much less labor intensive.
Despite numerous contenders, the NFXP maximum likelihood estimator remains the leading estimation methodfor Markov decision models.[3]
An alternative to full-solution methods is non-solution methods. In this case, the researcher can estimate the structural parameters without having to fully solve the backwards recursion problem for each parameter guess. Non-solution methods are typically faster while requiring more assumptions, but the additional assumptions are in many cases realistic.
The leading non-solution method is conditional choice probabilities, developed by V. Joseph Hotz and Robert A. Miller.[4]
The bus engine replacement model developed in the seminal paper is one of the first dynamic stochastic models of discrete choice estimated using real data, and continues to serve as classical example of the problems of this type.[2]
The model is a simple regenerative optimal stopping stochastic dynamic problem faced by the decision maker, Harold Zurcher, superintendent of maintenance at the Madison Metropolitan Bus Company in Madison, Wisconsin. For every bus in operation in each time period Harold Zurcher has to decide whether to replace the engine and bear the associated replacement cost, or to continue operating the bus at an ever raising cost of operation, which includes insurance and the cost of lost ridership in the case of a breakdown.
Let
xt
t
c(xt,\theta)
\theta
RC
\beta
U(xt,\xit,d,\theta)=\begin{cases} -c(xt,\theta)+\xit,keep,&\\ -RC-c(0,\theta)+\xit,replace,& \end{cases} = u(xt,d,\theta)+ \begin{cases} \xit,keep,&rm{if} d=keep,\\ \xit,replace,&rm{if} d=replace, \end{cases}
where
d
\xit,keep
\xit,replace
\xit,keep
\xit,replace
\xit,\bullet
\xit-1,\bullet
xt
Then the optimal decisions satisfy the Bellman equation
V(x,\xi,\theta)=maxd=keep,replace\left\{u(x,d,\theta)+\xid+\iintV(x',\xi',\theta)q(d\xi'\midx',\theta)p(dx'\midx,d,\theta)\right\}
where
p(dx'\midx,d,\theta)
q(d\xi'\midx',\theta)
Given the distributional assumption on
q(d\xi'\midx',\theta)
d
P(d\midx,\theta)=
\exp\{u(x,d,\theta)+\betaEV(x,d,\theta)\ | |
where
EV(x,d,\theta)
EV(x,d,\theta)=\int\left[log\left(\sumd=keep,replace\exp\{u(x,d',\theta)+\betaEV(x',d',\theta)\}\right)\right]p(x'\midx,d,\theta).
It can be shown that the latter functional equation defines a contraction mapping if the state space
xt
EV(x,d,\theta)
\theta
EV(x,d,\theta)
\theta
(x,d)
The contraction mapping above can be solved numerically for the fixed point
EV(x,d,\theta)
P(d\midx,\theta)
\theta
L(\theta)=
N | |
\sum | |
i=1 |
Ti | |
\sum | |
t=1 |
log(P(dit\midxit,\theta))+log(p(xit\midxit-1,dit-1,\theta)),
where
xi,t
di,t
i=1,...,N
t=1,...,Ti
The joint algorithm for solving the fixed point problem given a particular value of parameter
\theta
L(\theta)
\theta
Rust's implementation of the nested fixed point algorithm is highly optimized for this problem, using Newton–Kantorovich iterations to calculate
P(d\midx,\theta)
In the nested fixed point algorithm,
P(d\midx,\theta)
\begin{align} max& L(\theta)&\\ subjectto& EV(x,d,\theta)=\int\left[log\left(\sumd=keep,replace\exp\{u(x,d',\theta)+\betaEV(x',d',\theta)\}\right)\right]p(x'\midx,d,\theta) \end{align}
This method is faster to compute than non-optimized implementations of the nested fixed point algorithm, and takes about as long as highly optimized implementations.[3]
The conditional choice probabilities method of Hotz and Miller can be applied in this setting. Hotz, Miller, Sanders, and Smith proposed a computationally simpler version of the method, and tested it on a study of the bus engine replacement problem. The method works by estimating conditional choice probabilities using simulation, then backing out the implied differences in value functions.[5]