In applied statistics, fractional models are, to some extent, related to binary response models. However, instead of estimating the probability of being in one bin of a dichotomous variable, the fractional model typically deals with variables that take on all possible values in the unit interval. One can easily generalize this model to take on values on any other interval by appropriate transformations.[1] Examples range from participation rates in 401(k) plans[2] to television ratings of NBA games.[3]
There have been two approaches to modeling this problem. Even though they both rely on an index that is linear in combined with a link function,[4] this is not strictly necessary. The first approach uses a log-odds transformation of as a linear function of, i.e.,
\operatorname{logit}y=log
y | |
1-y |
=x\beta
\operatornameE[y\lorx]=
\exp(x\beta) | |
1+\exp(x\beta) |
It immediately becomes clear that this set up is very similar to the binary logit model, with that difference that the variable can actually take on values in the unit interval. Many of the estimation techniques for the binary logit model, such as non-linear least squares and quasi-MLE, carry over in a natural way, just like heteroskedasticity adjustments and partial effects calculations.[5]
Extensions to this cross-sectional model have been provided that allow for taking into account important econometric issues, such as endogenous explanatory variables and unobserved heterogeneous effects. Under strict exogeneity assumptions, it is possible to difference out these unobserved effects using panel data techniques, although weaker exogeneity assumptions can also result in consistent estimators.[6] Control function techniques to deal with endogeneity concerns have also been proposed.[7]