Functional additive models explained
In statistics, functional additive models (FAM) can be viewed as extensions of generalized functional linear models where the linearity assumption between the response (scalar or functional) and the functional linear predictor is replaced by an additivity assumption.
Overview
Functional Additive Model
In these models, functional predictors (
) are paired with responses (
) that can be either scalar or functional. The response can follow a continuous or discrete distribution and this distribution may be in the
exponential family. In the latter case, there would be a canonical link that connects predictors and responses. Functional predictors (or responses) can be viewed as random trajectories generated by a square-integrable stochastic process. Using
functional principal component analysis and the
Karhunen-Loève expansion, these processes can be equivalently expressed as a countable sequence of their functional principal component scores (FPCs) and eigenfunctions. In the
FAM[1] the responses (scalar or functional) conditional on the predictor functions are modeled as function of the functional principal component scores of the predictor function in an additive structure. This model can be categorized as a Frequency Additive Model since it is additive in the predictor FPC scores.
Continuously Additive Model
The Continuously Additive Model (CAM)[2] assumes additivity in the time domain. The functional predictors are assumed to be smooth across the time domain since the times contained in an interval domain are an uncountable set, an unrestricted time-additive model is not feasible. This motivates to approximate sums of additive functions by integrals so that the traditional vector additive model be replaced by a smooth additive surface. CAM can handle generalized responses paired with multiple functional predictors.
Functional Generalized Additive Model
The Functional Generalized Additive Model (FGAM)[3] is an extension of generalized additive model with a scalar response and a functional predictor. This model can also deal with multiple functional predictors.The CAM and the FGAM are essentially equivalent apart from implementation details and therefore can be covered under one description. They can be categorized as Time-Additive Models.
Functional Additive Model
Model
Functional Additive Model for scalar and functional responses respectively, are given by
E(Y(t)\midX)=\muY(t)+
fkm(\xik)\psim(t),
where
and
are FPC scores of the processes
and
respectively,
and
are the eigenfunctions of processes
and
respectively, and
and
are arbitrary smooth functions.
To ensure identifiability one may require,
Efk(\xik)=0, k=1,2,\ldotsEfkm(\xik)=0,k=1,2,\ldotsm=1,2,\ldots
Implementation
The above model is considered under the assumption that the true FPC scores
for predictor processes are known. In general, estimation in the
generalized additive model requires
backfitting algorithm or smooth backfitting to account for the dependencies between predictors. Now FPCs are always uncorrelated and if the predictor processes are assumed to be
gaussian then the FPCs are independent. Then
E(Y-\muY|\xik)=E\{E(Y-\muY|X)|\xik\}=E\{\sum
fj(\xij)|\xik\}=fk(\xik),
similarly for functional responses
E(\zetam|\xik)=fkm(\xik),
This simplifies the estimation and requires only one-dimensional
smoothing of responses against individual predictor scores and will yield consistent estimates of
In data analysis one needs to estimate
before proceeding to infer the functions
and
, so there are errors in the predictors.
functional principal component analysis generates estimates
of
for individual predictor trajectories along with estimates for eigenfunctions, eigenvalues, mean functions and covariance functions. Different
smoothing methods can be applied to the data
\{\hat{\xi}ik,Yi\}i=1,...,n
and
\{\hat{\xi}ik,\hat{\zeta}im\}i=1,...,n
to estimate
and
respectively.
The fitted Functional Additive Model for scalar response is given by
| K |
\hat{E}(Y|X)=\bar{Y}+\sum | |
| k=1 |
\hat{fk}(\xik),
and the fitted Functional Additive Model for functional responses is by
\hat{E}(Y(t)|X)=\hat{\mu}Y(t)+\sum
\hat{f}km(\xik)\hat{\psi}m(t),t\in{T}
Note: The truncation points
and
need to be chosen data-adaptively. Possible methods include pseudo-
AIC, fraction of variance explained or minimization of prediction error or
cross-validation.
Extensions
For the case of multiple functional predictors with a scalar response, the Functional Additive Model can be extended by fitting a functional regression which is additive in the FPCs of each of the predictor processes
. The model considered here is Additive Functional Score Model (AFSM) given by
E(Y|X1,X2,...,Xd)=\sum
\sumkfjk(\xijk)
In case of multiple predictors the FPCs of different predictors are in general correlated and a smooth backfitting
[4] technique has been developed to obtain consistent estimates of the component functions
when the predictors are observed with errors having unknown distribution.
Continuously Additive Model
Model
Since the number of time points on an interval domain is uncountable, an unrestricted time-additive model
}f_t(X(t)) is not feasible. Thus a sequence of time-additive models is considered on an increasingly dense finite time grid
in
leading to
where
for a smooth bivariate function
with
(to ensure identifiability). In the limit
this becomes the continuously additive model
E(Y|X)=E(Y)+\limm\toinfty
j)\}=E(Y)+\intTg\{t,X(t)\}dt.
Special Cases
Generalized Functional Linear Model
For
g\{t,X(t)\}=\beta(t)\{X(t)-EX(t)\}
the model reduces to
generalized functional linear modelFunctional Transformation Model
For non-Gaussian predictor process,
g\{t,X(t)\}=\beta(t)[\zeta\{X(t)\}-E\zeta\{X(t)\},
where
is a smooth transformation of
reduces CAM to a Functional Transformation model.
Extensions
to the mean-response and applying a probability transformation
to
yields the FGAM given by
h(E(Y|X)=\theta0+\intTF[Gt\{Xi(t)\},t]dt,
where
is the intercept.
Note: For estimation and implementation see
Notes and References
- Müller and Yao. Functional Additive Models. Journal of the American Statistical Association. 2008. 103. 484. 1534–1544. 10.1198/016214508000000751. 1927777.
- Müller, Wu and Yao. Continuously Additive models for nonlinear functional regression. Biometrika. 2013. 100. 3. 607–622. 10.1093/biomet/ast004. 10.1.1.698.4344.
- McLean. etal. Functional Generalized additive models. Journal of Computational and Graphical Statistics. 2014. 23. 1. 249–269. 10.1080/10618600.2012.729985. 3982924. 24729671.
- Han, Müller and Park. Smooth Backfitting for Additive Modeling with Small Errors-in-Variables, with an Application to Additive Functional Regression for Multiple Predictor Functions. Bernoulli. 2017.