Functional additive models explained

In statistics, functional additive models (FAM) can be viewed as extensions of generalized functional linear models where the linearity assumption between the response (scalar or functional) and the functional linear predictor is replaced by an additivity assumption.

Overview

Functional Additive Model

In these models, functional predictors (

) are paired with responses (

) that can be either scalar or functional. The response can follow a continuous or discrete distribution and this distribution may be in the exponential family. In the latter case, there would be a canonical link that connects predictors and responses. Functional predictors (or responses) can be viewed as random trajectories generated by a square-integrable stochastic process. Using functional principal component analysis and the Karhunen-Loève expansion, these processes can be equivalently expressed as a countable sequence of their functional principal component scores (FPCs) and eigenfunctions. In the FAM^[1] the responses (scalar or functional) conditional on the predictor functions are modeled as function of the functional principal component scores of the predictor function in an additive structure. This model can be categorized as a Frequency Additive Model since it is additive in the predictor FPC scores.

Continuously Additive Model

The Continuously Additive Model (CAM)^[2] assumes additivity in the time domain. The functional predictors are assumed to be smooth across the time domain since the times contained in an interval domain are an uncountable set, an unrestricted time-additive model is not feasible. This motivates to approximate sums of additive functions by integrals so that the traditional vector additive model be replaced by a smooth additive surface. CAM can handle generalized responses paired with multiple functional predictors.

Functional Generalized Additive Model

The Functional Generalized Additive Model (FGAM)^[3] is an extension of generalized additive model with a scalar response and a functional predictor. This model can also deal with multiple functional predictors.The CAM and the FGAM are essentially equivalent apart from implementation details and therefore can be covered under one description. They can be categorized as Time-Additive Models.

Functional Additive Model

Model

Functional Additive Model for scalar and functional responses respectively, are given by

E(Y\midX)=\mu_Y+

	infty
\sum
	k=1

f_k(\xi_k)

E(Y(t)\midX)=\mu_Y(t)+

	infty
\sum
	k=1

	infty
\sum
	m=1

f_km(\xi_k)\psi_m(t),

where

\xi_k

and

\zeta_m

are FPC scores of the processes

and

respectively,

\phi_k

and

\psi_m

are the eigenfunctions of processes

and

respectively, and

f_k

and

f_km

are arbitrary smooth functions.

To ensure identifiability one may require,

Ef_k(\xi_k)=0, k=1,2,\ldotsEf_km(\xi_k)=0,k=1,2,\ldotsm=1,2,\ldots

Implementation

The above model is considered under the assumption that the true FPC scores

\xi_k

for predictor processes are known. In general, estimation in the generalized additive model requires backfitting algorithm or smooth backfitting to account for the dependencies between predictors. Now FPCs are always uncorrelated and if the predictor processes are assumed to be gaussian then the FPCs are independent. Then

E(Y-\mu_Y|\xi_{k)=E\{E(Y-\mu}_Y|X)|\xi_k\}=E\{\sum

	infty

	j=1

f_j(\xi_j)|\xi_k\}=f_k(\xi_k),

similarly for functional responses

E(\zeta_m|\xi_k)=f_km(\xi_k),

This simplifies the estimation and requires only one-dimensional smoothing of responses against individual predictor scores and will yield consistent estimates of

f_j.

In data analysis one needs to estimate

\xi_k

before proceeding to infer the functions

f_k

and

f_km

, so there are errors in the predictors. functional principal component analysis generates estimates

\hat{\xi_k}

\xi_k

for individual predictor trajectories along with estimates for eigenfunctions, eigenvalues, mean functions and covariance functions. Different smoothing methods can be applied to the data

\{\hat{\xi}_ik,Y_i\}_i=1,...,n

and

\{\hat{\xi}_ik,\hat{\zeta}_im\}_i=1,...,n

to estimate

f_k

and

f_km

respectively.

The fitted Functional Additive Model for scalar response is given by

	K
\hat{E}(Y\|X)=\bar{Y}+\sum
	k=1

\hat{f_k}(\xi_k),

and the fitted Functional Additive Model for functional responses is by

\hat{E}(Y(t)|X)=\hat{\mu}_Y(t)+\sum

	M

	m=1

	K
\sum
	k=1

\hat{f}_km(\xi_k)\hat{\psi}_m(t),t\in{T}

Note: The truncation points

and

need to be chosen data-adaptively. Possible methods include pseudo-AIC, fraction of variance explained or minimization of prediction error or cross-validation.

Extensions

For the case of multiple functional predictors with a scalar response, the Functional Additive Model can be extended by fitting a functional regression which is additive in the FPCs of each of the predictor processes

X_j,j=1,...,d

. The model considered here is Additive Functional Score Model (AFSM) given by

E(Y|X_1,X_2,...,X_d)=\sum

	d

	j=1

\sum_kf_jk(\xi_jk)

In case of multiple predictors the FPCs of different predictors are in general correlated and a smooth backfitting^[4] technique has been developed to obtain consistent estimates of the component functions

f_jk

when the predictors are observed with errors having unknown distribution.

Continuously Additive Model

Model

Since the number of time points on an interval domain is uncountable, an unrestricted time-additive model

E(Y|X)=\sum_\{t\in[0,T]\

}f_t(X(t)) is not feasible. Thus a sequence of time-additive models is considered on an increasingly dense finite time grid

t_1,t_2,...,t_m

leading to

E(Y|X(t₁),...,X(t_m))=

	mf
E(Y)+\sum
	j(X


	t_j

)

where

f_{j( ⋅ )=g(t}_{j, ⋅ )}

for a smooth bivariate function

with

E(\{g(t_j,X(t_j)\})=0

(to ensure identifiability). In the limit

m → infty

this becomes the continuously additive model

E(Y|X)=E(Y)+\lim_m\toinfty

	1
	m

	mg\{t
\sum
	j,X(t

_{j)\}=E(Y)+\int}_Tg\{t,X(t)\}dt.

Special Cases

Generalized Functional Linear Model

For

g\{t,X(t)\}=\beta(t)\{X(t)-EX(t)\}

the model reduces to generalized functional linear model

Functional Transformation Model

For non-Gaussian predictor process,

g\{t,X(t)\}=\beta(t)[\zeta\{X(t)\}-E\zeta\{X(t)\},

where

\zeta

is a smooth transformation of

X(t)

reduces CAM to a Functional Transformation model.

Extensions

to the mean-response and applying a probability transformation

G_t

X(t)

yields the FGAM given by

h(E(Y|X)=\theta_0+\int_TF[G_t\{X_i(t)\},t]dt,

where

\theta₀

is the intercept.
Note: For estimation and implementation see

Notes and References

Müller and Yao. Functional Additive Models. Journal of the American Statistical Association. 2008. 103. 484. 1534–1544. 10.1198/016214508000000751. 1927777.
Müller, Wu and Yao. Continuously Additive models for nonlinear functional regression. Biometrika. 2013. 100. 3. 607–622. 10.1093/biomet/ast004. 10.1.1.698.4344.
McLean. etal. Functional Generalized additive models. Journal of Computational and Graphical Statistics. 2014. 23. 1. 249–269. 10.1080/10618600.2012.729985. 3982924. 24729671.
Han, Müller and Park. Smooth Backfitting for Additive Modeling with Small Errors-in-Variables, with an Application to Additive Functional Regression for Multiple Predictor Functions. Bernoulli. 2017.