Functional principal component analysis (FPCA) is a statistical method for investigating the dominant modes of variation of functional data. Using this method, a random function is represented in the eigenbasis, which is an orthonormal basis of the Hilbert space L2 that consists of the eigenfunctions of the autocovariance operator. FPCA represents functional data in the most parsimonious way, in the sense that when using a fixed number of basis functions, the eigenfunction basis explains more variation than any other basis expansion. FPCA can be applied for representing random functions,[1] or in functional regression[2] and classification.
For a square-integrable stochastic process X(t), t ∈, let
\mu(t)=E(X(t))
G(s,t)=Cov(X(s),X(t))=
infty | |
\sum | |
k=1 |
λk\varphik(s)\varphik(t),
λ1\geqλ2\geq...\geq0
\varphi1
\varphi2
G:L2(l{T}) → L2(l{T}),G(f)=\intl{T}G(s,t)f(s)ds.
By the Karhunen–Loève theorem, one can express the centered process in the eigenbasis,
X(t)-\mu(t)=
infty | |
\sum | |
k=1 |
\xik\varphik(t),
\xik=\intl{T}(X(t)-\mu(t))\varphik(t)dt
\varphik
E(\xik)=0,Var(\xik)=λkandE(\xik\xil)=0fork\nel.
The centered process is then equivalent to ξ1, ξ2, .... A common assumption is that X can be represented by only the first few eigenfunctions (after subtracting the mean function), i.e.
X(t) ≈ Xm(t)=\mu(t)+
m | |
\sum | |
k=1 |
\xik\varphik(t),
E\left(\intl{T
The first eigenfunction
\varphi1
\varphi1=\underset{\Vert\varphi\Vert=1}{\operatorname{argmax}}\left\{\operatorname{Var}(\intl{T}(X(t)-\mu(t))\varphi(t)dt)\right\},
\Vert\varphi\Vert=\left(
2 | |
\int | |
l{T}\varphi(t) |
dt
| ||||
\right) |
.
The k-th eigenfunction
\varphik
\varphi1
\varphi2
\varphik-1
\varphik=\underset{\Vert\varphi\Vert=1,\langle\varphi,\varphij\rangle=0forj=1,...,k-1}{\operatorname{argmax}}\left\{\operatorname{Var}(\intl{T}(X(t)-\mu(t))\varphi(t)dt)\right\},
\langle\varphi,\varphij\rangle=\intl{T}\varphi(t)\varphij(t)dt,forj=1,...,k-1.
Let Yij = Xi(tij) + εij be the observations made at locations (usually time points) tij, where Xi is the i-th realization of the smooth stochastic process that generates the data, and εij are identically and independently distributed normal random variable with mean 0 and variance σ2, j = 1, 2, ..., mi. To obtain an estimate of the mean function μ(tij), if a dense sample on a regular grid is available, one may take the average at each location tij:
\hat{\mu}(tij)=
1 | |
n |
n | |
\sum | |
i=1 |
Yij.
Then the estimate of the covariance function
\hat{G}(s,t)
Gi(tij,til)=(Yij-\hat{\mu}(tij))(Yil-\hat{\mu}(til)),j ≠ l,i=1,...,n.
Note that the diagonal elements of Gi should be removed because they contain measurement error.[4]
In practice,
\hat{G}(s,t)
\hat{\varphi}k
\hat{vk}.
The fitted covariance should be positive definite and symmetric and is then obtained as
\tilde{G}(s,t)=
\sum | |
λk>0 |
\hat{λ}k\hat{\varphi}k(s)\hat{\varphi}k(t).
Let
\hat{V}(t)
\hat{V}(t)
\hat{\sigma}2=
2 | |
|l{T |
|}\intl{T
\hat{\sigma}2>0;
\hat{\sigma}2=0.
If the observations Xij, j=1, 2, ..., mi are dense in, then the k-th FPC ξk can be estimated by numerical integration, implementing
\hat{\xi}k=\langleX-\hat{\mu},\hat{\varphi}k\rangle.
However, if the observations are sparse, this method will not work. Instead, one can use best linear unbiased predictors,[3] yielding
\hat{\xi}k=\hat{λ}k
T | |
\hat{\varphi} | |
k |
-1 | |
\hat{\Sigma} | |
Yi |
(Yi-\hat{\mu}),
\hat{\Sigma} | |
Yi |
=\tilde{G}+\hat{\sigma}2
I | |
mi |
\tilde{G}
Asymptotic convergence properties of these estimates have been investigated.[3] [8] [9]
FPCA can be applied for displaying the modes of functional variation,[1] [10] in scatterplots of FPCs against each other or of responses against FPCs, for modeling sparse longitudinal data,[3] or for functional regression and classification (e.g., functional linear regression).[2] Scree plots and other methods can be used to determine the number of components included. Functional Principal component analysis has varied applications in time series analysis. At present, this method is being adapted from traditional multivariate techniques to analyze financial data sets such as stock market indices and generate implied volatility graphs.[11] A good example of advantages of the functional approach is the Smoothed FPCA (SPCA), developed by Silverman [1996] and studied by Pezzulli and Silverman [1993], that enables direct combination of FPCA along with a general smoothing approach that makes using the information stored in some linear differential operators possible. An important application of the FPCA already known from multivariate PCA is motivated by the Karhunen-Loève decomposition of a random function to the set of functional parameters – factor functions and corresponding factor loadings (scalar random variables). This application is much more important than in the standard multivariate PCA since the distribution of the random function is in general too complex to be directly analyzed and the Karhunen-Loève decomposition reduces the analysis to the interpretation of the factor functions and the distribution of scalar random variables. Due to dimensionality reduction as well as its accuracy to represent data, there is a wide scope for further developments of functional principal component techniques in the financial field.
Applications of PCA in automotive engineering.[12] [13] [14] [15]
The following table shows a comparison of various elements of principal component analysis (PCA) and FPCA. The two methods are both used for dimensionality reduction. In implementations, FPCA uses a PCA step.
However, PCA and FPCA differ in some critical aspects. First, the order of multivariate data in PCA can be permuted, which has no effect on the analysis, but the order of functional data carries time or space information and cannot be reordered. Second, the spacing of observations in FPCA matters, while there is no spacing issue in PCA. Third, regular PCA does not work for high-dimensional data without regularization, while FPCA has a built-in regularization due to the smoothness of the functional data and the truncation to a finite number of included components.
Element | In PCA | In FPCA | ||||||
---|---|---|---|---|---|---|---|---|
Data | X\inRp | X\inL2(l{T}) | ||||||
Dimension | p<infty | infty | ||||||
Mean | \mu=E(X) | \mu(t)=E(X(t)) | ||||||
Covariance | Cov(X)=\Sigmap | Cov(X(s),X(t))=G(s,t) | ||||||
Eigenvalues | λ1,λ2,...,λp | λ1,λ2,... | ||||||
Eigenvectors/Eigenfunctions | v1,v2,...,vp | \varphi1(t),\varphi2(t),... | ||||||
Inner Product | \langleX,Y\rangle=
XkYk | \langleX,Y\rangle=\intl{T}X(t)Y(t)dt | ||||||
Principal Components | zk=\langleX-\mu,
\rangle,k=1,2,...,p | \xik=\langleX-\mu,\varphik\rangle,k=1,2,... |