In mathematics, least squares function approximation applies the principle of least squares to function approximation, by means of a weighted sum of other functions. The best approximation can be defined as that which minimizes the difference between the original function and the approximation; for a least-squares approach the quality of the approximation is measured in terms of the squared differences between the two.
See also: Fourier series and Generalized Fourier series.
A generalization to approximation of a data set is the approximation of a function by a sum of other functions, usually an orthogonal set:[1]
f(x) ≈ fn(x)=a1\phi1(x)+a2\phi2(x)+ … +an\phin(x),
with the set of functions an orthonormal set over the interval of interest, : see also Fejér's theorem. The coefficients are selected to make the magnitude of the difference ||||2 as small as possible. For example, the magnitude, or norm, of a function over the can be defined by:[2]
\|g\|=
b | |
\left(\int | |
a |
g*(x)g(x)dx\right)1/2
where the ‘*’ denotes complex conjugate in the case of complex functions. The extension of Pythagoras' theorem in this manner leads to function spaces and the notion of Lebesgue measure, an idea of “space” more general than the original basis of Euclidean geometry. The satisfy orthonormality relations:[3]
b | |
\int | |
a |
\phi
* | |
i |
(x)\phij(x)dx=\deltaij,
where δij is the Kronecker delta. Substituting function into these equations then leads tothe n-dimensional Pythagorean theorem:[4]
2 | |
\|f | |
n\| |
=
2 | |
|a | |
1| |
+
2 | |
|a | |
2| |
+ … +
2. | |
|a | |
n| |
The coefficients making ||f − fn||2 as small as possible are found to be:[1]
aj=
b | |
\int | |
a |
\phi
* | |
j |
(x)f(x)dx.
The generalization of the n-dimensional Pythagorean theorem to infinite-dimensional  real inner product spaces is known as Parseval's identity or Parseval's equation.[5] Particular examples of such a representation of a function are the Fourier series and the generalized Fourier series.
It follows that one can find a "best" approximation of another function by minimizing the area between two functions, a continuous function
f
[a,b]
g\inW
W
C[a,b]
Area=
b | |
\int | |
a |
\left\vertf(x)-g(x)\right\vertdx,
W
b | |
\int | |
a |
[f(x)-g(x)]2dx
g
f
W
As such,
\lVertf-g\rVert2
\lVertf-g\rVert
b | |
\int | |
a |
[f(x)-g(x)]2dx=\left\langlef-g,f-g\right\rangle=\lVertf-g\rVert2.
In other words, the least squares approximation of
f
g\insubspaceW
f
\left\langlef,g\right\rangle
Let
f
[a,b]
W
C[a,b]
f
W
g=\left\langlef,\vecw1\right\rangle\vecw1+\left\langlef,\vecw2\right\rangle\vecw2+ … +\left\langlef,\vecwn\right\rangle\vecwn,
where
B=\{\vecw1,\vecw2,...,\vecwn\}
W