Galerkin method explained

In mathematics, in the area of numerical analysis, Galerkin methods are a family of methods for converting a continuous operator problem, such as a differential equation, commonly in a weak formulation, to a discrete problem by applying linear constraints determined by finite sets of basis functions. They are named after the Soviet mathematician Boris Galerkin.

Often when referring to a Galerkin method, one also gives the name along with typical assumptions and approximation methods used:

Ritz–Galerkin method (after Walther Ritz) typically assumes symmetric and positive definite bilinear form in the weak formulation, where the differential equation for a physical system can be formulated via minimization of a quadratic function representing the system energy and the approximate solution is a linear combination of the given set of the basis functions.^[1]
Bubnov–Galerkin method (after Ivan Bubnov) does not require the bilinear form to be symmetric and substitutes the energy minimization with orthogonality constraints determined by the same basis functions that are used to approximate the solution. In an operator formulation of the differential equation, Bubnov–Galerkin method can be viewed as applying an orthogonal projection to the operator.
Petrov–Galerkin method (after Georgii I. Petrov^[2]) allows using basis functions for orthogonality constraints (called test basis functions) that are different from the basis functions used to approximate the solution. Petrov–Galerkin method can be viewed as an extension of Bubnov–Galerkin method, applying a projection that is not necessarily orthogonal in the operator formulation of the differential equation.

Examples of Galerkin methods are:

the Galerkin method of weighted residuals, the most common method of calculating the global stiffness matrix in the finite element method,^[3] ^[4]
the boundary element method for solving integral equations,
Krylov subspace methods.^[5]

Example: Matrix linear system

We first introduce and illustrate the Galerkin method as being applied to a system of linear equations

Ax=b

. We define the parameters as follow:

A=\begin{bmatrix} 2&0&0\\ 0&2&1\\ 0&1&2 \end{bmatrix}

which is symmetric and positive definite, and the right-hand-side

b=\begin{bmatrix}2\ 0\ 0\end{bmatrix}.

The true solution to this linear system is

x=\begin{bmatrix}1\ 0\ 0\end{bmatrix}.

With Galerkin method, we can solve the system in a lower-dimensional space to obtain an approximate solution. Let us use the following basis for the subspace:

V=\begin{bmatrix} 0&0\\ 1&0\\ 0&1 \end{bmatrix}.

Then, we can write the Galerkin equation

\left(V^*AV\right)y=V^*b

where the left-hand-side matrix is

V^*AV=\begin{bmatrix} 2&1\\ 1&2 \end{bmatrix},

and the right-hand-side vector is

V^*b=\begin{bmatrix} 0\\ 0 \end{bmatrix}.

We can then obtain the solution vector in the subspace:

y=\begin{bmatrix} 0\\ 0 \end{bmatrix},

which we finally project back to the original space to determine the approximate solution to the original equation as

Vy=\begin{bmatrix} 0\\ 0\\ 0 \end{bmatrix}.

In this example, our original Hilbert space is actually the 3-dimensional Euclidean space

R³

equipped with the standard scalar product

(u,v)=u^Tv

, our 3-by-3 matrix

defines the bilinear form

a(u,v)=u^TAv

, and the right-hand-side vector

defines the bounded linear functional

f(v)=b^Tv

. The columns

e₁=\begin{bmatrix} 0\\ 1\\ 0 \end{bmatrix} e₂=\begin{bmatrix} 0\\ 0\\ 1 \end{bmatrix},

of the matrix

form an orthonormal basis of the 2-dimensional subspace of the Galerkin projection. The entries of the 2-by-2 Galerkin matrix

V^*AV

are

a(e_j,e_i),i,j=1,2

, while the components of the right-hand-side vector

V^*b

of the Galerkin equation are

f(e_i),i=1,2

. Finally, the approximate solution

is obtained from the components of the solution vector

of the Galerkin equation and the basis as

	2
\sum
	j=1

y_je_j

Linear equation in a Hilbert space

Weak formulation of a linear equation

, namely,

find

u\inV

such that for all

v\inV,a(u,v)=f(v)

Here,

a( ⋅ , ⋅ )

is a bilinear form (the exact requirements on

a( ⋅ , ⋅ )

will be specified later) and

is a bounded linear functional on

Galerkin dimension reduction

Choose a subspace

V_n\subsetV

of dimension n and solve the projected problem:

Find

u_n\inV_n

such that for all

v_n\inV_n,a(u_n,v_n)=f(v_n)

We call this the Galerkin equation. Notice that the equation has remained unchanged and only the spaces have changed.Reducing the problem to a finite-dimensional vector subspace allows us to numerically compute

u_n

as a finite linear combination of the basis vectors in

V_n

Galerkin orthogonality

The key property of the Galerkin approach is that the error is orthogonal to the chosen subspaces. Since

V_n\subsetV

, we can use

v_n

as a test vector in the original equation. Subtracting the two, we get the Galerkin orthogonality relation for the error,

\epsilon_n=u-u_n

which is the error between the solution of the original problem,

, and the solution of the Galerkin equation,

u_n

a(\epsilon_n,v_n)=a(u,v_n)-a(u_n,v_n)=f(v_n)-f(v_n)=0.

Matrix form of Galerkin's equation

Since the aim of Galerkin's method is the production of a linear system of equations, we build its matrix form, which can be used to compute the solution algorithmically.

Let

e_1,e_2,\ldots,e_n

be a basis for

V_n

. Then, it is sufficient to use these in turn for testing the Galerkin equation, i.e.: find

u_n\inV_n

such that

a(u_n,e_i)=f(e_i) i=1,\ldots,n.

We expand

u_n

with respect to this basis,

u_n=

	n
\sum
	j=1

u_je_j

and insert it into the equation above, to obtain

	n
a\left(\sum
	j=1

u_je_j,e_i\right)=

	n
\sum
	j=1

u_ja(e_j,e_i)=f(e_i) i=1,\ldots,n.

This previous equation is actually a linear system of equations

Au=f

, where

A_ij=a(e_j,e_i), f_i=f(e_i).

Symmetry of the matrix

Due to the definition of the matrix entries, the matrix of the Galerkin equation is symmetric if and only if the bilinear form

a( ⋅ , ⋅ )

is symmetric.

Analysis of Galerkin methods

Here, we will restrict ourselves to symmetric bilinear forms, that is

a(u,v)=a(v,u).

While this is not really a restriction of Galerkin methods, the application of the standard theory becomes much simpler. Furthermore, a Petrov–Galerkin method may be required in the nonsymmetric case.

The analysis of these methods proceeds in two steps. First, we will show that the Galerkin equation is a well-posed problem in the sense of Hadamard and therefore admits a unique solution. In the second step, we study the quality of approximation of the Galerkin solution

u_n

The analysis will mostly rest on two properties of the bilinear form, namely

Boundedness: for all

u,v\inV

holds

a(u,v)\leC\|u\|\|v\|

for some constant

C>0

Ellipticity: for all

u\inV

holds

a(u,u)\gec\|u\|²

for some constant

c>0.

By the Lax-Milgram theorem (see weak formulation), these two conditions imply well-posedness of the original problem in weak formulation. All norms in the following sections will be norms for which the above inequalities hold (these norms are often called an energy norm).

Well-posedness of the Galerkin equation

Since

V_n\subsetV

, boundedness and ellipticity of the bilinear form apply to

V_n

. Therefore, the well-posedness of the Galerkin problem is actually inherited from the well-posedness of the original problem.

Quasi-best approximation (Céa's lemma)

See main article: Céa's lemma. The error

u-u_n

between the original and the Galerkin solution admits the estimate

\|u-u_n\|\le

	C
	c

inf
	v_n\inV_n

\|u-v_n\|.

This means, that up to the constant

C/c

, the Galerkin solution

u_n

is as close to the original solution

as any other vector in

V_n

. In particular, it will be sufficient to study approximation by spaces

V_n

, completely forgetting about the equation being solved.

Proof

Since the proof is very simple and the basic principle behind all Galerkin methods, we include it here:by ellipticity and boundedness of the bilinear form (inequalities) and Galerkin orthogonality (equals sign in the middle), we have for arbitrary

v_n\inV_n

	2
c\\|u-u
	n\\|

\lea(u-u_n,u-u_n)=a(u-u_n,u-v_n)\leC\|u-u_n\|\|u-v_n\|.

Dividing by

c\|u-u_n\|

and taking the infimum over all possible

v_n

yields the lemma.

Galerkin's best approximation property in the energy norm

For simplicity of presentation in the section above we have assumed that the bilinear form

a(u,v)

is symmetric and positive definite, which implies that it is a scalar product and the expression

\|u\|_a=\sqrt{a(u,u)}

is actually a valid vector norm, called the energy norm. Under these assumptions one can easily prove in addition Galerkin's best approximation property in the energy norm.

Using Galerkin a-orthogonality and the Cauchy–Schwarz inequality for the energy norm, we obtain

\|u-u_n\|

	2

	a

=a(u-u_n,u-u_n)=a(u-u_n,u-v_n)\le\|u-u_n\|_a\|u-v_n\|_a.

Dividing by

\|u-u_n\|_a

and taking the infimum over all possible

v_n\inV_n

proves that the Galerkin approximation

u_n\inV_n

is the best approximation in the energy norm within the subspace

V_n\subsetV

, i.e.

u_n\inV_n

is nothing but the orthogonal, with respect to the scalar product

a(u,v)

, projection of the solution

to the subspace

V_n

Galerkin method for stepped Structures

I. Elishakof, M. Amato, A. Marzani, P.A. Arvan, and J.N. Reddy ^[6] ^[7] ^[8] ^[9] studied the application of the Galerkin method to stepped structures. They showed that the generalized function, namely unit-step function, Dirac’s delta function, and the doublet function are needed for obtaining accurate results.

History

The approach is usually credited to Boris Galerkin.^[10] ^[11] The method was explained to the Western reader by Hencky^[12] and Duncan^[13] ^[14] among others. Its convergence was studied by Mikhlin^[15] and Leipholz^[16] ^[17] ^[18] ^[19] Its coincidence with Fourier method was illustrated by Elishakoff et al.^[20] ^[21] ^[22] Its equivalence to Ritz's method for conservative problems was shown by Singer.^[23] Gander and Wanner^[24] showed how Ritz and Galerkin methods led to the modern finite element method. One hundred years of method's development was discussed by Repin. Elishakoff, Kaplunov and Kaplunov^[25] show that the Galerkin’s method was not developed by Ritz, contrary to the Timoshenko’s statements.

References

A. Ern, J.L. Guermond, Theory and practice of finite elements, Springer, 2004,
"Georgii Ivanovich Petrov (on his 100th birthday)", Fluid Dynamics, May 2012, Volume 47, Issue 3, pp 289-291, DOI 10.1134/S0015462812030015
S. Brenner, R. L. Scott, The Mathematical Theory of Finite Element Methods, 2nd edition, Springer, 2005,
P. G. Ciarlet, The Finite Element Method for Elliptic Problems, North-Holland, 1978,
[Yousef Saad|Y. Saad]
Elishakoff, I., Amato, M., Ankitha, A. P., & Marzani, A. (2021). Rigorous implementation of the Galerkin method for stepped structures needs generalized functions. Journal of Sound and Vibration, 490, 115708.
Elishakoff, I., Amato, M., & Marzani, A. (2021). Galerkin’s method revisited and corrected in the problem of Jaworsky and Dowell. Mechanical Systems and Signal Processing, 155, 107604.
Elishakoff, I., & Amato, M. (2021). Flutter of a beam in supersonic flow: truncated version of Timoshenko–Ehrenfest equation is sufficient. International Journal of Mechanics and Materials in Design, 1-17.
Amato, M., Elishakoff, I., & Reddy, J. N. (2021). Flutter of a Multicomponent Beam in a Supersonic Flow. AIAA Journal, 59(11), 4342-4353.
Galerkin, B.G.,1915, Rods and Plates, Series Occurring in Various Questions Concerning the Elastic Equilibrium of Rods and Plates, Vestnik Inzhenerov i Tekhnikov, (Engineers and Technologists Bulletin), Vol. 19, 897-908 (in Russian),(English Translation: 63-18925, Clearinghouse Fed. Sci. Tech. Info.1963).
"Le destin douloureux de Walther Ritz (1878-1909)", (Jean-Claude Pont, editor), Cahiers de Vallesia, 24, (2012),
Hencky H.,1927, Eine wichtige Vereinfachung der Methode von Ritz zur angennäherten Behandlung von Variationproblemen, ZAMM: Zeitschrift für angewandte Mathematik und Mechanik, Vol. 7, 80-81 (in German).
Duncan, W.J.,1937, Galerkin’s Method in Mechanics and Differential Equations, Aeronautical Research Committee Reports and Memoranda, No. 1798.
Duncan, W.J., 1938, The Principles of the Galerkin Method, Aeronautical Research Report and Memoranda, No. 1894.
S. G. Mikhlin, "Variational methods in Mathematical Physics", Pergamon Press, 1964
Leipholz H.H.E., 1976, Use of Galerkin’s Method for Vibration Problems, Shock and Vibration Digest, Vol. 8, 3-18
Leipholz H.H.E., 1967, Über die Wahl der Ansatzfunktionen bei der Durchführung des Verfahrens von Galerkin, Acta Mech., Vol. 3, 295-317 (in German).
Leipholz H.H.E., 1967, Über die Befreiung der Anzatzfunktionen des Ritzschen und Galerkinschen Verfahrens von den Randbedingungen, Ing. Arch., Vol. 36, 251-261 (in German).
Leipholz, H.H.E.,1976, Use of Galerkin’s Method for Vibration Problems, The Shock and Vibration Digest Vol. 8, 3-18, 1976.
Elishakoff, I., Lee, L.H.N.,1986, On Equivalence of the Galerkin and Fourier Series Methods for One Class of Problems, Journal of Sound and Vibration, Vol. 109, 174-177.
Elishakoff, I., Zingales, M., 2003, Coincidence of Bubnov-Galerkin and Exact Solution in an Applied Mechanics Problem, Journal of Applied Mechanics, Vol. 70, 777-779.
Elishakoff, I., Zingales M., 2004, Convergence of Bubnov-Galerkin Method Exemplified, AIAA Journal, Vol. 42(9), 1931-1933.
Singer J., 1962, On Equivalence of the Galerkin and Rayleigh-Ritz Methods, Journal of the Royal Aeronautical Society, Vol. 66, No. 621, p.592.
Gander, M.J, Wanner, G., 2012, From Euler, Ritz, and Galerkin to Modern Computing, SIAM Review, Vol. 54(4), 627-666.
.Elishakoff, I., Julius Kaplunov, Elizabeth Kaplunov, 2020, “Galerkin’s method was not developed by Ritz, contrary to the Timoshenko’s statement”, in Nonlinear Dynamics of Discrete and Continuous Systems (A. Abramyan, I. Andrianov and V. Gaiko, eds.), pp. 63-82, Springer, Berlin.

External links

- Galerkin Method from MathWorld

Galerkin method explained

Example: Matrix linear system

Linear equation in a Hilbert space

Weak formulation of a linear equation

Galerkin dimension reduction

Galerkin orthogonality

Matrix form of Galerkin's equation

Symmetry of the matrix

Analysis of Galerkin methods

Well-posedness of the Galerkin equation

Quasi-best approximation (Céa's lemma)

Proof

Galerkin's best approximation property in the energy norm

Galerkin method for stepped Structures

History

See also

References

External links