Christoffel symbols explained

In mathematics and physics, the Christoffel symbols are an array of numbers describing a metric connection.[1] The metric connection is a specialization of the affine connection to surfaces or other manifolds endowed with a metric, allowing distances to be measured on that surface. In differential geometry, an affine connection can be defined without reference to a metric, and many additional concepts follow: parallel transport, covariant derivatives, geodesics, etc. also do not require the concept of a metric.[2] [3] However, when a metric is available, these concepts can be directly tied to the "shape" of the manifold itself; that shape is determined by how the tangent space is attached to the cotangent space by the metric tensor.[4] Abstractly, one would say that the manifold has an associated (orthonormal) frame bundle, with each "frame" being a possible choice of a coordinate frame. An invariant metric implies that the structure group of the frame bundle is the orthogonal group . As a result, such a manifold is necessarily a (pseudo-)Riemannian manifold.[5] [6] The Christoffel symbols provide a concrete representation of the connection of (pseudo-)Riemannian geometry in terms of coordinates on the manifold. Additional concepts, such as parallel transport, geodesics, etc. can then be expressed in terms of Christoffel symbols.

In general, there are an infinite number of metric connections for a given metric tensor; however, there is a unique connection that is free of torsion, the Levi-Civita connection. It is common in physics and general relativity to work almost exclusively with the Levi-Civita connection, by working in coordinate frames (called holonomic coordinates) where the torsion vanishes. For example, in Euclidean spaces, the Christoffel symbols describe how the local coordinate bases change from point to point.

At each point of the underlying -dimensional manifold, for any local coordinate system around that point, the Christoffel symbols are denoted for . Each entry of this array is a real number. Under linear coordinate transformations on the manifold, the Christoffel symbols transform like the components of a tensor, but under general coordinate transformations (diffeomorphisms) they do not. Most of the algebraic properties of the Christoffel symbols follow from their relationship to the affine connection; only a few follow from the fact that the structure group is the orthogonal group (or the Lorentz group for general relativity).

Christoffel symbols are used for performing practical calculations. For example, the Riemann curvature tensor can be expressed entirely in terms of the Christoffel symbols and their first partial derivatives. In general relativity, the connection plays the role of the gravitational force field with the corresponding gravitational potential being the metric tensor. When the coordinate system and the metric tensor share some symmetry, many of the are zero.

The Christoffel symbols are named for Elwin Bruno Christoffel (1829–1900).

Note

The definitions given below are valid for both Riemannian manifolds and pseudo-Riemannian manifolds, such as those of general relativity, with careful distinction being made between upper and lower indices (contra-variant and co-variant indices). The formulas hold for either sign convention, unless otherwise noted.

Einstein summation convention is used in this article, with vectors indicated by bold font. The connection coefficients of the Levi-Civita connection (or pseudo-Riemannian connection) expressed in a coordinate basis are called Christoffel symbols.

Preliminary definitions

M

, an atlas consists of a collection of charts

\varphi:U\toRn

for each open cover

U\subsetM

. Such charts allow the standard vector basis

(\vec{e}1,,\vec{e}n)

on

Rn

to be pulled back to a vector basis on the tangent space

TM

of

M

. This is done as follows. Given some arbitrary real function

f:M\toR

, the chart allows a gradient to be defined:

\partialif\equiv

\partial\left(f\circ\varphi-1\right)
\partialxi

fori=1,2,...,n

This gradient is commonly called a pullback because it "pulls back" the gradient on

Rn

to a gradient on

M

. The pullback is independent of the chart

\varphi

. In this way, the standard vector basis

(\vec{e}1,,\vec{e}n)

on

Rn

pulls back to a standard ("coordinate") vector basis

(\partial1,,\partialn)

on

TM

. This is called the "coordinate basis", because it explicitly depends on the coordinates on

Rn

. It is sometimes called the "local basis".

This definition allows a common abuse of notation. The

\partiali

were defined to be in one-to-one correspondence with the basis vectors

\vec{e}i

on

Rn

. The notation

\partiali

serves as a reminder that the basis vectors on the tangent space

TM

came from a gradient construction. Despite this, it is common to "forget" this construction, and just write (or rather, define) vectors

ei

on

TM

such that

ei\equiv\partiali

. The full range of commonly used notation includes the use of arrows and boldface to denote vectors:

\partiali\equiv

\partial
\partialxi

\equivei\equiv\vec{e}i\equivei\equiv\boldsymbol\partiali

where

\equiv

is used as a reminder that these are defined to be equivalent notation for the same concept. The choice of notation is according to style and taste, and varies from text to text.

The coordinate basis provides a vector basis for vector fields on

M

. Commonly used notation for vector fields on

M

include

X=\vecX=

i\partial
\partialxi
X
i=X
The upper-case

X

, without the vector-arrow, is particularly popular for index-free notation, because it both minimizes clutter and reminds that results are independent of the chosen basis, and, in this case, independent of the atlas.

The same abuse of notation is used to push forward one-forms from

Rn

to

M

. This is done by writing

(\varphi1,\ldots,\varphin)=(x1,\ldots,xn)

or

x=\varphi

or

xi=\varphii

. The one-form is then

dxi=d\varphii

. This is soldered to the basis vectors as
i
dx
j
. Note the careful use of upper and lower indexes, to distinguish contravarient and covariant vectors.

The pullback induces (defines) a metric tensor on

M

. Several styles of notation are commonly used:g_ = \mathbf_i \cdot \mathbf_j=\langle \vec_i, \vec_j\rangle = e_i^a e_j^b \,\eta_where both the centerdot and the angle-bracket

\langle,\rangle

denote the scalar product. The last form uses the tensor

ηab

, which is understood to be the "flat-space" metric tensor. For Riemannian manifolds, it is the Kronecker delta

ηab=\deltaab

. For pseudo-Riemannian manifolds, it is the diagonal matrix having signature

(p,q)

. The notation
a
e
i
serves as a reminder that pullback really is a linear transform, given as the gradient, above. The index letters

a,b,c,

live in

Rn

while the index letters

i,j,k,

live in the tangent manifold.

gij

of the metric tensor

gij

is given byg^ g_=\delta^i_kThis is used to define the dual basis:\mathbf^i = \mathbf_j g^,\quad i = 1,\, 2,\, \dots,\, n

Some texts write

gi

for

ei

, so that the metric tensor takes the particularly beguiling form

gij=gigj

. This is commonly done so that the symbol

ei

can be used unambiguously for the vierbein.

Definition in Euclidean space

In Euclidean space, the general definition given below for the Christoffel symbols of the second kind can be proven to be equivalent to:_ = \frac \cdot \mathbf^ = \frac \cdot g^ \mathbf_

Christoffel symbols of the first kind can then be found via index lowering:\Gamma_ = _g_ = \frac \cdot \mathbf^ g_ = \frac \cdot \mathbf_

Rearranging, we see that (assuming the partial derivative belongs to the tangent space, which cannot occur on a non-Euclidean curved space):\frac = _ \mathbf_ = \Gamma_ \mathbf^

In words, the arrays represented by the Christoffel symbols track how the basis changes from point to point. If the derivative does not lie on the tangent space, the right expression is the projection of the derivative over the tangent space (see covariant derivative below). Symbols of the second kind decompose the change with respect to the basis, while symbols of the first kind decompose it with respect to the dual basis. In this form, it is easy to see the symmetry of the lower or last two indices:_ = _ and

\Gammakij=\Gammakji,

from the definition of

ei

and the fact that partial derivatives commute (as long as the manifold and coordinate system are well behaved).

The same numerical values for Christoffel symbols of the second kind also relate to derivatives of the dual basis, as seen in the expression:\frac = -_ \mathbf^,which we can rearrange as:_ = -\frac \cdot \mathbf_.

General definition

The Christoffel symbols come in two forms: the first kind, and the second kind. The definition of the second kind is more basic, and thus is presented first.

Christoffel symbols of the second kind (symmetric definition)

The Christoffel symbols of the second kind are the connection coefficients—in a coordinate basis—of the Levi-Civita connection.In other words, the Christoffel symbols of the second kind (sometimes or) are defined as the unique coefficients such that\nabla_i \mathrm_j = _\mathrm_k,where is the Levi-Civita connection on taken in the coordinate direction (i.e.,) and where is a local coordinate (holonomic) basis. Since this connection has zero torsion, and holonomic vector fields commute (i.e.

[ei,ej]=[\partiali,\partialj]=0

) we have\nabla_i \mathrm_j = \nabla_j \mathrm_i.Hence in this basis the connection coefficients are symmetric:[7] _ = _.For this reason, a torsion-free connection is often called symmetric.

The Christoffel symbols can be derived from the vanishing of the covariant derivative of the metric tensor : 0 = \nabla_l g_ = \frac - g__ - g__ = \frac - 2g__.

As a shorthand notation, the nabla symbol and the partial derivative symbols are frequently dropped, and instead a semicolon and a comma are used to set off the index that is being used for the derivative. Thus, the above is sometimes written as0 = \,g_ = g_ - g_ _ - g_ _ .

Using that the symbols are symmetric in the lower two indices, one can solve explicitly for the Christoffel symbols as a function of the metric tensor by permuting the indices and resumming:_ = \frac g^ \left(\frac + \frac - \frac \right) = \frac g^ \left(g_ + g_ - g_\right),

where is the inverse of the matrix, defined as (using the Kronecker delta, and Einstein notation for summation) . Although the Christoffel symbols are written in the same notation as tensors with index notation, they do not transform like tensors under a change of coordinates.

Contraction of indices

Contracting the upper index with either of the lower indices (those being symmetric) leads to_ = \frac \ln\sqrt

where

g=\detgik

is the determinant of the metric tensor. This identity can be used to evaluate divergence of vectors.

Christoffel symbols of the first kind

The Christoffel symbols of the first kind can be derived either from the Christoffel symbols of the second kind and the metric,\Gamma_ = g_ _\,,

or from the metric alone,\begin\Gamma_ &= \frac \left(\frac + \frac - \frac \right) \\ &= \frac\, \left(g_ + g_ - g_\right) \\ &= \frac\, \left(\partial_g_ + \partial_g_ - \partial_g_\right) \,. \\ \end

As an alternative notation one also finds[8] [9]

\Gamma_ = [ab, c].It is worth noting that .

Connection coefficients in a nonholonomic basis

The Christoffel symbols are most typically defined in a coordinate basis, which is the convention followed here. In other words, the name Christoffel symbols is reserved only for coordinate (i.e., holonomic) frames. However, the connection coefficients can also be defined in an arbitrary (i.e., nonholonomic) basis of tangent vectors by\nabla_\mathbf_j = _\mathbf_k.

Explicitly, in terms of the metric tensor, this is[10] _ = \frac g^ \left(g_ + g_ - g_ + c_ + c_ - c_ \right),

where are the commutation coefficients of the basis; that is,[\mathbf{u}_k,\, \mathbf{u}_l] = ^m \mathbf_m

where are the basis vectors and is the Lie bracket. The standard unit vectors in spherical and cylindrical coordinates furnish an example of a basis with non-vanishing commutation coefficients. The difference between the connection in such a frame, and the Levi-Civita connection is known as the contorsion tensor.

Ricci rotation coefficients (asymmetric definition)

When we choose the basis orthonormal: then . This implies that_ = \frac \eta^ \left(c_ + c_ - c_ \right)and the connection coefficients become antisymmetric in the first two indices:\omega_ = -\omega_\,,where\omega_ = \eta__\, .

In this case, the connection coefficients are called the Ricci rotation coefficients.[11] [12]

Equivalently, one can define Ricci rotation coefficients as follows:_ := \mathbf^k \cdot \left(\nabla_j \mathbf_i \right)\,,where is an orthonormal nonholonomic basis and its co-basis.

Transformation law under change of variable

Under a change of variable from

\left(x1,\ldots,xn\right)

to

\left(\bar{x}1,\ldots,\bar{x}n\right)

, Christoffel symbols transform as

_ = \frac\, \frac\, \frac\, _ + \frac\, \frac

where the overline denotes the Christoffel symbols in the

\bar{x}i

coordinate system. The Christoffel symbol does not transform as a tensor, but rather as an object in the jet bundle. More precisely, the Christoffel symbols can be considered as functions on the jet bundle of the frame bundle of, independent of any local coordinate system. Choosing a local coordinate system determines a local section of this bundle, which can then be used to pull back the Christoffel symbols to functions on, though of course these functions then depend on the choice of local coordinate system.

For each point, there exist coordinate systems in which the Christoffel symbols vanish at the point.[13] These are called (geodesic) normal coordinates, and are often used in Riemannian geometry.

There are some interesting properties which can be derived directly from the transformation law.

i}
{\Gamma
jk
behaves like a tensor.
i}
{\Gamma
jk
and
i}
{\tilde\Gamma
jk
, then their difference
i}
{\Gamma
jk

-

i}
{\tilde\Gamma
jk
is a tensor since the inhomogeneous terms cancel each other. The inhomogeneous terms depend only on how the coordinates are changed, but are independent of Christoffel symbol itself.
i}
{\Gamma
jk

i}
{\Gamma
kj
, then they remain unsymmetric under any change of coordinates. A corollary to this property is that it is impossible to find a coordinate system in which all elements of Christoffel symbol are zero at a point, unless lower indices are symmetric. This property was pointed out by Albert Einstein[14] and Erwin Schrödinger[15] independently.

Relationship to parallel transport and derivation of Christoffel symbols in Riemannian space

If a vector

\xii

is transported parallel on a curve parametrized by some parameter

s

on a Riemannian manifold, the rate of change of the components of the vector is given by\frac = -_ \frac\xi^j.

Now just by using the condition that the scalar product

gik\xiiηk

formed by two arbitrary vectors

\xii

and

ηk

is unchanged is enough to derive the Christoffel symbols. The condition is\frac\left(g_\xi^i\eta^k\right) = 0which by the product rule expands to\frac \frac \xi^i\eta^k + g_ \frac\eta^k + g_\xi^i\frac = 0.

Applying the parallel transport rule for the two arbitrary vectors and relabelling dummy indices and collecting the coefficients of

\xiiηkdxl

(arbitrary), we obtain

\frac = g__ + g__.

This is same as the equation obtained by requiring the covariant derivative of the metric tensor to vanish in the General definition section. The derivation from here is simple. By cyclically permuting the indices

ikl

in above equation, we can obtain two more equations and then linearly combining these three equations, we can express
i}
{\Gamma
jk
in terms of metric tensor.

Relationship to index-free notation

Let and be vector fields with components and . Then the th component of the covariant derivative of with respect to is given by\left(\nabla_X Y\right)^k = X^i (\nabla_i Y)^k = X^i \left(\frac + _ Y^m\right).

Here, the Einstein notation is used, so repeated indices indicate summation over indices and contraction with the metric tensor serves to raise and lower indices:g(X, Y) = X^i Y_i = g_X^i Y^k = g^X_i Y_k.

Keep in mind that and that, the Kronecker delta. The convention is that the metric tensor is the one with the lower indices; the correct way to obtain from is to solve the linear equations .

The statement that the connection is torsion-free, namely that\nabla_X Y - \nabla_Y X = [X,\, Y]

is equivalent to the statement that—in a coordinate basis—the Christoffel symbol is symmetric in the lower two indices:_ = _.

The index-less transformation properties of a tensor are given by pullbacks for covariant indices, and pushforwards for contravariant indices. The article on covariant derivatives provides additional discussion of the correspondence between index-free notation and indexed notation.

Covariant derivatives of tensors

The covariant derivative of a vector field with components is\nabla_l V^m = \frac + _ V^k.

By corollary, divergence of a vector can be obtained as\nabla_i V^i = \frac\frac.

The covariant derivative of a covector field is\nabla_l \omega_m = \frac - _ \omega_k.

The symmetry of the Christoffel symbol now implies\nabla_i\nabla_j \varphi = \nabla_j\nabla_i \varphifor any scalar field, but in general the covariant derivatives of higher order tensor fields do not commute (see curvature tensor).

The covariant derivative of a type tensor field is\nabla_l A^ = \frac + _ A^ + _ A^,that is,_ = _ + A^ _ + A^ _.

If the tensor field is mixed then its covariant derivative is_ = _ + _k _ - _m _, and if the tensor field is of type then its covariant derivative is A_ = A_ - A_ _ - A_ _.

Contravariant derivatives of tensors

To find the contravariant derivative of a vector field, we must first transform it into a covariant derivative using the metric tensor\nabla^l V^m = g^ \nabla_i V^m = g^ \partial_i V^m + g^ \Gamma^m_ V^k = \partial^l V^m + g^ \Gamma^m_ V^k

Applications

In general relativity

The Christoffel symbols find frequent use in Einstein's theory of general relativity, where spacetime is represented by a curved 4-dimensional Lorentz manifold with a Levi-Civita connection. The Einstein field equations—which determine the geometry of spacetime in the presence of matter—contain the Ricci tensor, and so calculating the Christoffel symbols is essential. Once the geometry is determined, the paths of particles and light beams are calculated by solving the geodesic equations in which the Christoffel symbols explicitly appear.

In classical (non-relativistic) mechanics

Let

xi

be the generalized coordinates and
x

i

be the generalized velocities, then the kinetic energy for a unit mass is given by

T=\tfrac{1}{2}gik

x

i

x

k

, where

gik

is the metric tensor. If

V\left(xi\right)

, the potential function, exists then the contravariant components of the generalized force per unit mass are

Fi=\partialV/\partialxi

. The metric (here in a purely spatial domain) can be obtained from the line element

ds2=2Tdt2

. Substituting the Lagrangian

L=T-V

into the Euler-Lagrange equation, we get[16]

g_\ddot^k + \frac\left(\frac + \frac - \frac\right) \dot^l \dot^k = F_i.

Now multiplying by

gij

, we get\ddot^j + _ \dot^l \dot^k = F^j.

When Cartesian coordinates can be adopted (as in inertial frames of reference), we have an Euclidean metrics, the Christoffel symbol vanishes, and the equation reduces to Newton's second law of motion. In curvilinear coordinates[17] (forcedly in non-inertial frames, where the metrics is non-Euclidean and not flat), fictitious forces like the Centrifugal force and Coriolis force originate from the Christoffel symbols, so from the purely spatial curvilinear coordinates.

In Earth surface coordinates

Given a spherical coordinate system, which describes points on the Earth surface (approximated as an ideal sphere).

\beginx(R, \theta, \varphi) &= \begin R\cos\theta\cos\varphi & R\cos\theta\sin\varphi & R\sin\theta \end \\\end

For a point x, is the distance to the Earth core (usually approximately the Earth radius). and are the latitude and longitude. Positive is the northern hemisphere. To simplify the derivatives, the angles are given in radians (where d sin(x)/dx = cos(x), the degree values introduce an additional factor of 360 / 2 pi).

At any location, the tangent directions are

eR

(up),

e\theta

(north) and

e\varphi

(east) - you can also use indices 1,2,3.

\begine_ &= \begin \cos\theta\cos\varphi & \cos\theta\sin\varphi & \sin\theta \end \\e_ &= R \cdot \begin -\sin\theta\cos\varphi & - \sin\theta\sin\varphi & \cos\theta \end \\e_ &= R\cos\theta \cdot \begin -\sin\varphi & \cos\varphi & 0 \end \\\end

The related metric tensor has only diagonal elements (the squared vector lengths). This is an advantage of the coordinate system and not generally true.

\beging_ = 1 \qquad & g_ = R^2 \qquad & g_ = R^2\cos^2\theta \qquad & g_ = 0 \quad \mathrm \\g^ = 1 \qquad & g^ = 1/R^2 \qquad & g^ = 1/(R^2\cos^2\theta) \qquad & g^ = 0 \quad \mathrm \\\end

Now the necessary quantities can be calculated. Examples:

\begine^ = e_ g^ = 1 \cdot e_ &= \begin \cos\theta\cos\varphi & \cos\theta\sin\varphi & \sin\theta \end \\_ = e^ \cdot \frac e_\varphi &= e^ \cdot \begin -R\cos\theta\cos\varphi & -R\cos\theta\sin\varphi & 0 \end = -R\cos^2\theta \\\end

The resulting Christoffel symbols of the second kind

k}
{\Gamma
ji

=ek

\partialej
\partialxi

then are (organized by the "derivative" index in a matrix):

\begin\begin_ & _ & _ \\_ & _ & _ \\_ & _ & _ \\\end &= \quad \begin 0 & 0 & 0 \\ 0 & 1/R & 0 \\ 0 & 0 & 1/R \end \\\begin_ & _ & _ \\_ & _ & _ \\_ & _ & _ \\\end \quad &= \begin 0 & -R & 0 \\ 1/R & 0 & 0 \\ 0 & 0 & -\tan\theta \end \\\begin_ & _ & _ \\_ & _ & _ \\_ & _ & _ \\\end &= \quad \begin 0 & 0 & -R\cos^2\theta \\ 0 & 0 & \cos\theta\sin\theta \\ 1/R & -\tan\theta & 0 \end \\\end

These values show how the tangent directions (columns:

eR

,

e\theta

,

e\varphi

) change, seen from an outside perspective (e.g. from space), but given in the tangent directions of the actual location (rows:,,).

As an example, take the nonzero derivatives by in

k}
{\Gamma
j\theta
, which corresponds to a movement towards north (positive dθ):

e\theta

changes by -R dθ in the up (R) direction. So the north direction will rotate downwards towards the center of the Earth.

eR

will be adjusted towards the north. The different lengths of

eR

and

e\theta

lead to a factor of 1/R .

e\varphi

changes its length (-tan(θ) on the diagonal), it will shrink (-tan(θ) dθ < 0) on the northern hemisphere, and increase (-tan(θ) dθ > 0) on the southern hemisphere.These effects are maybe not apparent during the movement, because they are the adjustments that keep the measurements in the coordinates,, . Nevertheless, it can affect distances, physics equations, etc. So if e.g. you need the exact change of a magnetic field pointing approximately "south", it can be necessary to also correct your measurement by the change of the north direction using the Christoffel symbols to get the "true" (tensor) value.

The Christoffel symbols of the first kind

{\Gammal}ji=glk

k}
{\Gamma
ji

show the same change using metric-corrected coordinates, e.g. for derivative by :

\begin\begin_ & _ & _ \\_ & _ & _ \\_ & _ & _ \\\end &= R\cos\theta \begin 0 & 0 & -\cos\theta \\ 0 & 0 & R\sin\theta \\ \cos\theta & -R\sin\theta & 0 \end \\\end[18]

See also

References

Notes and References

  1. See, for instance, and
  2. Ronald Adler, Maurice Bazin, Menahem Schiffer, Introduction to General Relativity (1965) McGraw-Hill Book Company (See section 2.1)
  3. Charles W. Misner, Kip S. Thorne, John Archibald Wheeler, Gravitation (1973) W. H. Freeman (See chapters 8-11)
  4. Misner, Thorne, Wheeler, op. cit. (See chapter 13)
  5. Jurgen Jost, Riemannian Geometry and Geometric Analysis, (2002) Springer-Verlag
  6. David Bleeker, Gauge Theory and Variational Principles (1991) Addison-Wesely Publishing Company
  7. Book: U. . Chatterjee . N. . Chatterjee . Vector & Tensor Analysis . 2010 . 480 .
  8. Book: U. . Chatterjee . N. . Chatterjee . 2010 . Vector and Tensor Analysis . 480.
  9. Book: D.J. . Struik . Lectures on Classical Differential Geometry . first published in 1988 Dover . 1961 . 114.
  10. Web site: Christoffel Symbol of the Second Kind -- from Wolfram MathWorld. mathworld.wolfram.com . https://web.archive.org/web/20090123011440/http://mathworld.wolfram.com/ChristoffelSymboloftheSecondKind.html . 2009-01-23.
  11. Dei sistemi di congruenze ortogonali in una varietà qualunque. G. Ricci-Curbastro . Mem. Acc. Lincei . 2. 1896. 276–322. 5.
  12. Ricci's coefficients of rotation . H. Levy. Bull. Amer. Math. Soc. . 31. 1925. 142–145. 3–4. 10.1090/s0002-9904-1925-03996-8. free.
  13. This is assuming that the connection is symmetric (e.g., the Levi-Civita connection). If the connection has torsion, then only the symmetric part of the Christoffel symbol can be made to vanish.
  14. Web site: The Meaning of Relativity (1956, 5th Edition). Einstein . Albert . 2005. Princeton University Press (2005).
  15. Schrödinger, E. (1950). Space-time structure. Cambridge University Press.
  16. Adler, R., Bazin, M., & Schiffer, M. Introduction to General Relativity (New York, 1965).
  17. David, Kay, Tensor Calculus (1988) McGraw-Hill Book Company (See section 11.4)
  18. Sesslar, Alexander  J. “Published Mathematical Works | Christoffel Symbols and Spherical Coordinates .” 2023 https://sites.google.com/view/published-mathematical-works/home