In geometry, Euler's rotation theorem states that, in three-dimensional space, any displacement of a rigid body such that a point on the rigid body remains fixed, is equivalent to a single rotation about some axis that runs through the fixed point. It also means that the composition of two rotations is also a rotation. Therefore the set of rotations has a group structure, known as a rotation group.
The theorem is named after Leonhard Euler, who proved it in 1775 by means of spherical geometry. The axis of rotation is known as an Euler axis, typically represented by a unit vector . Its product by the rotation angle is known as an axis-angle vector. The extension of the theorem to kinematics yields the concept of instant axis of rotation, a line of fixed points.
In linear algebra terms, the theorem states that, in 3D space, any two Cartesian coordinate systems with a common origin are related by a rotation about some fixed axis. This also means that the product of two rotation matrices is again a rotation matrix and that for a non-identity rotation matrix one eigenvalue is 1 and the other two are both complex, or both equal to −1. The eigenvector corresponding to this eigenvalue is the axis of rotation connecting the two systems.
Euler states the theorem as follows:[1]
Theorema.Quomodocunque sphaera circa centrum suum conuertatur, semper assignari potest diameter,cuius directio in situ translato conueniat cum situ initiali.
or (in English):
When a sphere is moved around its centre it is always possible to find a diameter whose direction in the displaced position is the same as in the initial position.
Euler's original proof was made using spherical geometry and therefore whenever he speaks about triangles they must be understood as spherical triangles.
To arrive at a proof, Euler analyses what the situation would look like if the theorem were true. To that end, suppose the yellow line in Figure 1 goes through the center of the sphere and is the axis of rotation we are looking for, and point is one of the two intersection points of that axis with the sphere. Then he considers an arbitrary great circle that does not contain (the blue circle), and its image after rotation (the red circle), which is another great circle not containing . He labels a point on their intersection as point . (If the circles coincide, then can be taken as any point on either; otherwise is one of the two points of intersection.)
Now is on the initial circle (the blue circle), so its image will be on the transported circle (red). He labels that image as point . Since is also on the transported circle (red), it is the image of another point that was on the initial circle (blue) and he labels that preimage as (see Figure 2). Then he considers the two arcs joining and to . These arcs have the same length because arc is mapped onto arc . Also, since is a fixed point, triangle is mapped onto triangle, so these triangles are isosceles, and arc bisects angle .
Let us construct a point that could be invariant using the previous considerations. We start with the blue great circle and its image under the transformation, which is the red great circle as in the Figure 1. Let point be a point of intersection of those circles. If ’s image under the transformation is the same point then is a fixed point of the transformation, and since the center is also a fixed point, the diameter of the sphere containing is the axis of rotation and the theorem is proved.
Otherwise we label ’s image as and its preimage as, and connect these two points to with arcs and . These arcs have the same length. Construct the great circle that bisects and locate point on that great circle so that arcs and have the same length, and call the region of the sphere containing and bounded by the blue and red great circles the interior of . (That is, the yellow region in Figure 3.) Then since and is on the bisector of, we also have .
Now let us suppose that is the image of . Then we know and orientation is preserved, so must be interior to . Now is transformed to, so . Since is also the same length as, then and . But, so and . Therefore is the same point as . In other words, is a fixed point of the transformation, and since the center is also a fixed point, the diameter of the sphere containing is the axis of rotation.
Euler also points out that can be found by intersecting the perpendicular bisector of with the angle bisector of, a construction that might be easier in practice. He also proposed the intersection of two planes:
Proposition. These two planes intersect in a diameter. This diameter is the one we are looking for.
Proof. Let us call either of the endpoints (there are two) of this diameter over the sphere surface. Since is mapped on and the triangles have the same angles, it follows that the triangle is transported onto the triangle . Therefore the point has to remain fixed under the movement.
Corollaries. This also shows that the rotation of the sphere can be seen as two consecutive reflections about the two planes described above. Points in a mirror plane are invariant under reflection, and hence the points on their intersection (a line: the axis of rotation) are invariant under both the reflections, and hence under the rotation.
Another simple way to find the rotation axis is by considering the plane on which the points,, lie. The rotation axis is obviously orthogonal to this plane, and passes through the center of the sphere.
Given that for a rigid body any movement that leaves an axis invariant is a rotation, this also proves that any arbitrary composition of rotations is equivalent to a single rotation around a new axis.
A spatial rotation is a linear map in one-to-one correspondence with a rotation matrix that transforms a coordinate vector into, that is . Therefore, another version of Euler's theorem is that for every rotation, there is a nonzero vector for which ; this is exactly the claim that is an eigenvector of associated with the eigenvalue 1. Hence it suffices to prove that 1 is an eigenvalue of ; the rotation axis of will be the line, where is the eigenvector with eigenvalue 1.
A rotation matrix has the fundamental property that its inverse is its transpose, that is
RTR=RRT=I,
Compute the determinant of this relation to find that a rotation matrix has determinant ±1. In particular,
\begin{align} 1=\det(I)&=\det\left(RTR\right)=\det\left(RT\right)\det(R)=\det(R)2\\ \Longrightarrow \det(R)&=\pm1. \end{align}
A rotation matrix with determinant +1 is a proper rotation, and one with a negative determinant −1 is an improper rotation, that is a reflection combined with a proper rotation.
It will now be shown that a proper rotation matrix has at least one invariant vector, i.e., . Because this requires that, we see that the vector must be an eigenvector of the matrix with eigenvalue . Thus, this is equivalent to showing that .
Use the two relations
\det(-A)=(-1)3\det(A)=-\det(A)
\det\left(R-1\right)=1
\begin{align} &\det(R-I)=\det\left((R-I)T\right)\\ {}={}&\det\left(RT-I\right)=\det\left(R-1-R-1R\right)\\ {}={}&\det\left(R-1(I-R)\right)=\det\left(R-1\right)\det(-(R-I))\\ {}={}&-\det(R-I)\\[3pt] \Longrightarrow 0={}&\det(R-I). \end{align}
This shows that is a root (solution) of the characteristic equation, that is,
\det(R-λI)=0 \hbox{for} λ=1.
In other words, the matrix is singular and has a non-zero kernel, that is, there is at least one non-zero vector, say, for which
(R-I)n=0 \Longleftrightarrow Rn=n.
The line for real is invariant under, i.e., is a rotation axis. This proves Euler's theorem.
Two matrices (representing linear maps) are said to be equivalent if there is a change of basis that makes one equal to the other. A proper orthogonal matrix is always equivalent (in this sense) to either the following matrix or to its vertical reflection:
R\sim \begin{pmatrix} \cos\phi&-\sin\phi&0\\ \sin\phi&\cos\phi&0\\ 0&0&1\\ \end{pmatrix}, 0\le\phi\le2\pi.
Then, any orthogonal matrix is either a rotation or an improper rotation. A general orthogonal matrix has only one real eigenvalue, either +1 or −1. When it is +1 the matrix is a rotation. When −1, the matrix is an improper rotation.
If has more than one invariant vector then and . Any vector is an invariant vector of .
In order to prove the previous equation some facts from matrix theory must be recalled.
An matrix has orthogonal eigenvectors if and only if is normal, that is, if . This result is equivalent to stating that normal matrices can be brought to diagonal form by a unitary similarity transformation:
AU=U \operatorname{diag}(\alpha1,\ldots,\alpham) \Longleftrightarrow U\daggerAU=\operatorname{diag}(\alpha1,\ldots,\alpham),
U\dagger=U-1.
\left(U\daggerAU\right)\dagger=
* | |
\operatorname{diag}\left(\alpha | |
m\right) |
= U\daggerA-1U=\operatorname{diag}\left(
1 | ,\ldots, | |
\alpha1 |
1 | |
\alpham |
\right)
* | |
\alpha | |
k |
=
1 | |
\alphak |
\Longleftrightarrow
* | |
\alpha | |
k\alpha |
k=
2 | |
\left|\alpha | |
k\right| |
=1, k=1,\ldots,m.
After recollection of these general facts from matrix theory, we return to the rotation matrix . It follows from its realness and orthogonality that we can find a such that:
RU=U \begin{pmatrix} ei\phi&0&0\\ 0&e-i\phi&0\\ 0&0&\pm1\\ \end{pmatrix}
R
Ru1=ei\phiu1 \hbox{and} Ru2=e-i\phiu2.
(R-ei\phiI)u1=0
(R-e-i\phi
* | |
I)u | |
1 |
=0
u2=
* | |
u | |
1 |
u2
(R-I)u3=0
u3
R
RU \begin{pmatrix}
1 | |
\sqrt{2 |
U'\daggerRU'=\begin{pmatrix} \cos\phi&-\sin\phi&0\\ \sin\phi&\cos\phi&0\\ 0&0&1\\ \end{pmatrix} with U' =U \begin{pmatrix}
1 | |
\sqrt{2 |
u1
u2
u3
u3=
The trace (sum of diagonal elements) of the real rotation matrix given above is . Since a trace is invariant under an orthogonal matrix similarity transformation,
Tr\left[ARAT\right]= Tr\left[RATA\right]=Tr[R] with AT=A-1,
In fact, all proper rotation rotation matrices form a group, usually denoted by SO(3) (the special orthogonal group in 3 dimensions) and all matrices with the same trace form an equivalence class in this group. All elements of such an equivalence class share their rotation angle, but all rotations are around different axes. If is an eigenvector of with eigenvalue 1, then is also an eigenvector of T, also with eigenvalue 1. Unless, and are different.
See main article: Rotation matrix, Rotation group SO(3) and Infinitesimal transformation.
Suppose we specify an axis of rotation by a unit vector, and suppose we have an infinitely small rotation of angle about that vector. Expanding the rotation matrix as an infinite addition, and taking the first order approach, the rotation matrix is represented as:
\DeltaR= \begin{bmatrix} 1&0&0\\ 0&1&0\\ 0&0&1 \end{bmatrix}+ \begin{bmatrix} 0&z&-y\\ -z&0&x\\ y&-x&0 \end{bmatrix}\Delta\theta= I+A\Delta\theta.
A finite rotation through angle about this axis may be seen as a succession of small rotations about the same axis. Approximating as where is a large number, a rotation of about the axis may be represented as:
R=\left(1+
A\theta | |
N |
\right)N ≈ eA\theta.
It can be seen that Euler's theorem essentially states that all rotations may be represented in this form. The product is the "generator" of the particular rotation, being the vector associated with the matrix . This shows that the rotation matrix and the axis–angle format are related by the exponential function.
One can derive a simple expression for the generator . One starts with an arbitrary plane (in Euclidean space) defined by a pair of perpendicular unit vectors and . In this plane one can choose an arbitrary vector with perpendicular . One then solves for in terms of and substituting into an expression for a rotation in a plane yields the rotation matrix which includes the generator .
\begin{align} x&=a\cos\alpha+b\sin\alpha\\ y&=-a\sin\alpha+b\cos\alpha\\[8pt] \cos\alpha&=aTx\\ \sin\alpha&=bTx\\[8px] y&=-abTx+baTx =\left(baT-abT\right)x\\[8px] x'&=x\cos\beta+y\sin\beta\ &=\left(I\cos\beta+\left(baT-abT\right)\sin\beta\right)x\\[8px]R&=I\cos\beta+\left(baT-abT\right)\sin\beta\ &=I\cos\beta+G\sin\beta\\[8px]G&=baT-abT\end{align}
To include vectors outside the plane in the rotation one needs to modify the above expression for by including two projection operators that partition the space. This modified rotation matrix can be rewritten as an exponential function.
\begin{align}
Pab |
&=-G2\ R&=I-
Pab |
+\left(I\cos\beta+G\sin\beta
\right)Pab |
=eG\beta\end{align}
Analysis is often easier in terms of these generators, rather than the full rotation matrix. Analysis in terms of the generators is known as the Lie algebra of the rotation group.
See main article: Three-dimensional rotation operator and Quaternions and spatial rotation.
It follows from Euler's theorem that the relative orientation of any pair of coordinate systems may be specified by a set of three independent numbers. Sometimes a redundant fourth number is added to simplify operations with quaternion algebra. Three of these numbers are the direction cosines that orient the eigenvector. The fourth is the angle about the eigenvector that separates the two sets of coordinates. Such a set of four numbers is called a quaternion.
While the quaternion as described above, does not involve complex numbers, if quaternions are used to describe two successive rotations, they must be combined using the non-commutative quaternion algebra derived by William Rowan Hamilton through the use of imaginary numbers.
Rotation calculation via quaternions has come to replace the use of direction cosines in aerospace applications through their reduction of the required calculations, and their ability to minimize round-off errors. Also, in computer graphics the ability to perform spherical interpolation between quaternions with relative ease is of value.
See also: Rotations in 4-dimensional Euclidean space.
In higher dimensions, any rigid motion that preserves a point in dimension or is a composition of at most rotations in orthogonal planes of rotation, though these planes need not be uniquely determined, and a rigid motion may fix multiple axes. Also, any rigid motion that preserves linearly independent points, which span an -dimensional body in dimension or, is a single plane of rotation. To put it another way, if two rigid bodies, with identical geometry, share at least points of 'identical' locations within themselves, the convex hull of which is -dimensional, then a single planar rotation can bring one to cover the other accurately in dimension or .
A rigid motion in three dimensions that does not necessarily fix a point is a "screw motion". This is because a composition of a rotation with a translation perpendicular to the axis is a rotation about a parallel axis, while composition with a translation parallel to the axis yields a screw motion; see screw axis. This gives rise to screw theory.