In mathematics, a change of variables is a basic technique used to simplify problems in which the original variables are replaced with functions of other variables. The intent is that when expressed in new variables, the problem may become simpler, or equivalent to a better understood problem.
Change of variables is an operation that is related to substitution. However these are different operations, as can be seen when considering differentiation (chain rule) or integration (integration by substitution).
A very simple example of a useful variable change can be seen in the problem of finding the roots of the sixth-degree polynomial:
x6-9x3+8=0.
Sixth-degree polynomial equations are generally impossible to solve in terms of radicals (see Abel–Ruffini theorem). This particular equation, however, may be written
(x3)2-9(x3)+8=0
(this is a simple case of a polynomial decomposition). Thus the equation may be simplified by defining a new variable
u=x3
\sqrt[3]{u}
u2-9u+8=0,
which is just a quadratic equation with the two solutions:
u=1 and u=8.
The solutions in terms of the original variable are obtained by substituting x3 back in for u, which gives
x3=1 and x3=8.
Then, assuming that one is interested only in real solutions, the solutions of the original equation are
x=(1)1/3=1 and x=(8)1/3=2.
Consider the system of equations
xy+x+y=71
x2y+xy2=880
where
x
y
x>y
Solving this normally is not very difficult, but it may get a little tedious. However, we can rewrite the second equation as
xy(x+y)=880
s=x+y
t=xy
s+t=71,st=880
(s,t)=(16,55)
(s,t)=(55,16)
x+y=16,xy=55,x>y
(x,y)=(11,5).
x+y=55,xy=16,x>y
(x,y)=(11,5)
Let
A
B
\Phi:A → B
Cr
\Phi
r
A
B
r
B
A
r
infty
\omega
The map
\Phi
Cr
\Phi
x=\Phi(y)
x
y
\Phi
y
x
Some systems can be more easily solved when switching to polar coordinates. Consider for example the equation
U(x,y):=(x2+y2)\sqrt{1-
x2 | |
x2+y2 |
}=0.
This may be a potential energy function for some physical problem. If one does not immediately see a solution, one might try the substitution
\displaystyle(x,y)=\Phi(r,\theta)
\displaystyle\Phi(r,\theta)=(r\cos(\theta),r\sin(\theta)).
Note that if
\theta
2\pi
[0,2\pi]
\Phi
\Phi
(0,infty] x [0,2\pi)
r=0
\Phi
\theta
\Phi
\sin2x+\cos2x=1
V(r,\theta)=r2\sqrt{1-
r2\cos2\theta | |
r2 |
}=r2\sqrt{1-\cos2\theta}=r2\left|\sin\theta\right|.
Now the solutions can be readily found:
\sin(\theta)=0
\theta=0
\theta=\pi
\Phi
y=0
x\not=0
y=0
Note that, had we allowed
r=0
\Phi
x,y\in\reals
See main article: Chain rule.
The chain rule is used to simplify complicated differentiation. For example, consider the problem of calculating the derivative
d | |
dx |
\sin(x2).
Let
y=\sinu
u=x2.
\begin{align} | d |
dx |
\sin(x2)&=
dy | |
dx |
\\[6pt] &=
dy | |
du |
du | |
dx |
&&Thispartisthechainrule.\\[6pt] &=\left(
d | |
du |
\sinu\right)\left(
d | |
dx |
x2\right)\\[6pt] &=(\cosu)(2x)\\ &=\left(\cos(x2)\right)(2x)\\ &=2x\cos(x2) \end{align}
See main article: Integration by substitution. Difficult integrals may often be evaluated by changing variables; this is enabled by the substitution rule and is analogous to the use of the chain rule above. Difficult integrals may also be solved by simplifying the integral using a change of variables given by the corresponding Jacobian matrix and determinant.[1] Using the Jacobian determinant and the corresponding change of variable that it gives is the basis of coordinate systems such as polar, cylindrical, and spherical coordinate systems.
The following theorem allows us to relate integrals with respect to Lebesgue measure to an equivalent integral with respect to the pullback measure under a parameterization G.[2] The proof is due to approximations of the Jordan content.
Suppose thatAs a corollary of this theorem, we may compute the Radon–Nikodym derivatives of both the pullback and pushforward measures ofis an open subset of\Omega
andRn
is aG:\Omega\toRn
diffeomorphism.C1
- If
is a Lebesgue measurable function onf
, thenG(\Omega)
is Lebesgue measurable onf\circG
. If\Omega
orf\geq0
thenf\inL1(G(\Omega),m),
.\intG(\Omega)f(x)dx=\int\Omegaf\circG(x)|detDxG|dx
- If
andE\subset\Omega
is Lebesgue measurable, thenE
is Lebesgue measurable, thenG(E)
.m(G(E))=\intE|detDxG|dx
m
T
The pullback measure in terms of a transformation
T
T*\mu:=\mu(T(A))
\intT(\Omega)gd\mu=\int\Omegag\circTdT*\mu
Pushforward measure and transformation formula
The pushforward measure in terms of a transformation
T
T*\mu:=\mu(T-1(A))
\int\Omegag\circTd\mu=\intT(\Omega)gdT*\mu
As a corollary of the change of variables formula for Lebesgue measure, we have that
dT*m | |
dm |
(x)=|detDxT|
dT*m | |
dm |
(x)=
-1 | |
|detD | |
xT |
|
From which we may obtain
\intT(\Omega)gdm=\int\Omegag\circTdT*m=\int\Omegag\circT|detDxT|dm(x)
\int\Omegagdm=\intT(\Omega)g\circT-1dT*m=\intT(\Omega)g\circT-1
-1 | |
|detD | |
xT |
|dm(x)
Variable changes for differentiation and integration are taught in elementary calculus and the steps are rarely carried out in full.
The very broad use of variable changes is apparent when considering differential equations, where the independent variables may be changed using the chain rule or the dependent variables are changed resulting in some differentiation to be carried out. Exotic changes, such as the mingling of dependent and independent variables in point and contact transformations, can be very complicated but allow much freedom.
Very often, a general form for a change is substituted into a problem and parameters picked along the way to best simplify the problem.
Probably the simplest change is the scaling and shifting of variables, that is replacing them with new variables that are "stretched" and "moved" by constant amounts. This is very common in practical applications to get physical parameters out of problems. For an nth order derivative, the change simply results in
dny | |
dxn |
=
yscale | ||||||
|
dn\haty | |
d\hatxn |
where
x=\hatxxscale+xshift
y=\hatyyscale+yshift.
This may be shown readily through the chain rule and linearity of differentiation. This change is very common in practical applications to get physical parameters out of problems, for example, the boundary value problem
\mu
d2u | |
dy2 |
=
dp | |
dx |
; u(0)=u(L)=0
describes parallel fluid flow between flat solid walls separated by a distance δ; μ is the viscosity and
dp/dx
d2\hatu | |
d\haty2 |
=1 ; \hatu(0)=\hatu(1)=0
where
y=\hatyL and u=\hatu
L2 | |
\mu |
dp | |
dx |
.
Scaling is useful for many reasons. It simplifies analysis both by reducing the number of parameters and by simply making the problem neater. Proper scaling may normalize variables, that is make them have a sensible unitless range such as 0 to 1. Finally, if a problem mandates numeric solution, the fewer the parameters the fewer the number of computations.
Consider a system of equations
\begin{align} m
v |
&=-
\partialH | |
\partialx |
\\[5pt] m
x |
&=
\partialH | |
\partialv |
\end{align}
for a given function
H(x,v)
\Phi(p)=1/m ⋅ p
R
R
v=\Phi(p)
\begin{align} p |
&=-
\partialH | \\[5pt] | |
\partialx |
x |
&=
\partialH | |
\partialp |
\end{align}
See main article: Lagrangian mechanics. Given a force field
\varphi(t,x,v)
m\ddotx=\varphi(t,x,v).
x=\Psi(t,y)
v=
\partial\Psi(t,y) | |
\partialt |
+
\partial\Psi(t,y) | |
\partialy |
⋅ w.
He found that the equations
\partial{L | |
}{ |
\partialy}=
d | |
dt |
\partial{L | |
L=T-V
In fact, when the substitution is chosen well (exploiting for example symmetries and constraints of the system) these equations are much easier to solve than Newton's equations in Cartesian coordinates.