In mathematics, the method of characteristics is a technique for solving partial differential equations. Typically, it applies to first-order equations, although more generally the method of characteristics is valid for any hyperbolic and parabolic partial differential equation. The method is to reduce a partial differential equation to a family of ordinary differential equations along which the solution can be integrated from some initial data given on a suitable hypersurface.
For a first-order PDE (partial differential equation), the method of characteristics discovers curves (called characteristic curves or just characteristics) along which the PDE becomes an ordinary differential equation (ODE). Once the ODE is found, it can be solved along the characteristic curves and transformed into a solution for the original PDE.
For the sake of simplicity, we confine our attention to the case of a function of two independent variables x and y for the moment. Consider a quasilinear PDE of the form
Suppose that a solution z is known, and consider the surface graph z = z(x,y) in R3. A normal vector to this surface is given by
\left( | \partialz | (x,y), |
\partialx |
\partialz | |
\partialy |
(x,y),-1\right).
As a result, equation is equivalent to the geometrical statement that the vector field
(a(x,y,z),b(x,y,z),c(x,y,z))
is tangent to the surface z = z(x,y) at every point, for the dot product of this vector field with the above normal vector is zero. In other words, the graph of the solution must be a union of integral curves of this vector field. These integral curves are called the characteristic curves of the original partial differential equation and are given by the Lagrange–Charpit equations
\begin{align} | dx | &=a(x,y,z),\\[8pt] |
dt |
dy | &=b(x,y,z),\\[8pt] | |
dt |
dz | |
dt |
&=c(x,y,z). \end{align}
A parametrization invariant form of the Lagrange–Charpit equations is:
dx | |
a(x,y,z) |
=
dy | |
b(x,y,z) |
=
dz | |
c(x,y,z) |
.
Consider now a PDE of the form
n | |
\sum | |
i=1 |
ai(x1,...,xn,u)
\partialu | |
\partialxi |
=c(x1,...,xn,u).
For this PDE to be linear, the coefficients ai may be functions of the spatial variables only, and independent of u. For it to be quasilinear,[1] ai may also depend on the value of the function, but not on any derivatives. The distinction between these two cases is inessential for the discussion here.
For a linear or quasilinear PDE, the characteristic curves are given parametrically by
(x1,...,xn,u)=(x1(s),...,xn(s),u(s))
u(X(s))=U(s)
such that the following system of ODEs is satisfied
Equations and give the characteristics of the PDE.
In the quasilinear case, the use of the method of characteristics is justified by Grönwall's inequality. The above equation may be written as
We must distinguish between the solutions to the ODE and the solutions to the PDE, which we do not know are equal a priori. Letting capital letters be the solutions to the ODE we find
Examining
\Delta(s)=|u(X(s))-U(s)|2
We cannot conclude the above is 0 as we would like, since the PDE only guarantees us that this relationship is satisfied for
u(x)
a(x,u) ⋅ \nablau(x)=c(x,u)
U(s)=u(X(s))
However, we can see thatsince by the PDE, the last term is 0. This equals
By the triangle inequality, we have
Assuming
a,c
C1
\Omega
X(0),U(0)
a,c
(X(s),U(s))
\Omega
s
U(0)=u(X(0))
(X(s),u(X(s)))
\Omega
s
(X(s),U(s))\in\Omega
(X(s),u(X(s)))\in\Omega
s\in[0,s0]
\|\nablau(X(s))\|\leqM
M\in\R
s\in[0,s0]
C\inR
\Delta(0)=0
\Delta(s)=0
[0,\varepsilon)
u(X(s))=U(s)
\varepsilon
U(\varepsilon)=u(X(\varepsilon))
\varepsilon
u(X(s))=U(s)
u(X(s))=U(s)
Consider the partial differential equation
where the variables pi are shorthand for the partial derivatives
pi=
\partialu | |
\partialxi |
.
Let (xi(s),u(s),pi(s)) be a curve in R2n+1. Suppose that u is any solution, and that
u(s)=u(x1(s),...,xn(s)).
Along a solution, differentiating with respect to s gives
\sumi(F
xi |
+Fu
p | |||
|
i+\sumi
F | |
pi |
p |
i=0
u |
-\sumipi
x |
i=0
\sumi(
x |
idpi-
p |
idxi)=0.
The second equation follows from applying the chain rule to a solution u, and the third follows by taking an exterior derivative of the relation
du-\sumipidxi=0
x |
i=λ
F | , | |
pi |
p |
i=-λ(F
xi |
+Fupi),
u |
=λ\sumipiF
pi |
where λ is a constant. Writing these equations more symmetrically, one obtains the Lagrange–Charpit equations for the characteristic
| =- | |||||
|
| = | |||||
|
| ||||||
|
.
Geometrically, the method of characteristics in the fully nonlinear case can be interpreted as requiring that the Monge cone of the differential equation should everywhere be tangent to the graph of the solution. The second order partial differential equation is solved with Charpit method.
As an example, consider the advection equation (this example assumes familiarity with PDE notation, and solutions to basic ODEs).
a
\partialu | |
\partialx |
+
\partialu | |
\partialt |
=0
where
a
u
x
t
d | |
ds |
u(x(s),t(s))=F(u,x(s),t(s)),
where
(x(s),t(s))
d | |
ds |
u(x(s),t(s))=
\partialu | |
\partialx |
dx | |
ds |
+
\partialu | |
\partialt |
dt | |
ds |
by the chain rule. Now, if we set
dx | |
ds |
=a
dt | |
ds |
=1
a
\partialu | |
\partialx |
+
\partialu | |
\partialt |
which is the left hand side of the PDE we started with. Thus
d | |
ds |
u=a
\partialu | |
\partialx |
+
\partialu | |
\partialt |
=0.
So, along the characteristic line
(x(s),t(s))
us=F(u,x(s),t(s))=0
u(xs,ts)=u(x0,0)
(xs,ts)
(x0,0)
dt | |
ds |
=1
t(0)=0
t=s
dx | |
ds |
=a
x(0)=x0
x=as+x0=at+x0
du | |
ds |
=0
u(0)=f(x0)
u(x(t),t)=f(x0)=f(x-at)
In this case, the characteristic lines are straight lines with slope
a
u
Let X be a differentiable manifold and P a linear differential operator
P:Cinfty(X)\toCinfty(X)
P=\sum|\alpha|\leP\alpha(x)
\partial | |
\partialx\alpha |
\sigmaP(x,\xi)=\sum|\alpha|=k
\alpha(x)\xi | |
P | |
\alpha |
where the ξi are the fiber coordinates on the cotangent bundle induced by the coordinate differentials dxi. Although this is defined using a particular coordinate system, the transformation law relating the ξi and the xi ensures that σP is a well-defined function on the cotangent bundle.
The function σP is homogeneous of degree k in the ξ variable. The zeros of σP, away from the zero section of T∗X, are the characteristics of P. A hypersurface of X defined by the equation F(x) = c is called a characteristic hypersurface at x if
\sigmaP(x,dF(x))=0.
Characteristics are also a powerful tool for gaining qualitative insight into a PDE.
One can use the crossings of the characteristics to find shock waves for potential flow in a compressible fluid. Intuitively, we can think of each characteristic line implying a solution to
u
Characteristics may fail to cover part of the domain of the PDE. This is called a rarefaction, and indicates the solution typically exists only in a weak, i.e. integral equation, sense.
The direction of the characteristic lines indicates the flow of values through the solution, as the example above demonstrates. This kind of knowledge is useful when solving PDEs numerically as it can indicate which finite difference scheme is best for the problem.