Convex conjugate explained
In mathematics and mathematical optimization, the convex conjugate of a function is a generalization of the Legendre transformation which applies to non-convex functions. It is also known as Legendre–Fenchel transformation, Fenchel transformation, or Fenchel conjugate (after Adrien-Marie Legendre and Werner Fenchel). It allows in particular for a far reaching generalization of Lagrangian duality.
Definition
Let
be a
real topological vector space and let
be the
dual space to
. Denote by
\langle ⋅ , ⋅ \rangle:X* x X\toR
the canonical dual pairing, which is defined by
\left(x*,x\right)\mapstox*(x).
For a function
f:X\toR\cup\{-infty,+infty\}
taking values on the
extended real number line, its
is the function
f*:X*\toR\cup\{-infty,+infty\}
whose value at
is defined to be the
supremum:
f*\left(x*\right):=\sup\left\{\left\langlex*,x\right\rangle-f(x)~\colon~x\inX\right\},
or, equivalently, in terms of the infimum:
f*\left(x*\right):=-inf\left\{f(x)-\left\langlex*,x\right\rangle~\colon~x\inX\right\}.
This definition can be interpreted as an encoding of the convex hull of the function's epigraph in terms of its supporting hyperplanes.[1]
Examples
For more examples, see .
f(x)=\left\langlea,x\right\rangle-b
is
is
x*=\nablaf\left(\nablaf{*
}\left(x^ \right)\right),
and moreover
f\prime\prime(x) ⋅ f{*\prime\prime}\left(x*(x)\right)=1,
f{*\prime\prime}\left(x*\right) ⋅ f\prime\prime\left(x(x*)\right)=1.
Scaling properties
If for some
g(x)=\alpha+\betax+\gamma ⋅ f\left(λx+\delta\right)
, then
g*\left(x*\right)=-\alpha-\delta
+\gamma ⋅ f*\left(
\right).
Behavior under linear transformations
Let
be a
bounded linear operator. For any convex function
on
where
(Af)(y)=inf\{f(x):x\inX,Ax=y\}
is the preimage of
with respect to
and
is the
adjoint operator of
[4] A closed convex function
is symmetric with respect to a given set
of
orthogonal linear transformations,
for all
and all
if and only if its convex conjugate
is symmetric with respect to
Table of selected convex conjugates
The following table provides Legendre transforms for many common functions as well as a few useful properties.[5]
|
|
|
|
---|
(where
) |
|
|
|
|
| f*(x*)-\langleb,x*\rangle
|
|
(where
) |
|
|
|
\alpha+\betax+\gamma ⋅ f(λx+\delta)
|
| -\alpha-\delta
\gamma ⋅ f*\left(
\right) (\gamma>0)
|
|
(where
) |
|
(where
) |
|
(where
) |
|
(where
) |
|
|
|
|
|
|
|
|
|
|
| \begin{cases}x*log(x*)-x*&ifx*>0\ 0&ifx*=0\end{cases}
|
|
|
| \begin{cases}x*log(x*)+(1-x*)log(1-x*)&if0<x*<1\ 0&ifx*=0,1\end{cases}
|
|
|
| \begin{cases}x*log(x*)-(1+x*)log(1+x*)&ifx*>0\ 0&ifx*=0\end{cases}
|
| |
See also
References
- Web site: Legendre Transform . April 14, 2019.
- Book: Phelps, Robert . Robert R. Phelps
. Robert R. Phelps . Convex Functions, Monotone Operators and Differentiability. limited . 2 . 1993. Springer . 0-387-56715-1. 42.
- 10.1137/070687542 . The Proximal Average: Basic Theory . 2008 . Bauschke . Heinz H. . Goebel . Rafal . Lucet . Yves . Wang . Xianfu . SIAM Journal on Optimization . 19 . 2 . 766. 10.1.1.546.4270 .
- Ioffe, A.D. and Tichomirov, V.M. (1979), Theorie der Extremalaufgaben. Deutscher Verlag der Wissenschaften. Satz 3.4.3
- Book: Borwein . Jonathan . Jonathan Borwein. Lewis . Adrian . Convex Analysis and Nonlinear Optimization: Theory and Examples. limited . 2 . 2006 . Springer . 978-0-387-29570-1. 50–51.
- Book: Arnol'd
, Vladimir Igorevich
. Vladimir Igorevich Arnol'd
. Vladimir Igorevich Arnol'd . Mathematical Methods of Classical Mechanics . Second . Springer . 1989 . 0-387-96890-3 . 997295 . registration .
- Book: Rockafellar
, R. Tyrell
. R. Tyrrell Rockafellar
. R. Tyrrell Rockafellar . Convex Analysis . Princeton University Press . 1970 . Princeton . 0-691-01586-4 . 0274683.
Further reading