Farkas' lemma explained

In mathematics, Farkas' lemma is a solvability theorem for a finite system of linear inequalities. It was originally proven by the Hungarian mathematician Gyula Farkas.Farkas' lemma is the key result underpinning the linear programming duality and has played a central role in the development of mathematical optimization (alternatively, mathematical programming). It is used amongst other things in the proof of the Karush–Kuhn–Tucker theorem in nonlinear programming.[1] Remarkably, in the area of the foundations of quantum theory, the lemma also underlies the complete set of Bell inequalities in the form of necessary and sufficient conditions for the existence of a local hidden-variable theory, given data from any specific set of measurements.

Generalizations of the Farkas' lemma are about the solvability theorem for convex inequalities, i.e., infinite system of linear inequalities. Farkas' lemma belongs to a class of statements called "theorems of the alternative": a theorem stating that exactly one of two systems has a solution.[2]

Statement of the lemma

There are a number of slightly different (but equivalent) formulations of the lemma in the literature. The one given here is due to Gale, Kuhn and Tucker (1951).[3] Here, the notation

x\geq0

means that all components of the vector

x

are nonnegative.

Example

Let,

A=\begin{bmatrix}6&4\\3&0\end{bmatrix},

and

b=\begin{bmatrix}b1\\b2\end{bmatrix}.

The lemma says that exactly one of the following two statements must be true (depending on and):
  1. There exist, such that and, or
  2. There exist such that,, and .

Here is a proof of the lemma in this special case:

x1=\tfrac{b2}{3}

and

x2=\tfrac{b1-2b2}{4}.

Option 2 is false, since b_1 y_1 + b_2 y_2 \ge b_2 (2y_1 + y_2) = b_2 \frac, so if the right-hand side is positive, the left-hand side must be positive too.

Geometric interpretation

C(A)

spanned by the columns of ; that is,

C(A)=\{Ax\midx\geq0\}.

Observe that

C(A)

is the set of the vectors for which the first assertion in the statement of Farkas' lemma holds. On the other hand, the vector in the second assertion is orthogonal to a hyperplane that separates and

C(A).

The lemma follows from the observation that belongs to

C(A)

if and only if there is no hyperplane that separates it from

C(A).

More precisely, let

a1,...,an\in\Rm

denote the columns of . In terms of these vectors, Farkas' lemma states that exactly one of the following two statements is true:
  1. There exist non-negative coefficients

x1,...,xn\in\R

such that

b=x1a1+...+xnan.

  1. There exists a vector

y\in\Rm

such that
\top
a
i

y\geq0

for

i=1,...,n,

and

b\topy<0.

The sums

x1a1+...+xnan

with nonnegative coefficients

x1,...,xn

form the cone spanned by the columns of . Therefore, the first statement tells that belongs to

C(A).

The second statement tells that there exists a vector such that the angle of with the vectors is at most 90°, while the angle of with the vector is more than 90°. The hyperplane normal to this vector has the vectors on one side and the vector on the other side. Hence, this hyperplane separates the cone spanned by

a1,...,an

from the vector .

For example, let,, and . The convex cone spanned by and can be seen as a wedge-shaped slice of the first quadrant in the plane. Now, suppose . Certainly, is not in the convex cone . Hence, there must be a separating hyperplane. Let . We can see that,, and . Hence, the hyperplane with normal indeed separates the convex cone from .

Logic interpretation

A particularly suggestive and easy-to-remember version is the following: if a set of linear inequalities has no solution, then a contradiction can be produced from it by linear combination with nonnegative coefficients. In formulas: if

Ax\leb

is unsolvable then

y\topA=0,

y\topb=-1,

y\ge0

has a solution.[4] Note that

y\topA

is a combination of the left-hand sides,

y\topb

a combination of the right-hand side of the inequalities. Since the positive combination produces a zero vector on the left and a −1 on the right, the contradiction is apparent.

Thus, Farkas' lemma can be viewed as a theorem of logical completeness:

Ax\leb

is a set of "axioms", the linear combinations are the "derivation rules", and the lemma says that, if the set of axioms is inconsistent, then it can be refuted using the derivation rules.[5]

Implications in complexity theory

Farkas' lemma implies that the decision problem "Given a system of linear equations, does it have a non-negative solution?" is in the intersection of NP and co-NP. This is because, according to the lemma, both a "yes" answer and a "no" answer have a proof that can be verified in polynomial time. The problems in the intersection

NP\capcoNP

are also called well-characterized problems. It is a long-standing open question whether

NP\capcoNP

is equal to P. In particular, the question of whether a system of linear equations has a non-negative solution was not known to be in P, until it was proved using the ellipsoid method.

Variants

The Farkas Lemma has several variants with different sign constraints (the first one is the original version):

Ax=b

has a solution

x\geq0,

or

A\topy\geq0

has a solution

y\in\Rm

with

b\topy<0.

Ax\leqb

has a solution

x\geq0,

or

A\topy\geq0

has a solution

y\geq0

with

b\topy<0

Ax\leqb

has a solution

x\in\Rn,

or

A\topy=0

has a solution

y\geq0

with

b\topy<0

.

Ax=b

has a solution

x\in\Rn,

or

A\topy=0

has a solution

y\in\Rm

with

b\topy0.

The latter variant is mentioned for completeness; it is not actually a "Farkas lemma" since it contains only equalities. Its proof is an exercise in linear algebra.

There are also Farkas-like lemmas for integer programs. For systems of equations, the lemma is simple:

Ax=b

has an integral solution

x\in\Zn,

or there exists

y\in\Rn

such that

A\topy

is integral and

b\topy

is not integral.For system of inequalities, the lemma is much more complicated. It is based on the following two rules of inference:
  1. Given inequalities
T
a
1

x\leqb1,\ldots,

T
a
m

x\leqbm

and coefficients

w1,\ldots,wm

, infer the inequality
m
\left(\sum
i=1

wi

T\right)
a
i

x\leq

m
\sum
i=1

wibi

.
  1. Given an inequality

a1x1++amxm\leqb

, infer the inequality

\lfloora1\rfloorx1++\lflooram\rfloorxm\leq\lfloorb\rfloor

.The lemma says that:

Ax\leqb

has an integral solution

x\in\Zn,x\geq0

, or it is possible to infer from

Ax\leqb

using finitely many applications of inference rules 1,2 the inequality

0Tx\leq-1

.The variants are summarized in the table below.
!System!Constraints on x!!Alternative system!Constraints on y

Ax=b

x\in\Rn,x\geq0

A\topy\geq0

,

b\topy<0

y\in\Rm

Ax\leqb

x\in\Rn,x\geq0

A\topy\geq0

,

b\topy<0

y\in\Rm

,

y\geq0

Ax\leqb

x\in\Rn

A\topy=0

,

b\topy<0

y\in\Rm

,

y\geq0

Ax=b

x\in\Rn

A\topy=0

,

b\topy0

y\in\Rm

Ax=b

x\in\Zn

A\topy

integral,

b\topy

not integral

y\in\Rn

Ax\leqb

x\in\Zn,x\geq0

0Tx\leq-1

can be inferred from

Ax\leqb

Generalizations

Generalized Farkas' lemma can be interpreted geometrically as follows: either a vector is in a given closed convex cone, or there exists a hyperplane separating the vector from the cone; there are no other possibilities. The closedness condition is necessary, see Separation theorem I in Hyperplane separation theorem. For original Farkas' lemma,

S

is the nonnegative orthant
n,
\R
+
hence the closedness condition holds automatically. Indeed, for polyhedral convex cone, i.e., there exists a

B\in\Rn

such that

S=\{Bx\midx\in

k
\R
+

\},

the closedness condition holds automatically. In convex optimization, various kinds of constraint qualification, e.g. Slater's condition, are responsible for closedness of the underlying convex cone

C(A).

By setting

S=\Rn

and

S*=\{0\}

in generalized Farkas' lemma, we obtain the following corollary about the solvability for a finite system of linear equalities:

Further implications

Farkas's lemma can be varied to many further theorems of alternative by simple modifications, such as Gordan's theorem: Either

Ax<0

has a solution, or

A\topy=0

has a nonzero solution with .

Common applications of Farkas' lemma include proving the strong duality theorem associated with linear programming and the Karush–Kuhn–Tucker conditions. An extension of Farkas' lemma can be used to analyze the strong duality conditions for and construct the dual of a semidefinite program. It is sufficient to prove the existence of the Karush–Kuhn–Tucker conditions using the Fredholm alternative but for the condition to be necessary, one must apply von Neumann's minimax theorem to show the equations derived by Cauchy are not violated.

See also

Further reading

Notes and References

  1. Book: Takayama, Akira . Mathematical Economics . New York . Cambridge University Press . 2nd . 1985 . 0-521-31498-4 . 48 . registration .
  2. Web site: Border. KC. 2013. Alternative Linear Inequalities. 2021-11-29.
  3. . See Lemma 1 on page 318.
  4. .
  5. Pages 81–104.