Submodular set function explained

In mathematics, a submodular set function (also known as a submodular function) is a set function that, informally, describes the relationship between a set of inputs and an output, where adding more of one input has a decreasing additional benefit (diminishing returns). The natural diminishing returns property which makes them suitable for many applications, including approximation algorithms, game theory (as functions modeling user preferences) and electrical networks. Recently, submodular functions have also found utility in several real world problems in machine learning and artificial intelligence, including automatic summarization, multi-document summarization, feature selection, active learning, sensor placement, image collection summarization and many other domains.

Definition

\Omega

is a finite set, a submodular function is a set function

f:2^\Omega → R

, where

2^\Omega

denotes the power set of

\Omega

, which satisfies one of the following equivalent conditions.

For every

X,Y\subseteq\Omega

with

X\subseteqY

and every

x\in\Omega\setminusY

we have that

f(X\cup\{x\})-f(X)\geqf(Y\cup\{x\})-f(Y)

For every

S,T\subseteq\Omega

we have that

f(S)+f(T)\geqf(S\cupT)+f(S\capT)

For every

X\subseteq\Omega

and

x_1,x_2\in\Omega\backslashX

such that

x_{1 ≠}x₂

we have that

f(X\cup\{x_1\})+f(X\cup\{x_2\})\geqf(X\cup\{x_1,x_2\})+f(X)

A nonnegative submodular function is also a subadditive function, but a subadditive function need not be submodular.If

\Omega

is not assumed finite, then the above conditions are not equivalent. In particular a function

defined by

f(S)=1

is finite and

f(S)=0

is infinite satisfies the first condition above, but the second condition fails when

and

are infinite sets with finite intersection.

Types and examples of submodular functions

Monotone

A set function

is monotone if for every

T\subseteqS

we have that

f(T)\leqf(S)

. Examples of monotone submodular functions include:

Linear (Modular) functions : Any function of the form

f(S)=\sum_i\inw_i

is called a linear function. Additionally if

\foralli,w_i\geq0

then f is monotone.

Budget-additive functions : Any function of the form

f(S)=min\left\{B,~\sum_i\inw_i\right\}

for each

w_i\geq0

and

B\geq0

is called budget additive.

Coverage functions : Let

\Omega=\{E_1,E_2,\ldots,E_n\}

be a collection of subsets of some ground set

\Omega'

. The function

f(S)=\left\|cup
	E_i\inS

E_i\right|

for

S\subseteq\Omega

is called a coverage function. This can be generalized by adding non-negative weights to the elements.

Entropy : Let

\Omega=\{X_1,X_2,\ldots,X_n\}

be a set of random variables. Then for any

S\subseteq\Omega

we have that

H(S)

is a submodular function, where

H(S)

is the entropy of the set of random variables

, a fact known as Shannon's inequality.^[1] Further inequalities for the entropy function are known to hold, see entropic vector.

Matroid rank functions : Let

\Omega=\{e_1,e_2,...,e_n\}

be the ground set on which a matroid is defined. Then the rank function of the matroid is a submodular function.^[2]

Non-monotone

A submodular function that is not monotone is called non-monotone.

Symmetric

A non-monotone submodular function

is called symmetric if for every

S\subseteq\Omega

we have that

f(S)=f(\Omega-S)

.Examples of symmetric non-monotone submodular functions include:

Graph cuts : Let

\Omega=\{v_1,v_2,...,v_n\}

be the vertices of a graph. For any set of vertices

S\subseteq\Omega

let

f(S)

denote the number of edges

e=(u,v)

such that

u\inS

and

v\in\Omega-S

. This can be generalized by adding non-negative weights to the edges.

Mutual information : Let

\Omega=\{X_1,X_2,\ldots,X_n\}

be a set of random variables. Then for any

S\subseteq\Omega

we have that

f(S)=I(S;\Omega-S)

is a submodular function, where

I(S;\Omega-S)

is the mutual information.

Asymmetric

A non-monotone submodular function which is not symmetric is called asymmetric.

Directed cuts : Let

\Omega=\{v_1,v_2,...,v_n\}

be the vertices of a directed graph. For any set of vertices

S\subseteq\Omega

let

f(S)

denote the number of edges

e=(u,v)

such that

u\inS

and

v\in\Omega-S

. This can be generalized by adding non-negative weights to the directed edges.

Continuous extensions of submodular set functions

Often, given a submodular set function that describes the values of various sets, we need to compute the values of fractional sets. For example: we know that the value of receiving house A and house B is V, and we want to know the value of receiving 40% of house A and 60% of house B. To this end, we need a continuous extension of the submodular set function.

Formally, a set function

f:2^\Omega → R

with

|\Omega|=n

can be represented as a function on

\{0,1\}ⁿ

, by associating each

S\subseteq\Omega

with a binary vector

x^S\in\{0,1\}ⁿ

such that

	S
x
	i

when

i\inS

, and

	S
x
	i

otherwise. A continuous extension of

is a continuous function

F:[0,1]ⁿ → R

, that matches the value of

x\in\{0,1\}ⁿ

, i.e.

F(x^S)=f(S)

Several kinds of continuous extensions of submodular functions are commonly used, which are described below.

Lovász extension

This extension is named after mathematician László Lovász. Consider any vector

x=\{x_1,x_2,...,x_n\}

such that each

0\leqx_i\leq1

. Then the Lovász extension is defined as

	L(x)=E(f(\{i\|x
f
	i\geq

λ\}))

where the expectation is over

chosen from the uniform distribution on the interval

[0,1]

. The Lovász extension is a convex function if and only if

is a submodular function.

Multilinear extension

Consider any vector

x=\{x_1,x_2,\ldots,x_n\}

such that each

0\leqx_i\leq1

. Then the multilinear extension is defined as ^[3] ^[4]

F(x)=\sum_S\subseteqf(S)\prod_i\inx_i\prod_i\notin(1-x_i)

Intuitively, x_i represents the probability that item i is chosen for the set. For every set S, the two inner products represent the probability that the chosen set is exactly S. Therefore, the sum represents the expected value of f for the set formed by choosing each item i at random with probability xi, independently of the other items.

Convex closure

Consider any vector

x=\{x_1,x_2,...,x_n\}

such that each

0\leqx_i\leq1

. Then the convex closure is defined as

	-(x)=min\left(\sum
f
	S

\alpha_Sf(S):\sum_S\alpha_S1_S=x,\sum_S\alpha_S=1,\alpha_S\geq0\right)

The convex closure of any set function is convex over

[0,1]ⁿ

Concave closure

Consider any vector

x=\{x_1,x_2,...,x_n\}

such that each

0\leqx_i\leq1

. Then the concave closure is defined as

	+(x)=max\left(\sum
f
	S

\alpha_Sf(S):\sum_S\alpha_S1_S=x,\sum_S\alpha_S=1,\alpha_S\geq0\right)

Relations between continuous extensions

For the extensions discussed above, it can be shown that

f⁺(x)\geqF(x)\geqf^-(x)=f^L(x)

when

is submodular.

Properties

The class of submodular functions is closed under non-negative linear combinations. Consider any submodular function

f_1,f_2,\ldots,f_k

and non-negative numbers

\alpha_1,\alpha_{2,\ldots,\alpha}_k

. Then the function

defined by

	k
g(S)=\sum
	i=1

\alpha_if_i(S)

is submodular.

For any submodular function

, the function defined by

g(S)=f(\Omega\setminusS)

is submodular.

The function

g(S)=min(f(S),c)

, where

is a real number, is submodular whenever

is monotone submodular. More generally,

g(S)=h(f(S))

is submodular, for any non decreasing concave function

Consider a random process where a set

is chosen with each element in

\Omega

being included in

independently with probability

. Then the following inequality is true

E[f(T)]\geqpf(\Omega)+(1-p)f(\varnothing)

where

\varnothing

is the empty set. More generally consider the following random process where a set

is constructed as follows. For each of

1\leqi\leql,A_i\subseteq\Omega

construct

S_i

by including each element in

A_i

independently into

S_i

with probability

p_i

. Furthermore let

	l
S=\cup
	i=1

S_i

. Then the following inequality is true

E[f(S)]\geq\sum_R\subseteq\Pi_i\inp_i\Pi_i\notin(1-p_i)f(\cup_i\inA_i)

Optimization problems

Submodular functions have properties which are very similar to convex and concave functions. For this reason, an optimization problem which concerns optimizing a convex or concave function can also be described as the problem of maximizing or minimizing a submodular function subject to some constraints.

Submodular set function minimization

The hardness of minimizing a submodular set function depends on constraints imposed on the problem.

The unconstrained problem of minimizing a submodular function is computable in polynomial time, and even in strongly-polynomial time. Computing the minimum cut in a graph is a special case of this minimization problem.
The problem of minimizing a submodular function with a cardinality lower bound is NP-hard, with polynomial factor lower bounds on the approximation factor.

Submodular set function maximization

Unlike the case of minimization, maximizing a generic submodular function is NP-hard even in the unconstrained setting. Thus, most of the works in this field are concerned with polynomial-time approximation algorithms, including greedy algorithms or local search algorithms.

The problem of maximizing a non-negative submodular function admits a 1/2 approximation algorithm. Computing the maximum cut of a graph is a special case of this problem.
The problem of maximizing a monotone submodular function subject to a cardinality constraint admits a

1-1/e

approximation algorithm.^[5] The maximum coverage problem is a special case of this problem.

The problem of maximizing a monotone submodular function subject to a matroid constraint (which subsumes the case above) also admits a

1-1/e

approximation algorithm.

Many of these algorithms can be unified within a semi-differential based framework of algorithms.

Applications

Submodular functions naturally occur in several real world applications, in economics, game theory, machine learning and computer vision. Owing to the diminishing returns property, submodular functions naturally model costs of items, since there is often a larger discount, with an increase in the items one buys. Submodular functions model notions of complexity, similarity and cooperation when they appear in minimization problems. In maximization problems, on the other hand, they model notions of diversity, information and coverage.

External links

http://www.cs.berkeley.edu/~stefje/references.html has a longer bibliography
http://submodularity.org/ includes further material on the subject

Notes and References

Web site: Information Processing and Learning. cmu.
Fujishige (2005) p.22
Book: Vondrak, Jan . Proceedings of the fortieth annual ACM symposium on Theory of computing . Optimal approximation for the submodular welfare problem in the value oracle model . 2008-05-17 . https://doi.org/10.1145/1374376.1374389 . STOC '08 . New York, NY, USA . Association for Computing Machinery . 67–74 . 10.1145/1374376.1374389 . 978-1-60558-047-0. 170510 .
Calinescu . Gruia . Chekuri . Chandra . Pál . Martin . Vondrák . Jan . January 2011 . Maximizing a Monotone Submodular Function Subject to a Matroid Constraint . SIAM Journal on Computing . en . 40 . 6 . 1740–1766 . 10.1137/080733991 . 0097-5397.
Web site: Williamson. David P.. Bridging Continuous and Discrete Optimization: Lecture 23.

Submodular set function explained

Definition

Types and examples of submodular functions

Monotone

Non-monotone

Symmetric

Asymmetric

Continuous extensions of submodular set functions

Lovász extension

Multilinear extension

Convex closure

Concave closure

Relations between continuous extensions

Properties

Optimization problems

Submodular set function minimization

Submodular set function maximization

Related optimization problems

Applications

See also

External links

Notes and References