Matroid oracle explained

In mathematics and computer science, a matroid oracle is a subroutine through which an algorithm may access a matroid, an abstract combinatorial structure that can be used to describe the linear dependencies between vectors in a vector space or the spanning trees of a graph, among other applications.

The most commonly used oracle of this type is an independence oracle, a subroutine for testing whether a set of matroid elements is independent. Several other types of oracle have also been used; some of them have been shown to be weaker than independence oracles, some stronger, and some equivalent in computational power.[1]

Many algorithms that perform computations on matroids have been designed to take an oracle as input, allowing them to run efficiently without change on many different kinds of matroids, and without additional assumptions about what kind of matroid they are using. For instance, given an independence oracle for any matroid, it is possible to find the minimum weight basis of the matroid by applying a greedy algorithm that adds elements to the basis in sorted order by weight, using the independence oracle to test whether each element can be added.[2]

In computational complexity theory, the oracle model has led to unconditional lower bounds proving that certain matroid problems cannot be solved in polynomial time, without invoking unproved assumptions such as the assumption that P ≠ NP. Problems that have been shown to be hard in this way include testing whether a matroid is binary or uniform, or testing whether it contains certain fixed minors.[3]

Why oracles?

Although some authors have experimented with computer representations of matroids that explicitly list all independent sets or all basis sets of the matroid,[4] these representations are not succinct: a matroid with

\scriptstylen

elements may expand into a representation that takes space exponential in

\scriptstylen

. Indeed, the number of distinct matroids on

\scriptstylen

elements grows doubly exponentially as
2nn-3/2+o(1)
2
[5] from which it follows that any explicit representation capable of handling all possible matroids would necessarily use exponential space.[6]

Instead, different types of matroids may be represented more efficiently from the other structures from which they are defined: uniform matroids from their two numeric parameters, graphic matroids, bicircular matroids, and gammoids from graphs, linear matroids from matrices, etc. However, an algorithm for performing computations on arbitrary matroids needs a uniform method of accessing its argument, rather than having to be redesigned for each of these matroid classes. The oracle model provides a convenient way of codifying and classifying the kinds of access that an algorithm might need.

History

Starting with, "independence functions" or "

\scriptstyleI

-functions" have been studied as one of many equivalent ways of axiomatizing matroids. An independence function maps a set of matroid elements to the number

\scriptstyle1

if the set is independent or

\scriptstyle0

if it is dependent; that is, it is the indicator function of the family of independent sets, essentially the same thing as an independence oracle.[7]

Matroid oracles have also been part of the earliest algorithmic work on matroids. Thus,, in studying matroid partition problems, assumed that the access to the given matroid was through a subroutine that takes as input an independent set

\scriptstyleI

and an element

\scriptstylex

, and either returns a circuit in

\scriptstyleI\cup\{x\}

(necessarily unique and containing

\scriptstylex

, if it exists) or determines that no such circuit exists. used a subroutine that tests whether a given set is independent (that is, in more modern terminology, an independence oracle), and observed that the information it provides is sufficient to find the minimum weight basis in polynomial time.

Beginning from the work of and, researchers began studying oracles from the point of view of proving lower bounds on algorithms for matroids and related structures. These two papers by Hausmann and Korte both concerned the problem of finding a maximum cardinality independent set, which is easy for matroids but (as they showed) harder to approximate or compute exactly for more general independence systems represented by an independence oracle. This work kicked off a flurry of papers in the late 1970s and early 1980s showing similar hardness results for problems on matroids[8] and comparing the power of different kinds of matroid oracles.[9]

Since that time, the independence oracle has become standard for most research on matroid algorithms.[10] There has also been continued research on lower bounds,[11] and comparisons of different types of oracle.[12]

Types of oracles

The following types of matroid oracles have been considered.

\scriptstyle+infty

if the given set is independent).[14]

\scriptstylex

of the matroid takes as its input a set of matroid elements, and returns as output a Boolean value, true if the given set contains a circuit that includes

\scriptstylex

and false otherwise.[15]

Relative power of different oracles

Although there are many known types of oracles, the choice of which to use can be simplified, because many of them are equivalent in computational power. An oracle

\scriptstyleX

is said to be polynomially reducible to another oracle

\scriptstyleY

if any call to

\scriptstyleX

may be simulated by an algorithm that accesses the matroid using only oracle

\scriptstyleY

and takes polynomial time as measured in terms of the number of elements of the matroid; in complexity-theoretic terms, this is a Turing reduction. Two oracles are said to be polynomially equivalent if they are polynomially reducible to each other. If

\scriptstyleX

and

\scriptstyleY

are polynomially equivalent, then every result that proves the existence or nonexistence of a polynomial time algorithm for a matroid problem using oracle

\scriptstyleX

also proves the same thing for oracle

\scriptstyleY

.

For instance, the independence oracle is polynomially equivalent to the circuit-finding oracle of . If a circuit-finding oracle is available, a set may be tested for independence using at most

\scriptstylen

calls to the oracle by starting from an empty set, adding elements of the given set one element at a time, and using the circuit-finding oracle to test whether each addition preserves the independence of the set that has been constructed so far. In the other direction, if an independence oracle is available, the circuit in a set

\scriptstyleI\cup\{x\}

may be found using at most

\scriptstylen

calls to the oracle by testing, for each element

\scriptstyley\inI

, whether

\scriptstyleI\setminus\{y\}\cup\{x\}

is independent and returning the elements for which the answer is no. The independence oracle is also polynomially equivalent to the rank oracle, the spanning oracle, the first two types of closure oracle, and the port oracle.[1]

The basis oracle, the circuit oracle, and the oracle that tests whether a given set is closed are all weaker than the independence oracle: they can be simulated in polynomial time by an algorithm that accesses the matroid using an independence oracle, but not vice versa. Additionally, none of these three oracles can simulate each other within polynomial time. The girth oracle is stronger than the independence oracle, in the same sense.[9]

As well as polynomial time Turing reductions, other types of reducibility have been considered as well. In particular, showed that,in parallel algorithms, the rank and independence oracles are significantly different in computational power. The rank oracle allows the construction of a minimum weight basis by

\scriptstylen

simultaneous queries, of the prefixes of the sorted order of the matroid elements: an element belongs to the optimal basis if and only if the rank of its prefix differs from the rank of the previous prefix. In contrast, finding a minimum basis with an independence oracle is much slower: it can be solved deterministically in

\scriptstyleO(\sqrtn)

time steps, and there is a lower bound of

\scriptstyle\Omega((n/logn)1/3)

even for randomized parallel algorithms.

Algorithms

Many problems on matroids are known to be solvable in polynomial time, by algorithms that access the matroid only through an independence oracle or another oracle of equivalent power, without need of any additional assumptions about what kind of matroid has been given to them. These polynomially-solvable problems include:

\scriptstylek

-connected (in the sense of) for

\scriptstylek\le3

.[17]

\scriptstylen

elements and rank

\scriptstyler

, with the additional assumption that the number of bases is within a polynomial factor of the number of

\scriptstyler

-element sets.[22]

Impossibility proofs

For many matroid problems, it is possible to show that an independence oracle does not provide enough power to allow the problem to be solved in polynomial time. The main idea of these proofs is to find two matroids

\scriptstyleM

and

\scriptstyleM'

on which the answer to the problem differs and which are difficult for an algorithm to tell apart. In particular, if

\scriptstyleM

has a high degree of symmetry, and differs from

\scriptstyleM'

only in the answers to a small number of queries, then it may take a very large number of queries for an algorithm to be sure of distinguishing an input of type

\scriptstyleM

from an input formed by using one of the symmetries of

\scriptstyleM

to permute

\scriptstyleM'

.[3]

A simple example of this approach can be used to show that it is difficult to test whether a matroid is uniform. For simplicity of exposition, let

\scriptstylen

be even, let

\scriptstyleM

be the uniform matroid

\scriptstyle

n/2
U{}
n
, and let

\scriptstyleM'

be a matroid formed from

\scriptstyleM

by making a single one of the

\scriptstylen/2

-element basis sets of

\scriptstyleM

dependent instead of independent. In order for an algorithm to correctly test whether its input is uniform, it must be able to distinguish

\scriptstyleM

from every possible permutation of

\scriptstyleM'

. But in order for a deterministic algorithm to do so, it must test every one of the

\scriptstylen/2

-element subsets of the elements: if it missed one set, it could be fooled by an oracle that chose that same set as the one to make dependent. Therefore, testing for whether a matroid is uniform may require
\binom{n}{n/2}=\Omega\left(2n
\sqrtn

\right)

independence queries, much higher than polynomial. Even a randomized algorithm must make nearly as many queries in order to be confident of distinguishing these two matroids.[23]

formalize this approach by proving that, whenever there exist two matroids

\scriptstyleM

and

\scriptstyleM'

on the same set of elements but with differing problem answers, an algorithm that correctly solves the given problem on those elements must use at least
|\operatorname{aut
(M)|}{\sum

i|\operatorname{fix}(M,Qi)|}

queries, where

\scriptstyle\operatorname{aut}(M)

denotes the automorphism group of

\scriptstyleM

,

\scriptstyleQi

denotes the family of sets whose independence differs from

\scriptstyleM

to

\scriptstyleM'

, and

\scriptstyle\operatorname{fix}(M,Qi)

denotes the subgroup of automorphisms that maps

\scriptstyleQi

to itself. For instance, the automorphism group of the uniform matroid is just the symmetric group, with size

\scriptstylen!

, and in the problem of testing uniform matroids there was only one set

\scriptstyleQi

with

\scriptstyle

2
|\operatorname{fix}(M,Q
i)|=(n/2)!
, smaller by an exponential factor than

\scriptstylen!

.[24]

Problems that have been proven to be impossible for a matroid oracle algorithm to compute in polynomial time include:

\scriptstyleH

as a minor, except in the special cases that

\scriptstyleH

is uniform with rank or corank at most one. More generally, if

\scriptstylel{H}

is a fixed finite set of matroids, and there is no uniform matroid in

\scriptstylel{H}

, then it is not possible to test in polynomial time whether a given matroid contains one or more of the matroids in

\scriptstylel{H}

as a minor.[25]

Among the set of all properties of

\scriptstylen

-element matroids, the fraction of the properties that do not require exponential time to test goes to zero, in the limit, as

\scriptstylen

goes to infinity.[6]

See also

References

Notes and References

  1. ; .
  2. .
  3. .
  4. .
  5. ; ; .
  6. .
  7. For additional research on matroids based on the independence function axiomatization, see e.g.,, and .
  8. ; ; ; .
  9. .
  10. E.g. see,,,, and .
  11. .
  12. .
  13. ; .
  14. .
  15. .
  16. .
  17. . A paper claiming a similar result for any fixed constant

    \scriptstylek

    was announced by Cunningham and Edmonds at roughly the same time, but appears not to have been published., pp. 186–187, writes "Locating

    \scriptstylek

    -sums for general

    \scriptstylek\ge4

    is much more difficult ... We do not know how this can be efficiently accomplished for binary matroids, let alone for general matroids."
  18. .
  19. .
  20. .
  21. .
  22. . In contrast, it is not possible for deterministic algorithms to approximate the number of bases of a matroid accurately in polynomial time .
  23. .
  24. As well as being in, this formalization is surveyed in . In most of the applications of this technique in,

    \scriptstyleM

    is uniform, but applies the same idea to a non-uniform but highly symmetric matroid.
  25. . Results of and give special cases of this for the problems of finding a

    \scriptstyle

    2
    U{}
    4
    minor and a Vámos matroid minor, respectively. Testing whether a matroid is graphic or regular may be expressed in terms of a finite set of forbidden minors, and may be solved in polynomial time, but the forbidden minors for these problems include the uniform matroid

    \scriptstyle

    2
    U{}
    4
    , so they do not contradict this impossibility result.
  26. showed this for binary matroids, for finite fields, for arbitrary fields, and for the existence of a field over which the matroid is representable.
  27. . However, the special case of this problem for bipartite graphs can be solved in polynomial time as a matroid intersection problem.