Weak component explained

In graph theory, the weak components of a directed graph partition the vertices of the graph into subsets that are totally ordered by reachability. They form the finest partition of the set of vertices that is totally ordered in this way.

Definition

The weak components were defined in a 1972 paper by Ronald Graham, Donald Knuth, and (posthumously) Theodore Motzkin, by analogy to the strongly connected components of a directed graph, which form the finest possible partition of the graph's vertices into subsets that are partially ordered by reachability. Instead, they defined the weak components to be the finest partition of the vertices into subsets that are totally ordered by

In more detail, defines the weak components through a combination of four symmetric relations on the vertices of any directed graph, denoted here as

u

and

v

of the graph,

u\Leftrightarrowv

if and only if each vertex is reachable from the other: there exist paths in the graph from

u

to

v

and from

v

The

\Leftrightarrow

relation is an equivalence relation, and its equivalence classes are used to define the strongly connected components of the graph.

u

and

v

of the graph,

u\parallelv

if and only if neither vertex is reachable from the other: there do not exist paths in the graph in either direction between

u

u

and

v

of the graph,

uv

if and only if either

u\Leftrightarrowv

That is, there may be a two-way connection between these vertices, or they may be mutually unreachable, but they may not have a one-way connection.

\asymp

is defined as the transitive closure That is,

u\asympv

when there is a sequence

u ≈ … ≈ v

of vertices, starting with

u

and ending such that each consecutive pair in the sequence is related Then

\asymp

is an equivalence relation: every vertex is related to itself by

\asymp

(because it can reach itself in both directions by paths of length zero), any two vertices that are related by

\asymp

can be swapped for each other without changing this relation (because

\asymp

is built out of the symmetric relations

\Leftrightarrow

and

\asymp

is a transitive relation (because it is a transitive closure). As with any equivalence relation, it can be used to partition the vertices of the graph into equivalence classes, subsets of the vertices such that two vertices are related by

\asymp

if and only if they belong to the same equivalence class. These equivalence classes are the weak components of the given

The original definition by Graham, Knuth, and Motzkin is equivalent but formulated somewhat differently. Given a directed they first construct another graph

\hatG

as the complement graph of the transitive closure As describes, the edges in

\hatG

represent, pairs of vertices that are not connected by a path Then, two vertices belong to the same weak component when either they belong to the same strongly connected component of

G

or As Graham, Knuth, and Motzkin show, this condition defines an equivalence the same one defined above

Corresponding to these definitions, a directed graph is called weakly connected if it has exactly one weak component. This means that its vertices cannot be partitioned into two subsets, such that all of the vertices in the first subset can reach all of the vertices in the second subset, but such that none of the vertices in the second subset can reach any of the vertices in the first subset. It differs from other notions of weak connectivity in the literature, such as connectivity and components in the underlying unconnected graph, for which Knuth suggests the alternative terminology

Properties

If

X

and

Y

are two weak components of a directed graph,then either all vertices in

X

can reach all vertices in

Y

by paths in the graph, or all vertices in

Y

can reach all vertices However, there cannot exist reachability relations in both directions between these two components. Therefore, we can define an ordering on the weak components, according to which

X<Y

when all vertices in

X

can reach all vertices By definition, This is an asymmetric relation (two elements can only be related in one direction, not the other) and it inherits the property of being a transitive relation from the transitivity of reachability. Therefore, it defines a total ordering on the weak components. It is the finest possible partition of the vertices into a totally ordered set of vertices consistent with

This ordering on the weak components can alternatively be interpreted as a weak ordering on the vertices themselves, with the property that when

u<v

in the weak ordering, there necessarily exists a path from

u

but not from

v

However, this is not a complete characterization of this weak ordering, because two vertices

u

and

v

could have this same reachability ordering while belonging to the same weak component as each

Every weak component is a union of strongly connected If the strongly connected components of any given graph are contracted to single vertices, producing a directed acyclic graph (the of the given graph), and then this condensation is topologically sorted, then each weak component necessarily appears as a consecutive subsequence of the topological order of the strong

Algorithms

An algorithm for computing the weak components of a given directed graph in linear time was described by, and subsequently simplified by and As Tarjan observes, Tarjan's strongly connected components algorithm based on depth-first search will output the strongly connected components in (the reverse of) a topologically sorted order. The algorithm for weak components generates the strongly connected components in this order, and maintains a partition of the components that have been generated so far into the weak components of their induced subgraph. After all components are generated, this partition will describe the weak components of the whole

It is convenient to maintain the current partition into weak components in a stack, with each weak component maintaining additionally a list of its, strongly connected components that have no incoming edges from other strongly connected components in the same weak component, with the most recently generated source first. Each newly generated strongly connected component may form a new weak component on its own, or may end up merged with some of the previously constructed weak components near the top of the stack, the ones for which it cannot reach all

Thus, the algorithm performs the following

S

is generated, perform the following steps with it:

S

has no edges to the top weak component of the stack, pop that component from the stack.

S

and all of the unhit sources from the top component that was popped, and push

W

onto the stack.Each test for whether any edges from

S

hit a weak component can be performed in constant time once we find an edge from

S

to the most recently generated earlier strongly connected component, by comparing the target component of that edge to the first source of the second-to-top component on the stack