In automata theory (a branch of theoretical computer science), DFA minimization is the task of transforming a given deterministic finite automaton (DFA) into an equivalent DFA that has a minimum number of states. Here, two DFAs are called equivalent if they recognize the same regular language. Several different algorithms accomplishing this task are known and described in standard textbooks on automata theory.[1]
For each regular language, there also exists a minimal automaton that accepts it, that is, a DFA with a minimum number of states and this DFA is unique (except that states can be given different names).[2] [3] The minimal DFA ensures minimal computational cost for tasks such as pattern matching.
There are three classes of states that can be removed or merged from the original DFA without affecting the language it accepts.
DFA minimization is usually done in three steps:
The state
p
M=(Q,\Sigma,\delta,q0,F)
w
\Sigma*
*(q | |
p=\delta | |
0,w) |
Q
\Sigma
\delta
\delta*
q0
F
do while (new_states ≠ the empty set)
unreachable_states := Q \ reachable_states
Assuming an efficient implementation of the state sets (e.g. new_states
) and operations on them (such as adding a state or checking whether it is present), this algorithm can be implemented with time complexity
O(n+m)
n
m
Unreachable states can be removed from the DFA without affecting the language that it accepts.
The following algorithms present various approaches to merging nondistinguishable states.
One algorithm for merging the nondistinguishable states of a DFA, due to, is based on partition refinement, partitioning the DFA states into groups by their behavior. These groups represent equivalence classes of the Nerode congruence, whereby every two states are equivalent if they have the same behavior for every input sequence. That is, for every two states and that belong to the same block of the partition, and every input word, the transitions determined by should always take states and to either states that both accept or states that both reject. It should not be possible for to take to an accepting state and to a rejecting state or vice versa.
The following pseudocode describes the form of the algorithm as given by Xu.[4] Alternative forms have also been presented.[5]
while (W is not empty) do choose and remove a set A from W for each c in Σ do let X be the set of states for which a transition on c leads to a state in A for each set Y in P for which X ∩ Y is nonempty and Y \ X is nonempty do replace Y in P by the two sets X ∩ Y and Y \ X if Y is in W replace Y in W by the same two sets else if |X ∩ Y| <= |Y \ X| add X ∩ Y to W else add Y \ X to W
The algorithm starts with a partition that is too coarse: every pair of states that are equivalent according to the Nerode congruence belong to the same set in the partition, but pairs that are inequivalent might also belong to the same set. It gradually refines the partition into a larger number of smaller sets, at each step splitting sets of states into pairs of subsets that are necessarily inequivalent.The initial partition is a separation of the states into two subsets of states that clearly do not have the same behavior as each other: the accepting states and the rejecting states. The algorithm then repeatedly chooses a set from the current partition and an input symbol, and splits each of the sets of the partition into two (possibly empty) subsets: the subset of states that lead to on input symbol, and the subset of states that do not lead to . Since is already known to have different behavior than the other sets of the partition, the subsets that lead to also have different behavior than the subsets that do not lead to . When no more splits of this type can be found, the algorithm terminates.
Lemma. Given a fixed character c and an equivalence class Y that splits into equivalence classes B and C, only one of B or C is necessary to refine the whole partition.[6]
Example: Suppose we have an equivalence class Y that splits into equivalence classes B and C. Suppose we also have classes D, E, and F; D and E have states with transitions into B on character c, while F has transitions into C on character c. By the Lemma, we can choose either B or C as the distinguisher, let's say B. Then the states of D and E are split by their transitions into B. But F, which doesn't point into B, simply doesn't split during the current iteration of the algorithm; it will be refined by other distinguisher(s).
Observation. All of B or C is necessary to split referring classes like D, E, and F correctly—subsets won't do.
The purpose of the outermost if
statement (if Y is in W
) is to patch up W, the set of distinguishers. We see in the previous statement in the algorithm that Y has just been split. If Y is in W, it has just become obsolete as a means to split classes in future iterations. So Y must be replaced by both splits because of the Observation above. If Y is not in W, however, only one of the two splits, not both, needs to be added to W because of the Lemma above. Choosing the smaller of the two splits guarantees that the new addition to W is no more than half the size of Y; this is the core of the Hopcroft algorithm: how it gets its speed, as explained in the next paragraph.
The worst case running time of this algorithm is, where is the number of states and is the size of the alphabet. This bound follows from the fact that, for each of the transitions of the automaton, the sets drawn from that contain the target state of the transition have sizes that decrease relative to each other by a factor of two or more, so each transition participates in of the splitting steps in the algorithm. The partition refinement data structure allows each splitting step to be performed in time proportional to the number of transitions that participate in it. This remains the most efficient algorithm known for solving the problem, and for certain distributions of inputs its average-case complexity is even better, .[7]
Once Hopcroft's algorithm has been used to group the states of the input DFA into equivalence classes, the minimum DFA can be constructed by forming one state for each equivalence class. If is a set of states in, is a state in, and is an input character, then the transition in the minimum DFA from the state for, on input, goes to the set containing the state that the input automaton would go to from state on input . The initial state of the minimum DFA is the one containing the initial state of the input DFA, and the accepting states of the minimum DFA are the ones whose members are accepting states of the input DFA.
Moore's algorithm for DFA minimization is due to . Like Hopcroft's algorithm, it maintains a partition that starts off separating the accepting from the rejecting states, and repeatedly refines the partition until no more refinements can be made. At each step, it replaces the current partition with the coarsest common refinement of partitions, one of which is the current one and the rest of which are the preimages of the current partition under the transition functions for each of the input symbols. The algorithm terminates when this replacement does not change the current partition. Its worst-case time complexity is : each step of the algorithm may be performed in time using a variant of radix sort to reorder the states so that states in the same set of the new partition are consecutive in the ordering, and there are at most steps since each one but the last increases the number of sets in the partition. The instances of the DFA minimization problem that cause the worst-case behavior are the same as for Hopcroft's algorithm. The number of steps that the algorithm performs can be much smaller than, so on average (for constant) its performance is or even depending on the random distribution on automata chosen to model the algorithm's average-case behavior.[5]
Reversing the transitions of a non-deterministic finite automaton (NFA)
M
MR
R | |
M | |
D |
The intuition behind the algorithm is this: determinizing the reverse automaton merges states that are nondistinguishable in the original automaton, but may merge also states that should not be merged (i.e., are not merged in the minimal DFA). In such case, after we reverse the automaton for the second time, it may not be deterministic. That is why we need to determinize it again, obtaining the minimal DFA.
After we determinize
MR
R | |
M | |
D |
R | |
M | |
D |
R | |
(M | |
D) |
=M'
M'
M
M'
R | |
M | |
D |
R | |
M | |
D |
M'
M'
l{R},l{S}
q
M'
q\inl{R}
q\not\inl{S}
q
l{R}
l{S}
q
M'
\overline{M}
M
If
M
R | |
M | |
D |
R | |
M | |
D |
\overline{M}
M
\overline{M}
The worst-case complexity of Brzozowski's algorithm is exponential in the number of states of the input automaton. This holds regardless of whether the input is a NFA or a DFA. In the case of DFA, the exponential explosion can happen during determinization of the reversal of the input automaton;[11] in the case of NFA, it can also happen during the initial determinization of the input automaton.[12] However, the algorithm frequently performs better than this worst case would suggest.
See main article: NFA minimization. While the above procedures work for DFAs, the method of partitioning does not work for non-deterministic finite automata (NFAs).[13] While an exhaustive search may minimize an NFA, there is no polynomial-time algorithm to minimize general NFAs unless, an unsolved conjecture in computational complexity theory that is widely believed to be false. However, there are methods of NFA minimization that may be more efficient than brute force search.
]