Evolution strategy explained
In computer science, an evolution strategy (ES) is an optimization technique based on ideas of evolution. It belongs to the general class of evolutionary computation or artificial evolution methodologies.
History
The 'evolution strategy' optimization technique was created in the early 1960s and developed further in the 1970s and later by Ingo Rechenberg, Hans-Paul Schwefel and their co-workers.
Methods
Evolution strategies use natural problem-dependent representations, so problem space and search space are identical. In common with evolutionary algorithms, the operators are applied in a loop. An iteration of the loop is called a generation. The sequence of generations is continued until a termination criterion is met.
The special feature of the ES is the self-adaptation of mutation step sizes and the coevolution associated with it. The ES is briefly presented using the standard form,[1] [2] pointing out that there are many variants.[3] [4] The real-valued chromosome contains, in addition to the
decision variables,
mutation step sizes
, where:
. Often one mutation step size is used for all decision variables or each has its own step size. Mate selection to produce
offspring is random, i.e. independent of fitness. First, new mutation step sizes are generated per mating by intermediate recombination of the parental
with subsequent mutation as follows:
{\sigma}'j=\sigmaj ⋅ e(l{N(0,1)-l{N}j(0,1))}
where
is a
normally distributed random variable with mean
and standard deviation
.
applies to all
, while
is newly determined for each
. Next, discrete recombination of the decision variables is followed by a mutation using the new mutation step sizes as standard deviations of the normal distribution. The new decision variables
are calculated as follows:
xj'=xj+l{N}j(0,{\sigma}j')
This results in an evolutionary search on two levels: First, at the problem level itself and second, at the mutation step size level. In this way, it can be ensured that the ES searches for its target in ever finer steps. However, there is also the danger of being able to skip larger invalid areas in the search space only with difficulty.
The ES knows two variants of best selection for the generation of the next parent population: In the
-ES, only the
best offspring are used, whereas in the elitist
-ES, the
best are selected from parents and children.
Bäck and Schwefel recommend that the value of
should be seven times the population size
, whereby
must not be chosen too small because of the strong selection pressure. Suitable values for
are application-dependent and must be determined experimentally.
Individual step sizes for each coordinate, or correlations between coordinates, which are essentially defined by an underlying covariance matrix, are controlled in practice either by self-adaptation or by covariance matrix adaptation (CMA-ES). When the mutation step is drawn from a multivariate normal distribution using an evolving covariance matrix, it has been hypothesized that this adapted matrix approximates the inverse Hessian of the search landscape. This hypothesis has been proven for a static model relying on a quadratic approximation.[5]
The selection of the next generation in evolution strategies is deterministic and only based on the fitness rankings, not on the actual fitness values. The resulting algorithm is therefore invariant with respect to monotonic transformations of the objective function. The simplest evolution strategy operates on a population of size two: the current point (parent) and the result of its mutation. Only if the mutant's fitness is at least as good as the parent one, it becomes the parent of the next generation. Otherwise the mutant is disregarded. This is a
-ES. More generally,
mutants can be generated and compete with the parent, called
-ES. In
-ES the best mutant becomes the parent of the next generation while the current parent is always disregarded. For some of these variants, proofs of
linear convergence (in a
stochastic sense) have been derived on unimodal objective functions.
[6] [7] See also
Bibliography
- Ingo Rechenberg (1971): Evolutionsstrategie - Optimierung technischer Systeme nach Prinzipien der biologischen Evolution (PhD thesis). Reprinted by Frommann-Holzboog (1973).
- Hans-Paul Schwefel (1974): Numerische Optimierung von Computer-Modellen (PhD thesis). Reprinted by Birkhäuser (1977).
- Hans-Paul Schwefel: Evolution and Optimum Seeking. New York: Wiley & Sons 1995.
- H.-G. Beyer and H.-P. Schwefel. Evolution Strategies: A Comprehensive Introduction. Journal Natural Computing, 1(1):3 - 52, 2002.
- Hans-Georg Beyer: The Theory of Evolution Strategies. Springer, April 27, 2001.
- Ingo Rechenberg: Evolutionsstrategie '94. Stuttgart: Frommann-Holzboog 1994.
- J. Klockgether and H. P. Schwefel (1970). Two-Phase Nozzle And Hollow Core Jet Experiments. AEG-Forschungsinstitut. MDH Staustrahlrohr Project Group. Berlin, Federal Republic of Germany. Proceedings of the 11th Symposium on Engineering Aspects of Magneto-Hydrodynamics, Caltech, Pasadena, Cal., 24. - 26.3. 1970.
- M. Emmerich, O.M. Shir, and H. Wang: Evolution Strategies. In: Handbook of Heuristics, 1-31. Springer International Publishing (2018).
Research centers
Notes and References
- Book: Schwefel, Hans-Paul . Evolution and Optimum Seeking . 1995 . Wiley . 978-0-471-57148-3 . Sixth-generation computer technology series . New York .
- Bäck . Thomas . Schwefel . Hans-Paul . 1993 . An Overview of Evolutionary Algorithms for Parameter Optimization . Evolutionary Computation . en . 1 . 1 . 1–23 . 10.1162/evco.1993.1.1.1 . 1063-6560.
- Coelho . V. N. . Coelho . I. M. . Souza . M. J. F. . Oliveira . T. A. . Cota . L. P. . Haddad . M. N. . Mladenovic . N. . Silva . R. C. P. . Guimarães . F. G. . 2016 . Hybrid Self-Adaptive Evolution Strategies Guided by Neighborhood Structures for Combinatorial Optimization Problems . Evolutionary Computation . en . 24 . 4 . 637–666 . 10.1162/EVCO_a_00187 . 1063-6560.
- Hansen . Nikolaus . Ostermeier . Andreas . 2001 . Completely Derandomized Self-Adaptation in Evolution Strategies . Evolutionary Computation . en . 9 . 2 . 159–195 . 10.1162/106365601750190398 . 1063-6560.
- 10.1016/j.tcs.2019.09.002. O.M.. Shir. A. Yehudayoff. On the covariance-Hessian relation in evolution strategies. Theoretical Computer Science. 801. 157–174. Elsevier. 2020. 1806.03674. free.
- 10.1016/j.tcs.2004.11.017. A.. Auger. Convergence results for the (1,λ)-SA-ES using the theory of φ-irreducible Markov chains. Theoretical Computer Science. 334. 1–3. 35–69. Elsevier. 2005.
- 10.1016/j.tcs.2006.04.004. J.. Jägersküpper. How the (1+1) ES using isotropic mutations minimizes positive definite quadratic forms. Theoretical Computer Science. 361. 1. 38–56. Elsevier. 2006. free.