Fitness function explained

A fitness function is a particular type of objective function that is used to summarise, as a single figure of merit, how close a given design solution is to achieving the set aims. Fitness functions are used in software architecture and evolutionary algorithms (EA), such as genetic programming and genetic algorithms to guide simulations towards optimal design solutions.^[1]

Software programmers know that they shouldn’t release a bad code, but that priority competes with many other priorities for busy developers. That's why they should use fitness functions to keep their software in check.^[2]

In the field of EAs, each design solution is commonly represented as a string of numbers (referred to as a chromosome). After each round of testing, or simulation, the idea is to delete the n worst design solutions, and to breed n new ones from the best design solutions. Each design solution, therefore, needs to be awarded a figure of merit, to indicate how close it came to meeting the overall specification, and this is generated by applying the fitness function to the test, or simulation, results obtained from that solution.^[3]

Two main classes of fitness functions exist: one where the fitness function does not change, as in optimizing a fixed function or testing with a fixed set of test cases; and one where the fitness function is mutable, as in niche differentiation or co-evolving the set of test cases.^[4] Another way of looking at fitness functions is in terms of a fitness landscape, which shows the fitness for each possible chromosome. In the following, it is assumed that the fitness is determined based on an evaluation that remains unchanged during an optimization run.

A fitness function does not necessarily have to be able to calculate an absolute value, as it is sometimes sufficient to compare candidates in order to select the better one. A relative indication of fitness (candidate a is better than b) is sufficient in some cases,^[5] such as tournament selection or Pareto optimization.

Requirements of evaluation and fitness function

The quality of the evaluation and calculation of a fitness function is fundamental to the success of an EA optimisation. It implements Darwin's principle of "survival of the fittest". Without fitness-based selection mechanisms for mate selection and offspring acceptance, EA search would be blind and hardly distinguishable from the Monte Carlo method. When setting up a fitness function, one must always be aware that it is about more than just describing the desired target state. Rather, the evolutionary search on the way to the optimum should also be supported as much as possible (see also section on auxiliary objectives), if and insofar as this is not already done by the fitness function alone. If the fitness function is designed badly, the algorithm will either converge on an inappropriate solution, or will have difficulty converging at all.

Definition of the fitness function is not straightforward in many cases and often is performed iteratively if the fittest solutions produced by an EA is not what is desired. Interactive genetic algorithms address this difficulty by outsourcing evaluation to external agents which are normally humans.

Computational efficiency

The fitness function should not only correlate closely with the designer's goal, but it also should be computationally efficient. Speed of execution is very important, as a typical genetic algorithm must be iterated many times in order to produce a usable result for a non-trivial problem.

Fitness approximation^[6] ^[7] may be appropriate, especially in the following cases:

Fitness computation time of a single solution is extremely high
Precise model for fitness computation is missing
The fitness function is uncertain or noisy.^[8]

Alternatively or also in addition to the fitness approximation, the fitness calculations can also be distributed to a parallel computer in order to reduce the execution times. Depending on the population model of the EA used, both the EA itself and the fitness calculations of all offspring of one generation can be executed in parallel.^[9] ^[10]

Multi-objective optimization

Practical applications usually aim at optimizing multiple and at least partially conflicting objectives. Two fundamentally different approaches are often used for this purpose, Pareto optimization and optimization based on fitness calculated using the weighted sum.^[11]

Weighted sum and penalty functions

When optimizing with the weighted sum, the single values of the

objectives are first normalized so that they can be compared. This can be done with the help of costs or by specifying target values and determining the current value as the degree of fulfillment. Costs or degrees of fulfillment can then be compared with each other and, if required, can also be mapped to a uniform fitness scale. Without loss of generality, fitness is assumed to represent a value to be maximized. Each objective

o_i

is assigned a weight

w_i

in the form of a percentage value so that the overall raw fitness

f_raw

can be calculated as a weighted sum:

f_raw=

O{o
\sum
i

⋅ w_i} with

O{w
\sum
i}

=1

A violation of

restrictions

r_j

can be included in the fitness determined in this way in the form of penalty functions. For this purpose, a function

pf_j(r_j)

can be defined for each restriction which returns a value between

and

depending on the degree of violation, with the result being

if there is no violation. The previously determined raw fitness is multiplied by the penalty function(s) and the result is then the final fitness

f_final

f_final=f_raw ⋅

R{pf
\prod
j(r

_j)}=

O{(o
\sum
i

⋅ w_i)} ⋅

R{pf
\prod
j(r

_j)}

This approach is simple and has the advantage of being able to combine any number of objectives and restrictions. The disadvantage is that different objectives can compensate each other and that the weights have to be defined before the optimization. In addition, certain solutions may not be obtained, see the section on the comparison of both types of optimization.

Pareto optimization

A solution is called Pareto-optimal if the improvement of one objective is only possible with a deterioration of at least one other objective. The set of all Pareto-optimal solutions, also called Pareto set, represents the set of all optimal compromises between the objectives. The figure below on the right shows an example of the Pareto set of two objectives

f₁

and

f₂

to be maximized. The elements of the set form the Pareto front (green line). From this set, a human decision maker must subsequently select the desired compromise solution. Constraints are included in Pareto optimization in that solutions without constraint violations are per se better than those with violations. If two solutions to be compared each have constraint violations, the respective extent of the violations decides.^[12]

It was recognized early on that EAs with their simultaneously considered solution set are well suited to finding solutions in one run that cover the Pareto front sufficiently well.^[13] Besides the SPEA2,^[14] the NSGA-II^[15] and NSGA-III^[16] ^[17] have established themselves as standard methods.

The advantage of Pareto optimization is that, in contrast to the weighted sum, it provides all alternatives that are equivalent in terms of the objectives as an overall solution. The disadvantage is that a visualization of the alternatives becomes problematic or even impossible from four objectives on. Furthermore, the effort increases exponentially with the number of objectives.^[18] If there are more than three or four objectives, some have to be combined using the weighted sum or other aggregation methods.

Comparison of both types of assessment

With the help of the weighted sum, the total Pareto front can be obtained by a suitable choice of weights, provided that it is convex.^[19] This is illustrated by the adjacent picture on the left. The point

on the green Pareto front is reached by the weights

w₁

and

w₂

, provided that the EA converges to the optimum. The direction with the largest fitness gain in the solution set

is shown by the drawn arrows.

In case of a non-convex front, however, non-convex front sections are not reachable by the weighted sum. In the adjacent image on the right, this is the section between points

and

. This can be remedied to a limited extent by using an extension of the weighted sum, the cascaded weighted sum.

Comparing both assessment approaches, the use of Pareto optimization is certainly advantageous when little is known about the possible solutions of a task and when the number of optimization objectives can be narrowed down to three, at most four. However, in the case of repeated optimization of variations of one and the same task, the desired lines of compromise are usually known and the effort to determine the entire Pareto front is no longer justified. This is also true when no human decision is desired or possible after optimization, such as in automated decision processes.

Auxiliary objectives

In addition to the primary objectives resulting from the task itself, it may be necessary to include auxiliary objectives in the assessment to support the achievement of one or more primary objectives. An example of a scheduling task is used for illustration purposes. The optimization goals include not only a general fast processing of all orders but also the compliance with a latest completion time. The latter is especially necessary for the scheduling of rush orders. The second goal is not achieved by the exemplary initial schedule, as shown in the adjacent figure. A following mutation does not change this, but schedules the work step d earlier, which is a necessary intermediate step for an earlier start of the last work step e of the order. As long as only the latest completion time is evaluated, however, the fitness of the mutated schedule remains unchanged, even though it represents a relevant step towards the objective of a timely completion of the order. This can be remedied, for example, by an additional evaluation of the delay of work steps. The new objective is an auxiliary one, since it was introduced in addition to the actual optimization objectives to support their achievement. A more detailed description of this approach and another example can be found in.

External links

A Nice Introduction to Adaptive Fuzzy Fitness Granulation (AFFG) (PDF), A promising approach to accelerate the convergence rate of EAs.
The cyber shack of Adaptive Fuzzy Fitness Granulation (AFFG) That is designed to accelerate the convergence rate of EAs.
Fitness functions in evolutionary robotics: A survey and analysis (AFFG) (PDF), A review of fitness functions used in evolutionary robotics.
Ford, Neal; Richards, Mark, Sadalage, Pramod; Dehghani, Zhamak. (2021) Software Architecture: The Hard Parts O'Reilly Media, Inc. .

References

Book: Eiben . A.E. . Introduction to Evolutionary Computing . Smith . J.E. . 2015 . Springer . 978-3-662-44873-1 . 2nd . Natural Computing Series . Berlin, Heidelberg . 30 . en . Evaluation Function (Fitness Function) . 10.1007/978-3-662-44874-8 . 20912932.
Book: Fundamentals of Software Architecture: An Engineering Approach . O'Reilly Media . 2020 . 978-1492043454.
Book: Eiben . A.E. . Introduction to Evolutionary Computing . Smith . J.E. . 2015 . Springer . 978-3-662-44873-1 . Natural Computing Series . Berlin, Heidelberg . 25–48 . What Is an Evolutionary Algorithm? . 10.1007/978-3-662-44874-8. 20912932 .
Book: Eiben . A.E. . Introduction to Evolutionary Computing . Smith . J.E. . 2015 . Springer Berlin Heidelberg . 978-3-662-44873-1 . Natural Computing Series . Berlin, Heidelberg . 223–230 . Coevolutionary Systems . 10.1007/978-3-662-44874-8. 20912932 .
Book: Evolutionary Computation 2: Advanced Algorithms and Operators . 2000-11-20 . Taylor & Francis . 978-0-7503-0665-2 . Bäck . Thomas . en . 10.1201/9781420034349 . Fogel . David . Michalewicz . Zbigniew.
Jin . Y. . January 2005 . A comprehensive survey of fitness approximation in evolutionary computation . Soft Computing . en . 9 . 1 . 3–12 . 10.1007/s00500-003-0328-5 . 7626092 . 1432-7643.
Jin . Yaochu . Wang . Handing . Chugh . Tinkle . Miettinen . Kaisa . June 2019 . Data-Driven Evolutionary Optimization: An Overview and Case Studies . IEEE Transactions on Evolutionary Computation . 23 . 3 . 442–458 . 10.1109/TEVC.2018.2869001 . 55809527 . 1089-778X. 10871/34011 . free .
Book: Eiben . A.E. . Introduction to Evolutionary Computing . Smith . J.E. . 2015 . Springer . 978-3-662-44873-1 . 2nd . Natural Computing Series . Berlin, Heidelberg . 185–194 . Nonstationary and Noisy Function Optimisation . 10.1007/978-3-662-44874-8 . 20912932.
Book: Khalloof . Hatem . Mohammad . Mohammad . Shahoud . Shadi . Duepmeier . Clemens . Hagenmeyer . Veit . Proceedings of the 12th International Conference on Management of Digital EcoSystems . A Generic Flexible and Scalable Framework for Hierarchical Parallelization of Population-Based Metaheuristics . 2020-11-02 . https://dl.acm.org/doi/10.1145/3415958.3433041 . en . Virtual Event United Arab Emirates . ACM . 124–131 . 10.1145/3415958.3433041 . 978-1-4503-8115-4. 227179748 .
Book: Jähne, Paul . Overview of the current state of research on parallelisation of evolutionary algorithms on graphic cards . 2016 . Informatik 2016 Tagung vom 26. - 30. September 2016 . Gesellschaft für Informatik, FRG . 978-3-88579-653-4 . Mayr . Heinrich Christian . Bonn . en . 962381748 . Pinzger . Martin.
Book: Miettinen, Kaisa . Multiobjective Optimization: Interactive and Evolutionary Approaches . 2008 . Springer . 978-3-540-88907-6 . Branke . Jürgen . Lecture Notes in Computer Science . 5252 . Berlin, Heidelberg . 1–26 . en . Introduction to Multiobjective Optimization: Noninteractive Approaches . 10.1007/978-3-540-88908-3 . 15375227 . Deb . Kalyanmoy . Miettinen . Kaisa . Słowiński . Roman.
Book: Deb, Kalyanmoy . Multiobjective Optimization: Interactive and Evolutionary Approaches . 2008 . Springer . 978-3-540-88907-6 . Branke . Jürgen . Lecture Notes in Computer Science . 5252 . Berlin, Heidelberg . 58–96 . en . Introduction to Evolutionary Multiobjective Optimization . 10.1007/978-3-540-88908-3 . 15375227 . Deb . Kalyanmoy . Miettinen . Kaisa . Słowiński . Roman.
Fonseca . Carlos M. . Fleming . Peter J. . 1995 . An Overview of Evolutionary Algorithms in Multiobjective Optimization . Evolutionary Computation . en . 3 . 1 . 1–16 . 10.1162/evco.1995.3.1.1 . 8530790 . 1063-6560.
Eckart . Zitzler . Marco . Laumanns . Lothar . Thiele . 2001 . SPEA2: Improving the strength pareto evolutionary algorithm . Technical Report, Nr. 103. Computer Engineering and Networks Laboratory (TIK) . en . ETH Zürich 2001 . 10.3929/ethz-a-004284029. 16584254 .
Deb . K. . Pratap . A. . Agarwal . S. . Meyarivan . T. . 2002 . A fast and elitist multiobjective genetic algorithm: NSGA-II . IEEE Transactions on Evolutionary Computation . 6 . 2 . 182–197 . 10.1109/4235.996017. 9914171 .
Deb . Kalyanmoy . Jain . Himanshu . 2014 . An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints . IEEE Transactions on Evolutionary Computation . 18 . 4 . 577–601 . 10.1109/TEVC.2013.2281535 . 206682597 . 1089-778X.
Jain . Himanshu . Deb . Kalyanmoy . 2014 . An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point Based Nondominated Sorting Approach, Part II: Handling Constraints and Extending to an Adaptive Approach . IEEE Transactions on Evolutionary Computation . 18 . 4 . 602–622 . 10.1109/TEVC.2013.2281534 . 16426862 . 1089-778X.
Jakob . Wilfried . Blume . Christian . 2014-03-21 . Pareto Optimization or Cascaded Weighted Sum: A Comparison of Concepts . Algorithms . en . 7 . 1 . 166–185 . 2203.02697 . 10.3390/a7010166 . 1999-4893. free .
Book: Miettinen, Kaisa . Nonlinear Multiobjective Optimization . 1998 . Springer US . 978-1-4613-7544-9 . International Series in Operations Research & Management Science . 12 . Boston, MA . 10.1007/978-1-4615-5563-6.