Multiple dispatch or multimethods is a feature of some programming languages in which a function or method can be dynamically dispatched based on the run-time (dynamic) type or, in the more general case, some other attribute of more than one of its arguments.[1] This is a generalization of single-dispatch polymorphism where a function or method call is dynamically dispatched based on the derived type of the object on which the method has been called. Multiple dispatch routes the dynamic dispatch to the implementing function or method using the combined characteristics of one or more arguments.
Developers of computer software typically organize source code into named blocks variously called subroutines, procedures, subprograms, functions, or methods. The code in the function is executed by calling it – executing a piece of code that references its name. This transfers control temporarily to the called function; when the function's execution has completed, control is typically transferred back to the instruction in the caller that follows the reference.
Function names are usually selected so as to be descriptive of the function's purpose. It is sometimes desirable to give several functions the same name, often because they perform conceptually similar tasks, but operate on different types of input data. In such cases, the name reference at the function call site is not sufficient for identifying the block of code to be executed. Instead, the number and type of the arguments to the function call are also used to select among several function implementations.
In more conventional, i.e., single-dispatch object-oriented programming languages, when invoking a method (sending a message in Smalltalk, calling a member function in C++), one of its arguments is treated specially and used to determine which of the (potentially many) classes of methods of that name is to be applied. In many languages, the special argument is indicated syntactically; for example, a number of programming languages put the special argument before a dot in making a method call: special.method(other, arguments, here)
, so that lion.sound
would produce a roar, whereas sparrow.sound
would produce a chirp.
In contrast, in languages with multiple dispatch, the selected method is simply the one whose arguments match the number and type of the function call. There is no special argument that owns the function/method carried out in a particular call.
The Common Lisp Object System (CLOS) is an early and well-known example of multiple dispatch. Another notable example of the use of multiple dispatch is the Julia programming language.
Multiple dispatch should be distinguished from function overloading, in which static typing information, such as a term's declared or inferred type (or base type in a language with subtyping) is used to determine which of several possibilities will be used at a given call site, and that determination is made at compile or link time (or some other time before program execution starts) and is thereafter invariant for a given deployment or run of the program. Many languages such as C++ offer robust function overloading but do not offer dynamic multiple dispatch (C++ only permits dynamic single dispatch through use of virtual functions).
When working with languages that can discriminate data types at compile time, selecting among the alternatives can occur then. The act of creating such alternative functions for compile time selection is usually referred to as overloading a function.
In programming languages that defer data type identification until run time (i.e., late binding), selection among alternative functions must occur then, based on the dynamically determined types of function arguments. Functions whose alternative implementations are selected in this manner are referred to most generally as multimethods.
There is some run-time cost associated with dynamically dispatching function calls. In some languages, the distinction between overloading and multimethods can be blurred, with the compiler determining whether compile time selection can be applied to a given function call, or whether slower run time dispatch is needed.
There are several known issues with dynamic-dispatch, both single and multiple. While many of these issues are solved for single-dispatch, which has been a standard feature in object-oriented programming languages for decades, these issues become more complicated in the multiple-dispatch case.
In most popular programming languages, source code is delivered and deployed in granules of functionality which we will here call packages; actual terminology for this concept varies between language. Each package may contain multiple type, value, and function definitions, packages are often compiled separately in languages with a compilation step, and a non-cyclical dependency relationship may exist. A complete program is a set of packages, with a main package which may depend on several other packages, and the whole program consisting of the transitive closure of the dependency relationship.
The so-called expression problem relates to the ability for code in a depending package to extend behaviors (functions or datatypes) defined in a base package from within an including package, without modifying the source to the base package. Traditional single-dispatch OO languages make it trivial to add new datatypes but not new functions; traditional functional languages tend to have the opposite effect, and multiple dispatch, if implemented correctly, allows both. It is desirable for an implementation of multiple dispatch to have the following properties:
It is generally desirable that for any given invocation of a multi-method, there be at most one "best" candidate among implementation cases of the multi-method, and/or that if there is not, that this be resolved in a predictable and deterministic fashion, including failure. Non-deterministic behavior is undesirable. Assuming a set of types with a non-circular subtyping relationship, one can define that one implementation of a multi-method is "better" (more specific) if all dynamically-dispatched arguments in the first are subtypes of all dynamically-dispatched arguments specified in the second, and at least one is a strict subtype. With single dispatch and in the absence of multiple inheritance, this condition is trivially satisfied, but with multiple dispatch, it is possible for two or more candidates to satisfy a given actual argument list, but neither is more specific than the other (one dynamic argument being the subtype in one case, another being the subtype in the other case). This particularly can happen if two different packages, neither depending on the other, both extend some multi-method with implementations concerning each package's types, and then a third package that includes both (possibly indirectly) then invokes the multi-method using arguments from both packages.
Possible resolutions include:
Efficient implementation of single-dispatch, including in programming languages that are separately compiled to object code and linked with a low-level (not-language-aware) linker, including dynamically at program load/start time or even under the direction of the application code, are well known. The "vtable" method developed in C++ and other early OO languages (where each class has an array of function pointers corresponding to that class's virtual functions) is nearly as fast as a static method call, requiring O(1) overhead and only one additional memory lookup even in the un-optimized case. However, the vtable method uses the function name and not the argument type as its lookup key, and does not scale to the multiple dispatch case. (It also depends on the object-oriented paradigm of methods being features of classes, not standalone entities independent of any particular datatype).
Efficient implementation of multiple-dispatch remains an ongoing research problem.
To estimate how often multiple dispatch is used in practice, Muschevici et al.[2] studied programs that use dynamic dispatch. They analyzed nine applications, mostly compilers, written in six different languages: Common Lisp Object System, Dylan, Cecil, MultiJava, Diesel, and Nice. Their results show that 13–32% of generic functions use the dynamic type of one argument, while 2.7–6.5% of them use the dynamic type of multiple arguments. The remaining 65–93% of generic functions have one concrete method (overrider), and thus are not considered to use the dynamic types of their arguments. Further, the study reports that 2–20% of generic functions had two and 3–6% had three concrete function implementations. The numbers decrease rapidly for functions with more concrete overriders.
Multiple dispatch is used much more heavily in Julia, where multiple dispatch was a central design concept from the origin of the language: collecting the same statistics as Muschevici on the average number of methods per generic function, it was found that the Julia standard library uses more than double the amount of overloading than in the other languages analyzed by Muschevici, and more than 10 times in the case of binary operators.
The data from these papers is summarized in the following table, where the dispatch ratio DR
is the average number of methods per generic function; the choice ratio CR
is the mean of the square of the number of methods (to better measure the frequency of functions with a large number of methods);[2] and the degree of specialization DoS
is the average number of type-specialized arguments per method (i.e., the number of arguments that are dispatched on):
Language | Average # methods (DR) | Choice ratio (CR) | Degree of specialization (DoS) |
---|---|---|---|
2.33 | 63.30 | 1.06 | |
Common Lisp (CMU) | 2.03 | 6.34 | 1.17 |
Common Lisp (McCLIM) | 2.32 | 15.43 | 1.17 |
Common Lisp (Steel Bank) | 2.37 | 26.57 | 1.11 |
Diesel | 2.07 | 31.65 | 0.71 |
Dylan (Gwydion) | 1.74 | 18.27 | 2.14 |
Dylan (OpenDylan) | 2.51 | 43.84 | 1.23 |
5.86 | 51.44 | 1.54 | |
Julia (operators only) | 28.13 | 78.06 | 2.01 |
MultiJava | 1.50 | 8.92 | 1.02 |
Nice | 1.36 | 3.46 | 0.33 |
The theory of multiple dispatching languages was first developed by Castagna et al., by defining a model for overloaded functions with late binding.[3] [4] It yielded the first formalization of the problem of covariance and contravariance of object-oriented languages[5] and a solution to the problem of binary methods.[6]
Distinguishing multiple and single dispatch may be made clearer by an example. Imagine a game that has, among its (user-visible) objects, spaceships and asteroids. When two objects collide, the program may need to do different things according to what has just hit what.
C# introduced support for dynamic multimethods in version 4[7] (April 2010) using the 'dynamic' keyword. The following example demonstrates multimethods. Like many other statically-typed languages, C# also supports static method overloading.[8] Microsoft expects that developers will choose static typing over dynamic typing in most scenarios.[9] The 'dynamic' keyword supports interoperability with COM objects and dynamically-typed .NET languages.
The example below uses features introduced in C# 9 and C# 10.
Console.WriteLine(Collide(new Asteroid(101), new Spaceship(300)));Console.WriteLine(Collide(new Asteroid(10), new Spaceship(10)));Console.WriteLine(Collide(new Spaceship(101), new Spaceship(10)));
string Collide(SpaceObject x, SpaceObject y) => x.Size > 100 && y.Size > 100 ? "Big boom!" : CollideWith(x as dynamic, y as dynamic); // Dynamic dispatch to CollideWith method
class ColliderLibrary
abstract record SpaceObject(int Size);record Asteroid(int Size) : SpaceObject(Size);record Spaceship(int Size) : SpaceObject(Size);
Output:
Groovy is a general purpose Java compatible/interusable JVM language, which, contrary to Java, uses late binding / multiple dispatch.[10]
class Program
class Collider
class SpaceObject
@InheritConstructors class Asteroid extends SpaceObject @InheritConstructors class Spaceship extends SpaceObject
In a language with multiple dispatch, such as Common Lisp, it might look more like this (Common Lisp example shown):
and similarly for the other methods. Explicit testing and "dynamic casting" are not used.
In the presence of multiple dispatch, the traditional idea of methods as being defined in classes and contained in objects becomes less appealing—each collide-with method above is attached to two different classes, not one. Hence, the special syntax for method invocation generally disappears, so that method invocation looks exactly like ordinary function invocation, and methods are grouped not in classes but in generic functions.
Julia has built-in multiple dispatch, and it is central to the language design.[11] The Julia version of the example above might look like:
struct Asteroid <: SpaceObject size::Int endstruct Spaceship <: SpaceObject size::Int end
collide_with(::Asteroid, ::Spaceship) = "a/s"collide_with(::Spaceship, ::Asteroid) = "s/a"collide_with(::Spaceship, ::Spaceship) = "s/s"collide_with(::Asteroid, ::Asteroid) = "a/a"
collide(x::SpaceObject, y::SpaceObject) = (x.size > 100 && y.size > 100) ? "Big boom!" : collide_with(x, y)
Output:
julia> collide(Asteroid(10), Spaceship(10))"a/s"
julia> collide(Spaceship(101), Spaceship(10))"s/s"
Raku, like Perl, uses proven ideas from other languages, and type systems have shown themselves to offer compelling advantages in compiler-side code analysis and powerful user-side semantics via multiple dispatch.
It has both multimethods, and multisubs. Since most operators are subroutines, it also has multiple dispatched operators.
Along with the usual type constraints, it also has where constraints that allow making very specialized subroutines.
multi sub infix:« <=> » (Stellar-Object:D $a, Stellar-Object:D $b) multi sub infix:« < » (Stellar-Object:D $a, Stellar-Object:D $b) multi sub infix:« > » (Stellar-Object:D $a, Stellar-Object:D $b) multi sub infix:«
proto sub collide (Stellar-Object:D $, Stellar-Object:D $)
multi sub collide ($a, $b where $a < $b) multi sub collide ($a, $b where $a > $b)
multi sub collide ($a, $b)
multi sub collide (Spaceship $a, Spaceship $b where $a
multi sub collide (Asteroid $ (:mass($a)), Asteroid $ (:mass($b)))
my Spaceship $Enterprise .= new(:mass(1),:name('The Enterprise'));collide Asteroid.new(:mass(.1)), $Enterprise;collide $Enterprise, Spaceship.new(:mass(.1));collide $Enterprise, Asteroid.new(:mass(1));collide $Enterprise, Spaceship.new(:mass(1));collide Asteroid.new(:mass(10)), Asteroid.new(:mass(5));
In languages that do not support multiple dispatch at the language definition or syntactic level, it is often possible to add multiple dispatch using a library extension. JavaScript and TypeScript do not support multimethods at the syntax level, but it is possible to add multiple dispatch via a library. For example, the multimethod package[12] provides an implementation of multiple dispatch, generic functions.
Dynamically-typed version in JavaScript:
class Asteroid class Spaceship
const collideWith = multi(method([Asteroid, Asteroid], (x, y) =>), method([Asteroid, Spaceship], (x, y) =>), method([Spaceship, Asteroid], (x, y) =>), method([Spaceship, Spaceship], (x, y) =>),)
Statically-typed version in TypeScript:
class Asteroid class Spaceship
type CollideWith = Multi &
const collideWith: CollideWith = multi(method([Asteroid, Asteroid], (x, y) =>), method([Asteroid, Spaceship], (x, y) =>), method([Spaceship, Asteroid], (x, y) =>), method([Spaceship, Spaceship], (x, y) =>),)
Multiple dispatch can be added to Python using a library extension. For example, using the module multimethod.py and also with the module multimethods.py[13] which provides CLOS-style multimethods for Python without changing the underlying syntax or keywords of the language.
collide = Dispatchcollide.add_rule((Asteroid, Spaceship), as_func)collide.add_rule((Spaceship, Spaceship), ss_func)collide.add_rule((Spaceship, Asteroid), sa_func)
def aa_func(a, b): """Behavior when asteroid hits asteroid.""" # ...define new behavior...
collide.add_rule((Asteroid, Asteroid), aa_func)
collide(thing1, thing2)
Functionally, this is very similar to the CLOS example, but the syntax is conventional Python.
Using Python 2.4 decorators, Guido van Rossum produced a sample implementation of multimethods[14] with a simplified syntax:
and then it goes on to define the multimethod decorator.
The PEAK-Rules package provides multiple dispatch with a syntax similar to the above example.[15] It was later replaced by PyProtocols.[16]
The Reg library also supports multiple and predicate dispatch.[17]
With the introduction of type hints, multiple dispatch is possible with even simpler syntax. For example, using plum-dispatch,
@dispatchdef collide(a: Asteroid, b: Asteroid): """Behavior when asteroid hits a asteroid.""" # ...define new behavior... @dispatchdef collide(a: Asteroid, b: Spaceship): """Behavior when asteroid hits a spaceship.""" # ...define new behavior...
C does not have dynamic dispatch, so it must be implemented manually in some form. Often an enum is used to identify the subtype of an object. Dynamic dispatch can be done by looking up this value in a function pointer branch table. Here is a simple example in C:
void collision_AA(void) ;void collision_AS(void) ;void collision_SA(void) ;void collision_SS(void) ;
typedef enum Thing;
CollisionCase collisionCases[THING_COUNT][THING_COUNT] = ;
void collide(Thing a, Thing b)
int main(void)
With the C Object System library,[18] C does support dynamic dispatch similar to CLOS. It is fully extensible and does not need any manual handling of the methods. Dynamic message (methods) are dispatched by the dispatcher of COS, which is faster than Objective-C. Here is an example in COS:
// classes
defclass (Asteroid)// data membersendclass
defclass (Spaceship)// data membersendclass
// generics
defgeneric (_Bool, collide_with, _1, _2);
// multimethods
defmethod (_Bool, collide_with, Asteroid, Asteroid) // deal with asteroid hitting asteroidendmethod
defmethod (_Bool, collide_with, Asteroid, Spaceship) // deal with asteroid hitting spaceshipendmethod
defmethod (_Bool, collide_with, Spaceship, Asteroid) // deal with spaceship hitting asteroidendmethod
defmethod (_Bool, collide_with, Spaceship, Spaceship) // deal with spaceship hitting spaceshipendmethod
// example of use
int main(void)
, C++ natively supports only single dispatch, though adding multi-methods (multiple dispatch) was proposed by Bjarne Stroustrup (and collaborators) in 2007.[19] The methods of working around this limit are analogous: use either the visitor pattern, dynamic cast or a library:
struct Thing ;
struct Asteroid : Thing ;
struct Spaceship : Thing ;
or pointer-to-method lookup table:
class Thing ;
class Asteroid: public Thing ;
class Spaceship: public Thing ;
Thing::CollisionHandlerMap Thing::collisionCases;const std::uint32_t Asteroid::cid = typeid(Asteroid).hash_code;const std::uint32_t Spaceship::cid = typeid(Spaceship).hash_code;
void Asteroid::initCases
void Spaceship::initCases
int main
The YOMM2 library[20] provides a fast, orthogonal implementation of open multimethods.
The syntax for declaring open methods is inspired by a proposal for a native C++ implementation. The library requires that the user registers all the classes used as virtual arguments (and their sub-classes), but does not require any modifications to existing code. Methods are implemented as ordinary inline C++ functions; they can be overloaded and they can be passed by pointer. There is no limit on the number of virtual arguments, and they can be arbitrarily mixed with non-virtual arguments.
The library uses a combination of techniques (compressed dispatch tables, collision free integer hash table) to implement method calls in constant time, while mitigating memory usage. Dispatching a call to an open method with a single virtual argument takes only 15–30% more time than calling an ordinary virtual member function, when a modern optimizing compiler is used.
The Asteroids example can be implemented as follows:
class Thing ;
class Asteroid : public Thing ;
class Spaceship : public Thing ;
register_classes(Thing, Spaceship, Asteroid);
declare_method(void, collideWith, (virtual_ define_method(void, collideWith, (Thing& left, Thing& right)) define_method(void, collideWith, (Asteroid& left, Asteroid& right)) define_method(void, collideWith, (Asteroid& left, Spaceship& right)) define_method(void, collideWith, (Spaceship& left, Asteroid& right)) define_method(void, collideWith, (Spaceship& left, Spaceship& right)) int main Stroustrup mentions in The Design and Evolution of C++ that he liked the concept of multimethods and considered implementing it in C++ but claims to have been unable to find an efficient sample implementation (comparable to virtual functions) and resolve some possible type ambiguity problems. He then states that although the feature would still be nice to have, that it can be approximately implemented using double dispatch or a type based lookup table as outlined in the C/C++ example above so is a low priority feature for future language revisions.[21] , as do many other object-oriented programming languages, D natively supports only single dispatch. However, it is possible to emulate open multimethods as a library function in D. The openmethods library[22] is an example. // The override for two DenseMatrix objects@methodMatrix _plus(DenseMatrix a, DenseMatrix b) // The override for two DiagonalMatrix objects@methodMatrix _plus(DiagonalMatrix a, DiagonalMatrix b) In a language with only single dispatch, such as Java, multiple dispatch can be emulated with multiple levels of single dispatch: class Asteroid implements Collideable class Spaceship implements Collideable Run time D
Java
instanceof
checks at one or both levels can also be used.Support in programming languages
Primary paradigm
Supporting general multimethods
Via extensions
See also
External links
Notes and References