Fixed-point combinator explained

In combinatory logic for computer science, a fixed-point combinator (or fixpoint combinator),[1] is a higher-order function (i.e. a function which takes a function as argument) that returns some fixed point (a value that is mapped to itself) of its argument function, if one exists.

Formally, if

rm{fix}

is a fixed-point combinator and the function

f

has one or more fixed points, then

rm{fix} f

is one of these fixed points, i.e.

f(rm{fix} f)=rm{fix} f .

Fixed-point combinators can be defined in the lambda calculus and in functional programming languages and provide a means to allow for recursive definitions.

Y combinator in lambda calculus

In the classical untyped lambda calculus, every function has a fixed point. A particular implementation of

rm{fix}

is Haskell Curry's paradoxical combinator Y, given by[2] [3]

Y=λf.(λx.f(xx))(λx.f(xx))

(Here we use the standard notations and conventions of lambda calculus: Y is a function that takes one argument f and returns the entire expression following the first period; the expression

λx.f(xx)

denotes a function that takes one argument x, thought of as a function, and returns the expression

f(xx)

, where

(xx)

denotes x applied to itself. Juxtaposition of expressions denotes function application, is left-associative, and has higher precedence than the period.)

Verification

The following calculation verifies that

Yg

is indeed a fixed point of the function

g

:

Yg

=(λf.(λx.f(xx))(λx.f(xx)))g   

by the definition of

Y

=(λx.g(xx))(λx.g(xx))

by β-reduction: replacing the formal argument f of Y with the actual argument g

=g((λx.g(xx))(λx.g(xx)))

by β-reduction: replacing the formal argument x of the first function with the actual argument

(λx.g(xx))

=g(Yg)

by second equality, above

The lambda term

g(Yg)

may not, in general, β-reduce to the term

Yg

. However, both terms β-reduce to the same term, as shown.

Uses

Applied to a function with one variable, the Y combinator usually does not terminate. More interesting results are obtained by applying the Y combinator to functions of two or more variables. The additional variables may be used as a counter, or index. The resulting function behaves like a while or a for loop in an imperative language.

Used in this way, the Y combinator implements simple recursion. The lambda calculus does not allow a function to appear as a term in its own definition as is possible in many programming languages, but a function can be passed as an argument to a higher-order function that applies it in a recursive manner.

The Y combinator may also be used in implementing Curry's paradox. The heart of Curry's paradox is that untyped lambda calculus is unsound as a deductive system, and the Y combinator demonstrates this by allowing an anonymous expression to represent zero, or even many values. This is inconsistent in mathematical logic.

Example implementations

An example implementation of the Y combinator in two languages is presented below.

  1. Y Combinator in Python

Y = lambda f: (lambda x: f(x(x)))(lambda x: f(x(x)))

Y(Y)

// Y Combinator in C++, using C++ 14 extensions

int main

Note that both of these programs, while formally correct, are useless in practice; they both loop indefinitely until they terminate through stack overflow. More generally, as both Python and C++ use strict evaluation, the Y combinator is generally useless in those languages; see below for the Z combinator, which can be used in strict programming languages.

Fixed-point combinator

The Y combinator is an implementation of a fixed-point combinator in lambda calculus. Fixed-point combinators may also be easily defined in other functional and imperative languages. The implementation in lambda calculus is more difficult due to limitations in lambda calculus.The fixed-point combinator may be used in a number of different areas:

Fixed-point combinators may be applied to a range of different functions, but normally will not terminate unless there is an extra parameter. When the function to be fixed refers to its parameter, another call to the function is invoked, so the calculation never gets started. Instead, the extra parameter is used to trigger the start of the calculation.

The type of the fixed point is the return type of the function being fixed. This may be a real or a function or any other type.

In the untyped lambda calculus, the function to apply the fixed-point combinator to may be expressed using an encoding, like Church encoding. In this case particular lambda terms (which define functions) are considered as values. "Running" (beta reducing) the fixed-point combinator on the encoding gives a lambda term for the result which may then be interpreted as fixed-point value.

Alternately, a function may be considered as a lambda term defined purely in lambda calculus.

These different approaches affect how a mathematician and a programmer may regard a fixed-point combinator. A mathematician may see the Y combinator applied to a function as being an expression satisfying the fixed-point equation, and therefore a solution.

In contrast, a person only wanting to apply a fixed-point combinator to some general programming task may see it only as a means of implementing recursion.

Values and domains

Many functions do not have any fixed points, for instance

f:\N\to\N

with

f(n)=n+1

. Using Church encoding, natural numbers can be represented in lambda calculus, and this function f can be defined in lambda calculus. However, its domain will now contain all lambda expression, not just those representing natural numbers. The Y combinator, applied to f, will yield a fixed-point for f, but this fixed-point won't represent a natural number. If trying to compute Y f in an actual programming language, an infinite loop will occur.

Function versus implementation

The fixed-point combinator may be defined in mathematics and then implemented in other languages. General mathematics defines a function based on its extensional properties.[4] That is, two functions are equal if they perform the same mapping. Lambda calculus and programming languages regard function identity as an intensional property. A function's identity is based on its implementation.

A lambda calculus function (or term) is an implementation of a mathematical function. In the lambda calculus there are a number of combinators (implementations) that satisfy the mathematical definition of a fixed-point combinator.

Definition of the term "combinator"

Combinatory logic is a higher-order functions theory. A combinator is a closed lambda expression, meaning that it has no free variables. The combinators may be combined to direct values to their correct places in the expression without ever naming them as variables.

Recursive definitions and fixed-point combinators

Fixed-point combinators can be used to implement recursive definition of functions. However, they are rarely used in practical programming.[5] Strongly normalizing type systems such as the simply typed lambda calculus disallow non-termination and hence fixed-point combinators often cannot be assigned a type or require complex type system features. Furthermore fixed-point combinators are often inefficient compared to other strategies for implementing recursion, as they require more function reductions and construct and take apart a tuple for each group of mutually recursive definitions.

The factorial function

The factorial function provides a good example of how a fixed-point combinator may be used to define recursive functions. The standard recursive definition of the factorial function in mathematics can be written as

\operatorname{fact}n=\begin{cases} 1&if~n=0\\ n x \operatorname{fact}(n-1)&otherwise. \end{cases}

where n is a non-negative integer.If we want to implement this in lambda calculus, where integers are represented using Church encoding, we run into the problem that the lambda calculus does not allow the name of a function ('fact') to be used in the function's definition. This can be circumvented using a fixed-point combinator

sf{fix}

as follows.

Define a function F of two arguments f and n:

Ffn=(\operatorname{IsZero}n) 1 (\operatorname{multiply}n(f(\operatorname{pred}n)))

(Here

(\operatorname{IsZero}n)

is a function that takes two arguments and returns its first argument if n=0, and its second argument otherwise;

\operatorname{pred}n

evaluates to n-1.)

Now define

\operatorname{fact}=sf{fix} F

. Then

\operatorname{fact}

is a fixed-point of F, which gives

\begin{align}\operatorname{fact}n&=F\operatorname{fact}n\\ &=(\operatorname{IsZero}n) 1 (\operatorname{multiply}n(\operatorname{fact}(\operatorname{pred}n))) \end{align}

as desired.

Fixed-point combinators in lambda calculus

The Y combinator, discovered by Haskell B. Curry, is defined as

Y=λf.(λx.f(xx))(λx.f(xx))

Other fixed-point combinators

In untyped lambda calculus fixed-point combinators are not especially rare. In fact there are infinitely many of them.[6] In 2005 Mayer Goldberg showed that the set of fixed-point combinators of untyped lambda calculus is recursively enumerable.[7]

The Y combinator can be expressed in the SKI-calculus as

Y=S(K(SII))(S(S(KS)K)(K(SII)))

Additional combinators (B, C, K, W system) allow for a much shorter definition. With

U=SII

the self-application combinator, since

S(Kx)yz=x(yz)=Bxyz

and

Sx(Ky)z=xzy=Cxyz

, the above becomes

Y=S(KU)(SB(KU))=BU(CBU)

The simplest fixed-point combinator in the SK-calculus, found by John Tromp, is

Y'=SSK(S(K(SS(S(SSK))))K)

although note that it is not in normal form, which is longer. This combinator corresponds to the lambda expression

Y'=(λxy.xyx)(λyx.y(xyx))

The following fixed-point combinator is simpler than the Y combinator, and β-reduces into the Y combinator; it is sometimes cited as the Y combinator itself:

X=λf.(λx.xx)(λx.f(xx))

Another common fixed-point combinator is the Turing fixed-point combinator (named after its discoverer, Alan Turing):[8] [2]

\Theta=(λxy.y(xxy))(λxy.y(xxy))

Its advantage over

Y

is that

\Thetaf

beta-reduces to

f(\Thetaf)

,whereas

Yf

and

f(Yf)

only beta-reduce to a common term.

\Theta

also has a simple call-by-value form:

\Thetav=(λxy.y(λz.xxyz))(λxy.y(λz.xxyz))

The analog for mutual recursion is a polyvariadic fix-point combinator,[9] [10] [11] which may be denoted Y*.

Strict fixed-point combinator

In a strict programming language the Y combinator will expand until stack overflow, or never halt in case of tail call optimization.[12] The Z combinator will work in strict languages (also called eager languages, where applicative evaluation order is applied). The Z combinator has the next argument defined explicitly, preventing the expansion of

Zg

in the right-hand side of the definition:[13]

Zgv=g(Zg)v.

and in lambda calculus it is an eta-expansion of the Y combinator:

Z=λf.(λx.f(λv.xxv))(λx.f(λv.xxv)).

Non-standard fixed-point combinators

If F is a fixed-point combinator in untyped lambda calculus, then we have

Fx.Fx=λx.x(Fx)=λx.x(x(Fx))=

Terms that have the same Böhm tree as a fixed-point combinator, i.e. have the same infinite extension

λx.x(x(x))

, are called non-standard fixed-point combinators. Any fixed-point combinator is also a non-standard one, but not all non-standard fixed-point combinators are fixed-point combinators because some of them fail to satisfy the fixed-point equation that defines the "standard" ones. These combinators are called strictly non-standard fixed-point combinators; an example is the following combinator:

N=BM(B(BM)B)

where

B=λxyz.x(yz)

M=λx.xx.

The set of non-standard fixed-point combinators is not recursively enumerable.[7]

Implementation in other languages

The Y combinator is a particular implementation of a fixed-point combinator in lambda calculus. Its structure is determined by the limitations of lambda calculus. It is not necessary or helpful to use this structure in implementing the fixed-point combinator in other languages.

Simple examples of fixed-point combinators implemented in some programming paradigms are given below.

Lazy functional implementation

In a language that supports lazy evaluation, like in Haskell, it is possible to define a fixed-point combinator using the defining equation of the fixed-point combinator which is conventionally named fix. Since Haskell has lazy datatypes, this combinator can also be used to define fixed points of data constructors (and not only to implement recursive functions). The definition is given here, followed by some usage examples. In Hackage, the original sample is: [14]

fix, fix' :: (a -> a) -> afix f = let x = f x in x -- Lambda dropped. Sharing. -- Original definition in Data.Function.-- alternative:fix' f = f (fix' f) -- Lambda lifted. Non-sharing.

fix (\x -> 9) -- this evaluates to 9

fix (\x -> 3:x) -- evaluates to the lazy infinite list [3,3,3,...]

fact = fix fac -- evaluates to the factorial function where fac f 0 = 1 fac f x = x * f (x-1)

fact 5 -- evaluates to 120

Strict functional implementation

In a strict functional language, as illustrated below with OCaml, the argument to f is expanded beforehand, yielding an infinite call sequence,

f(f...(f(fixf))...)x

.

This may be resolved by defining fix with an extra parameter.

let rec fix f x = f (fix f) x (* note the extra x; here fix f = \x-> f (fix f) x *)

let factabs fact = function (* factabs has extra level of lambda abstraction *) 0 -> 1 | x -> x * fact (x-1)

let _ = (fix factabs) 5 (* evaluates to "120" *)

In a multi-paradigm functional language (one decorated with imperative features), such as Lisp, Peter Landin suggested the use of a variable assignment to create a fixed-point combinator,[15] as in the below example using Scheme:

(define Y! (lambda (f) ((lambda (i) (set! i (f (lambda (x) (i x)))) ;; (set! i expr) assigns i the value of expr i) ;; replacing it in the present lexical scope #f)))

Using a lambda calculus with axioms for assignment statements, it can be shown that Y! satisfies the same fixed-point law as the call-by-value Y combinator:[16] [17]

(Y! λx.e)e'=(λx.e)(Y! λx.e)e'

In more idiomatic modern Lisp usage, this would typically be handled via a lexically scoped label (a let expression), as lexical scope was not introduced to Lisp until the 1970s:

(define Y* (lambda (f) ((lambda (i) (let ((i (f (lambda (x) (i x))))) ;; (let ((i expr)) i) locally defines i as expr i)) ;; non-recursively: thus i in expr is not expr #f)))

Or without the internal label:

(define Y (lambda (f) ((lambda (i) (i i)) (lambda (i) (f (lambda (x) (apply (i i) x)))))))

Imperative language implementation

This example is a slightly interpretive implementation of a fixed-point combinator. A class is used to contain the fix function, called fixer. The function to be fixed is contained in a class that inherits from fixer. The fix function accesses the function to be fixed as a virtual function. As for the strict functional definition, fix is explicitly given an extra parameter x, which means that lazy evaluation is not needed.

template class fixer

class fact : public fixer

long result = fact.fix(5);

Typing

In System F (polymorphic lambda calculus) a polymorphic fixed-point combinator has type[18]

∀a.(a → a) → awhere a is a type variable. That is, fix takes a function, which maps a → a and uses it to return a value of type a.

In the simply typed lambda calculus extended with recursive data types, fixed-point operators can be written, but the type of a "useful" fixed-point operator (one whose application always returns) may be restricted.

In the simply typed lambda calculus, the fixed-point combinator Y cannot be assigned a type[19] because at some point it would deal with the self-application sub-term

x~x

by the application rule:

{\Gamma\vdashx:t1\tot2   \Gamma\vdashx:t1}\over{\Gamma\vdashx~x:t2}

where

x

has the infinite type

t1=t1\tot2

. No fixed-point combinator can in fact be typed; in those systems, any support for recursion must be explicitly added to the language.

Type for the Y combinator

In programming languages that support recursive data types, it is possible to type the Y combinator by appropriately accounting for the recursion at the type level. The need to self-apply the variable x can be managed using a type (Rec a), which is defined so as to be isomorphic to (Rec a -> a).

For example, in the following Haskell code, we have In and out being the names of the two directions of the isomorphism, with types:[20] [21]

In :: (Rec a -> a) -> Rec aout :: Rec a -> (Rec a -> a)

which lets us write:

newtype Rec a = In

y :: (a -> a) -> ay = \f -> (\x -> f (out x x)) (In (\x -> f (out x x)))

Or equivalently in OCaml:

type 'a recc = In of ('a recc -> 'a)let out (In x) = x

let y f = (fun x a -> f (out x x) a) (In (fun x a -> f (out x x) a))

Alternatively:

let y f = (fun x -> f (fun z -> out x x z)) (In (fun x -> f (fun z -> out x x z)))

General information

Because fixed-point combinators can be used to implement recursion, it is possible to use them to describe specific types of recursive computations, such as those in fixed-point iteration, iterative methods, recursive join in relational databases, data-flow analysis, FIRST and FOLLOW sets of non-terminals in a context-free grammar, transitive closure, and other types of closure operations.

A function for which every input is a fixed point is called an identity function. Formally:

\forallx(fx=x)

In contrast to universal quantification over all

x

, a fixed-point combinator constructs one value that is a fixed point of

f

. The remarkable property of a fixed-point combinator is that it constructs a fixed point for an arbitrary given function

f

.

Other functions have the special property that, after being applied once, further applications don't have any effect. More formally:

\forallx(f(fx)=fx)

Such functions are called idempotent (see also Projection (mathematics)). An example of such a function is the function that returns 0 for all even integers, and 1 for all odd integers.

In lambda calculus, from a computational point of view, applying a fixed-point combinator to an identity function or an idempotent function typically results in non-terminating computation. For example, we obtain

(Y λx.x)=(λx.(xx) λx.(xx))

where the resulting term can only reduce to itself and represents an infinite loop.

Fixed-point combinators do not necessarily exist in more restrictive models of computation. For instance, they do not exist in simply typed lambda calculus.

The Y combinator allows recursion to be defined as a set of rewrite rules,[22] without requiring native recursion support in the language.[23]

In programming languages that support anonymous functions, fixed-point combinators allow the definition and use of anonymous recursive functions, i.e. without having to bind such functions to identifiers. In this setting, the use of fixed-point combinators is sometimes called anonymous recursion.[24] [25]

See also

References

External links

Notes and References

  1. Book: Peyton Jones, Simon L. . The Implementation of Functional Programming . 1987 . Prentice Hall International .
  2. Book: Henk Barendregt . Henk Barendregt . The Lambda Calculus  - Its Syntax and Semantics . Amsterdam . North-Holland . Studies in Logic and the Foundations of Mathematics . 103 . 1985 . 0444867481.
  3. Throughout this article, the syntax rules given in Lambda calculus#Notation are used, to save parentheses.
  4. Web site: Selinger . Peter . Lecture Notes on Lambda Calculus (PDF) . 6.
  5. Web site: For those of us who don't know what a Y-Combinator is or why it's useful, ... . Hacker News . 2 August 2020.
  6. Book: Bimbó, Katalin . Katalin Bimbó . Combinatory Logic: Pure, Applied and Typed . 27 July 2011 . 48 . CRC Press . 9781439800010 .
  7. Goldberg, 2005
  8. 2268281 . Alan Mathison Turing . The

    p

    -function in

    λ

    -

    K

    -conversion . . 2 . 4 . 164 . December 1937.
  9. Web site: Many faces of the fixed-point combinator . okmij.org.
  10. http://osdir.com/ml/lang.haskell.cafe/2003-10/msg00211.html Polyvariadic Y in pure Haskell98
  11. Web site: recursion - Fixed-point combinator for mutually recursive functions? . Stack Overflow.
  12. Web site: Bene . Adam . Fixed-Point Combinators in JavaScript . Bene Studio . Medium . 2 August 2020 . en . 17 August 2017.
  13. Web site: CS 6110 S17 Lecture 5. Recursion and Fixed-Point Combinators . Cornell University . 4.1 A CBV Fixed-Point Combinator.
  14. https://hackage.haskell.org/package/base-4.10.0.0/docs/src/Data.Function.html#fix The original definition in Data.Function
  15. Landin . P. J. . Peter Landin . January 1964 . 10.1093/comjnl/6.4.308 . 4 . The Computer Journal . 308–320 . The mechanical evaluation of expressions . 6.
  16. Book: Felleisen, Matthias . The Lambda-v-CS Calculus . 1987 . Indiana University .
  17. Book: Talcott, Carolyn . Carolyn Talcott . The Essence of Rum: A theory of the intensional and extensional aspects of Lisp-type computation . Ph.D. thesis . 1985 . Stanford University.
  18. Girard . Jean-Yves . 10.1016/0304-3975(86)90044-7 . 2 . Theoretical Computer Science . 867281 . 159–192 . The system of variable types, fifteen years later . 45 . 1986. See in particular p. 180.
  19. http://cs.anu.edu.au/courses/COMP3610/lectures/Lambda.pdf An Introduction to the Lambda Calculus
  20. Haskell mailing list thread on How to define Y combinator in Haskell, 15 Sep 2006
  21. Book: Geuvers . Herman . On Fixed point and Looping Combinators in Type Theory . Verkoelen . Joep . 10.1.1.158.1478.
  22. Book: The Little Lisper . . . 1986 . Chapter 9 - Lambda The Ultimate . 179. "In the chapter we have derived a Y-combinator which allows us to write recursive functions of one argument without using define."
  23. Web site: The Y Combinator (Slight Return) or: How to Succeed at Recursion Without Really Recursing . Mike Vanier . live . https://web.archive.org/web/20110822202348/http://mvanier.livejournal.com/2897.html . 2011-08-22. "More generally, Y gives us a way to get recursion in a programming language that supports first-class functions but that doesn't have recursion built in to it."
  24. This terminology appears to be largely folklore, but it does appear in the following:
    • Trey Nash, Accelerated C# 2008, Apress, 2007,, p. 462—463. Derived substantially from Wes Dyer's blog (see next item).
    • Wes Dyer Anonymous Recursion in C#, February 02, 2007, contains a substantially similar example found in the book above, but accompanied by more discussion.
  25. The If Works Deriving the Y combinator, January 10th, 2008