Mathematical morphology explained

Mathematical morphology (MM) is a theory and technique for the analysis and processing of geometrical structures, based on set theory, lattice theory, topology, and random functions. MM is most commonly applied to digital images, but it can be employed as well on graphs, surface meshes, solids, and many other spatial structures.

Topological and geometrical continuous-space concepts such as size, shape, convexity, connectivity, and geodesic distance, were introduced by MM on both continuous and discrete spaces. MM is also the foundation of morphological image processing, which consists of a set of operators that transform images according to the above characterizations.

The basic morphological operators are erosion, dilation, opening and closing.

MM was originally developed for binary images, and was later extended to grayscale functions and images. The subsequent generalization to complete lattices is widely accepted today as MM's theoretical foundation.

History

Mathematical Morphology was developed in 1964 by the collaborative work of Georges Matheron and Jean Serra, at the École des Mines de Paris, France. Matheron supervised the PhD thesis of Serra, devoted to the quantification of mineral characteristics from thin cross sections, and this work resulted in a novel practical approach, as well as theoretical advancements in integral geometry and topology.

In 1968, the Centre de Morphologie Mathématique was founded by the École des Mines de Paris in Fontainebleau, France, led by Matheron and Serra.

During the rest of the 1960s and most of the 1970s, MM dealt essentially with binary images, treated as sets, and generated a large number of binary operators and techniques: Hit-or-miss transform, dilation, erosion, opening, closing, granulometry, thinning, skeletonization, ultimate erosion, conditional bisector, and others. A random approach was also developed, based on novel image models. Most of the work in that period was developed in Fontainebleau.

From the mid-1970s to mid-1980s, MM was generalized to grayscale functions and images as well. Besides extending the main concepts (such as dilation, erosion, etc.) to functions, this generalization yielded new operators, such as morphological gradients, top-hat transform and the Watershed (MM's main segmentation approach).

In the 1980s and 1990s, MM gained a wider recognition, as research centers in several countries began to adopt and investigate the method. MM started to be applied to a large number of imaging problems and applications, especially in the field of non-linear filtering of noisy images.

In 1986, Serra further generalized MM, this time to a theoretical framework based on complete lattices. This generalization brought flexibility to the theory, enabling its application to a much larger number of structures, including color images, video, graphs, meshes, etc. At the same time, Matheron and Serra also formulated a theory for morphological filtering, based on the new lattice framework.

The 1990s and 2000s also saw further theoretical advancements, including the concepts of connections and levelings.

In 1993, the first International Symposium on Mathematical Morphology (ISMM) took place in Barcelona, Spain. Since then, ISMMs are organized every 2–3 years: Fontainebleau, France (1994); Atlanta, USA (1996); Amsterdam, Netherlands (1998); Palo Alto, CA, USA (2000); Sydney, Australia (2002); Paris, France (2005); Rio de Janeiro, Brazil (2007); Groningen, Netherlands (2009); Intra (Verbania), Italy (2011); Uppsala, Sweden (2013); Reykjavík, Iceland (2015); Fontainebleau, France (2017); and Saarbrücken, Germany (2019).[1]

References

Binary morphology

Rd

or the integer grid

Zd

, for some dimension d.

Structuring element

The basic idea in binary morphology is to probe an image with a simple, pre-defined shape, drawing conclusions on how this shape fits or misses the shapes in the image. This simple "probe" is called the structuring element, and is itself a binary image (i.e., a subset of the space or grid).

Here are some examples of widely used structuring elements (denoted by B):

E=R2

; B is an open disk of radius r, centered at the origin.

E=Z2

; B is a 3 × 3 square, that is, B = .

E=Z2

; B is the "cross" given by B = .

Basic operators

The basic operations are shift-invariant (translation invariant) operators strongly related to Minkowski addition.

Let E be a Euclidean space or an integer grid, and A a binary image in E.

Erosion

The erosion of the binary image A by the structuring element B is defined by

A\ominusB=\{z\inE|Bz\subseteqA\},

where Bz is the translation of B by the vector z, i.e.,

Bz=\{b+z\midb\inB\}

,

\forallz\inE

.

When the structuring element B has a center (e.g., B is a disk or a square), and this center is located on the origin of E, then the erosion of A by B can be understood as the locus of points reached by the center of B when B moves inside A. For example, the erosion of a square of side 10, centered at the origin, by a disc of radius 2, also centered at the origin, is a square of side 6 centered at the origin.

The erosion of A by B is also given by the expression

A\ominusB=capbA-b

.

Example application: Assume we have received a fax of a dark photocopy. Everything looks like it was written with a pen that is bleeding. Erosion process will allow thicker lines to get skinny and detect the hole inside the letter "o".

Dilation

The dilation of A by the structuring element B is defined by

AB=cupbAb.

The dilation is commutative, also given by

AB=BA=cupaBa

.

If B has a center on the origin, as before, then the dilation of A by B can be understood as the locus of the points covered by B when the center of B moves inside A. In the above example, the dilation of the square of side 10 by the disk of radius 2 is a square of side 14, with rounded corners, centered at the origin. The radius of the rounded corners is 2.

The dilation can also be obtained by

AB=\{z\inE\mid

s)
(B
z

\capA\varnothing\}

, where Bs denotes the symmetric of B, that is,

Bs=\{x\inE\mid-x\inB\}

.

Example application: dilation is the dual operation of the erosion. Figures that are very lightly drawn get thick when "dilated". Easiest way to describe it is to imagine the same fax/text is written with a thicker pen.

Opening

The opening of A by B is obtained by the erosion of A by B, followed by dilation of the resulting image by B:

A\circB=(A\ominusB)B.

The opening is also given by

A\circB=

cup
Bx\subseteqA

Bx

, which means that it is the locus of translations of the structuring element B inside the image A. In the case of the square of side 10, and a disc of radius 2 as the structuring element, the opening is a square of side 10 with rounded corners, where the corner radius is 2.

Example application: Let's assume someone has written a note on a non-soaking paper and that the writing looks as if it is growing tiny hairy roots all over. Opening essentially removes the outer tiny "hairline" leaks and restores the text. The side effect is that it rounds off things. The sharp edges start to disappear.

Closing

The closing of A by B is obtained by the dilation of A by B, followed by erosion of the resulting structure by B:

A\bulletB=(AB)\ominusB.

The closing can also be obtained by

A\bulletB=(Ac\circBs)c

, where Xc denotes the complement of X relative to E (that is,

Xc=\{x\inE\midx\notinX\}

). The above means that the closing is the complement of the locus of translations of the symmetric of the structuring element outside the image A.

Properties of the basic operators

Here are some properties of the basic binary morphological operators (dilation, erosion, opening and closing):

A\subseteqC

, then

AB\subseteqCB

, and

A\ominusB\subseteqC\ominusB

, etc.

AB=BA

.

A\ominusB\subseteqA\circB\subseteqA\subseteqA\bulletB\subseteqAB

.

(AB)C=A(BC)

. Moreover, the erosion satisfies

(A\ominusB)\ominusC=A\ominus(BC)

.

AB=(Ac\ominusBs)c

.

A\bulletB=(Ac\circBs)c

.

A\subseteq(C\ominusB)

if and only if

(AB)\subseteqC

.

A\circB\subseteqA

, whereas the closing is extensive, i.e.,

A\subseteqA\bulletB

.

Other operators and tools

Grayscale morphology

In grayscale morphology, images are functions mapping a Euclidean space or grid E into

R\cup\{infty,-infty\}

, where

R

is the set of reals,

infty

is an element larger than any real number, and

-infty

is an element smaller than any real number.

Grayscale structuring elements are also functions of the same format, called "structuring functions".

Denoting an image by f(x) the structuring function by b(x) and the support of b by B, the grayscale dilation of f by b is given by

(fb)(x)=\supy[f(x-y)+b(y)],

where "sup" denotes the supremum.

Similarly, the erosion of f by b is given by

(f\ominusb)(x)=infy[f(x+y)-b(y)],

where "inf" denotes the infimum.

Just like in binary morphology, the opening and closing are given respectively by

f\circb=(f\ominusb)b,

f\bulletb=(fb)\ominusb.

Flat structuring functions

It is common to use flat structuring elements in morphological applications. Flat structuring functions are functions b(x) in the form

b(x)=\begin{cases} 0,&x\inB,\\ -infty&otherwise, \end{cases}

where

B\subseteqE

.

In this case, the dilation and erosion are greatly simplified, and given respectively by

(fb)(x)=

\sup
z\inBs

f(x+z),

(f\ominusb)(x)=infzf(x+z).

In the bounded, discrete case (E is a grid and B is bounded), the supremum and infimum operators can be replaced by the maximum and minimum. Thus, dilation and erosion are particular cases of order statistics filters, with dilation returning the maximum value within a moving window (the symmetric of the structuring function support B), and the erosion returning the minimum value within the moving window B.

In the case of flat structuring element, the morphological operators depend only on the relative ordering of pixel values, regardless their numerical values, and therefore are especially suited to the processing of binary images and grayscale images whose light transfer function is not known.

Other operators and tools

By combining these operators one can obtain algorithms for many image processing tasks, such as feature detection, image segmentation, image sharpening, image filtering, and classification.Along this line one should also look into Continuous Morphology[2]

Mathematical morphology on complete lattices

Complete lattices are partially ordered sets, where every subset has an infimum and a supremum. In particular, it contains a least element and a greatest element (also denoted "universe").

Adjunctions (dilation and erosion)

Let

(L,\leq)

be a complete lattice, with infimum and supremum symbolized by

\wedge

and

\vee

, respectively. Its universe and least element are symbolized by U and

\emptyset

, respectively. Moreover, let

\{Xi\}

be a collection of elements from L.

A dilation is any operator

\delta\colonLL

that distributes over the supremum, and preserves the least element. I.e.:

veei\delta(Xi)=\delta\left(veeiXi\right)

,

\delta(\emptyset)=\emptyset

.

An erosion is any operator

\varepsilon\colonLL

that distributes over the infimum, and preserves the universe. I.e.:

wedgei\varepsilon(Xi)=\varepsilon\left(wedgeiXi\right)

,

\varepsilon(U)=U

.

Dilations and erosions form Galois connections. That is, for every dilation

\delta

there is one and only one erosion

\varepsilon

that satisfies

X\leq\varepsilon(Y)\Leftrightarrow\delta(X)\leqY

for all

X,Y\inL

.

Similarly, for every erosion there is one and only one dilation satisfying the above connection.

Furthermore, if two operators satisfy the connection, then

\delta

must be a dilation, and

\varepsilon

an erosion.

Pairs of erosions and dilations satisfying the above connection are called "adjunctions", and the erosion is said to be the adjoint erosion of the dilation, and vice versa.

Opening and closing

For every adjunction

(\varepsilon,\delta)

, the morphological opening

\gamma\colonL\toL

and morphological closing

\phi\colonL\toL

are defined as follows:

\gamma=\delta\varepsilon,

\phi=\varepsilon\delta.

The morphological opening and closing are particular cases of algebraic opening (or simply opening) and algebraic closing (or simply closing). Algebraic openings are operators in L that are idempotent, increasing, and anti-extensive. Algebraic closings are operators in L that are idempotent, increasing, and extensive.

Particular cases

Binary morphology is a particular case of lattice morphology, where L is the power set of E (Euclidean space or grid), that is, L is the set of all subsets of E, and

\leq

is the set inclusion. In this case, the infimum is set intersection, and the supremum is set union.

Similarly, grayscale morphology is another particular case, where L is the set of functions mapping E into

R\cup\{infty,-infty\}

, and

\leq

,

\vee

, and

\wedge

, are the point-wise order, supremum, and infimum, respectively. That is, is f and g are functions in L, then

f\leqg

if and only if

f(x)\leqg(x),\forallx\inE

; the infimum

f\wedgeg

is given by

(f\wedgeg)(x)=f(x)\wedgeg(x)

; and the supremum

f\veeg

is given by

(f\veeg)(x)=f(x)\veeg(x)

.

See also

References

External links

Notes and References

  1. Web site: International Symposium on Mathematical Morphology and Its Applications to Signal and Image Processing . 2024-05-17 . link.springer.com . en.
  2. G. Sapiro, R. Kimmel, D. Shaked, B. Kimia, and A. M. Bruckstein. Implementing continuous-scale morphology via curve evolution. Pattern Recognition, 26(9):1363–1372, 1993.