Chan's algorithm explained

In computational geometry, Chan's algorithm,^[1] named after Timothy M. Chan, is an optimal output-sensitive algorithm to compute the convex hull of a set

points, in 2- or 3-dimensional space. The algorithm takes

O(nlogh)

time, where

is the number of vertices of the output (the convex hull). In the planar case, the algorithm combines an

O(nlogn)

algorithm (Graham scan, for example) with Jarvis march (

O(nh)

), in order to obtain an optimal

O(nlogh)

time. Chan's algorithm is notable because it is much simpler than the Kirkpatrick–Seidel algorithm, and it naturally extends to 3-dimensional space. This paradigm^[2] has been independently developed by Frank Nielsen in his Ph.D. thesis.^[3]

Algorithm

Overview

A single pass of the algorithm requires a parameter

which is between 0 and

(number of points of our set

). Ideally,

m=h

but

, the number of vertices in the output convex hull, is not known at the start. Multiple passes with increasing values of

are done which then terminates when

m\geqh

(see below on choosing parameter

The algorithm starts by arbitrarily partitioning the set of points

into

K=\lceiln/m\rceil

subsets

(Q_k)_k=1,2,...K

with at most

points each; notice that

K=O(n/m)

For each subset

Q_k

, it computes the convex hull,

C_k

, using an

O(plogp)

algorithm (for example, Graham scan), where

is the number of points in the subset. As there are

subsets of

O(m)

points each, this phase takes

K ⋅ O(mlogm)=O(nlogm)

time.

During the second phase, Jarvis's march is executed, making use of the precomputed (mini) convex hulls,

(C_k)_k=1,2,...K

. At each step in this Jarvis's march algorithm, we have a point

p_i

in the convex hull (at the beginning,

p_i

may be the point in

with the lowest y coordinate, which is guaranteed to be in the convex hull of

), and need to find a point

p_i+1=f(p_i,P)

such that all other points of

are to the right of the line

p_ip_i+1

, where the notation

p_i+1=f(p_i,P)

simply means that the next point, that is

p_i+1

, is determined as a function of

p_i

and

. The convex hull of the set

Q_k

C_k

, is known and contains at most

points (listed in a clockwise or counter-clockwise order), which allows to compute

f(p_i,Q_k)

O(logm)

time by binary search. Hence, the computation of

f(p_i,Q_k)

for all the

subsets can be done in

O(Klogm)

time. Then, we can determine

f(p_i,P)

using the same technique as normally used in Jarvis's march, but only considering the points

(f(p_i,Q_k))_1\leq

(i.e. the points in the mini convex hulls) instead of the whole set

. For those points, one iteration of Jarvis's march is

O(K)

which is negligible compared to the computation for all subsets. Jarvis's march completes when the process has been repeated

O(h)

times (because, in the way Jarvis march works, after at most

iterations of its outermost loop, where

is the number of points in the convex hull of

, we must have found the convex hull), hence the second phase takes

O(Khlogm)

time, equivalent to

O(nlogh)

time if

is close to

(see below the description of a strategy to choose

such that this is the case).

By running the two phases described above, the convex hull of

points is computed in

O(nlogh)

time.

Choosing the parameter

If an arbitrary value is chosen for

, it may happen that

m<h

. In that case, after

steps in the second phase, we interrupt the Jarvis's march as running it to the end would take too much time.At that moment, a

O(nlogm)

time will have been spent, and the convex hull will not have been calculated.

The idea is to make multiple passes of the algorithm with increasing values of

; each pass terminates (successfully or unsuccessfully) in

O(nlogm)

time. If

increases too slowly between passes, the number of iterations may be large; on the other hand, if it rises too quickly, the first

for which the algorithm terminates successfully may be much larger than

, and produce a complexity

O(nlogm)>O(nlogh)

Squaring Strategy

One possible strategy is to square the value of

at each iteration, up to a maximum value of

(corresponding to a partition in singleton sets).^[4] Starting from a value of 2, at iteration

m=min

	2^t
\left(n,2

\right)

is chosen. In that case,

O(loglogh)

iterations are made, given that the algorithm terminates once we have

	2^t
2

\geqh\ifflog\left(

	2^t
2

\right)\geqlogh\iff2^t\geqlogh\ifflog{2^t}\geqlog{logh}\ifft\geqlog{logh},

with the logarithm taken in base

, and the total running time of the algorithm is

	\lceilloglogh\rceil
\sum
	t=0

O\left(nlog

	2^t
\left(2

\right)\right)=O(n)

	\lceilloglogh\rceil
\sum
	t=0

2^t=O\left(n ⋅ 2^1+\lceil\right)=O(nlogh).

In three dimensions

To generalize this construction for the 3-dimensional case, an

O(nlogn)

algorithm to compute the 3-dimensional convex hull by Preparata and Hong should be used instead of Graham scan, and a 3-dimensional version of Jarvis's march needs to be used. The time complexity remains

O(nlogh)

.^[1]

Pseudocode

In the following pseudocode, text between parentheses and in italic are comments. To fully understand the following pseudocode, it is recommended that the reader is already familiar with Graham scan and Jarvis march algorithms to compute the convex hull,

, of a set of points,

Input: Set

with

points .

Output: Set

with

points, the convex hull of

(Pick a point of

which is guaranteed to be in

: for instance, the point with the lowest y coordinate.)

(This operation takes

l{O}(n)

time: e.g., we can simply iterate through

p₁:=PICK\_START(P)

(

p₀

is used in the Jarvis march part of this Chan's algorithm,

so that to compute the second point,

p₂

, in the convex hull of

(Note:

p₀

is not a point of

(For more info, see the comments close to the corresponding part of the Chan's algorithm.)

p₀:=(-infty,0)

(Note:

, the number of points in the final convex hull of

, is not known.)

(These are the iterations needed to discover the value of

, which is an estimate of

(

h\leqm

is required for this Chan's algorithm to find the convex hull of

(More specifically, we want

h\leqm\leqh²

, so that not to perform too many unnecessary iterations

and so that the time complexity of this Chan's algorithm is

l{O}(nlogh)

(As explained above in this article, a strategy is used where at most

loglogn

iterations are required to find

(Note: the final

may not be equal to

, but it is never smaller than

and greater than

h²

(Nevertheless, this Chan's algorithm stops once

iterations of the outermost loop are performed,

that is, even if

m ≠ h

, it doesn't perform

iterations of the outermost loop.)

(For more info, see the Jarvis march part of this algorithm below, where

is returned if

p_i+1==p₁

for

1\leqt\leqloglogn

(Set parameter

for the current iteration. A "squaring scheme" is used as described above in this article.

There are other schemes: for example, the "doubling scheme", where

m=2^t

, for

t=1,...,\left\lceillogh\right\rceil

If the "doubling scheme" is used, though, the resulting time complexity of this Chan's algorithm is

l{O}(nlog²h)

	2^t
m:=2

(Initialize an empty list (or array) to store the points of the convex hull of

, as they are found.)

C:=

ADD(C,p₁₎

(Arbitrarily split set of points

into

K=\left\lceil

	n
	m

\right\rceil

subsets of roughly

elements each.)

Q_1,Q_2,...,Q_K:=SPLIT(P,m)

(Compute the convex hull of all

subsets of points,

Q_1,Q_2,...,Q_K

(It takes

l{O}(Kmlogm)=l{O}(nlogm)

time.)

m\leqh²

, then the time complexity is

l{O}(nlogh²⁾=l{O}(nlogh)

for

1\leqk\leqK

(Compute the convex hull of subset

Q_k

, using Graham scan, which takes

l{O}(mlogm)

time.)

(

C_k

is the convex hull of the subset of points

Q_k

C_k:=GRAHAM\_SCAN(Q_k)

(At this point, the convex hulls

C_1,C_2,...,C_K

of respectively the subsets of points

Q_1,Q_2,...,Q_K

have been computed.)

(Now, use a modified version of the Jarvis march algorithm to compute the convex hull of

(Jarvis march performs in

l{O}(nh)

time, where

is the number of input points and

is the number of points in the convex hull.)

(Given that Jarvis march is an output-sensitive algorithm, its running time depends on the size of the convex hull,

(In practice, it means that Jarvis march performs

iterations of its outermost loop.

At each of these iterations, it performs at most

iterations of its innermost loop.)

(We want

h\leqm\leqh²

, so we do not want to perform more than

iterations in the following outer loop.)

(If the current

is smaller than

, i.e.

m<h

, the convex hull of

cannot be found.)

(In this modified version of Jarvis march, we perform an operation inside the innermost loop which takes

l{O}(logm)

time.

Hence, the total time complexity of this modified version is

l{O}(mKlogm)=l{O}(m\left\lceil

	n
	m

\right\rceillogm)=l{O}(nlogm)=l{O}(nlog

	2^t
2

)=l{O}(n2^t).

m\leqh²

, then the time complexity is

l{O}(nlogh²⁾=l{O}(nlogh)

for

1\leqi\leqm

(Note: here, a point in the convex hull of

is already known, that is

p₁

(In this inner for loop,

possible next points to be on the convex hull of

q_i,1,q_i,2,...,q_i,K

, are computed.)

(Each of these

possible next points is from a different

C_k

that is,

q_i,k

is a possible next point on the convex hull of

which is part of the convex hull of

C_k

(Note:

q_i,1,q_i,2,...,q_i,K

depend on

: that is, for each iteration

, there are

possible next points to be on the convex hull of

(Note: at each iteration

, only one of the points among

q_i,1,q_i,2,...,q_i,K

is added to the convex hull of

for

1\leqk\leqK

(

JARVIS\_BINARY\_SEARCH

finds the point

d\inC_k

such that the angle

\measuredanglep_i-1p_id

is maximized,

where

\measuredanglep_i-1p_id

is the angle between the vectors

\overrightarrow{p_ip_i-1

} and

\overrightarrow{p_id}

. Such

is stored in

q_i,k

(Angles do not need to be calculated directly: the orientation test can be used .)

(

JARVIS\_BINARY\_SEARCH

can be performed in

l{O}(logm)

time.)

(Note: at the iteration

i=1

p_i-1=p₀=(-infty,0)

and

p₁

is known and is a point in the convex hull of

in this case, it is the point of

with the lowest y coordinate.)

q_i,k:=JARVIS\_BINARY\_SEARCH(p_i-1,p_i,C_k)

(Choose the point

z\in\{q_i,1,q_i,2,...,q_i,K\}

which maximizes the angle

\measuredanglep_i-1p_iz

to be the next point on the convex hull of

p_i+1:=JARVIS\_NEXT\_CH\_POINT(p_i-1,p_i,(q_i,1,q_i,2,...,q_i,K))

(Jarvis march terminates when the next selected point on the convext hull,

p_i+1

, is the initial point,

p₁

p_i+1==p₁

(Return the convex hull of

which contains

i=h

points.)

(Note: of course, no need to return

p_i+1

which is equal to

p₁

return

C:=(p_1,p_2,...,p_i)

else

ADD(C,p_i+1)

(If after

iterations a point

p_i+1

has not been found so that

p_i+1==p₁

, then

m<h

(We need to start over with a higher value for

Implementation

Chan's paper contains several suggestions that may improve the practical performance of the algorithm, for example:

When computing the convex hulls of the subsets, eliminate the points that are not in the convex hull from consideration in subsequent executions.
The convex hulls of larger point sets can be obtained by merging previously calculated convex hulls, instead of recomputing from scratch.
With the above idea, the dominant cost of algorithm lies in the pre-processing, i.e., the computation of the convex hulls of the groups. To reduce this cost, we may consider reusing hulls computed from the previous iteration and merging them as the group size is increased.

Extensions

Chan's paper contains some other problems whose known algorithms can be made optimal output sensitive using his technique, for example:

L(S)

of a set

line segments, which is defined as the lower boundary of the unbounded trapezoid of formed by the intersections.

Hershberger^[5] gave an

O(nlogn)

algorithm which can be sped up to

O(nlogh)

, where h is the number of edges in the envelope

Constructing output sensitive algorithms for higher dimensional convex hulls. With the use of grouping points and using efficient data structures,

O(nlogh)

complexity can be achieved provided h is of polynomial order in

Notes and References

Chan . Timothy M.. Optimal output-sensitive convex hull algorithms in two and three dimensions. 10.1007/BF02712873 . free. Discrete & Computational Geometry. 16. 361–368. 1996. 4.
Book: Nielsen . Frank. Grouping and Querying: A Paradigm to Get Output-Sensitive Algorithms. Discrete and Computational Geometry. Lecture Notes in Computer Science. 1763. 250–257. 2000. 10.1007/978-3-540-46515-7_21 . 978-3-540-67181-7. free.
Frank Nielsen. "Adaptive Computational Geometry".Ph.D. thesis, INRIA, 1996.
Chazelle . Bernard . Bernard Chazelle. Matoušek . Jiří . Jiří Matoušek (mathematician). Derandomizing an output-sensitive convex hull algorithm in three dimensions. Computational Geometry. 5. 27–32. 1995. 10.1016/0925-7721(94)00018-Q . free.
Hershberger . John. Finding the upper envelope of n line segments in O(n log n) time. Information Processing Letters. 33. 4. 169–174. 1989. 10.1016/0020-0190(89)90136-1.

Chan's algorithm explained

Algorithm

Overview

Choosing the parameter

Squaring Strategy

In three dimensions

Pseudocode

Implementation

Extensions

See also

Notes and References