Tjøstheim's coefficient explained

Tjøstheim's coefficient[1] is a measure of spatial association that attempts to quantify the degree to which two spatial data sets are related. Developed by Norwegian statistician Dag Tjøstheim. It is similar to rank correlation coefficients like Spearman's rank correlation coefficient and the Kendall rank correlation coefficient but also explicitly considers the spatial relationship between variables.

Consider two variables,

F(x,y)

and

G(x,y)

, observed at the same set of

N

spatial locations with co-ordinates

xi

and

yi

. The Rank of

F

at

(xi,yi)

is

RF(xi,yi)=

N
\sum
i

\theta(F(xi,yi)-F(xj,yj))

with a similar definition for

G

. Here

\theta

is a step function and this formula counts how many values

F(xj,yj)

are less than or equal to the value at the target point

F(xi,yi)

.

Now define

XF(i)=

N
\sum
j

xj\delta(i,RF(xj,yj))

where

\delta

is the Kronecker delta. This is the

x

coordinate of the

ith

ranked

F

value. The quantities

YF(i),XG(i)

and

YG(i)

can be defined similarly.

Tjøstheim's coefficient is defined by[2]

A=

N
\sum(XF(i)-\bar{X
i
F)(X

G(i)-\bar{X}G)+(YF(i)-\bar{Y}F)(YG(i)-\bar{Y}G)

N\left[(X
}{\left(\sum
F(i)

-

2
\bar{X}
F)

+(YF(i)-

2\right]
\bar{Y}
F)
N\left[(X
\sum
G(i)

-

2
\bar{X}
G)

+(YG(i)-

2\right]
\bar{Y}
G)

\right)1/2

}

Under the assumptions that

F

and

G

are independent and identically distributed random variables and are independent of each other it can be shown that

E[A]=0

and

var(A)=

N
\left(\sum
2\right)
x
i
2+2
N
\left(\sum
i
xi
2
y
i\right)
+
N
\left(\sum
i
2\right)
y
i
2
i
N
(N-1)\left(\sum
2
x
i
+
N
\sum
i
2\right)
y
i
2
i

The maximum variance of

1/(N-1)

occurs when all points are on a straight line and the minimum variance of

1/(2(N-1))

occurs for a symmetric cross pattern where

xiyi=0

and
N
\sum
i
2
x
i

=

N
\sum
i
2
y
i
.[3]

Tjøstheim's coefficient is implemented as cor.spatial in the R package SpatialPack.[4] Numerical simulations suggest that

A

is an effective measure of correlation between variables but is sensitive to the degree of autocorrelation in

F

and

G

.[3]

Notes and References

  1. 10.1093/biomet/65.1.109. D. Tjøstheim . 1978. A measure of association for spatial variables. Biometrika. 65. 1. 109–114 . 2335284 .
  2. Book: Vallejos . Ronny. Osorio . Feilpe. Bevilacqua . Moreno. 2020 . Spatial relationships between two georeferenced variables: With applications in R. Springer Cham. 978-3-030-56681-4. 10.1007/978-3-030-56681-4.
  3. 10.1111/j.1538-4632.1982.tb00066.x. B.J. Glick . 1982. A Spatial Rank-Order Correlation Measure. Geographical Analysis. 14. 2. 177–181 .
  4. http://spatialpack.mat.utfsm.cl