Getis–Ord statistics explained

Getis–Ord statistics, also known as Gi*, are used in spatial analysis to measure the local and global spatial autocorrelation. Developed by statisticians Arthur Getis and J. Keith Ord they are commonly used for Hot Spot Analysis[1] [2] to identify where features with high or low values are spatially clustered in a statistically significant way. Getis-Ord statistics are available in a number of software libraries such as CrimeStat, GeoDa, ArcGIS, PySAL[3] and R.[4] [5]

Local statistics

There are two different versions of the statistic, depending on whether the data point at the target location

i

is included or not[6]

Gi=

\sumjwijxj
\sumjxj

*
G
i

=

\sumjwijxj
\sumjxj

Here

xi

is the value observed at the

ith

spatial site and

wij

is the spatial weight matrix which constrains which sites are connected to one another. For
*
G
i
the denominator is constant across all observations.

A value larger (or smaller) than the mean suggests a hot (or cold) spot corresponding to a high-high (or low-low) cluster. Statistical significance can be estimated using analytical approximations as in the original work[7] [8] however in practice permutation testing is used to obtain more reliable estimates of significance for statistical inference.

Global statistics

The Getis-Ord statistics of overall spatial association are[9]

G=

\sumij,wijxixj
\sumij,xixj

G*=

\sumijwijxixj
\sumijxixj

The local and global

G*

statistics are related through the weighted average
\sumixi
*
G
i
\sumixi

=

\sumijxiwijxj
\sumixi\sumjxj

=G*

The relationship of the

G

and

Gi

statistics is more complicated due to the dependence of the denominator of

Gi

on

i

.

Relation to Moran's I

Moran's I is another commonly used measure of spatial association defined by

I=

N
W
\sumijwij(xi-\bar{x
)(x

j-\bar{x})}{\sumi(xi-\bar{x})2}

where

N

is the number of spatial sites and

W=\sumijwij

is the sum of the entries in the spatial weight matrix. Getis and Ord show[7] that

I=(K1/K2)G-K2\bar{x}\sumi(wi+w)xi+K2\bar{x}2W

Where

wi=\sumjwij

,

w=\sumjwji

,

K1=\left(\sumij,xixj\right)-1

and

K2=

W
N

\left(\sumi(xi-\bar{x})2\right)-1

. They are equal if

wij=w

is constant, but not in general.

Ord and Getis also show that Moran's I can be written in terms of

*
G
i

I=

1
W

\left(\sumiziVi

*
G
i

-N\right)

where

zi=(xi-\bar{x})/s

,

s

is the standard deviation of

x

and
2
V
i

=

1
N-1

\sumj\left(wij-

1
N

\sumkwik\right)2

is an estimate of the variance of

wij

.

See also

Notes and References

  1. Web site: RPubs - R Tutorial: Hotspot Analysis Using Getis Ord Gi .
  2. Web site: Hot Spot Analysis (Getis-Ord Gi*) (Spatial Statistics)—ArcGIS Pro | Documentation .
  3. https://pysal.org/
  4. Web site: R-spatial/Spdep . .
  5. 10.1007/s11749-018-0599-x . R.S. . Bivand . D.W. . Wong . 2018 . Comparing implementations of global and local indicators of spatial association. . Test . 27 . 3 . 716–748. 11250/2565494 . free .
  6. Web site: Local Spatial Autocorrelation (2) .
  7. 10.1111/j.1538-4632.1992.tb00261.x . A. . Getis . J.K. . Ord . 1992 . The analysis of spatial association by use of distance statistics . . 24 . 3 . 189–206.
  8. 10.1111/j.1538-4632.1995.tb00912.x . A. . Getis . J.K. . Ord . 1995 . Local spatial autocorrelation statistics: distributional issues and an application . . 27 . 4 . 286–306.
  9. Web site: How High/Low Clustering (Getis-Ord General G) works—ArcGIS Pro | Documentation .