Katz centrality explained

In graph theory, the Katz centrality or alpha centrality of a node is a measure of centrality in a network. It was introduced by Leo Katz in 1953 and is used to measure the relative degree of influence of an actor (or node) within a social network. Unlike typical centrality measures which consider only the shortest path (the geodesic) between a pair of actors, Katz centrality measures influence by taking into account the total number of walks between a pair of actors.

It is similar to Google's PageRank and to the eigenvector centrality.

Measurement

Katz centrality computes the relative influence of a node within a network by measuring the number of the immediate neighbors (first degree nodes) and also all other nodes in the network that connect to the node under consideration through these immediate neighbors. Connections made with distant neighbors are, however, penalized by an attenuation factor

\alpha

. Each path or connection between a pair of nodes is assigned a weight determined by

\alpha

and the distance between nodes as

\alphad

.

For example, in the figure on the right, assume that John's centrality is being measured and that

\alpha=0.5

. The weight assigned to each link that connects John with his immediate neighbors Jane and Bob will be

(0.5)1=0.5

. Since Jose connects to John indirectly through Bob, the weight assigned to this connection (composed of two links) will be

(0.5)2=0.25

. Similarly, the weight assigned to the connection between Agneta and John through Aziz and Jane will be

(0.5)3=0.125

and the weight assigned to the connection between Agneta and John through Diego, Jose and Bob will be

(0.5)4=0.0625

.

Mathematical formulation

Let A be the adjacency matrix of a network under consideration. Elements

(aij)

of A are variables that take a value 1 if a node i is connected to node j and 0 otherwise. The powers of A indicate the presence (or absence) of links between two nodes through intermediaries. For instance, in matrix

A3

, if element

(a2,12)=1

, it indicates that node 2 and node 12 are connected through some walk of length 3. If

CKatz(i)

denotes Katz centrality of a node i, then, given a value

\alpha\in(0,1)

, mathematically:

CKatz(i)=

infty
\sum
k=1
n
\sum
j=1

\alphak

k)
(A
ji
Note that the above definition uses the fact that the element at location

(i,j)

of

Ak

reflects the total number of

k

degree connections between nodes

i

and

j

. The value of the attenuation factor

\alpha

has to be chosen such that it is smaller than the reciprocal of the absolute value of the largest eigenvalue of A. In this case the following expression can be used to calculate Katz centrality:

\overrightarrow{C}Katz=((I-\alphaAT)-1-I)\overrightarrow{I}

Here

I

is the identity matrix,

\overrightarrow{I}

is a vector of size n (n is the number of nodes) consisting of ones.

AT

denotes the transposed matrix of A and

(I-\alphaAT)-1

denotes matrix inversion of the term

(I-\alphaAT)

.

An extension of this framework allows for the walks to be computed in a dynamical setting. By taking a time dependent series of network adjacency snapshots of the transient edges, the dependency for walks to contribute towards a cumulative effect is presented. The arrow of time is preserved so that the contribution of activity is asymmetric in the direction of information propagation.

Network producing data of the form:

\left\{A[k]\in\RN\right\}    fork=0,1,2,\ldots,M,

representing the adjacency matrix at each time

tk

. Hence:

\left(A[k]\right)ij=\begin{cases}1&thereisanedgefromnodeitonodejattimetk\ 0&otherwise\end{cases}

The time points

t0<t1<<tM

are ordered but not necessarily equally spaced.

Q\in\RN

for which

(Q)ij

is a weighted count of the number of dynamic walks of length

w

from node

i

to node

j

. The form for the dynamic communicability between participating nodes is:

l{Q}=\left(I-\alphaA[0]\right)-1\left(I-\alphaA[M]\right)-1.

This can be normalized via:

\hat{l{Q}}[k]=

\hat{l{Q
}^ \left(I-\alpha A^ \right)^}.

Therefore, centrality measures that quantify how effectively node

n

can 'broadcast' and 'receive' dynamic messages across the network:
broadcast
C
n

:=

N
\sum
k=1

l{Q}nkand

receive
C
n

:=

N
\sum
k=1

l{Q}kn

.

Alpha centrality

Ai,j

, Katz centrality is defined as follows:

\vec{x}=(I-\alphaAT)-1\vec{e}-\vec{e}

where

ej

is the external importance given to node

j

, and

\alpha

is a nonnegative attenuation factor which must be smaller than the inverse of the spectral radius of

A

. The original definition by Katz[1] used a constant vector

\vec{e}

. Hubbell[2] introduced the usage of a general

\vec{e}

.

Half a century later, Bonacich and Lloyd[3] defined alpha centrality as:

\vec{x}=(I-\alphaAT)-1\vec{e}

which is essentially identical to Katz centrality. More precisely, the score of a node

j

differs exactly by

ej

, so if

\vec{e}

is constant the order induced on the nodes is identical.

Applications

Katz centrality can be used to compute centrality in directed networks such as citation networks and the World Wide Web.

Katz centrality is more suitable in the analysis of directed acyclic graphs where traditionally used measures like eigenvector centrality are rendered useless.

Katz centrality can also be used in estimating the relative status or influence of actors in a social network. The work presented in shows the case study of applying a dynamic version of the Katz centrality to data from Twitter and focuses on particular brands which have stable discussion leaders. The application allows for a comparison of the methodology with that of human experts in the field and how the results are in agreement with a panel of social media experts.

In neuroscience, it is found that Katz centrality correlates with the relative firing rate of neurons in a neural network. The temporal extension of the Katz centrality is applied to fMRI data obtained from a musical learning experiment in where data is collected from the subjects before and after the learning process. The results show that the changes to the network structure over the musical exposure created in each session a quantification of the cross communicability that produced clusters in line with the success of learning.

A generalized form of Katz centrality can be used as an intuitive ranking system for sports teams, such as in college football.[4]

Alpha centrality is implemented in igraph library for network analysis and visualization.[5]

Notes and References

  1. Leo Katz . A new status index derived from sociometric analysis . Psychometrika . 18 . 1 . 39–43 . 10.1007/BF02289026. 1953 . 121768822 .
  2. Charles H. Hubbell . An input-output approach to clique identification . Sociometry . 28 . 4 . 377–399 . 10.2307/2785990. 1965 . 2785990 .
  3. P. Bonacich, P. Lloyd . Eigenvector-like measures of centrality for asymmetric relations . Social Networks . 23 . 3 . 191–201 . 10.1016/S0378-8733(01)00038-7. 2001 . 10.1.1.226.2113 .
  4. Park . Juyong . Newman . M. E. J. . A network-based ranking system for American college football . Journal of Statistical Mechanics: Theory and Experiment . 31 October 2005 . 2005 . 10 . P10014 . 10.1088/1742-5468/2005/10/P10014 . 1742-5468. physics/0505169 . 15120571 .
  5. Web site: Welcome to igraph's new home.