Sparse network explained

In network science, a sparse network has much fewer links than the possible maximum number of links within that network (the opposite is a dense network). The study of sparse networks is a relatively new area primarily stimulated by the study of real networks, such as social and computer networks.^[1]

The notion of much fewer links is, of course, colloquial and informal. While a threshold for a particular network may be invented, there is no universal threshold that defines what much fewer actually means. As a result, there is no formal sense of sparsity for any finite network, despite widespread agreement that most empirical networks are indeed sparse. There is, however, a formal sense of sparsity in the case of infinite network models, determined by the behavior of the number of edges (M) and/or the average degree ((k)) as the number of nodes (N) goes to infinity.^[2]

Definitions

A simple unweighted network of size

is called sparse if the number of links

in it is much smaller than the maximum possible number of links

M_max

:^[1]

M\llM_max={N\choose2}

In any given (real) network, the number of nodes N and links M are just two numbers, therefore the meaning of the much smaller sign (

\ll

above) is purely colloquial and informal, and so are statements like "many real networks are sparse."

However, if we deal with a synthetic graph sequence

G_N

, or a network model that is well defined for networks

G_N

of any size N = 1,2,...,

infty

, then the

\ll

attains its usual formal meaning:

M\llM_max\iffM=o(M_max)\iff\lim_{N → infty}

	M
	M_max

In other words, a network sequence or model

G_N

is called dense or sparse depending on whether the (expected) average degree

\langlek\rangle=2M/N

G_N

scales linearly or sublinearly with N:^[2] ^[3]

G_N

is dense if

\langlek\rangle=O(N)

;

G_N

is sparse if

\langlek\rangle=o(N)

An important subclass of sparse networks are networks whose average degree is either constant or converges to a constant. Some authors call only such networks sparse, while others reserve special names for them: ^[4]

G_N

is truly sparse or extremely sparse or ultrasparse if

\langlek\rangle=O(1)

There also exist alternative, stricter definitions of network sparsity requiring the convergence of the degree distribution in

G_N

to a well defined limit at

N → infty

.^[5] According to this definition, the N-star graph

S_N

, for example, is not sparse.

Node degree distribution

The node degree distribution changes with the increasing connectivity. Different link densities in the complex networks have different node-degree distribution, as Flickr Network Analysis suggests. The sparsely connected networks have a scale free, power law distribution. With increasing connectivity, the networks show increasing divergence from power law. One of the main factors, influencing on the network connectivity is the node similarity. For instance, in social networks, people are likely to be linked to each other if they share common social background, interests, tastes, beliefs, etc. In context of biological networks, proteins or other molecules are linked if they have exact or complementary fit of their complex surfaces.^[6]

Common terminology

If the nodes in the networks are not weighted, the structural components of the network can be shown through adjacency matrix. If the most elements in the matrix are zero, such matrix is referred as sparse matrix. In contrast, if most of the elements are nonzero, then the matrix is dense. The sparsity or density of the matrix is identified by the fraction of the zero element to the total number of the elements in the matrix. Similarly, in the context of graph theory, if the number of links is close to its maximum, then the graph would be known as dense graph. If the number of links is lower than the maximum number of links, this type of graphs are referred as sparse graph.^[7]

Applications

Sparse Network can be found in social, computer and biological networks, as well as, its applications can be found in transportation, power-line, citation networks, etc. Since most real networks are large and sparse, there were several models developed to understand and analyze them.^[8] These networks have inspired sparse network-on-chip design in multiprocessor embedded computer engineering.

Sparse networks also induce cheaper computations by making it efficient to store the network as an Adjacency list, rather than an Adjacency matrix. For example, when using an adjacency list, iterating over a node's neighbors can be achieved in O(M/N), whereas it is achieved in O(N) with an adjacency matrix.^[2]

Notes and References

Book: Barabási. Albert-László. Network Science. 2015. Cambridge University Press. 25 May 2015.
Book: Newman. Mark. Networks 2nd Edition. 14 Feb 2021.
Book: Bollobás. Béla. Random Graphs. 1985. Academic Press.
Janson. Svante. On Edge Exchangeable Random Graphs. 2018. J Stat Phys. 173. 3–4. 448–484. 10.1007/s10955-017-1832-9. 30930480. 6405020. 1702.06396 . 2018JSP...173..448J .
Book: van der Hofstad. Remco. Random Graphs and Complex Networks. 2017. Cambridge University Press. 10.1017/9781316779422. 9781316779422.
Scholz. Matthias. Node similarity as a basic principle behind connectivity in complex networks. Journal of Data Mining and Digital Humanities. 7 January 2015. 2015 . 77. 10.46298/jdmdh.33 . 221799 . 25 May 2015. free. 1010.0803.
Web site: Nykamp. Duane Q.. An introduction to networks. Math Insight. 25 May 2015.
Web site: Gribonval. Rémi. Sparse Models, Algorithms and Learning for Large-scale data. SMALL. 25 May 2015.