## Abstract

A binary state on a graph means an assignment of binary values to its vertices. A time-dependent sequence of binary states is referred to as binary dynamics. We describe a method for the classification of binary dynamics of digraphs, using particular choices of closed neighbourhoods. Our motivation and application comes from neuroscience, where a directed graph is an abstraction of neurons and their connections, and where the simplification of large amounts of data is key to any computation. We present a topological/graph theoretic method for extracting information out of binary dynamics on a graph, based on a selection of a relatively small number of vertices and their neighbourhoods. We consider existing and introduce new real-valued functions on closed neighbourhoods, comparing them by their ability to accurately classify different binary dynamics. We describe a classification algorithm that uses two parameters and sets up a machine learning pipeline. We demonstrate the effectiveness of the method on simulated activity on a digital reconstruction of cortical tissue of a rat, and on a nonbiological random graph with similar density.

## Author Summary

We explore the mathematical concept of a closed neighbourhood in a digraph in relation to classifying binary dynamics on a digraph, with particular emphasis on dynamics on a neuronal network. Using methodology based on selecting neighbourhoods and vectorising them by combinatorial and topological parameters, we experimented with a dataset implemented on the Blue Brain Project reconstruction of a neocortical column, and on an artificial neural network with random underlying graph implemented on the NEST simulator. In both cases the outcome was run through a support vector machine algorithm reaching classification accuracy of up to 88% for the Blue Brain Project data and up to 81% for the NEST data. This work is open to generalisation to other types of networks and the dynamics on them.

## INTRODUCTION

A *binary state* on a graph means an assignment of binary values to its vertices. A motivating example in this
article appears in the context of neuroscience. If one encodes the connectivity of a
neuronal network as a directed graph, then the spikes produced by the neurons at an
instant of time is a binary state on the encoding graph. Allowing time to vary and
recording the spiking patterns of the neurons in the network produces an example of
a *binary dynamics* on the
encoding graph, namely a one-parameter
family of binary states on its vertices. A network of neurons that
receives external signals and responds to those signals thus generates a binary
dynamics. Binary dynamics appear in other contexts as well (Gleeson, 2008; Samuelsson
& Socolar, 2006), but in this paper we use networks of spiking
neurons as a primary example.

The *signal classification problem*, that is, the task of correctly
pairing a signal injected into a neuronal network with the response of the network,
or in other words, identifying the incoming signal from the response, is generally
very challenging. This paper proposes a methodology by which this task can be
approached and provides scenarios in which this methodology is successful.

Considering raw binary states on a large graph is generally quite problematic for a
number of reasons. First, the sheer number of theoretically possible states makes
analysing a collection of them a daunting task (Churchland & Abbott, 2016; Fan
& Markram, 2019). Moreover, natural systems such as neuronal
networks tend to be very noisy, in the sense that the emerging dynamics from the
same stimulus may take a rather large variety of forms (Cunningham & Yu, 2014; Stein, Gossen, & Jones, 2005). Finally, it is a
general working hypothesis in studying network dynamics that the network structure
affects its function (Bargmann & Marder,
2013; Chambers & MacLean,
2016; Curto & Morrison,
2019; Rubinov & Sporns,
2010). This paradigm in neuroscience is often encapsulated by the slogan
“*neurons that fire together tend to wire
together*.” Hence, when studying dynamics on a neuronal network, it
makes sense to examine assemblies of vertices, or subgraphs, and the way in which
they behave as dynamical subunits, instead of considering individual vertices in the
network (Babichev, Ji, Mémoli, &
Dabaghian, 2016; Curto & Itskov,
2008; Milo et al., 2002).

In previous studies we considered cliques in a directed graph, with various orientations of the connections between nodes, as basic units from which one could extract information about binary dynamics (Govc, Levi, & Smith, 2021; M. W. Reimann et al., 2017). However, the results in these papers fell short of suggesting an efficient classifier of binary dynamics (Govc et al., 2021, Sections 4.1–4.2). Indeed, when we applied the methods of Govc et al. (2021) and M. W. Reimann et al. (2017) to the main dataset used in this paper, we obtained unsatisfactory classification accuracy. This suggests that in a graph that models a natural system, cliques may be too small to carry the amount of information required for classification of a noisy signal. This motivates us to build our classification strategy on neuron assemblies, where the richer structure serves a dual purpose of amalgamating dynamical information and regulating the noise inherent in single neurons or cliques.

The guiding hypothesis of this paper is that a collection of vertex assemblies,
forming a subgraph of the ambient connectivity graph encoding a network, can be used
in classification of binary dynamics on the network. A network of spiking neurons is
our primary example. Taking this hypothesis as a guideline, we introduce a very
flexible feature generation methodology that takes as input binary dynamics on a digraph 𝒢 induced on a
preselected collection of subgraphs of 𝒢, and turns it into a feature
vector, which an then be used in machine learning classification. The neighbourhood
of a vertex *v* in the graph 𝒢, namely the subgraph of
𝒢 that is induced by *v* and all its neighbours in 𝒢,
suggests itself naturally as a type of subgraph to be considered in this procedure,
and is a central object of study in this paper. Vertex neighbourhoods have been
studied extensively in graph theory and its applications (Kartun-Giles & Bianconi, 2019). An outline is given
below and a full description in Methods.

The way we apply the method can be summarised as follows. Given a directed graph
𝒢, we use a variety of real valued vertex functions that we refer to as *selection parameters* and are derived from the neighbourhood of
each vertex, to create a sorted list of the vertices. With respect to each such
parameter, we pick the “top performing” vertices and select their
neighbourhoods. To that collection of subgraphs we apply our feature generation
method, which is based again on applying the same parameters to the selected
neighbourhoods, now in the role of *feature parameters*. All the
parameters we use are invariant under isomorphism of directed graphs, that is, graph
properties that remain unchanged when the vertices are permuted while leaving their
connectivity intact. Therefore we occasionally refer to certain parameters as
“graph invariants.”

The choice of parameters is related to measures of network connectivity and
architecture. For instance, the parameters **fcc** and **tcc** (see Table 1) are examples of measures of
functional segregation (Rubinov & Sporns,
2010). The parameters we refer to as *spectral parameters* arise in spectral graph theory (Chung, 2005)
and are prevalent in many applications, including in neuroscience. For instance, the
paper de Lange, de Reus, and van den Heuvel
(2014) studies the Laplacian spectrum of the macroscopic anatomical neural networks of macaques and cats, and the
microscopic network of the *C. elegans*. The topological parameters,
such as the Euler characteristic **ec** and Betti numbers are classical topological invariants. In M. W. Reimann et al. (2017) these were
used in various ways to extract information on structure and function and their
interaction in the Blue Brain Project reconstruction on the neocortical column. The
parameter **size** is a natural parameter associated to any graph and is
closely related to firing rate in neuroscience. However, most of the parameters we
tested were never examined in a neuroscientific context. Our aim was to investigate
which parameters may prove useful in classification of binary dynamics without
making any assumptions about their relevance. It is exactly this approach that
allowed us to discover that certain spectral parameters perform strongly as
selection parameters, while others do not. At the same time a newly introduced
topological parameter, “normalised Betti coefficient” **nbc** shows strong performance as a feature parameter when tested on neighbourhoods with
low selection parameter values, but not on high selection values.

**Table 1.**

Abbreviation
. | Short Description
. |
---|---|

fcc | Clustering coefficient (Fagiolo) |

tcc | Transitive clustering coefficient |

ec | Euler characteristic |

nbc | Normalised Betti coefficient |

size | Number of vertices in the graph |

asg | Adjacency spectral gap |

asr | Adjacency spectral radius |

blsg | Bauer Laplacian spectral gap |

blsr | Bauer Laplacian spectral radius |

clsg | Chung Laplacian spectral gap |

clsr | Chung Laplacian spectral radius |

tpsg | Transition probability spectral gap |

tpsr | Transition probability spectral radius |

Abbreviation
. | Short Description
. |
---|---|

fcc | Clustering coefficient (Fagiolo) |

tcc | Transitive clustering coefficient |

ec | Euler characteristic |

nbc | Normalised Betti coefficient |

size | Number of vertices in the graph |

asg | Adjacency spectral gap |

asr | Adjacency spectral radius |

blsg | Bauer Laplacian spectral gap |

blsr | Bauer Laplacian spectral radius |

clsg | Chung Laplacian spectral gap |

clsr | Chung Laplacian spectral radius |

tpsg | Transition probability spectral gap |

tpsr | Transition probability spectral radius |

*Note*. See Supporting
Information Table S1 for additional parameters.

The primary test of our methods in this paper is done on data generated by the Blue Brain Project that was also used in M. Reimann et al. (2021) for signal classification by established neuroscience methodology. The data consists of eight families of neuronal stimuli that are injected in a random sequence to the digital reconstruction of the neocortical column of a young rat. This reconstructed microcircuit consists of approximately 31,000 neurons and 8,000,000 synaptic connections, and is capable of receiving neuronal signals and responding to them in a biologically accurate manner (Markram et al., 2015). We used 60% of the data to train a support vector machine, and the remaining 40% for classification. With our methods we are able to achieve classification accuracy of up to 88%.

In this article we did not attempt to explain the relevance of any of the mathematical concepts we use to neuroscience, as our main aim was to discover and investigate the utility of various concepts. However, in M. Reimann et al. (2021) the same dataset is studied by standard techniques of computational neuroscience combined with the ideas presented in this paper. In particular, it is shown that an informed choice of neighbourhood improves classification accuracy when compared to traditional methods. Interestingly, the selection of neighbourhoods that improved performance with the technique presented in M. Reimann et al. (2021) show reduced performance with the techniques presented in this article, and vice versa. In both projects a classification accuracy of nearly 90% was achievable, but with different selection parameters (see Results). This suggests that considering vertex neighbourhoods as computational units can be beneficial in more than one way.

To further test our methods in different settings, we used the NEST neural simulation tool (Jordan et al., 2019) to generate neuronal networks. This software package simulates network models of spiking neurons by using simplified neuron models to allow more flexibility and faster processing speed. We created a collection of eight families of stimuli, but on random graphs with varying densities, and applied our machinery to that dataset. Here again we obtained classification accuracy of up to 81%.

Important work on (open) vertex neighbourhoods was reported recently in Kartun-Giles and Bianconi (2019). Our approach is independent of this work and is different from it in a number of ways. Most significantly, we do not study the structure of the entire graph and its dynamical properties by means of its full neighbourhood structure. Instead, we aim to infer dynamical properties of the graph from a relatively small collection of vertices, selected by certain graph theoretic and topological properties, and their neighbourhoods.

High-resolution figures and supplementary material is available at the Aberdeen Neurotopology Group webpage. In particular, we included a comprehensive visualization of spectral graph invariants of the Blue Brain Project graph, as well as other types of stochastically generated graphs, animations of some of the background work for this project, and a list of links to software implementing the methodology described in this paper.

## RESULTS

We start with a brief description of the mathematical formalism used in this article and our approach to classification tasks. This is intended to make the section accessible to readers without a strong mathematical background. We then proceed by describing our main data source and the setup and implementation of our experiments. Following this preparation we present our results, validation experiments, and an application of the same techniques in a different setup.

### A Brief Introduction to the Mathematical Formalism

In this article a *digraph* will always mean a finite collection
of vertices (nodes) *V* and a finite collection of oriented edges
(arcs) *E*. Reciprocal edges between a pair of vertices are
allowed, but multiple edges in the same orientation between a fixed pair of
vertices and self-loops are not allowed.

The fundamental mathematical concept essential for our discussion is that of the
neighbourhood of a vertex in a digraph (Figure 1). Let 𝒢 be a digraph, and let *v*_{0} be any vertex in 𝒢. The *neighbours* of *v*_{0} in 𝒢
are all vertices that are “one step away” from *v*_{0}, in either direction. The *neighbourhood* of *v*_{0} in
𝒢 is the subgraph of 𝒢 induced by *v*_{0} and all its neighbours, which we denote by *N*_{𝒢}(*v*_{0}). The
vertex *v*_{0} is referred to as the *centre* of its neighbourhood.

Numerical invariants of digraphs can be found in pure and applied graph theory literature, many of those found their uses in theoretical neuroscience (see Rubinov & Sporns, 2010, for a good survey). Some such invariants are used in this article, and a few are introduced here for the first time (e.g., transitive clustering coefficient). Other parameters we used are defined by using topological constructions that arise from digraphs. Such constructions are typically invariant under digraph isomorphism. Standard tools of algebraic topology can then be used to extract numerical invariants of graphs in ways that take emerging higher dimensional structure into account.

There are many ways in which one can associate a topological space with a
digraph. In this article we use the *directed flag complex*. It
is a topological space made out of gluing together *simplices* in
different dimensions, starting at 0-simplices (points), 1-simplices (edges),
2-simplices (triangles), 3-simplices (tetrahedra), etc. The *n*-simplices in a directed flag complex associated to a digraph
are its directed (*n* + 1)-cliques, namely the ordered subsets of
vertices {*v*_{0}, *v*_{1},
…, *v*_{n}}, such that there is
an edge from *v*_{i} to *v*_{j} for all *i* < *j*. Figure 2 shows
the directed flag complex associated to a small digraph. The directed flag
complex was introduced and used for topologically analysing structural and
functional properties of the Blue Brain Project reconstruction of the
neocortical columns of a rat (M. W. Reimann et
al., 2017). The interested reader may find a comprehensive survey of
directed flag complexes and other topological concepts in the Materials and
Methods section of M. W. Reimann et al.
(2017). If *v*_{0} is a vertex in 𝒢,
we denote by TR_{𝒢}(*v*_{0}) the directed
flag complex of *N*_{𝒢}(*v*_{0}).

### The Classification Method

We now describe briefly our approach to classification of binary dynamics. For a precise mathematical definition of what we mean by binary dynamics see Methods. The task at hand can be described as follows. We are given a large set of instantiations of binary dynamics on a fixed digraph 𝒢, each of which is labelled by a symbol from some relatively small set. The label of each binary dynamics is unique and known. The aim is to produce a machine learning compatible topological summary for each binary dynamics, so that when the summaries are introduced in a random order, one can train on part of the data with known labels and predict the unknown labels of the remaining part.

The *first step* is selection of neighbourhoods. For each vertex *v* in the digraph 𝒢 we consider its neighbourhood *N*_{𝒢}(*v*) and the
associated directed flag complex Tr_{𝒢}(*v*). We
then compute a variety of numerical graph
parameters of *N*_{𝒢}(*v*) and topological
parameters of Tr_{𝒢}(*v*). These parameters are
used to create a ranked list of vertices in 𝒢. We then select for each
parameter 50 vertices that obtained the top (or bottom) values with respect to
that parameter. We now have a set of 50 neighbourhoods corresponding to each
parameter. A parameter that is used in this step is referred to as a *selection parameter*, and we denote it by *P*. A short summary of the main parameters we used with their
abbreviations is in Table 1. A detailed
description of the parameters is given in Methods.

In the *second step* we introduce binary dynamics in 𝒢.
Each instantiation of the dynamics consists of several consecutive time bins (in
our experiments we used two, but there is no limitation). For each time bin we
consider the neurons that were active and the subgraph that they induce in each
of the neighbourhoods we preselected. This gives us, for each selection
parameter and each time bin, a set of 50 subgraphs that correspond to a
particular instantiation of binary dynamics on 𝒢.

The *third step* is vectorising the data, that is, a computation
of the same graph parameters and topological parameters for each of the
subgraphs resulting from the second step. When we use our parameters in the
vectorisation process they are referred to as *feature
parameters*, and are denoted by *Q*. This now gives a
vector corresponding to each instantiation of the dynamics, and the pair
(*P*, *Q*) of selection and feature
parameters.

The *fourth and final step* is injecting the data into a support
vector machine. In this project we used 60% of the data for training and
the remaining for testing. See Figure 3 for
a schematic summary of the process.

We note that the method described here is an example of a much more general methodology that is described in detail in the Methods section of this article. In particular, the graph and topological parameters that we chose to work with are selected from within the abundance of mathematical concepts that arise in graph theory, combinatorics, and topology. We do not attempt in this article to associate a neuroscientific meaning to these parameters.

### The Data

Our main source of data is a simulation that was run on a Blue Brain Project
reconstruction of the microcircuitry of the somatosensory cortex in the brain of
a rat (Markram et al., 2015). From this
model we extract the connectivity of the microcircuit in the form of a digraph
whose vertices correspond to neurons, and with an edge from *v* to *u* if there is a synaptic connection from the neuron
corresponding to *v* to the one corresponding to *u*. We denote the Blue Brain Project digraph by 𝒢.
The digraph consists of 31,346 vertices and 7,803,528 edges. The connectivity
matrix of this specific circuit, as well as 41 other instantiations of the
reconstruction, is accessible on the Digital Reconstruction of Neocortical
Microcircuitry website.

The binary dynamics we experimented with consists of eight stimuli families labelled 0–7. For each stimulus a random subset (10%) of afferent neurons is activated. The stimuli differ with respect to which subset of afferent neurons is activated, where afferents can be shared between stimuli. The probability of a given afferent being associated with two given stimuli is 1%. In each stimulation time window one and only one stimulus is presented. The stimuli were injected into the circuit in a random sequence of 200 milliseconds per stimulus, and 557 repeats for each stimulus label. The dataset thus consists of 4,456 binary dynamics functions. The task is to determine the label of that stimulus, that is, the expected output is an integer from 0 to 7. Thus, the chance level performance would be 12.5%. More detail on the source of data, biological analysis, and an alternative approach to classification of the same data is in M. Reimann et al. (2021).

### Setup

We computed all the graph parameters listed in Table 1, as well as additional parameters listed in the Supporting Information, for all
neighbourhoods in the digraph (see Supporting
Information, Data and Code, for a brief description of computational
methods and links to software). We fixed a positive integer *M*,
and for each selection parameter *P* we selected the vertices *v*_{1}, *v*_{2}, …, *v*_{M}, whose neighbourhoods *N*_{𝒢}(*v*_{1}),
…, *N*_{𝒢}(*v*_{M})
obtained the top (or bottom) *M* values of the parameter *P* (see Step II in Methods). We experimented with *M* = 20, 50, 100, and
200. Here we report on the results we obtained for *M* = 50,
which provided the highest classification accuracy. For *M* = 20
performance was strong as well, but for *M* = 100 and 200 the
improvement compared to *M* = 50 was relatively minor, and not
worth the additional time and computation needed.

### Vector Summaries

Each binary dynamics in our dataset has time parameter *t* between
0 and 200 milliseconds. The subinterval [0, 60] is where almost all the spiking
activity is concentrated across the interval. Furthermore, the bulk of the
stimulus is injected in the first 10 ms. Since we aimed to classify the response
to the stimulus rather than the stimulus itself, we chose Δ = [10, 60]
and divided that interval into two 25 ms subintervals, as experimentation showed
that these choices provide the highest classification accuracy (see Step I in Methods).

We denote each instantiation of binary dynamics on 𝒢 by *B*^{n}, for *n* = 1,
…, 4,456. Each instantiation consists of two binary states $B1n$, $B2n$ corresponding to the neurons that fired in each of the 25 ms subintervals. For
each selection parameter *P*, and each of the corresponding
neighbourhoods *N*_{𝒢}(*v*_{m}), *m* = 1, …, 50, we computed the subgraphs *N*_{m,k} of *N*_{𝒢}(*v*_{m})
induced by the binary state $Bkn$,
that is, the subgraph induced by the neurons that fired in the given interval.
This gave us, for each binary dynamics *B*^{n} and each graph parameter *P*, a 2 × 50 matrix $UnP$ of subgraphs of 𝒢, whose (*m*, *k*) entry
is $Nm,kn$ (see Step II in Methods).

Finally, for each graph parameter *Q* (from the same list of
parameters) we applied *Q* to the entries of the matrix $UnP$ to obtain a numerical feature matrix $UnP,Q$ corresponding to the binary dynamics function *B*^{n}, the selection parameter *P*, and the feature parameter *Q*. The matrix $UnP,Q$ is a vector summary of the binary dynamics *B*^{n} (see Step III in Methods).

### Classification

For each pair of graph parameters (*P*, *Q*) the
vector summaries {$UnP,Q$}
were fed into a support vector machine (SVM) algorithm. Our classification
pipeline was implemented in Python using the scikit-learn package and the SVC implementation therein. The SVC was initialised with default
settings and we used a 60/40 train/test split. The kernel used was Radial Basis
function. We used one-versus-one approach for multiclass classification. For
cross-validation we used standard fivefold cross-validation in scikit-learn, https://scikit-learn.org/stable/about.html. The results are
presented in Figure 4.

For each of the selection parameters we tested, we considered both the
neighbourhoods that obtained the top 50 values and those that obtained the
bottom 50 values. In all the experiments, four parameters gave markedly better
performance when used as feature parameters than all other parameters: Euler
characteristic (**ec**), normalised Betti coefficient
(**nbc**), **size**, and Bauer Laplacian spectral radius
(**blsr**). All four perform significantly better than other
feature parameters when the neighbourhoods were selected by bottom value
parameters. With respect to top value selection parameters, **ec** and **size** performed well, while **nbc** and **blsr** were significantly weaker as feature parameters, except
when coupled with Chung Laplacian spectral gap (**clsg**). The
neighbourhoods selected by top values of selection parameters gave best results
when the selection parameter was one of the spectral graph invariants, while
selecting by bottom value of selection parameters, the two types of clustering
coefficients (**cc** and **tcc**) and Euler characteristic
(**ec**) performed best.

Interestingly, the two best performing feature parameters, Euler characteristic
and size, gave good results across all selection parameters, and performed
almost equally well, regardless of whether the neighbourhoods were selected by
top or bottom selection parameter value. This suggests that, at least in this
particular network, the choice of feature parameter plays a much more important
role in classification accuracy than any specific selection parameter. On the
other hand, examining the rows of the best performing feature parameters, in Figure 4, we see a difference of up to
27% (top **ec**), 40% (top **nbc**), and
18% (top **size**) in classification accuracy, depending on
which selection parameter is used, suggesting that, within a fixed choice of a
feature parameter, the selection parameter may play an important role in the
capability of the respective neighbourhoods to encode binary dynamics. Note that
randomly classifying the eight stimuli gives an accuracy of 12.5%.

### Validation

In order to validate our methods, we created five experiments, the results of which we then compared to a subset of the original tests. In each case we retrained the SVM algorithm and then retested.

A motivating idea in neuroscience in general, and in this work in particular, is that structure is strongly related to function. Our approach, using neighbourhoods sorted by graph parameters and using the same graph parameters as feature parameters is proposed in this article as a useful way of discovering combinations of parameters that achieve good classification results of binary dynamics. To test the validity of this proposal, we challenged our assumptions in five different ways, as described below.

#### Random selection.

In this simple control experiment we test the significance of the selection parameter by comparing the results to a random choice of 50 vertices and performing the same vector summary procedure on their neighbourhoods. Twenty iterations of this experiment were performed, and the results for each feature parameter were compared to the outcome for the same feature parameter and the selection parameter with respect to which this feature parameter performed best. The results are described in Figure 5.

We observe that in almost all cases reported here, a choice of neighbourhoods determined by a selection parameter outperforms a random choice (in some cases marginally). We also note that in all those cases the performance of a choice informed by one of these selection parameters exhibits a more consistent behaviour in terms of classification accuracy. This can be seen from the considerably larger error bars in the case neighbourhoods are selected at random. On the other hand, for some feature parameters a random choice does not seem to be a disadvantage, even compared to the best selection parameter with respect to this feature parameter (Supporting Information Figure S3). This suggests that while selection and generation of vector summary by objective parameters are advantageous, experimentation is generally necessary in order to decide which parameters best fit the classification task.

#### Neighbourhood versus centre.

A working hypothesis in this paper is that neighbourhoods carry more
information about a binary dynamics than individual vertices. We examined
for each selection of 50 neighbourhoods by a graph parameter, as described
above, the classification capability of the centres of these neighbourhoods.
Specifically, this experiment is identical to the original classification
experiment, except for each selection parameter *P* the two
rows of the corresponding feature matrix have binary values, where the *j*-th entry in row *i* is set to be 1 if
the *j*-th neuron in the sorted list fired in the *i*-th time bin at least once and 0 otherwise. These
feature vectors were then used in the classification task by using the same
train and test methodology. For each of the selection parameters we tested,
we considered both the top 50 and the bottom 50 neurons in the corresponding
sorted list.

The results of this experiment were compared with the original experiments
and are shown in Figure 6. We note that
in all cases a very significant drop in performance occurs. Interestingly,
some vertices in the top 50 of a sorted list show classification accuracy
that is far better than random, while the bottom 50 give performance
comparable to random (for example, **fcc**). In some cases,
however, the bottom 50 vertices give better performance than the top 50.
This suggests that the selection parameters play a role in classification
accuracy even before considering the activity in the neighbourhood of a
vertex.

We also note that for almost all top valued selection parameters recorded in Figure 6 and some of the bottom valued ones, the classification performance using the centre alone is significantly better than random. This observation reinforces the idea that selection parameters inform on the capability of neurons to inform on activity.

#### Neighbourhoods versus arbitrary subgraphs.

For each selection parameter we considered the degrees of the 50 selected
centres. For a centre *v*_{i} of
degree *d*_{i} we then selected at
random *d*_{i} vertices in the
ambient graph and considered the subgraph induced by those vertices and the
centre *v*_{i}. We used these 50
subgraphs in place of the original neighbourhoods. In this way we create for
each centre a new subgraph with the same vertex count as the original
neighbourhoods that is unrelated to the centres in any other controllable
way. We extracted feature vectors by using these subgraphs for each of the
selection parameters and repeated the classification experiment. The results
were compared to the original results with respect to the strongest
performing feature parameter. Notice that these are always either **ec** or **size**, both of which can be applied to an
arbitrary digraph, not necessarily a neighbourhood.

The results of this experiment were compared with the original experiments,
and are shown in Figure 7. There is a
clear drop in performance for all selection parameters except **fcc** (Fagiolo’s clustering coefficient; see Methods). Furthermore, classification
using these subgraphs shows considerably larger error bars. This suggests
that using neighbourhoods with our methodology is advantageous. One
explanation for this may be the tighter correlation of activity among
neurons in a neighbourhood, compared to an arbitrary subgraph of the same
size in the network, but we did not attempt to verify this hypothesis.

#### Fake neighbourhoods.

In this experiment we considered for each centre its degree and selected at random the corresponding number of vertices from the ambient graph. We then modified the adjacency matrix of the ambient graph so that the centre is connected to each of the vertices selected in the appropriate direction, so as to preserve the centre’s in- and out-degree. Computationally, this amounts to applying a random permutation to the row and the column of each of the centres. The result is a new ambient graph, where the old centres are now centres of new neighbourhoods. We extracted feature vectors using these “fake neighbourhoods” and repeated the classification experiment. The results were compared with the original classification. The outcome is illustrated in Figure 8.

We note that with respect to almost all selection parameters there is a
significant drop in performance resulting from this modification. The one
exception is **fcc**, where **ec** as a feature parameter
actually sometimes gives slightly better results, but with a large error
bar. It is interesting that the results are similar for some of the
parameters to those observed in previous experiment (Figure 7), but quite different for others.
However, the drop in performance is similar in both cases. We make no
hypothesis attempting to explain these observations.

#### Shuffled activity.

In this experiment we applied a random permutation *σ* of the neuron indices in the Blue Brain Project microcircuit, so that neuron *σ*(*i*) now receives the spike
train (sequence of spikes) of neuron *i* for each stimulus.
That is, we precompose the binary dynamics with *σ* to
get a new binary dynamics, which still appears in eight varieties, since the
operation of permuting the neuron indices is bijective. In other words, we
can reconstruct the original activity from the shuffled activity by applying
the inverse permutation *σ*^{−1}. The
same selection and feature parameters were used and the resulting data was
used for training and testing. The results are shown in Figure 9.

We observe again that there is a significant drop in performance resulting from this shuffling. This is quite surprising since the shuffled activity spike train should give eight families of stimuli that carry some sort of internal resemblance, and since we retrained and tested with these stimuli, one could expect that the classification results will be comparable to those of the original experiments. That not being the case suggests that structure and function in the Blue Brain Project reconstruction are indeed tightly related.

### Testing the Method on an Artificial Neuronal Network

To test our methods in a nonbiological binary dynamics setting, we conducted a set of experiments with the NEST simulator (Jordan et al., 2019). The NEST software simulates spiking neuronal network models and offers a vast simplification of neuronal networks that are based on the exact morphology of neurons (such as the Blue Brain Project reconstructions). It also provides great flexibility in the sense that it allows any connectivity graph to be implemented in it and any initial stimulation to be injected into the system with the response modulated by various flexible parameters.

To move as far as possible from a strict biological setup, we generated a number of Erdős–Rényi random digraphs on 1,000 vertices, which we implemented on NEST. We then created eight distinct stimuli, each enervating a random selection of 100 vertices of the graph. A random sequence of stimuli was then created, with each stimulus type repeated 500 times. Our experiment consisted of injecting the sequence of stimuli into the simulator, for a duration of 5 ms, one every 200 ms, to reduce the influence of one stimulus on the next. To introduce some randomness, the start time of each stimulus is randomly selected from the first 10 ms, the strength of each stimulus is multiplied by a random number between 1 and 2, and background noise is included (using NEST’s noise_generator device with strength 3). For each 200-ms interval, the first 10 ms were not included in the classification. As a result some of the input may be included in the classified data, but never more than 4 ms, and for approximately 60% of the 4,000 stimuli the input is completely excluded from classification. The code used to create these experiments is available in Lazovskis (2021), and the experiments are presented visually in Figure 10.

The spikes from this simulation were then extracted and were run through the same pipeline as the Blue Brain Project data. We experimented with graph densities of 0.08, 0.01, and 0.005, and with selections of 10, 20, and 50 neighbourhoods. Figure 11 shows the performance by the selection parameters from Table 1. Size was used in all cases as a feature parameter. The best performance was obtained with 50 neighbourhoods, with graph density of 0.01 in almost all selection parameters. The results of experiments with all parameters can be seen in Supporting Information Figure S5.

Interestingly, the middle graph density of 0.01 consistently performed equally as
well or better than both the denser 0.08 and less dense 0.005 across all feature
parameters, except neighbourhood size (**size**) and adjacency spectral
gap (**asg**). Another interesting observation is that the strongest
selection parameter in this experiment turns out to be normalised Betti
coefficient (**nbc**), or transitive clustering coefficient
(**tcc**), depending on if “strongest” is taken to
mean with the highest individual accuracy or with the highest average accuracy
from cross-validation, respectively. Both of these selection parameters in the
Blue Brain Project experiments exhibited rather mediocre performance (see Figure 4, left). This suggests that different
networks and binary dynamics on them may require experimentation with a
collection of selection (and feature) parameters, in order to optimise the
classification accuracy.

## DISCUSSION

In this paper we examined the concept of a closed neighbourhood in relation to the classification of binary dynamics on a digraph. Regardless of what the source of the binary dynamics is, but with the assumption that it is given in a time series of labelled instantiations, we ask how can the dynamics be read off and classified. In the context of neuroscience, which is our primary motivation for this study, this is a question on the boundary between computational neuroscience and machine learning. Our methods provide a method of addressing this question.

We proposed a methodology that will take as input binary dynamics on a digraph and produce a vector summary of the dynamics by means of combinatorial and/or topological parameters of a relatively small number of neighbourhoods. Using this methodology we experimented with a dataset implemented on the Blue Brain Project reconstruction of the neocortical column of a rat, and on an artificial neural network with random underlying graph implemented on the NEST simulator. In both cases the vector summaries were then run through a support vector machine algorithm that was able to achieve a classification accuracy of up to 88% for the Blue Brain Project data and up to 81% for the NEST data.

We used the same parameters both for selecting neighbourhoods and for the creation of
feature vectors. We saw that certain spectral graph parameters used as selection
parameters perform significantly better than more classical parameters such as
degree and clustering coefficients. We also observed that the parameters that
performed best as feature parameters were the simplest ones, namely *size* and *Euler characteristic*. Comparison to
randomly selected neighbourhoods showed that the methodology works reasonably well
even without selecting the neighbourhoods in an informed way, but that
neighbourhoods selected in a way informed by graph parameters gives in general a
better performance with a much smaller error range.

Our aim was to demonstrate that certain selections of subgraphs, informed by objective structural parameters, carry enough information to allow classification of noisy signals in a network of spiking neurons. In this article the subgraphs selected are closed neighbourhoods, and the selection criteria are our chosen selection parameters. We did not, however show, or attempt to demonstrate, that the use of neighbourhoods as a concept, or graph parameters as a selection mechanism, are the best methodology. The same techniques could be applied to other subgraph selections and other vectorisation methods, which can be analysed by our pipeline with relatively small modifications.

Another aspect of our ideas that was not exploited at all in this project is the use of more than a single graph parameter in the selection procedure. We did show that different parameters are distributed differently in the Blue Brain Project graph, and hence one may hypothesise that optimising neighbourhood selection by two or more parameters may give improved classification accuracy.

As our aim was not to obtain the best classification, but rather to provide a good methodology for ingesting binary dynamics on a digraph and a producing machine learning digestible data stream, we did not experiment with other more sophisticated machine learning algorithms. It is conceivable that doing so may produce even better classification accuracy than what is achieved here.

Finally, our approach is closely related to graph neural networks where convolution is performed by aggregating information from neighbourhoods, that is, for every vertex, features are learned from all the adjacent vertices. The pipeline presented in this paper also takes as input sequences of neural firings and sequences of neuron assemblies that turn the firing patterns into feature values. The interaction of our work and the modelling perspectives from graph neural networks and sequence-to-sequence learning might thus pose an interesting future research question.

## METHODS

### Mathematical Concepts and Definitions

We introduce the basic concepts and notation that are used throughout this
article. By a *digraph* we mean a *finite*, *directed simple graph*, that is, where reciprocal edges
between a pair of vertices are allowed, but multiple edges in the same
orientation between a fixed pair of vertices and self-loops are not allowed.

Topology is the study of topological
spaces—a vast generalisation of geometric objects. In this
paper we only consider spaces that are built out of simplices. Simplices occur
in any dimension *n* ≥ 0, where a 0-simplex is a point, a 1-simplex is a line segment, a
2-simplex is a triangle, a 3-simplex a tetrahedron, and so forth in higher
dimensions. Simplices can be glued together to form a topological space. A good
survey for this material intended primarily for readers with a neuroscience
background can be found in the Materials and Methods section of M. W. Reimann et al. (2017).

We now describe a general setup that associates a family of topological objects with a digraph. A particular case of this setup is the main object of study in this paper.

**Definition 1**. *A* topological operator on digraphs *is an algorithm that associates with a digraph* 𝒢 *a topological space* Γ(𝒢)*, such that
if* 𝓗 ⊆ 𝒢 *is a subgraph then* Γ(𝓗) ⊆ Γ(𝒢) *as a closed
subspace.*

That is, a topological operator on digraphs is a functor from the category of digraphs and digraph inclusions to the category of topological spaces and inclusions. The flag complex of 𝒢 (ignoring orientation), the directed flag complex (Lütgehetmann, Govc, Smith, & Levi, 2020), and the flag tournaplex (Govc et al., 2021) are examples of such operators.

**Definition 2**. *Let* 𝒢 = (*V*, *E*) *be a digraph, and let v*_{0} ∈ *V be any vertex.*

- ▪
*The*neighbours of*v*_{0}in 𝒢*are all vertices v*_{0}≠*v*∈*V that are incident to v*_{0}*.* - ▪
*The*open neighbourhood of*v*_{0}*is the subgraph of*𝒢*induced by the neighbours of v*_{0}*in*𝒢*. The*closed neighbourhood of*v*_{0}in 𝒢*is the subgraph induced by the neighbours of v*_{0}*and v*_{0}*itself.*

We denote the open and closed neighbourhoods of *v*_{0} in
𝒢 by $N\mathcal{G}\xb0$(*v*_{0})
and *N*_{𝒢}(*v*_{0}),
respectively (Figure 12). More
generally:

- ▪
Let

*S*⊆*V*be a subset of vertices. Then $N\mathcal{G}\xb0$(*S*) denotes the union of open neighbourhoods of all*v*∈*S*. Similarly*N*_{𝒢}(*S*) is the union of all closed neighbourhoods of vertices*v*∈*S*.

*S*= {

*v*

_{0},

*v*

_{1}}, and

*v*

_{0}and

*v*

_{1}are incident in 𝒢, then $N\mathcal{G}\xb0$(

*S*) =

*N*

_{𝒢}(

*S*). In this article we will mostly consider closed neighbourhoods. Neighbourhoods are also used in the paper M. Reimann et al. (2021), which is closely related to this article.

**Terminology 1**. Let 𝒢 be a digraph and let *S* be a subset of vertices in 𝒢. Unless explicitly stated otherwise, we
shall from now on refer to the closed neighbourhood of *S* in
𝒢 simply as the *neighbourhood* of *S* in
𝒢. In the case where *S* contains a single vertex *v*_{0}, we will refer to *v*_{0} as the *centre* of *N*_{𝒢}(*v*_{0}).

The topological operator we consider in this article is the directed flag complex of a digraph which we recall next. See Figure 2 for an example.

**Definition 3**. A *directed n-clique* is a digraph,
whose underlying undirected graph is an *n*-clique, and such that
the orientation of its edges determines a linear order on its vertices. An *ordered**simplicial complex* is
a collection *X* of finite ordered sets that is closed under
subsets. The *n*-simplices of an ordered simplicial complex *X* are the sets of cardinality *n* + 1. If
𝒢 is a digraph, then the *directed flag complex* associated to 𝒢 is the ordered simplicial complex whose *n*-simplices are the directed (*n* +
1)-cliques in 𝒢. We denote the directed flag complex of a digraph
𝒢 by |𝒢|.

### Encoding Binary Dynamics on Neighbourhoods

We now describe our approach to classification of binary dynamics on a graph in general terms.

**Definition 4**. Let 𝒢 = (*V*, *E*) be a graph (directed or undirected). A *binary
state* on 𝒢 is a function *β* : *V* → {0, 1}. Equivalently, a binary state on
𝒢 is a partition of *V* into two disjoint subsets that
correspond to *β*^{−1}(0) and *β*^{−1}(1), or alternatively as a
choice of an element of the power set 𝒫(*V*) of *V*. *A binary dynamics* on 𝒢 is a
function *B* : ℝ_{≥0} →
𝒫(*V*) that satisfies the following condition:

- ▪
There is a partition of ℝ

_{≥0}into finitely many half open intervals {[*a*_{i},*b*_{i})$}i=1P$ for some*P*≥ 1, such that*B*is constant on [*a*_{i},*b*_{i}), for all*i*= 1, …,*P*.

Activity in a network of neurons, both natural and artificial, is a canonical example of a binary dynamics on a directed network.

#### Setup.

The task we address in this section is a general classification methodology for binary dynamics functions. Namely, suppose one is given

- ▪
a set of binary dynamics functions {

*B*_{i}|*i*≥ 1} on a fixed ambient graph 𝒢, - ▪
a set of labels 𝓛 = {

*L*_{1},*L*_{2}, …,*L*_{n}}, and - ▪
a labelling function

*L*: {*B*_{i}|*i*≥ 1} → 𝓛.

*functions labeled by the same label are variants of the same event*(without specifying what the event is, or in what way its variants are similar). The aim is to produce a topological summary for each

*B*

_{i}in a way that will make the outcome applicable to standard machine learning algorithms. We next describe our proposed mechanism.

#### Creation of vector summary.

Fix a graph 𝒢 and a real-valued graph parameter *Q*,
that is, a real-valued function taking digraphs as input and whose values
are invariant under graph isomorphisms. Suppose that a set of labeled binary
dynamics functions
{*B*^{n}$}n=1N$ on 𝒢 is given. Select an *M*-tuple
(𝓗_{1}, 𝓗_{2}, …,
𝓗_{M}) of subgraphs of 𝒢,
for some fixed positive integer *M*.

Fix a time interval and divide it into time bins. In each bin, record the
vertex set that showed the value 1, that is, was *active* at
some point during that time bin. For each 1 ≤ *m* ≤ *M*, restrict that set to
𝓗_{m} and record the subgraph
induced by the active vertices. Apply *Q* to obtain a
numerical *M*-tuple, and concatenate the vectors into a long
vector, which encodes all time bins corresponding to the given dynamics.

We now describe the procedure more accurately in three steps.

- I) Interval partition uniformising. Fix an interval
*I*= [*a*,*b*] ⊂ ℝ_{≥0}and a positive integer*K*. Let Δ = $b\u2212aK$. For 1 ≤*k*≤*K*, let*I*_{k}denote the subinterval$Ik=defa+k\u22121\Delta a+k\Delta \u2286ab.$ - II) Subgraph extraction. For 1 ≤
*n*≤*N*and each 1 ≤*m*≤*M*, let $\beta m,kn$ denote the binary state on 𝓗_{m}defined byLet $\U0001d4d7m,kn$ ⊆ 𝓗$\beta m,kn=defv\u2208\U0001d4d7m\u2203t\u2208Iksuchthatv\u2208Bnt.$_{m}be the subgraph induced by all vertices in the set $\beta m,kn$. We refer to $\U0001d4d7m,kn$ as the*active subgraph*of 𝓗_{m}with respect to the binary dynamics function*B*^{n}. - III)
Numerical featurisation. For each 1 ≤

*n*≤*N*, let $qm,kn$ denote the value of*Q*applied to $\U0001d4d7m,kn$. Let*F*^{n}denote the*M*×*K*matrix corresponding to the binary dynamics function*B*^{n}, that is (*F*^{n})_{m,k}= $qm,kn$.

*vector summary of the collection*{

*B*

^{n}$}n=1N$ (Figure 3). It allows great flexibility as its outcome depends on a number of important choices:

- ▪
the ambient graph 𝒢,

- ▪
the selection procedure of subgraphs,

- ▪
the interval

*I*and the binning factor*K*, and - ▪
the graph parameter

*Q*.

All these choices may be critical to the task of classifying binary dynamics functions, as our use case shows, and have to be determined by experimentation with the data.

### Selection and Feature Parameters

In this section we describe the graph parameters used in this article. Some of these parameters are well known in the literature. All of them are invariant under digraph isomorphism. The parameters presented in this section are the primary parameters used for both selection and generation of vector summaries. We chose these particular parameters either because of their prevalence in the literature, or for their strong performance as either selection or feature parameters in classification tasks. Other parameters we examined are mentioned in the Supporting Information.

Throughout this section, we let 𝒢 = (*V*, *E*) denote a locally finite digraph (that is, such that
every vertex is of finite degree). For *k* ≥ 1 and *v*_{0} ∈ *V* , we let
S_{k}(*v*_{0}) denote
the number of directed (*k* + 1)-cliques that contain *v*_{0}. In particular
S_{1}(*v*_{0}) =
deg(*v*_{0}).

#### Clustering coefficients.

Watts and Strogatz (1998) introduced
an invariant for undirected graphs they called *clustering
coefficient*. For each vertex *v*_{0} in
the graph 𝒢, one considers the quotient of the number *tv*_{0} of triangles in 𝒢 that contain *v*_{0} as a vertex by the number $degv02$ of triangles in the complete graph on *v*_{0} and its
neighbourhood in 𝒢. The clustering coefficient of 𝒢 is then
defined as the average across all *v*_{0} ∈
𝒢 of that number. Clustering coefficients are used in applied graph
theory as measures of segregation (Rubinov
& Sporns, 2010).

#### Clustering coefficient for digraphs.

*v*

_{0}every possible 3-clique that contains

*v*

_{0}, and then identifies pairs of them according to the role played by

*v*

_{0}, as a source, a sink, or an intermediate vertex (see Figure 13A–C) and (C). Fagiolo also considers cyclical triangles at

*v*

_{0}and identifies the two possible cases of such triangles (see Figure 13D). The Fagiolo clustering coefficient at

*v*

_{0}is thus the quotient of the number of equivalence classes of directed triangles at

*v*

_{0}, denoted by $t\u2192$

_{v0}, by the number of such classes in the complete graph on

*v*

_{0}and all its neighbours in 𝒢. Thus, if

*v*

_{0}is the

*i*-th vertex in 𝒢 with respect to some fixed ordering on the vertices, and

*A*= (

*a*

_{i,j}) is the adjacency matrix for 𝒢, then

*v*

_{0}is defined by

#### Transitive clustering coefficient.

A directed 3-clique is also known in the literature as a *transitive
3-tournament*. Our variation on the clustering coefficient, the *transitive clustering coefficient* of a vertex *v*_{0} in a digraph 𝒢, is the quotient
of the number of directed 3-cliques in 𝒢 that contain *v*_{0} as a vertex by the number of
theoretically possible such 3-cliques.

*v*

_{0}) and oud(

*v*

_{0}) denote the in-degree and out-degree of

*v*

_{0}. Let

*I*

_{v0},

*O*

_{v0}, and

*R*

_{v0}denote the number of in-neighbours (that are not out-neighbours), out-neighbours (that are not in-neighbours), and reciprocal neighbours of

*v*

_{0}, respectively. Notice that

We introduce our variation on Fagiolo’s clustering coefficient.

**Definition 5**. Define the transitive clustering coefficient at

*v*

_{0}by

*A*= (

*a*

_{i,j}) denote the adjacency matrix for 𝒢 with respect to some fixed ordering on its vertices. Then for each vertex

*v*

_{0}∈ 𝒢 that is the

*i*-th vertex in this ordering, S

_{2}(

*v*

_{0}) can be computed by the formula

#### Euler characteristic and normalised Betti coefficient.

The Betti numbers of the various topological constructions one can associate to a digraph have been shown in many works to give information about structure and function in a graph. A particular example, using Blue Brain Project data is M. W. Reimann et al. (2017).

#### Euler characteristic.

*X*

_{n}| is the number of

*n*-simplices in

*X*. Alternatively, the Euler characteristic can be defined using the homology of

*X*by

#### Normalised Betti coefficient.

The *normalised Betti coefficient* is based on a similar idea
to the Euler characteristic. It is invariant under graph isomorphism, but is
not a homotopy invariant. Also, unlike the Euler characteristic, it is not
independent of the chosen field of coefficients. We view the normalised
Betti coefficient as a measure of how “efficient” a digraph is
in generating homology, without reference to any particular dimension, but
with giving increasing weight to higher dimensional Betti numbers.

Let 𝒢 be a digraph, and for each *k* ≥ 0, let *s*_{k}(𝒢) denote the
number of *k*-simplices in the directed flag complex
|𝒢|. Fix some field 𝔽. By the *Betti number
β*_{i} of 𝒢 we mean the
dimension of the homology vector space *H*_{i}(|𝒢|,
𝔽).

**Definition 6**. Let 𝒢 be a locally finite digraph. Define the

*normalised Betti coefficient*of 𝒢 to be

Normalised Betti coefficients can be defined by any linear combination of Betti numbers, and also in a much more general context (simplicial posets), which we did not explore. Both the Euler characteristic and the normalised Betti coefficients are invariants of digraphs, and to use them as vertex functions we consider their value on the neighbourhood of a vertex.

#### Size (vertex count).

The *size* of a digraph can be interpreted in a number of
ways. One standard way to do so is for a fixed simplicial object associated
to a digraph, one counts the number of simplices in each dimension. This
will typically produce a vector of positive integers, the (euclidean) size
of which one can consider as the size of the digraph. Alternatively, the
simplex count in any dimension can also be considered as a measure of size.
In this article we interpret size as the number of vertices in the digraph.
Thus by *size* of a vertex *v*_{0} ∈ 𝒢 we mean the vertex count in *N*_{𝒢}(*v*_{0}).
When working with binary states on a digraph, neighbourhood size means the
number of vertices that obtain the value 1 in *N*_{𝒢}(*v*_{0}).

#### Spectral invariants.

The *spectrum* of a (real valued) square matrix or a linear
operator *A* is the collection of its eigenvalues. *Spectral graph theory* is the study of spectra of
matrices associated to graphs. It is a well developed part of combinatorial
graph theory and one that finds many applications in network theory,
computer science, chemistry, and many other subjects (see a collection of
web links on Applications of Spectral Graph Theory website: https://sites.google.com/site/spectralgraphtheory/home?authuser=0).
The various versions of the Laplacian matrix associated to a graph plays a
particularly important role. An interesting work relating neuroscience and
the Laplacian spectrum is de Lange et al.
(2014).

The *spectral gap* is generally defined as the difference
between the two largest moduli of eigenvalues of *A*. In some
situations, for instance in the case of the Laplacian matrix, the spectral
gap is defined to be the smallest modulus of nonzero eigenvalues. Given a
matrix and its spectrum, either number can be computed. As a standard in
this article, spectral gaps are considered as the first type described
above, except for the Chung Laplacian spectrum, where the spectral gap is
defined to be the value of the minimal nonzero eigenvalue. However, in
several cases we considered both options. To emphasise which option is taken
we decorated the parameter codes from Table 1 with a subscript “high” (referring to the
difference between the two largest moduli) or “low” (referring
to the smallest modulus of a nonzero eigenvalue). For example, Figures 7, 8, and 9 have **bls**_{low} as a parameter, indicating
the lowest nonzero value in the Bauer Laplacian spectrum (that is, the
minimal nonzero eigenvalue of the Bauer Laplacian matrix). Another variant
of the standard concepts of spectra is what we call the *reversed* spectral gap (Definitions 7 and 9).

Yet another common invariant we considered is the *spectral
radius*, which is the largest eigenvalue modulus of the matrix
in question. We consider here four matrices associated to digraphs: the
adjacency matrix, the transition probability matrix, the Chung Laplacian,
and the Bauer Laplacian, with details to follow.

#### The adjacency and transition probability matrices.

*V*,

*E*) be a weighted directed graph with weights

*w*

_{u,v}on the edge (

*u*,

*v*) in 𝒢, where

*w*

_{u,v}= 0 if (

*u*,

*v*) is not an edge in 𝒢. Let

*W*

_{𝒢}= (

*w*

_{u,v}) denote the weighted adjacency matrix of 𝒢. Let oud(

*u*) denote the out-degree of a vertex

*u*. The

*transition probability matrix*for 𝒢 is defined, up to an ordering of the vertex set

*V*, to be the matrix

*P*

_{𝒢}, with

*D*

^{−1}(𝒢) is the diagonal matrix with the reciprocal out-degree 1/out(

*u*) as the (

*u*,

*u*) entry, if out(

*u*) ≠ 0, else the (

*u*,

*u*) entry is 0.

**Definition 7**. Let 𝒢 be a digraph with adjacency matrix *A*_{𝒢} and transition probability matrix *P*_{𝒢}. The *adjacency spectral
gap* and the *transition probability spectral
gap* of 𝒢 are defined in each case to be the difference
between the two largest moduli of its eigenvalues.

If we replace in the definition of *P*_{𝒢} the
matrix *D*_{out}(𝒢) by *D*_{in}(𝒢) of in-degrees, we obtain a
variant of the transition probability matrix, which we denote by $P\mathcal{G}rev$,
and its spectral gap is referred to as the *reversed transition
probability spectral gap*.

For our specific application we considered the ordinary (as opposed to
weighted) adjacency matrix, namely, where all weights *w*_{u,v} are binary. We considered as parameters the spectral radius of the adjacency
and transition probability matrices.

#### The Chung Laplacian.

Chung defined the directed Laplacian for a weighted directed graph in Chung (2005). The
Perron–Frobenius theorem (Horn
& Johnson, 1990) states that any real valued irreducible
square matrix *M* with nonnegative entries admits a unique
eigenvector, all of whose entries are positive. The eigenvalue for this
eigenvector is routinely denoted by *ρ*, and it is an
upper bound for any other eigenvalue of *M*.

If 𝒢 is strongly connected (that is, when there is a directed path
between any two vertices in 𝒢), then its transition probability
matrix is irreducible, and hence satisfies the conditions of the
Perron–Frobenius theorem. Thus *P*_{𝒢} has an eigenvector, all of whose entries are positive. The *Perron
vector* is such an eigenvector *ϕ* that is
normalised in the sense that
∑_{v∈V}*ϕ*(*v*) = 1. Let Φ denote the
diagonal matrix with the *v*-th diagonal entry given by *ϕ*(*v*), and let *P* denote the transition probability matrix *P*_{𝒢}.

**Definition 8**. Let 𝒢 be a strongly connected digraph. The

*Chung Laplacian matrix*for 𝒢 is defined by

*P** denotes the Hermitian transpose of a matrix

*P*. The

*Chung Laplacian spectral gap λ*for a digraph 𝒢 is defined to be the smallest nonzero eigenvalue of the Laplacian matrix.

*λ*of a strongly connected digraph 𝒢 is related to the spectrum of its transition probability matrix

*P*by Chung (2005, Theorem 4.3), which states that the inequalities

*P*. The theory in Chung (2005) applies for strongly connected graphs and we therefore defined the Laplacian spectral gap of a neighbourhood to be that of its largest strongly connected component.

We use the spectral gap of the Chung Laplacian for the largest strongly connected component of a neighbourhood as a selection parameter. When used as a feature parameter we consider the spectral gap of the largest strongly connected component of the active subgraph of the neighbourhood. We also use the spectral radius of the Chung Laplacian, both as selection and feature parameter.

#### The Bauer Laplacian.

*C*(

*V*) denote the vector space of complex valued functions on

*V*. The Bauer Laplacian for 𝒢 is the transformation Δ

_{𝒢}:

*C*(

*V*) →

*C*(

*V*) defined by

*v*) ≠ 0 for all

*v*∈

*V*, then Δ

_{𝒢}corresponds to the matrix Δ

_{𝒢}=

*I*− $Din\u22121$(𝒢) ·

*W*

_{𝒢}, where $Din\u22121$(𝒢) is defined analogously to $Dout\u22121$(𝒢) in Definition 7, and

*W*

_{𝒢}is the weighted adjacency matrix. In our case

*W*is again taken to be the ordinary binary adjacency matrix.

**Definition 9**. The *Bauer Laplacian spectral gap* is the difference between the two largest moduli of eigenvalues in the
spectrum.

We also considered the spectral radius of the Bauer Laplacian. Both are used
as selection as well as feature parameters. If we replace in the definition *D*_{in}(𝒢) by *D*_{out}(𝒢), we obtain a matrix $\Delta \mathcal{G}rev$,
whose spectral gap we refer to as the *reversed Bauer Laplacian
spectral gap*.

## ACKNOWLEDGMENTS

The authors wish to thank Michael Reimann of the Blue Brain Project for supporting this project and sharing his wisdom and knowledge with us, and Daniela Egas Santander for suggestions to advance our ideas.

## SUPPORTING INFORMATION

Supporting information for this article is available at https://doi.org/10.1162/netn_a_00228.

## AUTHOR CONTRIBUTIONS

Ran Levi: Conceptualization; Funding acquisition; Methodology; Supervision; Writing – original draft; Writing – review & editing. Pedro Conceição: Conceptualization; Data curation; Formal analysis; Investigation; Validation. Dejan Govc: Data curation; Formal analysis; Investigation; Validation. Jānis Lazovskis: Conceptualization; Data curation; Formal analysis; Investigation; Software; Validation; Visualization; Writing – original draft. Henri Riihimäki: Conceptualization; Data curation; Formal analysis; Investigation; Software; Validation; Writing – original draft. Jason P. Smith: Conceptualization; Data curation; Formal analysis; Investigation; Software; Validation; Writing – original draft.

## FUNDING INFORMATION

Ran Levi, Engineering and Physical Sciences Research Council (https://dx.doi.org/10.13039/501100000266), Award ID: EP/P025072/. Dejan Govc, Javna Agencija za Raziskovalno Dejavnost RS (https://dx.doi.org/10.13039/501100004329), Award ID: P1-0292-0083.

## TECHNICAL TERMS

- Binary state on a graph:
An assignment of binary values to the vertices of the graph.

- Binary dynamics on a graph:
A one-parameter family of binary states on the graph.

- One-parameter family:
A family of objects or numbers that can be indexed by a single parameter, for example, time.

- Digraph:
A finite collection of points connected to each other by directed edges, without loops or multiple edges in the same direction.

- Laplacian:
A matrix associated to a digraph that informs on various qualitative properties of the digraph. In this article two versions of the Laplacian matrix are used.

- Betti numbers and homology:
A sequence of algebraic objects associated to a topological space that measures, figuratively speaking, for each

*n*the number of*n*-dimensional cavities or holes in the space. That number is called the*n*-th Betti number.- Algebraic topology:
The mathematical discipline of studying certain properties of topological spaces by means of associated algebraic objects.

- Graph parameter:
A real or complex number that can be associated to any given graph by some mathematical procedure.

- Closed neighbourhood:
In a graph

*G*, the closed neighbourhood of a vertex*v*refers to the set consisting of*v*and all its immediate neighbours, as well as the subgraph that this set induces in*G*.- Topological space:
A generalisation of a geometric object. In this article topological spaces are objects built out of simplices.

- Simplex (plural: simplices):
A basic building block for certain types of geometric objects. A 0-simplex is a point, a 1-simplex is a line segment, a 2-simplex is a triangle, and so on for all dimensions.

- Simplicial complex:
A topological space made out of gluing simplices together.

## REFERENCES

## Author notes

Competing Interests: The authors have declared that no competing interests exist.

Handling Editor: Gustavo Deco