Python package for the dynamic community detection in temporal networks implementing the flow stability framework described in
Alexandre Bovet, Jean-Charles Delvenne & Renaud Lambiotte, Flow stability for dynamic community detection, Sci. Adv., 8 (19), eabj3063. DOI: 10.1126/sciadv.abj3063
- Python3 (>=3.9)
- pandas
- scipy
- numpy
- Cython (optional but highly recommended)
- sparse_dot_mkl (optional, allows to perform multithreaded sparse matrix multiplication)
You can pip install flowstab
directly from this repository into your virtual
environment. Simply run:
pip install git+https://github.com/alexbovet/flow_stability.git
After installation you can access the relevant classes and methods by importing
flowstab
into your python scripts or via command line (see CLI scripts for details).
The easiest approach to use flowstab
is with the FlowStability
class that will provide you with contextual inforamtion, if needed:
from flowstab.flow_stability import FlowStability
fs = FlowStability()
# check what to do next
fs.state.next
# Out[3]: ([], 'set_temporal_network')
# > you need to run `set_temporal_network` - how should this be done:
fs.state.howto['set_temporal_network']
# print(fs.state.howto['set_temporal_network'])
#
# Set the temporal network for the flow stability analysis.
#
# Parameters
# ----------
# **kwargs : dict
# Arguments to initialize a ContTempNetwork instance.
# ...
fs.set_temporal_network(events_table="my_contacts.csv")
fs_mice.state.next
# Out[6]: ([], 'compute_laplacian_matrices')
fs.compute_laplacian_matrices()
fs_mice.state.next
# Out[8]: (['time_scale'], 'compute_inter_transition_matrices')
# So we need to set `time_scale` next, but how?
print(fs_mice.state.howto['time_scale'])
# Set the time scale(s) for the random walk's transition rate.
#
# .. note::
# You might also use `set_time_scale` to directly create a range of
# time scales.
#
# Parameters
# ----------
# time_scale : None, int, float, or iterator of float
# If None, a default value is used.
# If an int or float, a single time scale is set.
# If an iterator, it must yield float or int values.
#
# ...
fs.set_time_scale(10)
fs_mice.state.next
# Out[11]: ([], 'compute_inter_transition_matrices')
fs.compute_inter_transition_matrices()
fs.time_direction = 0 # perform both forward and backard
fs.set_flow_clustering()
fs.find_louvain_clustering()
fs.flow_clusetering_backward
# Out[17]: {10: <flowstab.network_clustering.FlowIntegralClustering at 0x7fef57a7fd00>}
Alternatively, you can use individual elements form the flowstab
packacke directly.
For exaple if you want to use the FlowIntegralClustering
class, for example, you would
want to add the following line in your script:
from flowstab.netowrk_clustering import FlowIntegralClustering
# forw_flow = FlowIntegralClustering(...
Refer to the examples folder more detailed usage examples.
flowstab
provides also several command line hooks that can be run
directly in the terminal after installation without the need of opening a
python shell:
run_clusterings
This command requires
sparse_dot_mkl which relies
on the closed-source libmkl_rt.so
library. In Ubuntu you might need to fetch
this library with apt-get install libmkl-rt
.
run_cov_integrals
run_laplacians_transmats
The main classes are:
ContTempNetwork
in the moduletemporal_network
which is used to store and save temporal networks and to compute inter-event transition matrices.FlowIntegralClustering
in the sub-modulenetwork_clustering
which is used to computed the flow stability (integral of covariance) and to find the best forward and backward partition using the Louvain algorithm.
Additional interesting classes and functions are:
Clustering
andSparseClustering
in thenetwork_clustering
sub-module can be used to directly cluster covariances or integrals of covariances.static_clustering
innetwork_clustering
is an helper function to cluster static networks using Markov Stability.run_multi_louvain
innetwork_clustering
helper function to run the Louvain multiple times on the same covariance in order to check the robustness of the partition.avg_norm_var_information
innetwork_clustering
computes the average Normalized Variation of Information of list of cluster lists obtained withrun_multi_louvain
.compute_parallel_clustering
inparallel_clustering
, same thanrun_multi_louvain
but in parallel.- the
parallel_expm
module contains functions to compute the matrix exponential of very large matrices using different strategies.
A jupyter notebook reproducing the example from Fig. 2 of the paper is available in asymmetric_example.ipynb
.