Communicated by Dr Xin Luo
Accepted Manuscript
Exploring temporal community structure and constant evolutionary pattern hiding in dynamic networks Pengfei Jiao, Wei Yu, Wenjun Wang, Xiaoming Li, Yueheng Sun PII: DOI: Reference:
S0925-2312(18)30480-6 10.1016/j.neucom.2018.03.065 NEUCOM 19513
To appear in:
Neurocomputing
Received date: Revised date: Accepted date:
7 December 2017 7 March 2018 26 March 2018
Please cite this article as: Pengfei Jiao, Wei Yu, Wenjun Wang, Xiaoming Li, Yueheng Sun, Exploring temporal community structure and constant evolutionary pattern hiding in dynamic networks, Neurocomputing (2018), doi: 10.1016/j.neucom.2018.03.065
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT
Exploring temporal community structure and constant evolutionary pattern hiding in dynamic networks
a School
CR IP T
Pengfei Jiaoa,b , Wei Yua,b , Wenjun Wanga,b , Xiaoming Lia,c , Yueheng Suna,b,∗ of Computer Science and Technology, Tianjin University, Tianjin, 300350, China Key Laboratory of Advanced Networking (TANK), Tianjin, 300350, China. c Corps 12th division network information center. Xinjiang ,830011 China
b Tianjin
Abstract
AN US
Detecting the temporal communities and analyzing their evolution in dynamic
networks is an important question for understanding the structure and function of complex networks. Most existing methods deal the temporal community detection and evolution as a two-step processes and only apply to unweighted and undirected dynamic networks. In this paper, we proposed a new clustering
M
method based on non-negative matrix factorization from a fully probabilistic perspective, to explore temporal and constant community structure as well as the importance of nodes in any type dynamic networks synchronously. In de-
ED
tailed, we first denote the important matrix of node in dynamic networks, the community membership matrix, the similarity matrix at each snapshot and the probability transition matrix of community between the two consecutive
PT
snapshots. Second, we constitute the temporal community detection method from the view of generating networks. Third, we introduce a gradient descent
CE
algorithm to optimize the objection function of the proposed model. The experimental results on both artificial and real dynamic networks demonstrate that the superior performance of our proposed method is over some widely-used
AC
methods.
Keywords: Non-negative matrix factorization (NMF), Temporal community detection, Evolutionary pattern mining, Nodes’ importance identifing ∗ Corresponding
author Email address:
[email protected] (Yueheng Sun)
Preprint submitted to Journal of LATEX Templates
May 3, 2018
ACCEPTED MANUSCRIPT
1. Introduction
CR IP T
Community detection, as a research hotspot of complex networks, has attracted great attention of scholars from different disciplines, which plays an essential role in finding meaningful structures and functions hidden in complex 5
networks [1, 2]. For example, communities, in the protein-protein interaction networks may be groups of proteins that perform specific biological functions
[3, 4], in the World Wide Web network may correspond to webpages with related
AN US
topics [5, 6], in metabolic networks may be related to functional modules such
as cycles and pathways [7, 8], and in food webs may be the identify compart10
ments [9, 10]. An increasing number of community detection methods have been proposed [11], such as spectral clustering [12, 13], statistical inference [14, 15], modularity optimization [16, 17], and random walk [18, 19]. However, these methods are only designed for static networks and not suitable for dynamic
M
networks, of which the nodes and edges are varying with time.
In fact, many social, biological, bibliographic, communication and computer
15
ED
systems can be modeled as dynamic networks [20]. Usually, we represent a dynamic network as a series of snapshots, the communities in which may grow or contract, merge or split over the snapshots [21], different evolutions make com-
20
PT
munity detection in dynamic networks more challenging. Such as in a dynamic collaboration network [22], nodes are denoted as researchers, the links represent
CE
collaboration relations, one community is corresponding to group of authors with same research field, changes in hotspots and in interests of researchers make the community have complex dynamics. Compared to community detection in
AC
static networks, analyzing the evolving communities demands the methods not
25
only detect the communities in each snapshot but also model the evolution of communities with identifying the importance of nodes in the dynamic networks. The temporal community structure and the evolution characteristics exist in
dynamic networks synchronously. It demands the methods detect the temporal communities and obtain the evolutionary pattern synchronously. However, most 2
ACCEPTED MANUSCRIPT
30
existing methods are just unilaterally designed for either detecting communities at each snapshot or analyzing their evolution. And some heuristic approaches [21, 23], which first obtain the community structure in dynamic networks, and
CR IP T
then analyze their evolution over time, are based on independent clustering. These methods aforementioned, which are designed for detecting communities 35
at each snapshot and then analyzing their evolution in two steps, tend to cause
structural information loss. To avoid this phenomenon, it is more interesting
and challenging for detecting temporal communities and obtaining evolutionary pattern synchronously.
40
AN US
In addition, it exists different kinds of dynamic networks, whose performance depends on the directional and weighted properties that co-occur in a real-world network. For example, in a communication network, the link is from caller to receiver, and the weight of the link can be the number or time of calls, so it
is a weighted and directed dynamic network. However, most exiting methods are designed generally for a specific kind of complex networks. In this case, it is meaningful to propose a general approach to handle different types of time-
M
45
varying networks and those subtleties in community structure.
ED
As we have discussed, it is necessary to propose a general approach to handle different types of time-varying networks for detecting temporal communities and obtaining evolutionary pattern synchronously. In this paper, we propose an evolutionary clustering method based on non-negative matrix factorization
PT
50
(NMF) in dynamic networks. It can detect the temporal community struc-
CE
ture, obtain the constant evolutionary patterns and identify the importance of nodes in community level synchronously. Differing from the general NMF, the proposed model can be interpreted from a fully probabilistic perspective. We take advantage of three matrices to describe the temporal networks, including
AC
55
the connecting probability matrix, the permanent probability matrix of common structure among snapshots and the probabilistic community membership matrix at each snapshot of the dynamic network. In addition, we assume that there are constant evolutionary patterns across the snapshots of temporal networks, and
60
denote them as a probability transition matrix among the temporal communi3
ACCEPTED MANUSCRIPT
ties with a penalty term in our model. Standard NMF method tends to estimate sparse components. Taking this limitation into consideration, we constraint the probability matrix of common structure in the form of an l1 penalty [24], which
65
CR IP T
clearly expresses the importance of nodes in a community. We also introduce a gradient descent algorithm to optimize the objective function in the proposed
model, and analysis its complexity. In addition, regardless of the types of the
dynamic networks including weighted/unweighted or directed/undirected, all
the snapshots of them can be denoted as connecting probability matrices, and our model is suitable for any matrices including symmetrical and asymmetrical
matrices. Therefore, our model is easy to extend to kinds of complex networks,
AN US
70
whose performance depends on the directional and weight properties. Also of note, our model is suitable for both non-overlapping and overlapping community detection because it can obtain the probabilistic community membership of each node.
In summary, we propose a novel evolutionary clustering model based on
75
M
NMF to explore temporal and constant community structure as well as the importance of nodes in dynamic networks synchronously. Our model is suitable
ED
for different kinds of dynamic networks including weighted/unweighted and directed/undirected, and can facilely detect both the non-overlapping and the 80
overlapping community structure. The importance of nodes in communities
PT
can be determined by comparing the probability of a specific community including each node. More importantly, with the probability transition matrix,
CE
we can detect the constant evolutionary pattern at snapshots. We also introduce a gradient descent algorithm to optimize the objective function in the proposed model, and analysis its complexity. Experimental results on both synthetic datasets and real datasets demonstrate that our model performs better than
AC
85
other five widely-used approaches.
4
ACCEPTED MANUSCRIPT
2. Related work Recently, temporal community detection has been receiving increasing at90
tention, surveys about which can be seen in [25, 26]. The existing methods for
CR IP T
community detection in dynamic networks can be mainly classified into three categories: two-steps based strategy, generative model, and evolutionary clustering.
Firstly, the two-steps strategy detects community structure of each snapshot 95
network with methods designed for static networks, and analyzes the community evolution among snapshots with related principles [21, 27]. For example,
AN US
the simplest two-steps method divides the network into discrete time steps and performs the static symmetric NMF (SNMF) [28] on each snapshot, respectively. In the other words, these approaches process each snapshot network as a 100
static network and then analyze the involving relationship of the communities at successive snapshot networks. However, they ignore the connection between
M
successive snapshots when detecting communities.
Secondly, the generative model methods are based on a dynamically generating process to construct a generative model of dynamic network, and then optimizing it with parameter estimation [29]. For example, Yang et al. [30] pro-
ED
105
pose a dynamic stochastic block model (DSBM) based on classic stochastic block
PT
model. The DSBM captures the evolution of communities by explicitly modeling the transition of community memberships for individual nodes. And they proposed a probabilistic simulated annealing algorithm combined with Gibbs Sampling to estimate parameters. The methods based on generative model is
CE
110
highly descriptive, but it tends to suffer from inaccuracy in dynamical commu-
AC
nity detection. Thirdly, the evolutionary clustering methods, which are most popular type
of approaches, take the clustering results of the previous snapshot network into
115
consideration when analyzing the current snapshot network. Specifically, the first evolutionary clustering framework is proposed by using the classic k-means and hierarchical clustering methods to cluster dynamic data with a temporal
5
ACCEPTED MANUSCRIPT
smoothness constraint [31]. A classic and most widely used method, FaceNet method [32], is a framework for analyzing Communities and their evolutions 120
through a robust unified process, which consider the evolutions of communities
CR IP T
and the temporal smoothness of evolutions. This method is an improvement of SNMF by defining the snapshot cost and the temporal cost with KullbackLeibler divergence. Qin et al. [13] propose a multi-similarity spectral clustering
(MSSC), which follows the evolutionary clustering strategy. The evolutionary 125
spectral clustering algorithm simultaneously considers multiple similarity ma-
trices. However, most of this type of methods are just unilaterally designed for
AN US
either detecting communities at each snapshot or analyzing their evolution, or take these two aspects into consideration in two steps. Therefore, we propose a general approach to handle different types of time-varying networks for detecting 130
temporal communities and obtaining evolutionary pattern synchronously.
M
3. Methods
In this section, we introduce the basic notations and the proposed model
ED
including optimization algorithm for dynamic networks. 3.1. NMF with Constant Evolutionary Regularization in Dynamic Network In dynamic networks, we consider the connecting probability of node pairs
135
PT
depend on the permanent prior probability and the time-varying connecting probability over time. Simultaneously, the time-varying community structure
CE
obeys a constant evolutionary pattern over time. In this case, we introduce a NMF-based generative model to explore the temporal and constant community structure.
AC
140
Firstly, we define a temporal network as G = (V, Et ), where t = 1, 2, ..., T .
V and Et denote the nodes of the network and edges at the snapshot t, re-
spectively. N = |V | and Mt = |Et | are the number of nodes and number of edges of the network at each t, repectively. T and K are the number of
145
snapshots and the number of communities of the dynamic network, repectively. Here, we set T and K as constants at snapshots. The tth snapshot 6
ACCEPTED MANUSCRIPT
of a dynamic network can be denoted with a connecting probability matrix Xt (t = 1, 2, ..., T ), the element Xij,t (i, j = 1, 2, ..., N , where N is the number of nodes of each snapshot) represents the connect probability between node i and node j of the tth snapshot. The connecting probability matrices is from the
CR IP T
150
column normalization of adjacent matrices at snapshots, which can be obtain PN from most kinds of dynamic networks, thus Xij,t ∈ {0, 1}N ×N , i=1 Xij,t = 1 Spontaneously, our model is not only suitable for weighted/unweighted dy-
AN US
155
namicnetworks but also directed/undirected dynamic networks. For example, X X12,t X13,t 11,t Xt = X21,t X22,t X23,t , where Xt represents the connecting probability X31,t X32,t X33,t matrix of snapshot t, then X11,t + X21,t + X31,t = 1, X12,t + X22,t + X32,t = 1, and X13,t + X23,t + X33,t = 1. In other words, the summations of probabilities that any node links to others are restricted to equal 1.
Secondly, we assume that it exists a permanent probability matrix of common structure W over time, of which element Wik (i = 1, 2, ..., N, k = 1, 2, ..., K,
M
160
where K is the number of community of each snapshot) represents the prior
ED
probability that community k includes node i, thus we have Wik ∈ {0, 1}N ×K , PN i=1 Wik = 1. In fact, the prior probability can incarnate the importance of a
PT
165
W W12 11 node in a community. For example, W = W21 W22 , then W11 + W21 + W31 W32 W31 = 1 and W12 + W22 + W32 = 1, the first column is the prior probability that
CE
community 1 includes node 1, 2, 3, respectively. Moreover, the node with the maximum value in the first column corresponds to the most important nodes in
AC
community 1.
170
Thirdly, it exists a probabilistic community membership matrix Ht (t =
1, 2, ..., T ), of which element Hjk,t (j = 1, 2, ..., N ) represents the time-varying connecting probability that node j connects community k in the tth snapPK shot. Correspondingly, Hik,t ∈ {0, 1}N ×K , k=1 Hik,t = 1. The community label of nodes can be determined by obtaining the maximum value of their
7
ACCEPTED MANUSCRIPT
the column index with maximum value in the first row.
CR IP T
175
probabilistic community membership in different communities. For example, H11,t H12,t Ht = H21,t H22,t , then H11,t + H12,t = 1, H21,t + H22,t = 1, and H31,t H32,t H31,t + H32,t = 1. Naturally, the community label of node 1 corresponds to
Obviously, the connecting probability Xij,t can approximate to the product
180
of the prior probability Wik and the time-varying connecting probability Hjk,t PK on the sum of k, Xij,t ≈ k=1 Wik Hjk,t , so we have Xt ≈ WHTt . According to
AN US
the core idea of standard NMF [33], the objective function can be constructed
by Euclidean distance[34, 35, 36], which is the square of the Frobenius norm of two matrices difference [37]. The object function can be constructed as follows: O1 =
T X t=1
||Xt − WHTt ||2F ,
(1)
Standard NMF method tends to estimate sparse components. Taking this limitation into consideration, a sparsity penalty is introduced in the form of an l1
M
185
penalty [24]. The object function can be constructed as follows: T X
||Xt − WHTt ||2F + λ
ED O2 =
t=1
K X
k=1
||Wk ||1 ,
(2)
where λ is a balance parameter. To explore the community evolutionary pat-
PT
tern of temporal network, we think that it exists a constant evolutionary pattern of communities over time, which can be denoted with a probability transition matrix Z, of which element Zlk (l, k = 1, 2, ..., K) represents the tran-
CE
190
sition probability from community l to community k between each two ad-
jacent snapshots. It means that the connect probability Hjk,t approximates
AC
to the product of the connect probability Hjl,t−1 and the transition probPK ability Zlk , Hjk,t ≈ And we l=1 Hjl,t−1 Zlk , so we have Ht ≈ Ht−1 Z. PK K×K also have Zlk ∈ {0, 1} , k=0 Zlk = 1, l, k = 1, 2, ..., K. For example, Z11 Z12 , then Z11 + Z12 = 1, and Z21 + Z22 = 1. The element Z12 Z= Z21 Z22 represents the transition probability from community 1 to community 2 in next
195
8
ACCEPTED MANUSCRIPT
snapshot, so the evolutionary pattern of communities can be visualized with the probability transition matrix Z . To reveal the community evolutionary pattern 200
in our model, another penalty term is introduced with the Frobenjus norm of
CR IP T
the difference between Ht−1 Z and Ht , where the balance parameter is γ. The object function of our model can be constructed as follows: T X
O3 =
t=1
||Xt − WHTt ||2F + λ
K X
k=1
||Wk ||1 + γ
T X t=2
3.2. Optimization algorithm
||Ht−1 Z − Ht ||2F ,
(3)
The object function O3 in equation (3) is not convex in both W and H together. Therefore we minimize the object function O3 with a gradient-descent
AN US
205
estimation algorithm, which can be rewritten as: O3 =
T X t=1
+γ
T r(Ht WT WHTt − XTt WHTt − Ht WT Xt ) + λtr(W)
T X
T r(Z
T
HTt−1 Ht−1 Z
−Z
HTt−1 Ht
−
HTt Ht−1 Z
+
HTt Ht )
(4) + const,
M
t=2
T
where the right of the equality applies the matrix properties T r(AB) = T r(BA)
ED
and T r(A) = T r(AT ). Let Ψik , Φjk and Ξlk be the lagrange multiplier for constraint Wik ≥ 0, Hjk ≥ 0 and Zlk ≥ 0, respectively, and Ψ = [ψik ], Φ = [φjk ], 210
Ξ = [ξlk ], the Lagrange L is T X
T r(Ht WT WHTt − XTt WHTt − Ht WT Xt ) + λtr(W)
PT
L=
t=1
T X
CE
+γ
t=2
T r(ZT HTt−1 Ht−1 Z − ZT HTt−1 Ht − HTt Ht−1 Z + HTt Ht )
(5)
+ T r(Ψ WT ) + T r(ΦHT ) + T r(ΞZT ) + const.
AC
The partial derivatives of L with respect to W is T
X ∂L = (2WHTt Ht − 2Xt Ht ) + λ + Ψ, ∂W t=1
(6)
the partial derivatives of L with respect to H is t = 1,
∂L = 2H1 WT W − 2XT1 W + 2γH1 ZZT − 2γH2 ZT + Φ, ∂H1 9
(7)
ACCEPTED MANUSCRIPT
2 ≤ t ≤ T − 1, ∂L = 2Ht WT W − 2XTt W − 2γHt−1 Z + 2γHt + 2γHt ZZT − 2γHt+1 ZT + Φ, (8) ∂Ht
215
and the partial derivatives of L with respect to Z is T
CR IP T
∂L = 2HT WT W − 2XTT W + 2γHT − 2γHT −1 Z + Φ, ∂HT
t = T,
∂L X = (2HTt−1 Ht−1 Z − 2HTt−1 Ht ) + Ξ. ∂Z t=2
(9)
(10)
AN US
Using the KKT conditions ψik Wik = 0, φjk Hjk = 0, and ξlk Zlk = 0, we get the following equations for Wik , Hjk , and Zlk : (
T X (2WHTt Ht − 2Xt Ht ))ik Ψik + λΨik = 0. t=1
(H1 WT W)jk Φjk − (XT1 W)jk Φjk + γ(H1 ZZT )jk Φjk − γ(H2 ZT )jk Φjk = 0. (12)
M
t = 1,
2 ≤ t ≤ T − 1, (Ht WT W)jk Φjk − (XTt W)jk Φjk − γ(Ht−1 Z)jk Φjk 220
(HT WT W)jk Φjk − (XTT W)jk Φjk + γ(HT )jk Φjk − γ(HT −1 Z)jk Φjk = 0. (14)
CE
PT
The updating rules can be obtained as follows: PT ( t=1 Xt Ht )ik Wik ← Wik PT , ( t=1 (WHTt Ht ))ik + λ
AC
(13)
ED
+ γ(Ht )jk Φjk + γ(Ht ZZT )jk Φjk − γ(Ht+1 ZT )jk Φlk = 0.
t = T,
(11)
(XT1 W + γH2 ZT )jk,1 , (H1 WT W + γH1 ZZT )jk,1
(16)
(XTt W + γHt−1 Z + γHt+1 ZT )jk,t , (Ht WT W + γHt + γHt ZZT )jk,t
(17)
(XTT W + γHT −1 Z)jk,T , (HT WT W + γHT )jk,T
(18)
Hjk,1 ← Hjk,1
Hjk,t ← Hjk,t
(15)
Hjk,T ← Hjk,T
10
ACCEPTED MANUSCRIPT
225
Zlk
PT ( t=2 HTt−1 Ht )lk . ← Zlk PT ( t=2 HTt−1 Ht−1 Z)lk
(19)
CR IP T
The optimization algorithm is presented for our model as Optimization Algorithm (Table 1). The input of our algorithm is the connecting probability matrices X = X1 , X2 , ...XT , the number of communities K and the balance
parameter λ and γ. We use the algorithm to iteratively update W, Ht and 230
Z until convergence. The output is the probability of common structure W,
the community membership matrices Ht , and the probability transition matrix
AN US
Z.Our model can detect non-overlapping community structure by obtaining the community label with the maximum value of community membership about each node, and detect overlapping community structure by finding the each 235
node’s community labels, whose community membership is greater than a certain threshold value. The threshold value can be set in 0 ∼ 1, and the smaller the value, the bigger the extent of overlapping.
M
From the Optimization Algorithm (Table 1), the most time-consuming part is the updating of Ht . The time cost of H1 is O(3niter (N 2 K + N K 2 )), where niter denotes the number of iterations. And for the snapshot networks t > 1, the
ED
240
time cost is O(niter (T −1)(3N 2 K+4N K 2 )), where T is the number of snapshots. Then the time complexity of the whole algorithm is O(niter T (N 2 K + N K 2 )).
PT
In fact, the real temporal networks are very sparse. When we denote the edges of the temporal networks as et , t = 1, 2, ..., T , the N 2 can be replaced with ne approximatively, where ne denotes the average edges for all snapshots of
CE
245
the temporal network. In addition, K is far less than N and ne , so that K can be ignored for the time complexity in the algorithm. Therefore, the time
AC
complexity will degrade to O(niter T (ne + N )), which is linear. Synchronously, using parallel computation or sampling technology can help boost computing
250
performance of our algorithm. Accordingly, our algorithm is easy to be extended to large-scale network.
11
CR IP T
ACCEPTED MANUSCRIPT
Table 1: The optimization Algorithm of the proposed model
Optimization Algorithm
Input: temporal network X = X1 , X2 , ... XT ; initial K, and balance parameter λ,γ Output: W, HTt and Z
Define: W is the probability of common structure, Ht is the community membership matrix,
Initialize: W = rand(N, K); for t in range (1,T) Ht = rand(N, K);
Z = rand(K, K); repeat
M
end for
AN US
and Z represents the probability transition matrix of the representing matrix.
P T ( T t=2 Ht−1 Ht )lk ; T H t=2 t−1 Ht−1 Z)lk P T ( t=1 Xt Ht )ik Wik ← Wik (PT (WH T H )) +λ ; t ik t t=1 (XT W+γH ZT ) 1 Hjk,1 ← Hjk,1 (H1 WT W+γH2 1 ZZjk,1 ; T) jk,1
PT
ED
Zlk ← Zlk (PT
for t in range(2, T-1) Hjk,t ← Hjk,t
T (XT t W+γHt−1 Z+γHt+1 Z )jk,t ; (Ht WT W+γHt +γHt ZZT )jk,t
CE
end for
(XT W+γH
Z)
T −1 jk,T ; Hjk,T ← Hjk,T (HTT WT W+γH T )jk,T
untill convergence;
AC
Return W, HTt and Z.
12
ACCEPTED MANUSCRIPT
4. Experiments In this section, we introduce evaluation metrics, five previously published
255
on both the synthetic and real world dynamic networks. 4.1. Evaluation Metrics and Comparison Algorithm
CR IP T
approaches as comparing groups with our model, and present the experiments
To evaluate the performance of our algorithm, extensive experiments are designed on both synthetic networks and frequently-used real networks. Specif-
ically, we compare the values of two widely-used evaluation metrics, the Normalized Mutual Information (N M I) [38] and the error rate (CA) [39], in our method
AN US
260
and other five base-lines. N M I, a well-known entropy measure, is widely used ˆ be the community structo measure the similarity of two clusterings. Let G tures obtained from our method, and G the ground truth of the community structures. N M I can be defined as the normcalization of mutual information ˆ
ˆ G) by the average of two entropies H(G) ˆ and H(G), N M I = 2 I(G;G) . I(G; ˆ H(G)+H(G) PK r r PK r PK s Nr N ·Nij ˆ ˆ In detail, I(G; G) = i=1 j=1 Nij log( N r ·N s ), H(G) = i=1 Ni log Ni , and i j PK s Ns H(G) = j=1 Njs log Nj , where N is the number of nodes in the network; K r
M
265
ED
and K s are the number of communities of the ground truth structure and that
of the structure obtained by our method, respectively; Nir , Njs and Nij are 270
the number of nodes in the ith community of the ground truth, the number
PT
of nodes in the jth community obtained by our method, and the number of common nodes in ith and jth communities, respectively. The N M I indicates
CE
ˆ and G, whose the value is a quantity from 0 to 1. The consistency between G ˆG ˆ T − GGT ||2 , it measures the distance between CA is defined as CA = ||G F
275
ˆ and that represented by G. The the community structure represented by G
AC
CA value usually increases with the size of the network, and a larger CA value means a poorer result. We choose five previously published approaches for detecting communities
as compareing groups with our method.
280
- a traditional two-steps strategy, which divides the network into discrete 13
ACCEPTED MANUSCRIPT
time steps and performs the static symmetric NMF (SNMF)[28] on each snapshot, respectively. - the structural and functional discovery with NMF method (SFNMF) [24],
CR IP T
which is a model based on NMF, which can display underlying structure and its evolution over time.
285
- the FaceNet method [32]which is a framework for analyzing Communities and their evolutions through a robust unified process, which consider the evolutions of communities and the temporal smoothness of evolutions, and
AN US
we set the parameter λ = 0.8.
- the Genluovain algorithm [23], optimize the temporal, multiscale and mul-
290
tiplex modularity with a greedy heuristic method. Here, we set the resolution parameter γ = 1 and couple parameter ω = 0.5 for this method, which are the commonly used parameter settings in the related works.
M
- the multi-similarity spectral clustering method (MCSS), which is an evolutionary spectral clustering algorithm with considering multiple similarity
295
ED
matrices simultaneously. 4.2. Illustrative example
PT
To clarify the working principle of our model, we make an illustrative example to explain each term in a small synthetic temporal network, which is 300
generated according to the description by Greene et al.[40]. Here, we generate a
CE
small synthetic network sample based on the embedding of switch events, which occurs when the nodes move among the communities. The small switch net-
AC
work involves 100 nodes, 7 embedded dynamic communities, and 20% of node memberships are randomly permuted at each snapshot to simulate the natural
305
movement of users among communities over time. Each snapshot graphs share the following parameters: the nodes have a mean degree of 15, a maximum degree of 50, and a mixing parameter value of µ = 0.7, which controls the overlap between communities.
14
ACCEPTED MANUSCRIPT
SNMF
0.45
SFNMF
0.4
FaceNet
CR IP T
This Work
0.35
Genlouvain
NMI
0.3
MSSC
0.25 0.2 0.15 0.1 0.05
1
2
3
4
5
6
7
8
9
10
AN US
snapshot
Figure 1: The illustrative example on a small switch network: NMI value of different methods on the switch synthetic network. The network has 10 snapshots, 100 nodes and 7 embedded dynamic communities, the nodes have a mean degree of 15, a maximum
M
degree of 50, and the mixing parameter µ is 0.7.
ED 0.15
0.1
0.05
0
50
C5
0.2
99
0.15
CE
0.1
0.05
0
100
50
100
0
50
100
95 0.15 0.1
0.15 80
0.1 0.05 0
0
50
100
AC
100
89
0.1 0.05
C7
0.2
0 50
0.15
0 0
C6
0.2
85
0 50
0.05
0
0
0.1
0
100
C4
0.2
0.15
0.05
Posibility
PT
0
C3
0.2
98
Posibility
Posibility
94
0.1
Posibility
Posibility
0.15
0.05
Posibility
C2
0.2
Posibility
C1
0.2
0
50
100
Figure 2: The illustrative example on a small switch network: the prior probability that each community include each node of the small switch synthetic network. The network is the same as the network in Fig.1. The x-axes show the node label, and the pink numbers represent the label of important nodes.
15
ACCEPTED MANUSCRIPT
The NMI value of different methods on the switch synthetic network is 310
showed in Fig.1, and the average level of our method has improved more than 35% than that of others, which indicates that our method has a superior per-
CR IP T
formance. Moreover, the prior probability that a specific community includes each node is showed in Fig.2. In fact, the prior probability can incarnate the
importance of a node in a community. Intuitively, from each histogram in Fig.2, 315
we can see in the prior probability distribution that the most important nodes
of community 1 ∼ 7 are node 94, node 98, node 85, node 89, node 99, node 95, node 80, respectively. Naturally, our model can automatically identify the
AN US
importance of the nodes in communities of dynamic networks automatically, which can be broadly applied to virtual marketing or diffusion of information. C1 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0
C2 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0
C3
1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0
C4
1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0
M
1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0
ED
1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0
C5
1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0
C6
1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0
C7
1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0
1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0
PT
Figure 3: The illustrative example on a small switch network: the probability that each nodes connect each communities at each snapshot of the small switch synthetic network. The network is the same as the network in Fig.1. Each histogram shows
CE
the probability of a node connects 7 communities at 10 snapshots, respectively, the different colours indicate the different communities. For example, the blue (C1) indicates community
AC
1.
320
In Fig.3, it shows the probability that a specific node connects each com-
munity at each snapshot, and the 7 colors indicate community 1 ∼ 7. The 100 histograms correspond to nodes 1 ∼ 100 in horizontal order, respectively. From each histogram in Fig.3, we can intuitively see the evolution of the probability distribution that a specific node connects the 7 communities at snapshots, and
16
ACCEPTED MANUSCRIPT
325
the x-axes show the snapshots t. Furthermore, to demonstrate the mixing degree of the probability that a specific node connects different communities, we compute its information entropy and demonstrate the changes of the informa-
CR IP T
tion entropy of all nodes at snapshots in Fig.4. For example, the probability distribution that node 41 (the histogram in the 5th row and the first column 330
of Fig.3) is almost uniform at the first several snapshots: the probability that node 41 connects community 2 approaches to 1. However, the above probability becomes smaller and smaller, while the probability connects community 4 is
bigger and bigger over time. Correspondingly, we can clearly see that the infor-
335
AN US
mation entropy of node 41 is getting bigger over time in Fig.4. The probabilities
that node 1 (the first histogram in Fig.3) connects the communities don’t have obvious changes over time. Correspondingly, the information entropy of node 1 keep stable over time from Fig.4.
Information entropy
10 9
6 5 4 3 2 1
10
20
PT
0
ED
Snapshots
7
0
2.5
M
8
30
2
1.5
1
0.5
0
40
50
60
70
80
90
100
Nodes
CE
Figure 4: The illustrative example on a small switch network: the information entropy of the probability that a node connect each communities at each snapshot of the small switch synthetic network. The network is the same as the network in Fig.1. The different colours indicate the value of information entropy, the x-axes show the nodes
AC
label n, and the y-axes show the snapshots t.
However, how do the nodes transfer among communities over time in dy-
namic network? To demonstrate this problem, we visualize the nodes transition
340
among communities at snapshots in Fig.5, which shows how the nodes transfer and the size of communities evolve at snapshots. The x-axes represent snap17
ACCEPTED MANUSCRIPT
7 6
4 3 2 1 1
2
3
4
5
6
7
8
9
10
AN US
Snapshot
CR IP T
Community
5
Figure 5: The illustrative example on a small switch network: the visualization of the nodes transition among communities at snapshots of the small switch synthetic network. The network is the same as the network in Fig.1. The size of dots represents the size of communities, the arrows represent the direction of the nodes transition, the x-axes
M
show the snapshots, and the y-axes show the communities.
7
0.9 0.8
ED
6
0.7 0.6
4
0.5
3
PT
Community l
5
0.4 0.3
2
0.2
1
CE
0
0
0.1 0
1
2
3
4
5
6
7
Community k
Figure 6: The illustrative example on a small switch network: the transition prob-
AC
ability from community l to community k at each two adjacent snapshots of the
small switch synthetic network. The network is the same as the network in Fig.1. As
shown in the legend, the different colours represent the transition probability from community l to community k, whose value correspond to the values of the elements in the transition
probability matrix Z respectively. The x-axes and the y-axes show the community label at current snapshot and last snapshot respectively.
18
ACCEPTED MANUSCRIPT
shots t, and the y-axes represent community indexes. The colorful dots represent communities, and their sizes demonstrate the number of nodes in communities. Correspondingly, Fig.6 shows the transition probability from community l to community k at each two adjacent snapshots. As shown in the legend, the
CR IP T
345
different colours represent the transition probability from community l to community k, whose value correspond to the values of the elements in the transition probability matrix Z respectively. In the other words, Fig.6 show the possibil-
ity that a node transfers from community l to community k. As shown in the 350
figure, the diagonal line is biggest, meaning that there is a bigger probability
AN US
that a node will keep in its current community. For example, there are many
nodes transfer from community 7 to community 1, 3 and 6, and the size of community 7 becomes smaller and smaller over time in Fig.5. Correspondingly, we can clearly see the probability from community 7 to community 1, 3 and 6 355
is bigger than others in Fig.6.
M
4.3. Parameter analysis
5 4
0.25
0.2
PT
3
0.3
ED
6
0.15
0.1
1
0.05
0 0
1
2
3
4
5
6
7
8
9
10
AC
CE
2
Figure 7: Parameter analysis on the small switch network. The NMI values of our
method with the parameters λ (range 0 ∼ 10) and γ (range 0 ∼ 6) on the small switch
synthetic network, which is the same as the network in Fig.1. As shown in the legend, the different colors represent the NMI values from 0 to 1.
To analyze the sensitivity of the parameters, we compute the NMI values 19
ACCEPTED MANUSCRIPT
of our method with the balance parameters λ (range 0 ∼ 10 with steplength of 0.1) and γ (range 0 ∼ 6 with steplength of 0.1) in the small switch synthetic 360
network. As shown in Fig. 7, the performance of our method almost tends
CR IP T
to stable when parameter λ is bigger than 5. Nevertheless, the performance is much better when parameter γ is between 0 ∼ 1. Therefore, it is a good
choice to set the parameters λ = 5 and γ = 0.3, which is used in all experiments
about our method in this work. According to the commonly used parameter 365
settings in the related works, we set the parameter λ = 0.8 for the FaceNet
method, the resolution parameter γ = 1 and couple parameter ω = 0.5 for the
AN US
Genluovain method. In addition, SNMF, SFNMF and MCSS do not require setting parameters. 4.4. Results comparison
The NMI and CA of our approach are compared against other five existing
370
approaches for detecting communities in dynamic networks: switch synthetic
M
networks and real-world networks from KIT e-mail data 1 . The switch network is also generated according to the description by Greene et al.[40].
375
ED
In order to demonstrate the performance of our method, comparison experiments were designed on the switch synthetic networks with three parametric cases. In Fig. 8, we intuitively demonstrate the performance on three switch
PT
networks, which share the following parameters: the number of snapshots is 10, the number of nodes is 1000 at each snapshot, and the maximum degree of the nodes is 50. The difference is that the mixing parameter µ is 0.7 and 0.8, and the average degree of the nodes is 5 and 15, respectively. Specifically, Fig. 8(a,
CE 380
b) show NMI and CA values of different methods at 10 snapshots in the switch
AC
network, where the mixing parameter µ is 0.7 and the average degree is 15. The average value of NMI from our method has improved by 11.49% and CA has reduced by 29.25% compared to FaceNet, showing the best performance among
385
these five methods in Fig.8(a, b). Similarly, Fig. 8(c, d) show NMI and CA 1 http://i11www.iti.uni-karlsruhe.de/en/projects/spp1307/emaildata
20
ACCEPTED MANUSCRIPT
CA
0.5 0 1
2
3
4
5
6
7
8
9
5
0
10
1
2
(a)
3
4
1
0.2
1
2
3
4
5
6
7
8
9
0
10
1
(c)
2
3
4
CA
10
0.5
5 0
0 1
2
3
4
5
6
7
8
9
1
10
7
8
9
10
5
6
7
8
9
10
(d)
104
15
1
6
(b)
0.4
0
5
105
2
CA
NMI
0.6
NMI
SNMF SFNMF FaceNet This Work Genlouvain MSSC
CR IP T
NMI
104
10
1
2
3
4
(e)
5
6
7
8
9
10
(f)
AN US
Figure 8: Performance for different methods in synthetic networks. (a,b)NMI and
CA values of different methods at 10 snapshots in synthetic networks, where the number of nodes is 1000 at each snapshot, the mixing parameter µ is 0.7, the average degree of the nodes is 15, and the maximum degree of the nodes is 50. (c,d) NMI and CA values of different menthods at 10 snashots in synthetic networks, where the number of nodes is 1000 at each snapshot, the mixing parameter µ is 0.8, the average degree of the nodes is 15, and the maximum degree of the nodes is 50. (e,f )NMI and CA values of different methods at 10
M
snapshots in synthetic networks, where the number of nodes is 1000 at each snapshot, the mixing parameter µ is 0.7, the average degree of the nodes is 3, and the maximum degree of
ED
the nodes is 50.
0.6
SNMF
2500
CA
0.8
PT
NMI
1
This Work
9
13
17
MSSC
21
24
1
5
9
13
(a)
17
21
24
7
8
(b)
0.8
6000
CA
8000
0.6 0.4
4000 2000
0.2
0 2
3
4
5
6
7
8
1
2
3
4
5
6
(d)
(c)
AC
1
Genlouvain
500
1
CE
NMI
5
FaceNet
1500 1000
0.4
1
SFNMF
2000
Figure 9: Performance for different method in real-world networks. NMI (a, c) and
CA (b, d) values of different methods at 10 snapshots in the real world network of e-mail contacts at the department of computer science at KIT. (a, b) The results is from the e-mail
networks taking 2 months as a snapshot. (c, d) The results is from the e-mail networks taking 6 months as a snapshot.
21
ACCEPTED MANUSCRIPT
values of different methods at 10 snapshots in the switch network, where the mixing parameter µ is 0.8 and the average degree is 15. The average value of NMI from our method has improved by 46.82% and CA has reduced by 5.33%
390
CR IP T
compared to FaceNet. And then, Fig. 8(e, f ) show NMI and CA values of different methods at 10 snapshots in the sparse switch network, where the mixing parameter µ is 0.7 and the average degree is 3. The average value of NMI from
our method has improved by 34.3% and CA has reduced by 66.41% compared
to MSSC. In Fig. 8 shows that NMI value of our method always outperforms that of other five methods, and CA value is mostly lower. From the above, these data indicate our method has a better performance.
AN US
395
In addition, we design some experiments on a real-world network (e-mail communication network) with an ever-changing graph during 48 consecutive months from September 2006 to August 2010 collected by the Department of Informatics at KIT. In the e-mail network, the vertices are the members of the 400
department of computer science at KIT, the weight of the edges correspond to
M
the number of e-mails sent between two individuals, and the clusters represent different groups in the department of computer science at KIT. The different
ED
number of months as a snapshot leads to the different number of clusters in each interval. For example, the shorter the interval, the more the data points will be 405
treated as isolated points. Due to limited space, we choose the results for two
PT
kinds of snapshot situations in Fig.9. Specifically, Fig.9(a, b) and Fig.9(c, d) correspond to the NMI and CA values for the 24 snapshots situation (each snap-
CE
shot is 2 months long) and the 8 snapshots situation (each snapshot is 6 months long). As shown in Fig.9(a, b), the average value of NMI from our method
has improved by 4.01% and CA has reduced by 5.78% compared to FaceNet,
showing the best performance among these five methods. Meanwhile, Fig.9(c,
AC
410
d) shows the average value of NMI from our method has improved by 4.20% and CA has reduced by 1.30% compared to FaceNet. With the comparison in Fig.9, we can find that our method outperforms the five baseline methods.
22
ACCEPTED MANUSCRIPT
415
5. Discussion In this work, we propose a novel method based on NMF to explore temporal and constant community structure in dynamic networks synchronously. Our
CR IP T
proposed model can be interpreted completely from the probabilistic perspec-
tive. Firstly, we describe the temporal networks with the connecting proba420
bility matrix, the permanent probability matrix of common structure and the probabilistic community membership matrix at each snapshot of the temporal
networks. Secondly, the permanent probability matrix of common structure can be treated as prior information of nodes at snapshots of temporal networks,
425
AN US
which denotes the importance of nodes in communities. Thirdly, we introduce
a probability transition matrix among the temporal communities as a penalty term in our model to obtain the constant evolutionary patterns between the snapshots of temporal networks. Fourthly, we constraint the probability matrix of common structure in the form of an l1 penalty. Lastly, a simple but effective
430
M
gradient descent algorithm is introduced to optimize the objective function in our model, which can converge to a stationary. Aiming at the dynamic networks, the core purpose of our method can
ED
be summarized as three aspects: detecting the temporal community structure, exploring the constant evolutionary pattern of communities over time,
435
PT
and identifying the importance of nodes in community. Our model is suitable for different kinds of dynamic networks including weighted/unweighted and directed/undirected, and can facilely detect both the non-overlapping and the
CE
overlapping community structure. The importance of nodes in communities is determined by comparing the probability of a specific community including each node. More importantly, with the probability transition matrix, we can detect the constant evolutionary pattern at snapshots. We also adopt a gradient de-
AC 440
scent algorithm to optimize the objective function in the proposed model, and analysis its complexity. In Section 4, we have designed several experiments as the illustrative examples on a synthetic network to clarify the working principle of our model. The
23
ACCEPTED MANUSCRIPT
445
illustrative examples include observing the performance of non-overlapping community detecting, identifying the importance of nodes in communities, and visualizing the changing of community membership about each node over time and
CR IP T
the constant evolutionary patterns of communities. Furthermore, we have evaluated the performance of our method on both synthetic and real-world networks, 450
and compared it with five widely-used methods. The results of experiments show that our method have remarkable performance on detecting communities in dynamic networks.
However, there are also some problems in our method to be further stud-
455
AN US
ied. In this work, we explore the constant evolutionary pattern of communities with a persistent probability transition matrix over time, but the evolutionary
pattern may be time-varying in dynamic networks. Thus, studying the timevarying evolutionary pattern will be our future work. Furthermore, the balance parameters λ and γ of our model can’t be modified automatically, which are just confirmed by the experiments. How to determine the balance parameters automatically may also be our future work. Another important problem is that
M
460
our model is not suitable for the changing number of nodes or communities over
ED
time in temporal networks, but those numbers are always time-varying in the real world. We believe that if we can optimize our method to solve the above
465
PT
problems, the effectiveness of our method will be further promoted.
Acknowledgements
CE
This work was supported by the Major Project of National Social Science
Fund (14ZDB153), the Project of National Social Science Fund (15BTQ056), the major research plan of the National Natural Science Foundation(91224009,
AC
51438009, 91746205, 91746107), the fundamental research of Xinjiang Corps
470
(2016AC015).
24
ACCEPTED MANUSCRIPT
References [1] V. Spirin, L. A. Mirny, Protein complexes and functional modules in molecular networks, Proceedings of the National Academy of Sciences 100 (21)
475
CR IP T
(2003) 12123–12128.
[2] A.-L. Barabˆ asi, H. Jeong, Z. N´eda, E. Ravasz, A. Schubert, T. Vicsek, Evolution of the social network of scientific collaborations, Physica A: Statistical mechanics and its applications 311 (3) (2002) 590–614.
[3] J. Chen, B. Yuan, Detecting functional modules in the yeast protein–
480
AN US
protein interaction network, Bioinformatics 22 (18) (2006) 2283–2290.
[4] A. W. Rives, T. Galitski, Modular organization of cellular networks, Proceedings of the National Academy of Sciences 100 (3) (2003) 1128–33. [5] Y. Dourisboure, F. Geraci, M. Pellegrini, Extraction and classification of
M
dense communities in the web, in: Proceedings of the 16th international conference on World Wide Web, ACM, 2007, pp. 461–470. [6] G. W. Flake, S. Lawrence, C. L. Giles, F. M. Coetzee, Self-organization and
ED
485
identification of web communities, in: IEEE Computer, 2002, pp. 66–71. [7] R. Guimer, L. A. Nunes Amaral, Functional cartography of complex
PT
metabolic networks., Nature 433 (7028) (2005) 895–900. [8] G. Palla, I. Der´enyi, I. Farkas, T. Vicsek, Uncovering the overlapping community structure of complex networks in nature and society, Nature
CE
490
435 (7043) (2005) 814–818.
AC
[9] A. E. Krause, K. A. Frank, D. M. Mason, R. E. Ulanowicz, W. W. Taylor,
495
Compartments revealed in food-web structure, Nature 426 (6964) (2003) 282–285.
[10] S. L. Pimm, The structure of food webs, Theoretical population biology 16 (2) (1979) 144–158.
25
ACCEPTED MANUSCRIPT
[11] S. Fortunato, D. Hric, Community detection in networks: A user guide, Physics Reports 659 (2016) 1–44. [12] U. Von Luxburg, A tutorial on spectral clustering, Statistics and computing
CR IP T
17 (4) (2007) 395–416.
500
[13] X. Qin, W. Dai, P. Jiao, W. Wang, N. Yuan, A multi-similarity spectral clustering method for community detection in dynamic networks, Scientific Reports 6 (31454) (2016) 1–12.
[14] M. E. Newman, E. A. Leicht, Mixture models and exploratory analysis in
AN US
networks, Proceedings of the National Academy of Sciences 104 (23) (2007)
505
9564–9569.
[15] T. P. Peixoto, Hierarchical block structures and high-resolution model selection in large networks, Physical Review X 4 (1) (2014) 011047. [16] M. E. Newman, M. Girvan, Finding and evaluating community structure in networks, Physical review E 69 (2) (2004) 026113.
M
510
[17] B. H. Good, Y.-A. de Montjoye, A. Clauset, Performance of modular-
046106.
ED
ity maximization in practical contexts, Physical Review E 81 (4) (2010)
PT
[18] L. G. Jeub, P. Balachandran, M. A. Porter, P. J. Mucha, M. W. Mahoney, Think locally, act locally: Detection of small, medium-sized, and large
515
CE
communities in large networks, Physical Review E 91 (1) (2015) 012821. [19] U. N. Raghavan, R. Albert, S. Kumara, Near linear time algorithm to
AC
detect community structures in large-scale networks, Physical review E
520
76 (3) (2007) 036106.
[20] P. Holme, J. Saramki, Temporal networks, Physics Reports 519 (3) (2012) 97–125.
[21] G. Palla, A. L. Barabsi, T. Vicsek, Quantifying social group evolution., Nature 446 (446) (2007) 664–667. 26
ACCEPTED MANUSCRIPT
[22] M. E. Newman, The structure of scientific collaboration networks., Proceedings of the National Academy of Sciences of the United States of Amer-
525
ica 98 (2) (2001) 404.
CR IP T
[23] P. J. Mucha, T. Richardson, K. Macon, M. A. Porter, J.-P. Onnela, Community structure in time-dependent, multiscale, and multiplex networks, science 328 (5980) (2010) 876–878. 530
[24] S. Mankad, G. Michailidis, Structural and functional discovery in dynamic
networks with non-negative matrix factorization, Physical Review E 88 (4)
AN US
(2013) 042812.
[25] T. Hartmann, A. Kappes, D. Wagner, Clustering evolving networks, in: Algorithm Engineering, Springer, 2016, pp. 280–329. 535
[26] G. Rossetti, R. Cazabet, Community discovery in dynamic networks: a survey, arXiv preprint arXiv:1707.03186.
M
[27] S. Asur, S. Parthasarathy, D. Ucar, An event-based framework for characterizing the evolutionary behavior of interaction graphs, Acm Transactions
540
ED
on Knowledge Discovery from Data 3 (4) (2009) 913–921. [28] F. Wang, T. Li, X. Wang, S. Zhu, C. Ding, Community discovery using nonnegative matrix factorization, Data Mining and Knowledge Discovery
PT
22 (3) (2011) 493–521.
[29] L. Akoglu, M. Mcglohon, C. Faloutsos, Rtm: Laws and a recursive gen-
CE
erator for weighted time-evolving graphs, in: Eighth IEEE International Conference on Data Mining, 2008, pp. 701–706.
545
AC
[30] T. Yang, Y. Chi, S. Zhu, Y. Gong, R. Jin, Detecting communities and their evolutions in dynamic social networks–a bayesian approach, Machine Learning 82 (2) (2011) 157–189.
[31] D. Chakrabarti, R. Kumar, A. Tomkins, Evolutionary clustering, in: Pro-
550
ceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, 2006, pp. 554–560. 27
ACCEPTED MANUSCRIPT
[32] Y.-R. Lin, Y. Chi, S. Zhu, H. Sundaram, B. L. Tseng, Facetnet: a framework for analyzing communities and their evolutions in dynamic networks, in: Proceedings of the 17th international conference on World Wide Web, ACM, 2008, pp. 685–694.
CR IP T
555
[33] D. D. Lee, H. S. Seung, Algorithms for non-negative matrix factorization, in: Advances in neural information processing systems, 2001, pp. 556–562.
[34] X. Luo, M. Zhou, Y. Xia, Q. Zhu, A. C. Ammari, A. Alabdulwahab, Generating highly accurate predictions for missing qos data via aggregating
nonnegative latent factor models, IEEE transactions on neural networks
AN US
560
and learning systems 27 (3) (2016) 524–537.
[35] X. Luo, M. Zhou, S. Li, Z. You, Y. Xia, Q. Zhu, A nonnegative latent factor model for large-scale sparse matrices in recommender systems via alternating direction method, IEEE transactions on neural networks and learning systems 27 (3) (2016) 579–592.
M
565
[36] X. Luo, J. Sun, Z. Wang, S. Li, M. Shang, Symmetric and nonnegative la-
ED
tent factor models for undirected, high-dimensional, and sparse networks in industrial applications, IEEE Transactions on Industrial Informatics 13 (6) (2017) 3098–3107.
[37] P. Paatero, U. Tapper, Positive matrix factorization: A non-negative factor
PT
570
model with optimal utilization of error estimates of data values, Environ-
CE
metrics 5 (2) (1994) 111–126. [38] L. Danon, A. Diaz-Guilera, J. Duch, A. Arenas, Comparing community
AC
structure identification, Journal of Statistical Mechanics: Theory and Ex-
575
periment 2005 (09) (2005) P09008.
[39] Y. R. Lin, Y. Chi, S. Zhu, H. Sundaram, B. L. Tseng, Analyzing communities and their evolutions in dynamic social networks, Acm Transactions on Knowledge Discovery from Data 3 (2) (2009) 8.
28
ACCEPTED MANUSCRIPT
[40] D. Greene, D. Doyle, P. Cunningham, Tracking the evolution of communi580
ties in dynamic social networks, in: Advances in social networks analysis and mining (ASONAM), 2010 international conference on, IEEE, 2010, pp.
AC
CE
PT
ED
M
AN US
CR IP T
176–183.
29
ACCEPTED MANUSCRIPT
AN US
CR IP T
Pengfei Jiao received the B.S. degree from the Hainan University, Haikou, China, in 2012. He is currently pursuing the Ph.D. degree from the School of Computer Science and Technology, Tianjin University, Tianjin, China. His current research interests include dynamic complex network analysis, data mining, and machine learning.
PT
ED
M
Wei Yu received the B.S. degree from the Jinggangshan University, Ji’an, China, in 2012, and M.S. degree from Henan Normal University, Xinxiang, China, in 2015. He is currently pursuing the Ph.D. degree from the School of Computer Science and Technology, Tianjin University, Tianjin, China. His research interests include dynamic complex network analysis, large-scale data mining and machine learning.
AC
CE
Wenjun Wang received the Ph.D. degree from Peking University, Beijing, China, in 2004. He is currently a Professor with the School of Computer Science and Technology, Tianjin University, Tianjin, China. His research interests include computational social science, emergency management, large-scale data mining and network science.
Xiaoming Li is a PH.D student in School of Computer Science and Technology at Tianjin University, China. Before that, he received his Master’s degree from Xinjiang university in 2008. Currently, he is a deputy researcher of the network information
ACCEPTED MANUSCRIPT
CR IP T
center of the twelve divisions of the corps. His research interests include human behavior dynamics and data mining.
AC
CE
PT
ED
M
AN US
Yueheng Sun received the Ph.D. degree from Tianjin University, Tianjin, China, in 2005. He is currently a lecturer with the School of Computer Science and Technology, Tianjin University, Tianjin, China. His research interests include social computing, data minning and machine learning.