Exploring temporal community structure and constant evolutionary pattern hiding in dynamic networks

Exploring temporal community structure and constant evolutionary pattern hiding in dynamic networks

Communicated by Dr Xin Luo Accepted Manuscript Exploring temporal community structure and constant evolutionary pattern hiding in dynamic networks P...

3MB Sizes 0 Downloads 19 Views

Communicated by Dr Xin Luo

Accepted Manuscript

Exploring temporal community structure and constant evolutionary pattern hiding in dynamic networks Pengfei Jiao, Wei Yu, Wenjun Wang, Xiaoming Li, Yueheng Sun PII: DOI: Reference:

S0925-2312(18)30480-6 10.1016/j.neucom.2018.03.065 NEUCOM 19513

To appear in:

Neurocomputing

Received date: Revised date: Accepted date:

7 December 2017 7 March 2018 26 March 2018

Please cite this article as: Pengfei Jiao, Wei Yu, Wenjun Wang, Xiaoming Li, Yueheng Sun, Exploring temporal community structure and constant evolutionary pattern hiding in dynamic networks, Neurocomputing (2018), doi: 10.1016/j.neucom.2018.03.065

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

Exploring temporal community structure and constant evolutionary pattern hiding in dynamic networks

a School

CR IP T

Pengfei Jiaoa,b , Wei Yua,b , Wenjun Wanga,b , Xiaoming Lia,c , Yueheng Suna,b,∗ of Computer Science and Technology, Tianjin University, Tianjin, 300350, China Key Laboratory of Advanced Networking (TANK), Tianjin, 300350, China. c Corps 12th division network information center. Xinjiang ,830011 China

b Tianjin

Abstract

AN US

Detecting the temporal communities and analyzing their evolution in dynamic

networks is an important question for understanding the structure and function of complex networks. Most existing methods deal the temporal community detection and evolution as a two-step processes and only apply to unweighted and undirected dynamic networks. In this paper, we proposed a new clustering

M

method based on non-negative matrix factorization from a fully probabilistic perspective, to explore temporal and constant community structure as well as the importance of nodes in any type dynamic networks synchronously. In de-

ED

tailed, we first denote the important matrix of node in dynamic networks, the community membership matrix, the similarity matrix at each snapshot and the probability transition matrix of community between the two consecutive

PT

snapshots. Second, we constitute the temporal community detection method from the view of generating networks. Third, we introduce a gradient descent

CE

algorithm to optimize the objection function of the proposed model. The experimental results on both artificial and real dynamic networks demonstrate that the superior performance of our proposed method is over some widely-used

AC

methods.

Keywords: Non-negative matrix factorization (NMF), Temporal community detection, Evolutionary pattern mining, Nodes’ importance identifing ∗ Corresponding

author Email address: [email protected] (Yueheng Sun)

Preprint submitted to Journal of LATEX Templates

May 3, 2018

ACCEPTED MANUSCRIPT

1. Introduction

CR IP T

Community detection, as a research hotspot of complex networks, has attracted great attention of scholars from different disciplines, which plays an essential role in finding meaningful structures and functions hidden in complex 5

networks [1, 2]. For example, communities, in the protein-protein interaction networks may be groups of proteins that perform specific biological functions

[3, 4], in the World Wide Web network may correspond to webpages with related

AN US

topics [5, 6], in metabolic networks may be related to functional modules such

as cycles and pathways [7, 8], and in food webs may be the identify compart10

ments [9, 10]. An increasing number of community detection methods have been proposed [11], such as spectral clustering [12, 13], statistical inference [14, 15], modularity optimization [16, 17], and random walk [18, 19]. However, these methods are only designed for static networks and not suitable for dynamic

M

networks, of which the nodes and edges are varying with time.

In fact, many social, biological, bibliographic, communication and computer

15

ED

systems can be modeled as dynamic networks [20]. Usually, we represent a dynamic network as a series of snapshots, the communities in which may grow or contract, merge or split over the snapshots [21], different evolutions make com-

20

PT

munity detection in dynamic networks more challenging. Such as in a dynamic collaboration network [22], nodes are denoted as researchers, the links represent

CE

collaboration relations, one community is corresponding to group of authors with same research field, changes in hotspots and in interests of researchers make the community have complex dynamics. Compared to community detection in

AC

static networks, analyzing the evolving communities demands the methods not

25

only detect the communities in each snapshot but also model the evolution of communities with identifying the importance of nodes in the dynamic networks. The temporal community structure and the evolution characteristics exist in

dynamic networks synchronously. It demands the methods detect the temporal communities and obtain the evolutionary pattern synchronously. However, most 2

ACCEPTED MANUSCRIPT

30

existing methods are just unilaterally designed for either detecting communities at each snapshot or analyzing their evolution. And some heuristic approaches [21, 23], which first obtain the community structure in dynamic networks, and

CR IP T

then analyze their evolution over time, are based on independent clustering. These methods aforementioned, which are designed for detecting communities 35

at each snapshot and then analyzing their evolution in two steps, tend to cause

structural information loss. To avoid this phenomenon, it is more interesting

and challenging for detecting temporal communities and obtaining evolutionary pattern synchronously.

40

AN US

In addition, it exists different kinds of dynamic networks, whose performance depends on the directional and weighted properties that co-occur in a real-world network. For example, in a communication network, the link is from caller to receiver, and the weight of the link can be the number or time of calls, so it

is a weighted and directed dynamic network. However, most exiting methods are designed generally for a specific kind of complex networks. In this case, it is meaningful to propose a general approach to handle different types of time-

M

45

varying networks and those subtleties in community structure.

ED

As we have discussed, it is necessary to propose a general approach to handle different types of time-varying networks for detecting temporal communities and obtaining evolutionary pattern synchronously. In this paper, we propose an evolutionary clustering method based on non-negative matrix factorization

PT

50

(NMF) in dynamic networks. It can detect the temporal community struc-

CE

ture, obtain the constant evolutionary patterns and identify the importance of nodes in community level synchronously. Differing from the general NMF, the proposed model can be interpreted from a fully probabilistic perspective. We take advantage of three matrices to describe the temporal networks, including

AC

55

the connecting probability matrix, the permanent probability matrix of common structure among snapshots and the probabilistic community membership matrix at each snapshot of the dynamic network. In addition, we assume that there are constant evolutionary patterns across the snapshots of temporal networks, and

60

denote them as a probability transition matrix among the temporal communi3

ACCEPTED MANUSCRIPT

ties with a penalty term in our model. Standard NMF method tends to estimate sparse components. Taking this limitation into consideration, we constraint the probability matrix of common structure in the form of an l1 penalty [24], which

65

CR IP T

clearly expresses the importance of nodes in a community. We also introduce a gradient descent algorithm to optimize the objective function in the proposed

model, and analysis its complexity. In addition, regardless of the types of the

dynamic networks including weighted/unweighted or directed/undirected, all

the snapshots of them can be denoted as connecting probability matrices, and our model is suitable for any matrices including symmetrical and asymmetrical

matrices. Therefore, our model is easy to extend to kinds of complex networks,

AN US

70

whose performance depends on the directional and weight properties. Also of note, our model is suitable for both non-overlapping and overlapping community detection because it can obtain the probabilistic community membership of each node.

In summary, we propose a novel evolutionary clustering model based on

75

M

NMF to explore temporal and constant community structure as well as the importance of nodes in dynamic networks synchronously. Our model is suitable

ED

for different kinds of dynamic networks including weighted/unweighted and directed/undirected, and can facilely detect both the non-overlapping and the 80

overlapping community structure. The importance of nodes in communities

PT

can be determined by comparing the probability of a specific community including each node. More importantly, with the probability transition matrix,

CE

we can detect the constant evolutionary pattern at snapshots. We also introduce a gradient descent algorithm to optimize the objective function in the proposed model, and analysis its complexity. Experimental results on both synthetic datasets and real datasets demonstrate that our model performs better than

AC

85

other five widely-used approaches.

4

ACCEPTED MANUSCRIPT

2. Related work Recently, temporal community detection has been receiving increasing at90

tention, surveys about which can be seen in [25, 26]. The existing methods for

CR IP T

community detection in dynamic networks can be mainly classified into three categories: two-steps based strategy, generative model, and evolutionary clustering.

Firstly, the two-steps strategy detects community structure of each snapshot 95

network with methods designed for static networks, and analyzes the community evolution among snapshots with related principles [21, 27]. For example,

AN US

the simplest two-steps method divides the network into discrete time steps and performs the static symmetric NMF (SNMF) [28] on each snapshot, respectively. In the other words, these approaches process each snapshot network as a 100

static network and then analyze the involving relationship of the communities at successive snapshot networks. However, they ignore the connection between

M

successive snapshots when detecting communities.

Secondly, the generative model methods are based on a dynamically generating process to construct a generative model of dynamic network, and then optimizing it with parameter estimation [29]. For example, Yang et al. [30] pro-

ED

105

pose a dynamic stochastic block model (DSBM) based on classic stochastic block

PT

model. The DSBM captures the evolution of communities by explicitly modeling the transition of community memberships for individual nodes. And they proposed a probabilistic simulated annealing algorithm combined with Gibbs Sampling to estimate parameters. The methods based on generative model is

CE

110

highly descriptive, but it tends to suffer from inaccuracy in dynamical commu-

AC

nity detection. Thirdly, the evolutionary clustering methods, which are most popular type

of approaches, take the clustering results of the previous snapshot network into

115

consideration when analyzing the current snapshot network. Specifically, the first evolutionary clustering framework is proposed by using the classic k-means and hierarchical clustering methods to cluster dynamic data with a temporal

5

ACCEPTED MANUSCRIPT

smoothness constraint [31]. A classic and most widely used method, FaceNet method [32], is a framework for analyzing Communities and their evolutions 120

through a robust unified process, which consider the evolutions of communities

CR IP T

and the temporal smoothness of evolutions. This method is an improvement of SNMF by defining the snapshot cost and the temporal cost with KullbackLeibler divergence. Qin et al. [13] propose a multi-similarity spectral clustering

(MSSC), which follows the evolutionary clustering strategy. The evolutionary 125

spectral clustering algorithm simultaneously considers multiple similarity ma-

trices. However, most of this type of methods are just unilaterally designed for

AN US

either detecting communities at each snapshot or analyzing their evolution, or take these two aspects into consideration in two steps. Therefore, we propose a general approach to handle different types of time-varying networks for detecting 130

temporal communities and obtaining evolutionary pattern synchronously.

M

3. Methods

In this section, we introduce the basic notations and the proposed model

ED

including optimization algorithm for dynamic networks. 3.1. NMF with Constant Evolutionary Regularization in Dynamic Network In dynamic networks, we consider the connecting probability of node pairs

135

PT

depend on the permanent prior probability and the time-varying connecting probability over time. Simultaneously, the time-varying community structure

CE

obeys a constant evolutionary pattern over time. In this case, we introduce a NMF-based generative model to explore the temporal and constant community structure.

AC

140

Firstly, we define a temporal network as G = (V, Et ), where t = 1, 2, ..., T .

V and Et denote the nodes of the network and edges at the snapshot t, re-

spectively. N = |V | and Mt = |Et | are the number of nodes and number of edges of the network at each t, repectively. T and K are the number of

145

snapshots and the number of communities of the dynamic network, repectively. Here, we set T and K as constants at snapshots. The tth snapshot 6

ACCEPTED MANUSCRIPT

of a dynamic network can be denoted with a connecting probability matrix Xt (t = 1, 2, ..., T ), the element Xij,t (i, j = 1, 2, ..., N , where N is the number of nodes of each snapshot) represents the connect probability between node i and node j of the tth snapshot. The connecting probability matrices is from the

CR IP T

150

column normalization of adjacent matrices at snapshots, which can be obtain PN from most kinds of dynamic networks, thus Xij,t ∈ {0, 1}N ×N , i=1 Xij,t = 1 Spontaneously, our model is not only suitable for weighted/unweighted dy-

AN US

155

namicnetworks but also directed/undirected dynamic networks. For example,  X X12,t X13,t  11,t    Xt = X21,t X22,t X23,t  , where Xt represents the connecting probability   X31,t X32,t X33,t matrix of snapshot t, then X11,t + X21,t + X31,t = 1, X12,t + X22,t + X32,t = 1, and X13,t + X23,t + X33,t = 1. In other words, the summations of probabilities that any node links to others are restricted to equal 1.

Secondly, we assume that it exists a permanent probability matrix of common structure W over time, of which element Wik (i = 1, 2, ..., N, k = 1, 2, ..., K,

M

160

where K is the number of community of each snapshot) represents the prior

ED

probability that community k includes node i, thus we have Wik ∈ {0, 1}N ×K , PN i=1 Wik = 1. In fact, the prior probability  can incarnate  the importance of a

PT

165

W W12  11    node in a community. For example, W = W21 W22  , then W11 + W21 +   W31 W32 W31 = 1 and W12 + W22 + W32 = 1, the first column is the prior probability that

CE

community 1 includes node 1, 2, 3, respectively. Moreover, the node with the maximum value in the first column corresponds to the most important nodes in

AC

community 1.

170

Thirdly, it exists a probabilistic community membership matrix Ht (t =

1, 2, ..., T ), of which element Hjk,t (j = 1, 2, ..., N ) represents the time-varying connecting probability that node j connects community k in the tth snapPK shot. Correspondingly, Hik,t ∈ {0, 1}N ×K , k=1 Hik,t = 1. The community label of nodes can be determined by obtaining the maximum value of their

7

ACCEPTED MANUSCRIPT

the column index with maximum value in the first row.

CR IP T

175

probabilistic community  membership in different communities. For example,  H11,t H12,t     Ht = H21,t H22,t  , then H11,t + H12,t = 1, H21,t + H22,t = 1, and   H31,t H32,t H31,t + H32,t = 1. Naturally, the community label of node 1 corresponds to

Obviously, the connecting probability Xij,t can approximate to the product

180

of the prior probability Wik and the time-varying connecting probability Hjk,t PK on the sum of k, Xij,t ≈ k=1 Wik Hjk,t , so we have Xt ≈ WHTt . According to

AN US

the core idea of standard NMF [33], the objective function can be constructed

by Euclidean distance[34, 35, 36], which is the square of the Frobenius norm of two matrices difference [37]. The object function can be constructed as follows: O1 =

T X t=1

||Xt − WHTt ||2F ,

(1)

Standard NMF method tends to estimate sparse components. Taking this limitation into consideration, a sparsity penalty is introduced in the form of an l1

M

185

penalty [24]. The object function can be constructed as follows: T X

||Xt − WHTt ||2F + λ

ED O2 =

t=1

K X

k=1

||Wk ||1 ,

(2)

where λ is a balance parameter. To explore the community evolutionary pat-

PT

tern of temporal network, we think that it exists a constant evolutionary pattern of communities over time, which can be denoted with a probability transition matrix Z, of which element Zlk (l, k = 1, 2, ..., K) represents the tran-

CE

190

sition probability from community l to community k between each two ad-

jacent snapshots. It means that the connect probability Hjk,t approximates

AC

to the product of the connect probability Hjl,t−1 and the transition probPK ability Zlk , Hjk,t ≈ And we l=1 Hjl,t−1 Zlk , so we have Ht ≈ Ht−1 Z. PK K×K also  have Zlk ∈ {0, 1} , k=0 Zlk = 1, l, k = 1, 2, ..., K. For example, Z11 Z12  , then Z11 + Z12 = 1, and Z21 + Z22 = 1. The element Z12 Z= Z21 Z22 represents the transition probability from community 1 to community 2 in next

195

8

ACCEPTED MANUSCRIPT

snapshot, so the evolutionary pattern of communities can be visualized with the probability transition matrix Z . To reveal the community evolutionary pattern 200

in our model, another penalty term is introduced with the Frobenjus norm of

CR IP T

the difference between Ht−1 Z and Ht , where the balance parameter is γ. The object function of our model can be constructed as follows: T X

O3 =

t=1

||Xt − WHTt ||2F + λ

K X

k=1

||Wk ||1 + γ

T X t=2

3.2. Optimization algorithm

||Ht−1 Z − Ht ||2F ,

(3)

The object function O3 in equation (3) is not convex in both W and H together. Therefore we minimize the object function O3 with a gradient-descent

AN US

205

estimation algorithm, which can be rewritten as: O3 =

T X t=1



T r(Ht WT WHTt − XTt WHTt − Ht WT Xt ) + λtr(W)

T X

T r(Z

T

HTt−1 Ht−1 Z

−Z

HTt−1 Ht



HTt Ht−1 Z

+

HTt Ht )

(4) + const,

M

t=2

T

where the right of the equality applies the matrix properties T r(AB) = T r(BA)

ED

and T r(A) = T r(AT ). Let Ψik , Φjk and Ξlk be the lagrange multiplier for constraint Wik ≥ 0, Hjk ≥ 0 and Zlk ≥ 0, respectively, and Ψ = [ψik ], Φ = [φjk ], 210

Ξ = [ξlk ], the Lagrange L is T X

T r(Ht WT WHTt − XTt WHTt − Ht WT Xt ) + λtr(W)

PT

L=

t=1

T X

CE



t=2

T r(ZT HTt−1 Ht−1 Z − ZT HTt−1 Ht − HTt Ht−1 Z + HTt Ht )

(5)

+ T r(Ψ WT ) + T r(ΦHT ) + T r(ΞZT ) + const.

AC

The partial derivatives of L with respect to W is T

X ∂L = (2WHTt Ht − 2Xt Ht ) + λ + Ψ, ∂W t=1

(6)

the partial derivatives of L with respect to H is t = 1,

∂L = 2H1 WT W − 2XT1 W + 2γH1 ZZT − 2γH2 ZT + Φ, ∂H1 9

(7)

ACCEPTED MANUSCRIPT

2 ≤ t ≤ T − 1, ∂L = 2Ht WT W − 2XTt W − 2γHt−1 Z + 2γHt + 2γHt ZZT − 2γHt+1 ZT + Φ, (8) ∂Ht

215

and the partial derivatives of L with respect to Z is T

CR IP T

∂L = 2HT WT W − 2XTT W + 2γHT − 2γHT −1 Z + Φ, ∂HT

t = T,

∂L X = (2HTt−1 Ht−1 Z − 2HTt−1 Ht ) + Ξ. ∂Z t=2

(9)

(10)

AN US

Using the KKT conditions ψik Wik = 0, φjk Hjk = 0, and ξlk Zlk = 0, we get the following equations for Wik , Hjk , and Zlk : (

T X (2WHTt Ht − 2Xt Ht ))ik Ψik + λΨik = 0. t=1

(H1 WT W)jk Φjk − (XT1 W)jk Φjk + γ(H1 ZZT )jk Φjk − γ(H2 ZT )jk Φjk = 0. (12)

M

t = 1,

2 ≤ t ≤ T − 1, (Ht WT W)jk Φjk − (XTt W)jk Φjk − γ(Ht−1 Z)jk Φjk 220

(HT WT W)jk Φjk − (XTT W)jk Φjk + γ(HT )jk Φjk − γ(HT −1 Z)jk Φjk = 0. (14)

CE

PT

The updating rules can be obtained as follows: PT ( t=1 Xt Ht )ik Wik ← Wik PT , ( t=1 (WHTt Ht ))ik + λ

AC

(13)

ED

+ γ(Ht )jk Φjk + γ(Ht ZZT )jk Φjk − γ(Ht+1 ZT )jk Φlk = 0.

t = T,

(11)

(XT1 W + γH2 ZT )jk,1 , (H1 WT W + γH1 ZZT )jk,1

(16)

(XTt W + γHt−1 Z + γHt+1 ZT )jk,t , (Ht WT W + γHt + γHt ZZT )jk,t

(17)

(XTT W + γHT −1 Z)jk,T , (HT WT W + γHT )jk,T

(18)

Hjk,1 ← Hjk,1

Hjk,t ← Hjk,t

(15)

Hjk,T ← Hjk,T

10

ACCEPTED MANUSCRIPT

225

Zlk

PT ( t=2 HTt−1 Ht )lk . ← Zlk PT ( t=2 HTt−1 Ht−1 Z)lk

(19)

CR IP T

The optimization algorithm is presented for our model as Optimization Algorithm (Table 1). The input of our algorithm is the connecting probability matrices X = X1 , X2 , ...XT , the number of communities K and the balance

parameter λ and γ. We use the algorithm to iteratively update W, Ht and 230

Z until convergence. The output is the probability of common structure W,

the community membership matrices Ht , and the probability transition matrix

AN US

Z.Our model can detect non-overlapping community structure by obtaining the community label with the maximum value of community membership about each node, and detect overlapping community structure by finding the each 235

node’s community labels, whose community membership is greater than a certain threshold value. The threshold value can be set in 0 ∼ 1, and the smaller the value, the bigger the extent of overlapping.

M

From the Optimization Algorithm (Table 1), the most time-consuming part is the updating of Ht . The time cost of H1 is O(3niter (N 2 K + N K 2 )), where niter denotes the number of iterations. And for the snapshot networks t > 1, the

ED

240

time cost is O(niter (T −1)(3N 2 K+4N K 2 )), where T is the number of snapshots. Then the time complexity of the whole algorithm is O(niter T (N 2 K + N K 2 )).

PT

In fact, the real temporal networks are very sparse. When we denote the edges of the temporal networks as et , t = 1, 2, ..., T , the N 2 can be replaced with ne approximatively, where ne denotes the average edges for all snapshots of

CE

245

the temporal network. In addition, K is far less than N and ne , so that K can be ignored for the time complexity in the algorithm. Therefore, the time

AC

complexity will degrade to O(niter T (ne + N )), which is linear. Synchronously, using parallel computation or sampling technology can help boost computing

250

performance of our algorithm. Accordingly, our algorithm is easy to be extended to large-scale network.

11

CR IP T

ACCEPTED MANUSCRIPT

Table 1: The optimization Algorithm of the proposed model

Optimization Algorithm

Input: temporal network X = X1 , X2 , ... XT ; initial K, and balance parameter λ,γ Output: W, HTt and Z

Define: W is the probability of common structure, Ht is the community membership matrix,

Initialize: W = rand(N, K); for t in range (1,T) Ht = rand(N, K);

Z = rand(K, K); repeat

M

end for

AN US

and Z represents the probability transition matrix of the representing matrix.

P T ( T t=2 Ht−1 Ht )lk ; T H t=2 t−1 Ht−1 Z)lk P T ( t=1 Xt Ht )ik Wik ← Wik (PT (WH T H )) +λ ; t ik t t=1 (XT W+γH ZT ) 1 Hjk,1 ← Hjk,1 (H1 WT W+γH2 1 ZZjk,1 ; T) jk,1

PT

ED

Zlk ← Zlk (PT

for t in range(2, T-1) Hjk,t ← Hjk,t

T (XT t W+γHt−1 Z+γHt+1 Z )jk,t ; (Ht WT W+γHt +γHt ZZT )jk,t

CE

end for

(XT W+γH

Z)

T −1 jk,T ; Hjk,T ← Hjk,T (HTT WT W+γH T )jk,T

untill convergence;

AC

Return W, HTt and Z.

12

ACCEPTED MANUSCRIPT

4. Experiments In this section, we introduce evaluation metrics, five previously published

255

on both the synthetic and real world dynamic networks. 4.1. Evaluation Metrics and Comparison Algorithm

CR IP T

approaches as comparing groups with our model, and present the experiments

To evaluate the performance of our algorithm, extensive experiments are designed on both synthetic networks and frequently-used real networks. Specif-

ically, we compare the values of two widely-used evaluation metrics, the Normalized Mutual Information (N M I) [38] and the error rate (CA) [39], in our method

AN US

260

and other five base-lines. N M I, a well-known entropy measure, is widely used ˆ be the community structo measure the similarity of two clusterings. Let G tures obtained from our method, and G the ground truth of the community structures. N M I can be defined as the normcalization of mutual information ˆ

ˆ G) by the average of two entropies H(G) ˆ and H(G), N M I = 2 I(G;G) . I(G; ˆ H(G)+H(G) PK r r PK r PK s Nr N ·Nij ˆ ˆ In detail, I(G; G) = i=1 j=1 Nij log( N r ·N s ), H(G) = i=1 Ni log Ni , and i j PK s Ns H(G) = j=1 Njs log Nj , where N is the number of nodes in the network; K r

M

265

ED

and K s are the number of communities of the ground truth structure and that

of the structure obtained by our method, respectively; Nir , Njs and Nij are 270

the number of nodes in the ith community of the ground truth, the number

PT

of nodes in the jth community obtained by our method, and the number of common nodes in ith and jth communities, respectively. The N M I indicates

CE

ˆ and G, whose the value is a quantity from 0 to 1. The consistency between G ˆG ˆ T − GGT ||2 , it measures the distance between CA is defined as CA = ||G F

275

ˆ and that represented by G. The the community structure represented by G

AC

CA value usually increases with the size of the network, and a larger CA value means a poorer result. We choose five previously published approaches for detecting communities

as compareing groups with our method.

280

- a traditional two-steps strategy, which divides the network into discrete 13

ACCEPTED MANUSCRIPT

time steps and performs the static symmetric NMF (SNMF)[28] on each snapshot, respectively. - the structural and functional discovery with NMF method (SFNMF) [24],

CR IP T

which is a model based on NMF, which can display underlying structure and its evolution over time.

285

- the FaceNet method [32]which is a framework for analyzing Communities and their evolutions through a robust unified process, which consider the evolutions of communities and the temporal smoothness of evolutions, and

AN US

we set the parameter λ = 0.8.

- the Genluovain algorithm [23], optimize the temporal, multiscale and mul-

290

tiplex modularity with a greedy heuristic method. Here, we set the resolution parameter γ = 1 and couple parameter ω = 0.5 for this method, which are the commonly used parameter settings in the related works.

M

- the multi-similarity spectral clustering method (MCSS), which is an evolutionary spectral clustering algorithm with considering multiple similarity

295

ED

matrices simultaneously. 4.2. Illustrative example

PT

To clarify the working principle of our model, we make an illustrative example to explain each term in a small synthetic temporal network, which is 300

generated according to the description by Greene et al.[40]. Here, we generate a

CE

small synthetic network sample based on the embedding of switch events, which occurs when the nodes move among the communities. The small switch net-

AC

work involves 100 nodes, 7 embedded dynamic communities, and 20% of node memberships are randomly permuted at each snapshot to simulate the natural

305

movement of users among communities over time. Each snapshot graphs share the following parameters: the nodes have a mean degree of 15, a maximum degree of 50, and a mixing parameter value of µ = 0.7, which controls the overlap between communities.

14

ACCEPTED MANUSCRIPT

SNMF

0.45

SFNMF

0.4

FaceNet

CR IP T

This Work

0.35

Genlouvain

NMI

0.3

MSSC

0.25 0.2 0.15 0.1 0.05

1

2

3

4

5

6

7

8

9

10

AN US

snapshot

Figure 1: The illustrative example on a small switch network: NMI value of different methods on the switch synthetic network. The network has 10 snapshots, 100 nodes and 7 embedded dynamic communities, the nodes have a mean degree of 15, a maximum

M

degree of 50, and the mixing parameter µ is 0.7.

ED 0.15

0.1

0.05

0

50

C5

0.2

99

0.15

CE

0.1

0.05

0

100

50

100

0

50

100

95 0.15 0.1

0.15 80

0.1 0.05 0

0

50

100

AC

100

89

0.1 0.05

C7

0.2

0 50

0.15

0 0

C6

0.2

85

0 50

0.05

0

0

0.1

0

100

C4

0.2

0.15

0.05

Posibility

PT

0

C3

0.2

98

Posibility

Posibility

94

0.1

Posibility

Posibility

0.15

0.05

Posibility

C2

0.2

Posibility

C1

0.2

0

50

100

Figure 2: The illustrative example on a small switch network: the prior probability that each community include each node of the small switch synthetic network. The network is the same as the network in Fig.1. The x-axes show the node label, and the pink numbers represent the label of important nodes.

15

ACCEPTED MANUSCRIPT

The NMI value of different methods on the switch synthetic network is 310

showed in Fig.1, and the average level of our method has improved more than 35% than that of others, which indicates that our method has a superior per-

CR IP T

formance. Moreover, the prior probability that a specific community includes each node is showed in Fig.2. In fact, the prior probability can incarnate the

importance of a node in a community. Intuitively, from each histogram in Fig.2, 315

we can see in the prior probability distribution that the most important nodes

of community 1 ∼ 7 are node 94, node 98, node 85, node 89, node 99, node 95, node 80, respectively. Naturally, our model can automatically identify the

AN US

importance of the nodes in communities of dynamic networks automatically, which can be broadly applied to virtual marketing or diffusion of information. C1 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0

C2 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0

C3

1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0

C4

1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0

M

1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0

ED

1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0

C5

1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0

C6

1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0

C7

1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0

1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0

PT

Figure 3: The illustrative example on a small switch network: the probability that each nodes connect each communities at each snapshot of the small switch synthetic network. The network is the same as the network in Fig.1. Each histogram shows

CE

the probability of a node connects 7 communities at 10 snapshots, respectively, the different colours indicate the different communities. For example, the blue (C1) indicates community

AC

1.

320

In Fig.3, it shows the probability that a specific node connects each com-

munity at each snapshot, and the 7 colors indicate community 1 ∼ 7. The 100 histograms correspond to nodes 1 ∼ 100 in horizontal order, respectively. From each histogram in Fig.3, we can intuitively see the evolution of the probability distribution that a specific node connects the 7 communities at snapshots, and

16

ACCEPTED MANUSCRIPT

325

the x-axes show the snapshots t. Furthermore, to demonstrate the mixing degree of the probability that a specific node connects different communities, we compute its information entropy and demonstrate the changes of the informa-

CR IP T

tion entropy of all nodes at snapshots in Fig.4. For example, the probability distribution that node 41 (the histogram in the 5th row and the first column 330

of Fig.3) is almost uniform at the first several snapshots: the probability that node 41 connects community 2 approaches to 1. However, the above probability becomes smaller and smaller, while the probability connects community 4 is

bigger and bigger over time. Correspondingly, we can clearly see that the infor-

335

AN US

mation entropy of node 41 is getting bigger over time in Fig.4. The probabilities

that node 1 (the first histogram in Fig.3) connects the communities don’t have obvious changes over time. Correspondingly, the information entropy of node 1 keep stable over time from Fig.4.

Information entropy

10 9

6 5 4 3 2 1

10

20

PT

0

ED

Snapshots

7

0

2.5

M

8

30

2

1.5

1

0.5

0

40

50

60

70

80

90

100

Nodes

CE

Figure 4: The illustrative example on a small switch network: the information entropy of the probability that a node connect each communities at each snapshot of the small switch synthetic network. The network is the same as the network in Fig.1. The different colours indicate the value of information entropy, the x-axes show the nodes

AC

label n, and the y-axes show the snapshots t.

However, how do the nodes transfer among communities over time in dy-

namic network? To demonstrate this problem, we visualize the nodes transition

340

among communities at snapshots in Fig.5, which shows how the nodes transfer and the size of communities evolve at snapshots. The x-axes represent snap17

ACCEPTED MANUSCRIPT

7 6

4 3 2 1 1

2

3

4

5

6

7

8

9

10

AN US

Snapshot

CR IP T

Community

5

Figure 5: The illustrative example on a small switch network: the visualization of the nodes transition among communities at snapshots of the small switch synthetic network. The network is the same as the network in Fig.1. The size of dots represents the size of communities, the arrows represent the direction of the nodes transition, the x-axes

M

show the snapshots, and the y-axes show the communities.

7

0.9 0.8

ED

6

0.7 0.6

4

0.5

3

PT

Community l

5

0.4 0.3

2

0.2

1

CE

0

0

0.1 0

1

2

3

4

5

6

7

Community k

Figure 6: The illustrative example on a small switch network: the transition prob-

AC

ability from community l to community k at each two adjacent snapshots of the

small switch synthetic network. The network is the same as the network in Fig.1. As

shown in the legend, the different colours represent the transition probability from community l to community k, whose value correspond to the values of the elements in the transition

probability matrix Z respectively. The x-axes and the y-axes show the community label at current snapshot and last snapshot respectively.

18

ACCEPTED MANUSCRIPT

shots t, and the y-axes represent community indexes. The colorful dots represent communities, and their sizes demonstrate the number of nodes in communities. Correspondingly, Fig.6 shows the transition probability from community l to community k at each two adjacent snapshots. As shown in the legend, the

CR IP T

345

different colours represent the transition probability from community l to community k, whose value correspond to the values of the elements in the transition probability matrix Z respectively. In the other words, Fig.6 show the possibil-

ity that a node transfers from community l to community k. As shown in the 350

figure, the diagonal line is biggest, meaning that there is a bigger probability

AN US

that a node will keep in its current community. For example, there are many

nodes transfer from community 7 to community 1, 3 and 6, and the size of community 7 becomes smaller and smaller over time in Fig.5. Correspondingly, we can clearly see the probability from community 7 to community 1, 3 and 6 355

is bigger than others in Fig.6.

M

4.3. Parameter analysis

5 4

0.25

0.2

PT

3

0.3

ED

6

0.15

0.1

1

0.05

0 0

1

2

3

4

5

6

7

8

9

10

AC

CE

2

Figure 7: Parameter analysis on the small switch network. The NMI values of our

method with the parameters λ (range 0 ∼ 10) and γ (range 0 ∼ 6) on the small switch

synthetic network, which is the same as the network in Fig.1. As shown in the legend, the different colors represent the NMI values from 0 to 1.

To analyze the sensitivity of the parameters, we compute the NMI values 19

ACCEPTED MANUSCRIPT

of our method with the balance parameters λ (range 0 ∼ 10 with steplength of 0.1) and γ (range 0 ∼ 6 with steplength of 0.1) in the small switch synthetic 360

network. As shown in Fig. 7, the performance of our method almost tends

CR IP T

to stable when parameter λ is bigger than 5. Nevertheless, the performance is much better when parameter γ is between 0 ∼ 1. Therefore, it is a good

choice to set the parameters λ = 5 and γ = 0.3, which is used in all experiments

about our method in this work. According to the commonly used parameter 365

settings in the related works, we set the parameter λ = 0.8 for the FaceNet

method, the resolution parameter γ = 1 and couple parameter ω = 0.5 for the

AN US

Genluovain method. In addition, SNMF, SFNMF and MCSS do not require setting parameters. 4.4. Results comparison

The NMI and CA of our approach are compared against other five existing

370

approaches for detecting communities in dynamic networks: switch synthetic

M

networks and real-world networks from KIT e-mail data 1 . The switch network is also generated according to the description by Greene et al.[40].

375

ED

In order to demonstrate the performance of our method, comparison experiments were designed on the switch synthetic networks with three parametric cases. In Fig. 8, we intuitively demonstrate the performance on three switch

PT

networks, which share the following parameters: the number of snapshots is 10, the number of nodes is 1000 at each snapshot, and the maximum degree of the nodes is 50. The difference is that the mixing parameter µ is 0.7 and 0.8, and the average degree of the nodes is 5 and 15, respectively. Specifically, Fig. 8(a,

CE 380

b) show NMI and CA values of different methods at 10 snapshots in the switch

AC

network, where the mixing parameter µ is 0.7 and the average degree is 15. The average value of NMI from our method has improved by 11.49% and CA has reduced by 29.25% compared to FaceNet, showing the best performance among

385

these five methods in Fig.8(a, b). Similarly, Fig. 8(c, d) show NMI and CA 1 http://i11www.iti.uni-karlsruhe.de/en/projects/spp1307/emaildata

20

ACCEPTED MANUSCRIPT

CA

0.5 0 1

2

3

4

5

6

7

8

9

5

0

10

1

2

(a)

3

4

1

0.2

1

2

3

4

5

6

7

8

9

0

10

1

(c)

2

3

4

CA

10

0.5

5 0

0 1

2

3

4

5

6

7

8

9

1

10

7

8

9

10

5

6

7

8

9

10

(d)

104

15

1

6

(b)

0.4

0

5

105

2

CA

NMI

0.6

NMI

SNMF SFNMF FaceNet This Work Genlouvain MSSC

CR IP T

NMI

104

10

1

2

3

4

(e)

5

6

7

8

9

10

(f)

AN US

Figure 8: Performance for different methods in synthetic networks. (a,b)NMI and

CA values of different methods at 10 snapshots in synthetic networks, where the number of nodes is 1000 at each snapshot, the mixing parameter µ is 0.7, the average degree of the nodes is 15, and the maximum degree of the nodes is 50. (c,d) NMI and CA values of different menthods at 10 snashots in synthetic networks, where the number of nodes is 1000 at each snapshot, the mixing parameter µ is 0.8, the average degree of the nodes is 15, and the maximum degree of the nodes is 50. (e,f )NMI and CA values of different methods at 10

M

snapshots in synthetic networks, where the number of nodes is 1000 at each snapshot, the mixing parameter µ is 0.7, the average degree of the nodes is 3, and the maximum degree of

ED

the nodes is 50.

0.6

SNMF

2500

CA

0.8

PT

NMI

1

This Work

9

13

17

MSSC

21

24

1

5

9

13

(a)

17

21

24

7

8

(b)

0.8

6000

CA

8000

0.6 0.4

4000 2000

0.2

0 2

3

4

5

6

7

8

1

2

3

4

5

6

(d)

(c)

AC

1

Genlouvain

500

1

CE

NMI

5

FaceNet

1500 1000

0.4

1

SFNMF

2000

Figure 9: Performance for different method in real-world networks. NMI (a, c) and

CA (b, d) values of different methods at 10 snapshots in the real world network of e-mail contacts at the department of computer science at KIT. (a, b) The results is from the e-mail

networks taking 2 months as a snapshot. (c, d) The results is from the e-mail networks taking 6 months as a snapshot.

21

ACCEPTED MANUSCRIPT

values of different methods at 10 snapshots in the switch network, where the mixing parameter µ is 0.8 and the average degree is 15. The average value of NMI from our method has improved by 46.82% and CA has reduced by 5.33%

390

CR IP T

compared to FaceNet. And then, Fig. 8(e, f ) show NMI and CA values of different methods at 10 snapshots in the sparse switch network, where the mixing parameter µ is 0.7 and the average degree is 3. The average value of NMI from

our method has improved by 34.3% and CA has reduced by 66.41% compared

to MSSC. In Fig. 8 shows that NMI value of our method always outperforms that of other five methods, and CA value is mostly lower. From the above, these data indicate our method has a better performance.

AN US

395

In addition, we design some experiments on a real-world network (e-mail communication network) with an ever-changing graph during 48 consecutive months from September 2006 to August 2010 collected by the Department of Informatics at KIT. In the e-mail network, the vertices are the members of the 400

department of computer science at KIT, the weight of the edges correspond to

M

the number of e-mails sent between two individuals, and the clusters represent different groups in the department of computer science at KIT. The different

ED

number of months as a snapshot leads to the different number of clusters in each interval. For example, the shorter the interval, the more the data points will be 405

treated as isolated points. Due to limited space, we choose the results for two

PT

kinds of snapshot situations in Fig.9. Specifically, Fig.9(a, b) and Fig.9(c, d) correspond to the NMI and CA values for the 24 snapshots situation (each snap-

CE

shot is 2 months long) and the 8 snapshots situation (each snapshot is 6 months long). As shown in Fig.9(a, b), the average value of NMI from our method

has improved by 4.01% and CA has reduced by 5.78% compared to FaceNet,

showing the best performance among these five methods. Meanwhile, Fig.9(c,

AC

410

d) shows the average value of NMI from our method has improved by 4.20% and CA has reduced by 1.30% compared to FaceNet. With the comparison in Fig.9, we can find that our method outperforms the five baseline methods.

22

ACCEPTED MANUSCRIPT

415

5. Discussion In this work, we propose a novel method based on NMF to explore temporal and constant community structure in dynamic networks synchronously. Our

CR IP T

proposed model can be interpreted completely from the probabilistic perspec-

tive. Firstly, we describe the temporal networks with the connecting proba420

bility matrix, the permanent probability matrix of common structure and the probabilistic community membership matrix at each snapshot of the temporal

networks. Secondly, the permanent probability matrix of common structure can be treated as prior information of nodes at snapshots of temporal networks,

425

AN US

which denotes the importance of nodes in communities. Thirdly, we introduce

a probability transition matrix among the temporal communities as a penalty term in our model to obtain the constant evolutionary patterns between the snapshots of temporal networks. Fourthly, we constraint the probability matrix of common structure in the form of an l1 penalty. Lastly, a simple but effective

430

M

gradient descent algorithm is introduced to optimize the objective function in our model, which can converge to a stationary. Aiming at the dynamic networks, the core purpose of our method can

ED

be summarized as three aspects: detecting the temporal community structure, exploring the constant evolutionary pattern of communities over time,

435

PT

and identifying the importance of nodes in community. Our model is suitable for different kinds of dynamic networks including weighted/unweighted and directed/undirected, and can facilely detect both the non-overlapping and the

CE

overlapping community structure. The importance of nodes in communities is determined by comparing the probability of a specific community including each node. More importantly, with the probability transition matrix, we can detect the constant evolutionary pattern at snapshots. We also adopt a gradient de-

AC 440

scent algorithm to optimize the objective function in the proposed model, and analysis its complexity. In Section 4, we have designed several experiments as the illustrative examples on a synthetic network to clarify the working principle of our model. The

23

ACCEPTED MANUSCRIPT

445

illustrative examples include observing the performance of non-overlapping community detecting, identifying the importance of nodes in communities, and visualizing the changing of community membership about each node over time and

CR IP T

the constant evolutionary patterns of communities. Furthermore, we have evaluated the performance of our method on both synthetic and real-world networks, 450

and compared it with five widely-used methods. The results of experiments show that our method have remarkable performance on detecting communities in dynamic networks.

However, there are also some problems in our method to be further stud-

455

AN US

ied. In this work, we explore the constant evolutionary pattern of communities with a persistent probability transition matrix over time, but the evolutionary

pattern may be time-varying in dynamic networks. Thus, studying the timevarying evolutionary pattern will be our future work. Furthermore, the balance parameters λ and γ of our model can’t be modified automatically, which are just confirmed by the experiments. How to determine the balance parameters automatically may also be our future work. Another important problem is that

M

460

our model is not suitable for the changing number of nodes or communities over

ED

time in temporal networks, but those numbers are always time-varying in the real world. We believe that if we can optimize our method to solve the above

465

PT

problems, the effectiveness of our method will be further promoted.

Acknowledgements

CE

This work was supported by the Major Project of National Social Science

Fund (14ZDB153), the Project of National Social Science Fund (15BTQ056), the major research plan of the National Natural Science Foundation(91224009,

AC

51438009, 91746205, 91746107), the fundamental research of Xinjiang Corps

470

(2016AC015).

24

ACCEPTED MANUSCRIPT

References [1] V. Spirin, L. A. Mirny, Protein complexes and functional modules in molecular networks, Proceedings of the National Academy of Sciences 100 (21)

475

CR IP T

(2003) 12123–12128.

[2] A.-L. Barabˆ asi, H. Jeong, Z. N´eda, E. Ravasz, A. Schubert, T. Vicsek, Evolution of the social network of scientific collaborations, Physica A: Statistical mechanics and its applications 311 (3) (2002) 590–614.

[3] J. Chen, B. Yuan, Detecting functional modules in the yeast protein–

480

AN US

protein interaction network, Bioinformatics 22 (18) (2006) 2283–2290.

[4] A. W. Rives, T. Galitski, Modular organization of cellular networks, Proceedings of the National Academy of Sciences 100 (3) (2003) 1128–33. [5] Y. Dourisboure, F. Geraci, M. Pellegrini, Extraction and classification of

M

dense communities in the web, in: Proceedings of the 16th international conference on World Wide Web, ACM, 2007, pp. 461–470. [6] G. W. Flake, S. Lawrence, C. L. Giles, F. M. Coetzee, Self-organization and

ED

485

identification of web communities, in: IEEE Computer, 2002, pp. 66–71. [7] R. Guimer, L. A. Nunes Amaral, Functional cartography of complex

PT

metabolic networks., Nature 433 (7028) (2005) 895–900. [8] G. Palla, I. Der´enyi, I. Farkas, T. Vicsek, Uncovering the overlapping community structure of complex networks in nature and society, Nature

CE

490

435 (7043) (2005) 814–818.

AC

[9] A. E. Krause, K. A. Frank, D. M. Mason, R. E. Ulanowicz, W. W. Taylor,

495

Compartments revealed in food-web structure, Nature 426 (6964) (2003) 282–285.

[10] S. L. Pimm, The structure of food webs, Theoretical population biology 16 (2) (1979) 144–158.

25

ACCEPTED MANUSCRIPT

[11] S. Fortunato, D. Hric, Community detection in networks: A user guide, Physics Reports 659 (2016) 1–44. [12] U. Von Luxburg, A tutorial on spectral clustering, Statistics and computing

CR IP T

17 (4) (2007) 395–416.

500

[13] X. Qin, W. Dai, P. Jiao, W. Wang, N. Yuan, A multi-similarity spectral clustering method for community detection in dynamic networks, Scientific Reports 6 (31454) (2016) 1–12.

[14] M. E. Newman, E. A. Leicht, Mixture models and exploratory analysis in

AN US

networks, Proceedings of the National Academy of Sciences 104 (23) (2007)

505

9564–9569.

[15] T. P. Peixoto, Hierarchical block structures and high-resolution model selection in large networks, Physical Review X 4 (1) (2014) 011047. [16] M. E. Newman, M. Girvan, Finding and evaluating community structure in networks, Physical review E 69 (2) (2004) 026113.

M

510

[17] B. H. Good, Y.-A. de Montjoye, A. Clauset, Performance of modular-

046106.

ED

ity maximization in practical contexts, Physical Review E 81 (4) (2010)

PT

[18] L. G. Jeub, P. Balachandran, M. A. Porter, P. J. Mucha, M. W. Mahoney, Think locally, act locally: Detection of small, medium-sized, and large

515

CE

communities in large networks, Physical Review E 91 (1) (2015) 012821. [19] U. N. Raghavan, R. Albert, S. Kumara, Near linear time algorithm to

AC

detect community structures in large-scale networks, Physical review E

520

76 (3) (2007) 036106.

[20] P. Holme, J. Saramki, Temporal networks, Physics Reports 519 (3) (2012) 97–125.

[21] G. Palla, A. L. Barabsi, T. Vicsek, Quantifying social group evolution., Nature 446 (446) (2007) 664–667. 26

ACCEPTED MANUSCRIPT

[22] M. E. Newman, The structure of scientific collaboration networks., Proceedings of the National Academy of Sciences of the United States of Amer-

525

ica 98 (2) (2001) 404.

CR IP T

[23] P. J. Mucha, T. Richardson, K. Macon, M. A. Porter, J.-P. Onnela, Community structure in time-dependent, multiscale, and multiplex networks, science 328 (5980) (2010) 876–878. 530

[24] S. Mankad, G. Michailidis, Structural and functional discovery in dynamic

networks with non-negative matrix factorization, Physical Review E 88 (4)

AN US

(2013) 042812.

[25] T. Hartmann, A. Kappes, D. Wagner, Clustering evolving networks, in: Algorithm Engineering, Springer, 2016, pp. 280–329. 535

[26] G. Rossetti, R. Cazabet, Community discovery in dynamic networks: a survey, arXiv preprint arXiv:1707.03186.

M

[27] S. Asur, S. Parthasarathy, D. Ucar, An event-based framework for characterizing the evolutionary behavior of interaction graphs, Acm Transactions

540

ED

on Knowledge Discovery from Data 3 (4) (2009) 913–921. [28] F. Wang, T. Li, X. Wang, S. Zhu, C. Ding, Community discovery using nonnegative matrix factorization, Data Mining and Knowledge Discovery

PT

22 (3) (2011) 493–521.

[29] L. Akoglu, M. Mcglohon, C. Faloutsos, Rtm: Laws and a recursive gen-

CE

erator for weighted time-evolving graphs, in: Eighth IEEE International Conference on Data Mining, 2008, pp. 701–706.

545

AC

[30] T. Yang, Y. Chi, S. Zhu, Y. Gong, R. Jin, Detecting communities and their evolutions in dynamic social networks–a bayesian approach, Machine Learning 82 (2) (2011) 157–189.

[31] D. Chakrabarti, R. Kumar, A. Tomkins, Evolutionary clustering, in: Pro-

550

ceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, 2006, pp. 554–560. 27

ACCEPTED MANUSCRIPT

[32] Y.-R. Lin, Y. Chi, S. Zhu, H. Sundaram, B. L. Tseng, Facetnet: a framework for analyzing communities and their evolutions in dynamic networks, in: Proceedings of the 17th international conference on World Wide Web, ACM, 2008, pp. 685–694.

CR IP T

555

[33] D. D. Lee, H. S. Seung, Algorithms for non-negative matrix factorization, in: Advances in neural information processing systems, 2001, pp. 556–562.

[34] X. Luo, M. Zhou, Y. Xia, Q. Zhu, A. C. Ammari, A. Alabdulwahab, Generating highly accurate predictions for missing qos data via aggregating

nonnegative latent factor models, IEEE transactions on neural networks

AN US

560

and learning systems 27 (3) (2016) 524–537.

[35] X. Luo, M. Zhou, S. Li, Z. You, Y. Xia, Q. Zhu, A nonnegative latent factor model for large-scale sparse matrices in recommender systems via alternating direction method, IEEE transactions on neural networks and learning systems 27 (3) (2016) 579–592.

M

565

[36] X. Luo, J. Sun, Z. Wang, S. Li, M. Shang, Symmetric and nonnegative la-

ED

tent factor models for undirected, high-dimensional, and sparse networks in industrial applications, IEEE Transactions on Industrial Informatics 13 (6) (2017) 3098–3107.

[37] P. Paatero, U. Tapper, Positive matrix factorization: A non-negative factor

PT

570

model with optimal utilization of error estimates of data values, Environ-

CE

metrics 5 (2) (1994) 111–126. [38] L. Danon, A. Diaz-Guilera, J. Duch, A. Arenas, Comparing community

AC

structure identification, Journal of Statistical Mechanics: Theory and Ex-

575

periment 2005 (09) (2005) P09008.

[39] Y. R. Lin, Y. Chi, S. Zhu, H. Sundaram, B. L. Tseng, Analyzing communities and their evolutions in dynamic social networks, Acm Transactions on Knowledge Discovery from Data 3 (2) (2009) 8.

28

ACCEPTED MANUSCRIPT

[40] D. Greene, D. Doyle, P. Cunningham, Tracking the evolution of communi580

ties in dynamic social networks, in: Advances in social networks analysis and mining (ASONAM), 2010 international conference on, IEEE, 2010, pp.

AC

CE

PT

ED

M

AN US

CR IP T

176–183.

29

ACCEPTED MANUSCRIPT

AN US

CR IP T

Pengfei Jiao received the B.S. degree from the Hainan University, Haikou, China, in 2012. He is currently pursuing the Ph.D. degree from the School of Computer Science and Technology, Tianjin University, Tianjin, China. His current research interests include dynamic complex network analysis, data mining, and machine learning.

PT

ED

M

Wei Yu received the B.S. degree from the Jinggangshan University, Ji’an, China, in 2012, and M.S. degree from Henan Normal University, Xinxiang, China, in 2015. He is currently pursuing the Ph.D. degree from the School of Computer Science and Technology, Tianjin University, Tianjin, China. His research interests include dynamic complex network analysis, large-scale data mining and machine learning.

AC

CE

Wenjun Wang received the Ph.D. degree from Peking University, Beijing, China, in 2004. He is currently a Professor with the School of Computer Science and Technology, Tianjin University, Tianjin, China. His research interests include computational social science, emergency management, large-scale data mining and network science.

Xiaoming Li is a PH.D student in School of Computer Science and Technology at Tianjin University, China. Before that, he received his Master’s degree from Xinjiang university in 2008. Currently, he is a deputy researcher of the network information

ACCEPTED MANUSCRIPT

CR IP T

center of the twelve divisions of the corps. His research interests include human behavior dynamics and data mining.

AC

CE

PT

ED

M

AN US

Yueheng Sun received the Ph.D. degree from Tianjin University, Tianjin, China, in 2005. He is currently a lecturer with the School of Computer Science and Technology, Tianjin University, Tianjin, China. His research interests include social computing, data minning and machine learning.