HQCA-WSN: High-quality clustering algorithm and optimal cluster head selection using fuzzy logic in wireless sensor networks

JID:FSS AID:7775 /FLA [m3SC+; v1.304; Prn:5/12/2019; 13:31] P.1 (1-30) Available online at www.sciencedirect.com ScienceDirect 1 1 2 2 3 Fuzzy...

Download PDF

5MB Sizes 0 Downloads 34 Views

Report

Full Text

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.1 (1-30)

Available online at www.sciencedirect.com

ScienceDirect

1

1 2

2

3

Fuzzy Sets and Systems ••• (••••) •••–•••

3 4

www.elsevier.com/locate/fss

4

5

5

6

6 7

7 8 9 10

HQCA-WSN: High-quality clustering algorithm and optimal cluster head selection using fuzzy logic in wireless sensor networks

8 9 10 11

11

Amir Abbas Baradaran

12

a,∗

, Keivan Navi

b

12 13

13

14

15

a Department of Computer Science and Engineering, Shahid Beheshti University, Tehran, Iran b Faculty of Computer Science and Engineering, Shahid Beheshti University G.C., Tehran, Iran

16

Received 14 May 2018; received in revised form 26 February 2019; accepted 29 November 2019

16

14

15

17

17

18

18

19

19

20 21 22 23 24 25 26 27 28 29 30 31 32 33

Abstract Reducing the consumption of energy and the network lifetime are the main challenges that affect wireless sensor networks (WSNs). High-quality clustering is one of the most important approaches for reducing the energy consumption in WSNs. Various criteria can be used to assess the quality of the clusters and considering all of these criteria can lead to high-quality clustering. In this study, we propose a method called the high-quality clustering algorithm (HQCA) for generating high-quality clusters. The HQCA method uses a criterion for measuring the cluster quality, which can improve the inter-cluster and intra-cluster distances as well as reducing the error rate during clustering. The optimal cluster head (CH) is selected based on fuzzy logic and according to various criteria such as the residual energy, the minimum and maximum energy in each cluster, and the minimum and maximum distances between the nodes in each cluster and the base station. The main advantages of this method are its high reliability, low error rate during the clustering process, the independence of key CHs, better scalability, and good performance in large-scale networks with a high number of nodes. The validity of the clustering quality is also measured based on external and internal criteria. Simulation results demonstrated that the HQCA-WSN method can significantly improve the energy consumption and network lifetime. The proposed method also significantly enhances the first node dies and last node dies metrics compared with similar methods. © 2019 Elsevier B.V. All rights reserved.

36

Keywords: Clustering; Energy consumption; Fuzzy logic; Quality; Wireless sensor network

1. Introduction

43 44 45

Wireless sensor networks (WSNs) comprise a set of nodes for monitoring and recording environmental data [1]. The node distributions in WSNs are random or deterministic [2]. In places that are difficult for humans to access, the nodes are randomly distributed (e.g., by throwing them onto the site). The number of nodes is high and little initial energy is used to charge them. Therefore, energy consumption and the network lifetime are the main challenges that affect these networks [3–5]. The data collected by the nodes are transmitted to a base station (BS) for processing

50 51 52

26 27 28 29 30 31 32 33

35 36

39

41 42 43 44 45

47

47

49

25

46

46

48

24

40

40

42

23

38

38

41

22

37

37

39

21

34

34 35

20

* Corresponding author.

E-mail addresses: [email protected] (A.A. Baradaran), [email protected] (K. Navi). https://doi.org/10.1016/j.fss.2019.11.015 0165-0114/© 2019 Elsevier B.V. All rights reserved.

48 49 50 51 52

JID:FSS AID:7775 /FLA

2

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.2 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11 12

12 13

Fig. 1. Single-hop and multi-hop data transfer in WSNs.

13

14

14

15

15

16

16

17

17

18

18

19

19

20

20

21

21

22

22

23

23

24

24

25

25 26

26 27

Fig. 2. Clustering in a WSN.

29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

27 28

28

[6]. Data transfer can be conducted in a single-hop or multi-hop manner [7,8]. Fig. 1 illustrates the single-hop and multi-hop methods. Some major applications of WSNs include monitoring forest fires, controlling the status of sensitive patients, and controlling military areas and traffic [9–12]. In general, the key factors that affect the design of WSNs are fault tolerance, scalability, costs, hardware limitations, reliability, the WSN topology, the transmission environment, and energy consumption [13–15]. The two approaches used to increase the lifetime of WSN are clustering and routing [16–20]. During clustering, a group of sensors is placed in a category called a cluster according to a set of common attributes. A highly qualified node in each cluster is selected as the cluster head (CH). The role of the CH is to collect the data received from cluster members and to transfer it to a BS or higher level CH depending on the type of transmission (single-hop or multi-hop). CHs transfer the received data directly to the BS during single-hop transmission, whereas the CHs transfer the received data to higher-level CHs during multi-hop transmission and the higher-level CHs then transfer the data to the BS. Multi-hop transmission is always used in large-scale networks. The cluster members are generally divided into two groups comprising common nodes and CHs. Fig. 2 illustrates the clustering process in a WSN. One of the key challenges in clustering is ensuring the quality of the clusters [21]. Many criteria can be used to assess the quality of clusters, where density and separation (inter-cluster and intra-cluster distances) are among the basic indices utilized for measuring the quality of clusters [22,23]. The data belonging to each cluster should be sufficiently close and the clusters formed should be sufficiently separate to minimize their overlapping. Separation is defined according to the following three parameters: the distance between the nearest data in two clusters, the distance between the farthest data in two clusters, and the distance between the centers of two clusters. The other criteria used for measuring the quality of clusters can be divided into three categories: external criteria, internal criteria, and relative criteria. External criteria such as the normalized mutual information (NMI) and entropy are dependent on predefined structures. Thus, these criteria can be used when we have correct clustering results to compare with our own results [24,25]. Lower entropy values and higher NMI values result in better quality clustering. Unlike the external criteria,

29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.3 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

1

3

1

2

Table 1 Features of clustering methods.

3

Clustering method

Advantages

Disadvantages

3

4

Partition-based clustering

• Simplicity and scalability (fairly good) • Suitable for relatively low node numbers and spherical clusters • High performance with split clusters

• • • • • • •

4

5 6 7

2

8 9 10 11

Hierarchical clustering

12 13 14

• Access all data and obtain the best solution • High flexibility • Suitable for point-to-point communications (e.g., tree structures)

15 16 17 18

Density-based clustering

19 20 21 22 23 24

Spectral clustering

25 26 27 28 29 30 31 32

Grid-based clustering

Low performance in large-scale networks Number of clusters determined by the user High dependence on the initial phase (setup) Sensitive to noise Inefficient in harsh environments Unsuitable for convex clusters with different sizes Inefficient with crowded and non-uniform clusters

• Unsuitable (high complexity) for the integration and separation of data • Lack of proper solutions to prove the quality of the clusters • Uncertainty regarding the criteria for terminating clustering • High costs when applied to large-scale and high-density networks

• Dynamic clustering (number of clusters not fixed) • Suitable for clusters with different shapes, sizes, and properties • Robust to noise and good performance in harsh environments

• High dependence on input parameter settings • Fails to provide a proper solution to confirm or reject the quality of the cluster • Inefficient in dense networks

• • • • •

Suitable for image processing Low processing time Uses a similarity matrix for clustering Simple implementation Suitable for low number of nodes

• High complexity when generating the similarity matrix • Sensitive to input parameters selected for clustering • High complexity during large-scale data clustering

• Dependent on the number of input data • Low processing time • High performance with irregular data distributions • Low computational complexity

• Probability of some grids being empty in random distributions • Imbalanced cluster formation • Low scalability to large-scale networks

37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

7 8 9 10 11 12 13 14 15 16 17 18 19 20 21

23 24 25 26 27 28 29 30 31 32 33 34

34

36

6

22

33

35

5

internal criteria such as the sum of squared errors or mean squared error are not dependent on previous knowledge and they can directly measure the quality of clusters based on the original data [26]. Relative criteria involve comparisons between different clusters, e.g., a clustering method may be applied to different data and the results are then compared. Clustering can be conducted with several methods such as hierarchical clustering [27], partition-based clustering [28], spectral clustering [29], grid-based clustering [30], and density-based clustering [31,32]. A tree structure is often used in hierarchical clustering, where greedy algorithms and stepwise optimization are employed. Two approaches are used in hierarchical clustering, which comprise top-down or bottom-up methods. In partition-based clustering, the data are divided into several partitions and each partition represents a cluster. In spectral clustering, a similarity matrix is used for clustering. In grid-based clustering, the grid is divided into multiple regions and nodes are added to these areas based on certain features. In density-based clustering, the clusters are areas with higher density separated from regions with lower density. Fig. 3 shows simple graphical examples of the different types of clustering methods. Table 1 provides general comparison of the different clustering methods. The aim of routing is to find the optimal route from the clusters to the BS in order to transfer the collected data. The most important parameters that affect routing are the network dynamics, energy constraints, data aggregation, and data transfer. The network dynamics depend on the type of application, which can generally be divided into fixed sensors and motion sensors. In applications such as traffic control or fire control, the sensors are usually fixed and motionless. In applications such as tracking targets, the sensors are mobile.

35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

4

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.4 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

13

14

14

15

15

16

16

17

17

18

18

19

19

20

20

21

21

22

22

23

23

24

24

25

25

26

26

27

27

28

28

29

29

30

30

31

31

32

32

33

33

34

34

35

35

36

36

37

37

38

38

39

39

40

40

41

41

42

42

43

43

44

44

45

45

46

46

47

47

48

48

49

49 50

50 51 52

Fig. 3. Simple graphical example of clustering methods: a, partition-based clustering; b, hierarchical clustering; c, spectral clustering; d, grid-based clustering; and e, density-based clustering.

51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.5 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

1

5

1

2

Table 2 Features of clustering methods.

3

Protocol name

Features

Protocol type

3

4

Flooding Directed Diffusion SPIN EAR

Request from BS

Data-centric

4

LEACH TEEN APTEEN PEGASIS

Setup and steady phase. The clusters are formed in the first phase and the CHs are then selected. Information is transmitted in the second phase.

Hierarchical

GEAR PGR GAF

Global positioning system needed to determine the positions of the nodes

Based on geographic locations

SAR

Based on tree structure. BS is usually rooted. Edges shaped according to quality of service, energy, and prioritization.

Quality of service-Based

5 6 7 8 9 10 11 12 13 14 15 16

2

5 6 7

9 10 11

19 20 21 22 23 24 25 26 27 28 29 30 31

14

Energy constraints affect multi-hop and single-hop transmission approaches. In the multi-hop approach, the power consumption and transmission range are lower. In general, many factors can determine energy losses but the most important are collisions, overheating, and idle listening. The data transfer model is also divided into three categories: time-driven [33], event-driven [34], and query-driven [35]. In the time-driven model, data are collected periodically and sent by each node. In the event-driven model, nodes report when an event occurs. In the query-driven model, a request is sent by the BS and the nodes then respond to this request. The routing protocols are divided into four general categories and Table 2 shows these general categories of routing protocols together with the protocol names and types. The remainder of this article is organized as follows. In Section 2, we summarize previous research into clustering as well as discussing the strengths and weaknesses of various methods. In Section 3, we explain the details of the proposed algorithm, network model, radio model, and energy model. In Section 4, we present the clustering process as well as the criteria comprising α, , and μ for measuring the cluster quality. These criteria ensure the quality of clustering. In Section 5, we explain how to select the CH using a fuzzy logic method. Finally, the simulation results are presented in Section 6.

36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

16

18 19 20 21 22 23 24 25 26 27 28 29 30 31 32

2. Related work

33 34

34 35

15

17

32 33

12 13

17 18

8

One of the main methods for WSN clustering is the LEACH protocol [36]. LEACH is a hierarchical, probabilistic, distributed, and single-hop protocol, which improves the network lifetime by distributing the power consumption across the nodes. In this method, data aggregation is conducted on the nodes, which reduces the number of messages sent, and thus the energy consumption. In LEACH, the nodes are organized in local clusters and a node in each cluster is selected as a CH. The cluster is selected randomly using a probability function called T (n). Equation (1) shows the threshold used in this protocol. ⎧ p ⎨ T (n) = if n ∈ G 1 − p(r mod p1 ) (1) ⎩ 0 otherwise

35

According to equation (1), a node decides to be a CH with probability p and broadcasts this decision. After the CHs are specified, they send acknowledgment messages to the other nodes and each node that is not a CH will select a cluster to join. This selection is based on two criteria: the minimum energy required for connection and the received signal strength indicator (RSSI). An important advantage of LEACH is that the CH role alternates among all of the nodes to achieve load balancing. This alternation is achieved by selecting a random number between zero and one for a sensor. A node in the current round is CH if its random number is less than T (n). LEACH also provides fully integrated load distribution. By randomly changing CH, LEACH can balance the energy consumption. The local synchronization scheme in LEACH provides better scalability during cluster formation. Despite its benefits, LEACH has three major

45

36 37 38 39 40 41 42 43 44

46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

6

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.6 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

13 14

14 15

Fig. 4. PEGASIS protocol.

17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

15 16

16

problems. First, the selection of a CH and its alternation are probabilistic, so there is still a likelihood of selecting low-energy nodes as a CH. Thus, the selected CHs may be concentrated in a specific area of the network. Therefore, a suitable distribution of CHs is not guaranteed by this protocol and there may be no CH in some node areas. Second, this protocol assumes that a CH has a wide range of communication and that the data can be sent directly to the BS. These are not realistic assumptions because a CH is a common node and the BS is not directly available to all nodes in most cases. Third, LEACH uses intra-cluster and inter-cluster communication with single-hop transmission, which is not sufficiently effective in large-scale networks. Due to the major problems with LEACH, a centralized approach was developed called the LEACH-C protocol [37]. The LEACH-C protocol is based on LEACH but each node sends information to the BS concerning its current position and energy level. In this protocol, the clusters are formed centrally by the BS. Thus, in each round and after determining the CH, the BS sends a message containing the CH identifier to all of the nodes. If the CH identifier for each node matches the desired identifier, then it is a CH; otherwise, it is considered a common node and it stays in the sleep mode provided that it does not have a transition. Selecting an optimal CH is an NP-hard problem in this protocol. Simulation results have shown that this method transmits more than 40% of the data per unit of energy relative to LEACH. The PEGASIS protocol [38] is the advanced mode of LEACH, which creates a connection chain between all of the sensors in the network instead of forming different clusters. In this protocol, each sensor is associated with its neighbor and only one sensor in the entire network is selected to send the data to the BS, which is referred to as the vital sensor (leader). Fig. 4 shows the PEGASIS protocol. The major benefits of PEGASIS included an extended network lifetime, improved first node dies (FND) and last node dies (LND) metrics, and lower overheads during the clustering process. However, the existence of a vital sensor in this protocol is a fundamental challenge because all of the network information will be lost after the destruction of this node, and thus this method has low reliability. TEEN is a hierarchical clustering protocol and it is also considered an energy-efficient clustering protocol [39]. The TEEN protocol uses multi-hop transmission and it can improve the network lifetime. This method lacks an appropriate distribution for the energy consumption by the clusters. In addition to the protocols introduced above, other protocols such as EECF [40], LLACA [41], CACH [42], BARC [43], LEACH-SWDN [44], O-LEACH [45], PSO-C [46], CBL [47], SONS [48], PECRP [49], and HUCL [50] can reduce the power consumption and increase the network lifetime. Table 3 compares many of the previously proposed protocols. Tarachand et al. proposed an algorithm called ERA [51], which forms clusters based on the remaining energy of the node and the inter-cluster distance. In this method, each sensor node performs self-organized and independent CH selection, so no messages are exchanged. In ERA, two parameters are employed comprising the remaining energy and distance, where each node directly decides to be a CH. Bhatia et al. proposed a method called GADA-LEACH where CH selection is performed by a genetic algorithm [52]. This protocol uses a relay node that serves as an interface between a CH and the BS. The main task of the relay node is to facilitate communication between a CH and the BS. Despite the benefits of GADA-LEACH, low-density regions cannot be differentiated from high-density regions in this method. In addition, due to the genetic algorithm used in this method and the random selection of CHs, the number

17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.7 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

1

7

1

2

Table 3 Comparison of previously proposed protocols for WSN clustering.

3

Protocol name

Features

Protocol type

CH selection

3

4

EECF

• • • • •

Distributed

• Based on a three-step message, remaining energy, and degree of each node

4

5 6 7 8 9

Hierarchical

• Based on two phases (setup and steady) • Efficient load balancing • Introduces a new energy model for the steady phase • Uses Z-MAC and CSMA/CA protocols for intracluster communication

Hierarchical

LEACH-SWDN

• Uses a sliding window • FND and LND improvements

Hierarchical

• Based on threshold, probability, and residual energy

PSO-C

• • • •

For homogeneous and heterogeneous networks High scalability Based on particle swarm optimization (PSO) Uses three phases: cluster formation, choosing the optimal CH, and data transfer • Intra-cluster communication organization

Hierarchical

• Based on fitness function, PSO, quality of clusters, energy, and network coverage

• Combination of LEACH and PEGASIS protocols • For large-scale networks • Uses TDMA

Hierarchical

14 15

BARC

17 18 19 20 21 22 23 24 25 26 27 28 29

CBL

30 31 32 33

SONS

34 35 36

PECRP

39 40 41 42 43 44

HUCL

• Based on local information about neighbors

11

• Based on the highest data received at the start

49 50 51 52

13

15

• Based on the proposed energy model, residual energy, and adjacency

16 17 18 19 20 21 22 23 24 25 26 27 28

• Based on threshold, probabilities, and rotational

29 30 31 32

Multi-hop protocol For large-scale networks Routing by spanning tree Based on three phases: pre-setup, process, and startup

Hierarchical

• • • •

Combination of LEACH and HEED protocols Multi-hop protocol Good aggregation High scalability

Distributed

• • • •

Static clustering and dynamic routing Multi-hop protocol Low overheads during clustering High overheads during routing

Centralized

• Based on the spanning tree and the remaining energy level of the node

33 34 35 36 37

• Based on residual energy and distance to the neighboring node

38 39 40

• Based on residual energy, distance to the BS, and number of neighbors

41 42 43 44 45 46

46

48

12

14

45

47

9 10

• • • •

37 38

8

• Based on two phases (setup and steady) • Uses TDMA scheduling in the transmission phase • Selects CHs rotationally • Single-hop transmission

CACH

13

16

7

Centralized

11

5 6

• Clustering algorithm based on machine learning • Cluster formation based on two phases (initial clustering and Re-clustering)

LLCA

10

12

Uses the SAB message Partition-based clustering CH selection using RCRA messages Best complexity: O(1) Worst complexity: O(n)

2

of CHs may be higher in some parts of the network and load balancing can be inadequate. Thus, low-density areas may have more CHs than high-density areas. The CBCCP [53] method employs a new hierarchical clustering protocol that focuses on reducing the data transmission time and energy consumption by using a multi-hop data aggregation approach, where routing is performed via a predetermined path called the new transmission algorithm. Clustering is an efficient approach for network organization by increasing the network lifetime, error tolerance, scalability, data aggregation, and load balancing, and obtaining a better topology. One problem that affects high-quality

47 48 49 50 51 52

JID:FSS AID:7775 /FLA

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.8 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

8

clustering is how to consider all of the useful parameters in the clustering process. Fuzzy logic is one of the most effective approaches for addressing this type of problem. In the FUZZY-TOPSIS protocol [54], a multi-criteria fuzzy decision-making technique is used to select CHs. This method selects CHs based on the remaining energy, energy consumption of the nodes, number of neighboring nodes, mean distance between neighboring nodes, and the distance from the BS. After applying these five criteria in FUZZY-TOPSIS, an index is created for each node, which allows the common nodes to decide whether to be a CH or not. After determining a CH, a message is sent to its neighbors. Common nodes join the corresponding CHs according to the maximum RSSI value and the smallest distance. In this method, CH selection is performed in six phases based on a random distribution of nodes, detection of neighboring nodes, CH selection, CH arrangement, a mechanism for organizing intra-cluster communication and inter-cluster multi-hop communication, and node mobility using a predictable weighted decision matrix and random paths. Another method based on fuzzy logic was proposed by Azad et al. where this protocol is a centralized clustering approach with a CH selection criterion based on multiple attribute fuzzy decision making (MADM). ECPF [55] is a protocol that uses three techniques comprising selecting tentative CHs based on the remaining energy, using fuzzy logic to select the final CHs, and demand-based clustering. The OCM-FCM [56] method uses a C-means-based fuzzy clustering algorithm, where the optimal CHs are selected based on the node density in the network. Clustering is conducted using the C-means algorithm in this method where CHs are selected based on the distance and degree of membership for each node. However, selecting the C parameter (number of clusters) is a fundamental challenge for the C-means algorithm. Baranidharan et al. proposed a method called DUCF [57] where the CHs are selected using fuzzy logic with an unequal clustering approach based on three criteria: remaining energy of the node, distance to the BS, and degree of each node. The final CHs are selected using the fuzzy inference system. Many fuzzy logic methods have been proposed, including CHEF [58] and EAUCF [59]. These methods obtain remarkable improvements compared with other previously developed methods, but an algorithm has not yet been proposed that considers the cluster density and the inter-cluster and intra-cluster distances simultaneously in addition to the criteria defined above. Thus, the routing process can only begin after ensuring the accuracy and quality of the clusters. Therefore, in order to optimize the energy consumption, increase the network lifetime, and improve the cluster quality, we propose a high-quality clustering algorithm based on a multi-dimensional fuzzy approach for selecting the optimal CHs. In general, the advantages of the proposed method are as follows.

30 31 32 33 34 35 36 37 38 39 40

• • • • • •

Reduced energy consumption, increased network lifetime, and improved FND and LND parameters. A standard is created to ensure the accuracy and quality of the clusters. Relationships are established between the number of nodes, number of clusters, and quality of the clusters. Improved inter-cluster and intra-cluster distances. Improved C-means clustering and selection of better cluster centers. A new criterion is established for differentiating clusters in high-density areas and clusters in low-density areas. This criterion can be used to determine whether the node distribution in a cluster is balanced or uniform, and whether the energy consumption is less in a cluster where the node distribution is balanced. • CH selection is conducted according to the remaining energy on the node, cluster quality criterion, node distribution in the cluster, distance of the node to the BS, mean nodes in the cluster, and cluster density. • The protocol can be applied to large-scale networks with many nodes (better scalability). • The proposed protocol is independent of the vital nodes. 3. Details of the proposed algorithm Before explaining the details of the proposed algorithm, we define the assumptions considered in our method. 3.1. Network model

49 50 51 52

6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27

29 30 31 32 33 34 35 36 37 38 39 40

42

44

46 47

47 48

5

45

45 46

4

43

43 44

3

41

41 42

2

28

28 29

1

The high-quality clustering algorithm (HQCA)-WSN has N sensor nodes and all of the sensors are distributed randomly in a square area measuring n ∗ n. The sensors have different initial energies and they cannot be added or removed after deployment. Sensors cannot be recharged after their energy has drained. Sensors know their positions and their positions are also known to the BS. It is also assumed that a media access control (MAC) layer prevents scheduling interruptions during the transmission of messages. Furthermore, the following assumptions are considered.

48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.9 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

1 2 3 4

– – – –

9

BS is located outside the area of the nodes and its location is variable in different scenarios. Wireless communication between nodes is symmetrical. The energy, memory, and power of the BS are infinite. The positions of the nodes are determined by positioning algorithms or a global positioning system.

1 2 3 4 5

5 6

3.2. Radio model

6 7

7 8 9 10 11 12 13 14

The proposed method uses a radio model based on the distance between the transmitter and receiver, where the shortest distance is considered the crossover distance. The transmission power is defined as follows:

8

Pt Gt Gr λ2 , (2) (4πλ)2 where Pt , Gt , and λ are the transmission power, transmission antenna gain, and wavelength of the signal, respectively. When the distance of the receiver is greater than the crossover distance, the transmission power is equal to:

10

ptr =

15 16 17 18 19 20 21 22 23 24 25

Ptr =

Pt Gt Gr h2t h2r , d4

28 29 30 31

36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

13 14

16

18 19

3.3. Energy model

20

The Heinzelman energy model is used where the energy consumption required to receive an m-bit message at a distance of d meters is defined as follows: Ec (m, d) = m Eelect + εfs d 2 d < dc , (4) 4 d ≥ dc Ec (m, d) = m Eelect + εmp d

21 22 23 24 25 26

where εfs and εmp depend on the sender’s sensitivity and the shape of the noise, Eelect is the electrical energy required to setup the electronic equipment, which depends on digital encoding, modulation, and filtering, and dc is the crossover distance between the transmitter and receiver. The radio energy required to receive an m-bit message is defined as follows. Ec = mEelect

(5)

27 28 29 30 31 32

3.4. Proposed method

33 34

34 35

12

17

where ht and hr represent the transmitter and receiver antenna heights, respectively.

32 33

11

15

(3)

26 27

9

One of the most important factors that affect high-quality clustering is simultaneously considering the two criteria comprising the intra-cluster and inter-cluster distances. In most of the previously proposed methods, only one of these two criteria is considered or neither. In methods where these criteria are considered, the accuracy and quality of the clusters are not measured at the end of the clustering process. The error rate during cluster formation is one of the most important issues that affect the formation of high-quality clusters. Most clustering methods use a fixed number of clusters at the beginning of the clustering process. For example, the C-means and K-means methods use a fixed number of clusters at the beginning of the process. Methods based on these algorithms are also problematic because the cluster quality depends on the number of clusters and the points considered as cluster centers. Clusters may also be formed with no members (empty clusters). In the following, we propose several criteria for measuring the cluster quality. The first criterion is based on the inter-cluster and intra-cluster densities. We show that if this criterion is low, then the quality of the cluster is better. Thus, if this criterion is low, the inter-cluster and intra-cluster distances will both be better. We also introduce another criterion that indicates whether an error has occurred in the clustering process. One of our high-quality clustering criteria is the arrangement of the nodes in a cluster. If more uniform nodes cover the cluster surface, then the clusters will be more balanced and the energy consumption by the nodes will be decreased. Finally, another criterion represents the balance of the nodes in the clusters. The accuracy of this criterion is proved based on statistical assumptions and the measurement error. Before introducing these criteria, we explain the clustering process in the following. In the proposed method, CH selection is performed by fuzzy logic and the formed clusters obtained are the highest quality.

35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.10 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

10

1

4. Clustering process

1 2

2 3 4 5 6 7 8 9 10 11 12 13 14

In the proposed method, the region where the sensors are distributed is divided into grids with the same size. The centers of gravity are then calculated for the grid and sensors. Two types of clustering are performed based on the centers of gravity for the grids and the sensors in each grid, where the centers of gravity for the grid and the sensors in each grid are considered the primary centers of the cluster. This process is repeated until the new cluster centers do not differ from that the previous cluster centers. If a grid is initially formed with no sensors or the number of its sensors is less than three, then its sensors are added to the nearest adjacent grid. This method avoids empty clusters or those with a low number of sensors, and the clusters have will have the lowest intra-cluster distance. The distance between the centers of gravity for the grid and the sensors in each grid is called the confidence interval and it is represented by μ0 . After conducting the two clustering methods, the quality of the clusters is measured using the three criteria and the highest quality cluster is selected. Measuring the cluster quality is more important than the clustering process. These criteria can also be used to measure the quality of clusters in previously proposed methods or future methods.

17 18 19 20 21 22 23 24 25 26 27 28 29

4.1. First criterion: intra-cluster and inter-cluster densities

32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

5 6 7 8 9 10 11 12 13 14

16

¯˛ = {∂ n | n = 1, 2, . . . , k}, where k and ∂ n are the numbers of clusters The set of cluster centers is denoted by O and cluster centers, respectively. The set of sensors that are not cluster centers is denoted by O ˛ = {∂i | i = 1, 2, . . . , ¯˛ }, where is the total number of clustered sensors and − O −O ˛¯ represents the total number of nodes except ¯˛ for the cluster centers. For example, if we have 100 sensors and five clusters, then k = 5 and = 100. In addition, O and O ˛ are defined as follows.

¯˛ = ∂ n n = 1, 2, 3, 4, 5 = ∂ 1 , ∂ 2 , ∂ 3 , ∂ 4 , ∂ 5 O O ˛ = {∂i | i = 1, 2, . . . , 95} = {∂1 , ∂2 , . . . , ∂95 }

indicates the average similarity between the cluster center ∂ n

18 19 20 21 22 23 24

26 27 28

(6)

n=m

n=1

17

25

The cluster quality is defined as follows:

k 1 m + n (k) = ( min , 1≤m≤k k mn

30 31

4

15

15 16

3

29 30

n

where n and all members of the cluster O ˛ , m indicates ˛ m , mn is also the similarity the average similarity between the cluster center ∂ m and all members of the cluster O between the ∂ n and ∂ m clusters, and n , m , and mn are defined as follows: 1 ∂i , ∂ n (7) n = n O ˛ n ∂i ∈O ˛ 1 m = ∂j , ∂ m (8) m O ˛ ∂j ∈O ˛m (9) ∂ n, ∂ m , mn =

31

where O ˛ and O ˛ are the number of nth cluster members and the number of mth cluster members, respectively, n m ∂i ∈O ˛ n (∂i , ∂ ) and ∂j ∈O ˛ m (∂j , ∂ ) are the total Euclidean distances of the nth and mth cluster sensors from their n m own cluster centers, (∂ , ∂ ) is the total Euclidean distance of the cluster centers from each other, and mn is the similarity between m and n clusters. The similarity is based on the proximity in terms of the Euclidean distance to the cluster centers. For example, we consider the distribution of sensors and clusters shown in Fig. 5. In this example, the sensors with coordinates (1, 24), (2, 20), (2, 22), and (4, 15) are the members of cluster 1 and the sensors with coordinates (3, 13), (1, 5), (5, 3), (3, 3), and (4, 8) are the members of cluster 2. In addition, the sensors with coordinates (1, 24), (2, 20), (2, 22), and (4, 15) are the members of cluster 3. Thus, K = 3 and = 100. According to the definitions given above, we have the following.

¯˛ = ∂ n n = 1, 2, 3 = (3, 20.25), (3.2, 6.4), (14.8333, 10) O

42

n

m

32 33 34 35 36 37 38 39 40 41

43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.11 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

11

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

Fig. 5. Distribution of nodes in the hypothetical clusters.

15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

13 14

14

O ˛ = {∂i | i = 1, 2, . . . , 97} = {∂1 , ∂2 , . . . , ∂97 } 1 ∂i , ∂ n 1 = n O ˛ ∂i ∈O ˛n 1 (1 − 3)2 + (24 − 20.25)2 + (2 − 3)2 + (20 − 20.25)2 + (5 − 3)2 + (22 − 20.25)2 = 4

+ (4 − 3)2 + (15 − 20.25)2 = 3.3207 1 ∂i , ∂ n 2 = n O ˛ ∂i ∈O ˛n 1 (3 − 3.2)2 + (13 − 6.4)2 + (1 − 3.2)2 + (5 − 6.4)2 + (5 − 3.2)2 + (3 − 6.4)2 = 5

+ (3 − 3.2)2 + (3 − 6.4)2 + (4 − 3.2)2 + (8 − 6.4)2 = 3.6505 1 ∂i , ∂ n 3 = n O ˛ ∂i ∈O ˛n 1 (16 − 14.8333)2 + (11 − 10)2 + (20 − 14.8333)2 + (10 − 10)2 = 6 + (17 − 14.8333)2 + (19 − 10)2 + (15 − 14.8333)2 + (5 − 10)2 + (9 − 14.8333)2 + (5 − 10)2

+ (12 − 14.8333)2 + (10 − 10)2 = 5.2466 ∂ n, ∂ m mn = = (3 − 3.2)2 + (20.25 − 6.4)2 + (3 − 14.8333)2 + (20.25 − 10)2 + (3.2 − 14.8333)2 + (6.4 − 10)2 = 24.1919

k 1 1 + 2 1 + 3 2 + 3 m + n 1 (k) = ( min , , = min 1≤m≤k k mn 3 mn mn mn n=1

n=m

1 min{0.2882, 0.3541, 0.3678} = 0.0961 3 Next, we assume that the clusters are formed as shown in Fig. 6. By repeating the calculation given above, the following parameters are obtained. =

1 = 4.1273 2 = 3.0826 3 = 4.4782

15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.12 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

12

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

Fig. 6. Distribution of nodes in the hypothetical clusters.

13

14

14

15

15

16

16

17

17

18

18

19

19

20

20

21

21

22

22

23

23

24

24

25

25 26

26

Fig. 7. Distribution of nodes in the hypothetical clusters.

27

29 30 31 32 33 34 35 36 37 38

27 28

28

mn = 24.5112 (k) = 0.0980 By comparing the two criteria for the first and second cases, we conclude that the first clustering results are better quality because the value of is lower. Thus, the intra-cluster and inter-cluster distances are better in the first case than the second case. In the next example, we assume that the distribution of the sensors is as shown in Fig. 7. Then, we have the following. ∂1 = (2.875, 4.625)

29 30 31 32 33 34 35 36 37 38

39

∂2 = (3, 1.8571)

39

40

∂3 = (7.75, 2.625)

40

41 42

1 = 1.6815

43

2 = 1.6993

44 45 46 47 48 49 50 51 52

3 = 1.4281 mn = 12.8517 = 0.0807 Next, we assume that the clusters are formed as shown in Fig. 8. In this case, is equal to 0.07964, which indicates that the clusters are better in the first case. During clustering, entropy is considered as an external criterion and its value ranges between zero and one. The cluster quality is lower when the entropy value is closer to one.

41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.13 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

13

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

Fig. 8. Distribution of nodes in the hypothetical clusters.

12

12 13

13

Table 4 Error table.

14 15 16

Cluster A Cluster B

17 18

14 15

Cluster A

Cluster B

16

a c

b d

17 18 19

19 20 21 22 23 24 25 26 27 28 29 30

Definition 1. If C and Pi are the number of clusters and the probability of data belonging to the ith cluster, respectively, then the entropy is equal to: Entropy = −

c

33 34 35 36 37 38 39 40 41 42 43

(10)

i=1

Lemma 1. Entropy is always between zero and one, and always positive. Proof. Suppose that 0 ≤ pi ≤ 1 and Log(pi ) < 0. Thus, pi ∗ log(pi ) is always negative. Therefore, the entropy value is always positive and its value ranges between 0 and 1. 2

The clustering entropy in Example 1 for the second mode (Fig. 6) is defined as follows. 5 5 5 5 5 5 Entropy = − log − log − log = 0.4771 10 10 10 10 10 10 Therefore, the clustering entropy is lower in the first case, and thus the quality is higher. After calculating the clustering entropy in Example 2, we also find that clustering quality is better in the first case than the second case. Thus, the low entropy and values indicate that the clustering quality is better. Therefore, we can demonstrate the accuracy of the criterion by calculating the entropy.

48 49 50 51 52

24 25 26 27 28 29 30

32 33 34 35 36 37 38 39 40 41 42 43 44

4.2. Second criterion: error resulting from clustering (α)

45 46

46 47

22

31

The clustering entropy in Example 1 for the first mode (Fig. 5) is defined as follows. 4 4 5 5 6 6 Entropy = − log − log − log = 0.4713 10 10 10 10 10 10

44 45

21

23

pi log(pi ).

31 32

20

As mentioned previously, the HQCA-WSN performs two types of clustering, where one is based on the center of gravity for the sensors in each grid and the other is based on the center of gravity for the grid. After forming both types of clusters, we apply the and entropy criteria to select one of the two clustering processes. Clearly, the cluster is selected with the lowest values for both criteria. The clustering error is denoted by α, which is calculated using a table called the error table. Table 4 shows the error table for the two types of clusters. It should be noted that this table can be expanded to any number of clusters.

47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.14 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

14

1 2 3 4 5 6 7 8 9 10 11 12

In Table 4, parameter a is the number of sensors in cluster A and they are the proper members of A. Similarly, parameter d represents the sensors in cluster B and they are correctly grouped in cluster B. The parameter c represents the sensors in cluster B and they are mistakenly grouped in cluster A. Finally, parameter b represents the sensors in cluster A and they are mistakenly grouped in cluster B. According to the error table, the clustering error is defined as follows:

15

16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

34 35 36 37 38 39

42 43 44 45 46

11 12 13

Nodes Number Clusters Number

is always equal to a constant number denoted by μ0 , as follows.

49 50 51 52

14 15 16

Nodes Number Clusters Number

17

According to the statistical hypothesis, the accuracy of Lemma 2 can be determined at the significance level α (or error). 2

19

Based on the statistical hypothesis, we can define the following hypothesis. H0 : μ ≤ μ 0 H1 : μ > μ0

18

20 21 22 23 24

(13)

25 26

In equation (13), H0 is the null hypothesis and H1 is the alternative hypothesis. The sensor distributions follow either the Z-distribution or Student’s t-distribution. If the number of sensors is less than or equal to 30, the distribution of the sensors follows the Student’s t-distribution. If the number of sensors is greater than 30, then according to the central limit theorem, the distribution of the sensors follows the Z-distribution. Therefore, if the number of sensors is denoted by n, we have the following two cases.

27 28 29 30 31 32

• If n ≤ 30, then the distribution of the sensors follows the Student’s t-distribution and test statistic is defined as follows. X − μ0 t= SX

(14)

X − μ0 SX

36

38 39 40

(15)

41 42 43

In equation (15), SX is equal to: T SX = √ , n

34

37

• If n > 30, then the distribution of the sensors follows the Z-distribution and the test statistic is defined as follows. z=

33

35

44 45

(16)

46 47

47 48

7

(12)

μ0 =

40 41

5

10

32 33

4

9

then the distribution of the sensors in a cluster is balanced or uniform.

Proof. Suppose that the value of

3

8

Nodes Number Clusters Number ,

13 14

(11)

where (b + c) represents the total number of members grouped mistakenly with the other clusters. Lemma 2. If μ ≤

2

6

b+c α= , a+b+c+d

1

where T and X represent the standard deviation and the average number of sensors in each cluster, respectively. After specifying the values defined above, the critical values (the boundaries of H0 and H1 ) should be specified based on α, as shown in Appendixes A and B. For clarity, we show each of the above definitions in the curve in Fig. 9. If the test statistic is in the acceptance zone for H0 , then the assumption of H0 is accepted; otherwise, the assumption of H0 is rejected.

48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.15 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

15

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

13

14

14

15

Fig. 9. Critical values based on the parameter α.

15

16

16

17

17

18

18

19

19

20

20

21

21

22

22

23

23

24

24

25

25

26

26

27

27 28

28

Fig. 10. Fuzzy set of energy levels.

29

31

5. CH selection

34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

31 32

32 33

29 30

30

In the proposed protocol, CHs are selected using fuzzy logic. Fuzzy logic involves a fuzzifier, inference system, rules base, and defuzzifier. The input of the fuzzy system is usually a crisp value, which is converted into a suitable fuzzy variable. The fuzzified values are sent to the fuzzy decision block (FDB), which comprises fuzzy rules and a fuzzy inference system. The FDB maps the fuzzy output based on the fuzzy rules. Finally, the fuzzy output is converted into a crisp output using defuzzification approaches. CHs are selected based on the remaining energy in the sensors, maximum and the minimum distances of the sensors to the BS, lowest and highest amount of energy per cluster of sensors, cluster quality criteria, distribution of sensors in the cluster, mean distance of sensors in the cluster, and the cluster density. We propose four total energy levels comprising low, medium, high, and very high. In fact, these variables are the fuzzy linguistic variables for the total energy. Fig. 10 shows the fuzzy set of energy levels. According to Fig. 10, the membership functions of the fuzzy set are defined as follows. 1 Energy ≤ 0.25 (17) Low 0.35−Energy 0.25 < Energy ≤ 0.35 0.1 Energy−0.25 0.25 < Energy ≤ 0.5 0.25 Medium (18) 0.6−Energy 0.5 < Energy ≤ 0.6 0.1 Energy−0.5 0.5 < Energy ≤ 0.8 0.3 (19) High 0.89−Energy 0.8 < Energy ≤ 0.89 0.09

33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.16 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

16

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

Fig. 11. Fuzzy set of distance levels.

13

13

14

14

15

15

16

16

17

17

18

18

19

19 20

20

Fig. 12. Minimum and maximum distances to BS.

21

22 23 24

Very High

21 22

Energy−0.8 0.09

0.8 < Energy ≤ 0.89

1

Energy > 0.89

(20)

27 28 29 30 31 32 33 34 35 36 37 38 39 40

Fig. 11 shows the different fuzzy levels (or membership functions) for the distances of sensors to the BS in each cluster, i.e., near, average, and far. The membership functions for the distance parameter are defined as follows: 1 x ≤ c1 Near (21) L−x c1 < x ≤ L L−c1 x−c 1 c1 < x ≤ L L−c1 (22) Average c2 −x c2 −L L < x ≤ c2 x−L c2 −L L < x ≤ c2 (23) Far 1 x > c2 In the equations above, c1 and c2 are the minimum and maximum sensor distances to the BS, respectively, x is the sensor’s distance to the BS, and L is the average distance to the BS, which is calculated as follows. L = (c1 + c2 )/2

(24)

43 44 45 46 47 48 49 50 51 52

26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41

41 42

24 25

25 26

23

Fig. 12 shows the parameters c1 and c2 . Fig. 13 shows the different fuzzy levels used for choosing the optimal CH. Clearly, the selected CHs are the sensors with the highest energy and the lowest distance to the BS. Thus, in each cluster, several sensors have the possibility of being a CH and the sensor that satisfies the conditions best is selected as the final CH. In Fig. 13, T1 is the multiple of the minimum energy in the maximum distance to the BS and T2 is the multiple of the maximum energy in the minimum distance to the BS. Thus, we have the following. T1 = Emin ∗ Dismax T2 = Emax ∗ Dismin

42 43 44 45 46 47 48

(25)

Table 5 shows the fuzzy rules employed for selecting tentative CHs. These rules are written based on the Mamdani approach.

49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.17 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

17

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

Fig. 13. Fuzzy diagram for final CH selection.

10

11

11

12

12

13 14 15

Table 5 Fuzzy rules for CH selection. Number

Rule

1 2 3 4 5 6 7 8 9

If Energy is “Very High” ˆ distance to Bs is “Nearest,” then CH is “Candidate” If Energy is (“Very High” ∨ “high”) ˆ distance to Bs is “Nearest,” then CH is “Candidate” If Energy is (“Very High” ∨ “high”) ˆ distance to Bs is “Near,” then CH is “Candidate” If Energy is “Very High” ˆ distance to Bs is “Near,” then CH is “Candidate” If Energy is “Medium” ˆ distance to Bs is “Nearest,” then CH is “Candidate” If Energy is “Medium” ˆ distance to Bs is (“Near” ∨ “Average”), then CH is “Candidate” If Energy is “Very High” ˆ distance to Bs is “Average,” then CH is “Candidate” If Energy is “Medium” ˆ distance to Bs is “Nearest,” then CH is “Candidate” If Energy is (“Medium” ∨ “high”) ˆ distance to Bs is “Near,” then CH is “Candidate”

16 17 18 19 20 21 22 23

13 14 15 16 17 18 19 20 21 22 23

24

24

25

25 26

26 27 28 29

We emphasize that in the proposed method, the quality of the clusters formed is more important than the clustering process. However, in the HQCA-WSN protocol, we provide three new criteria for measuring the cluster quality. Algorithm 1 presents the HQCA-WSN protocol. 6. Simulation results

34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

29

31 32

32 33

28

30

30 31

27

In this study, simulations were conducted with MATLAB. The energy of the sensors ranged between 0.2 and 0.9. The sensors were randomly distributed in a square measuring n ∗ n. The simulation parameters are shown in Table 6. The efficiency of the proposed algorithm was evaluated in term of the following parameters and the results were compared with those obtained in previous studies. • Network lifetime: One of the main goals of the proposed method to increase the network lifetime. The two main parameters that affect the lifetime of WSNs are FND and LND. Higher values for these two parameters indicate a longer network lifetime. • The number of CHs: The energy consumption is better during each round when more CHs are balanced in each round. • Improving the initial energy level for network setup: During the initial setup of the network, assigning low energy to each sensor is highly effective because it reduces the total amount of network energy in the initial setup. • Improving the energy consumption in each cluster: The energy consumption in each cluster should be improved in order to minimize the average energy in the entire network. • Evaluating the cluster quality using the confidence interval. • Determining the number of clusters using μ and α. • Evaluating the performance of large-scale networks with a large number of sensors. • Assessing the network performance with different BS situations. • Computational complexity of the proposed method.

33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

1 2 3 4 5

Algorithm 1 HQCA-WSN protocol. -

6 7 8 9 10 11 12 13 14 15

-

16 17 18 19 20 21 22 23 24 25

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.18 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

18

-

1

Distribute nodes randomly and uniformly; Set up the environments for the nodes with equal size (grid); For (i = 1; i ≤ grid_number; i++) Counter = Number of grid_nodes[i]; If (Counter == “empty”) Delete the grid. End if; If (Counter < 3) The nodes in the grid are added to the nearest adjacent grid. End if; End for; Calculate the initial centers; Flag = true; While (Flag == true) Calculate the new centers using the distance between the nodes in each grid from the center of gravity for the grid and to the base station; Start clustering; New centers = Calculate the new centers using the distance between the nodes in each grid to the center of gravity for the grid and to the base station; Calculate the first cluster quality criterion ( ); Calculate the second cluster quality criterion (α); Calculate the entropy; Perform Lemma 1 and Lemma 2, and confirm one of the clustering processes in terms of the entropy, , and α criteria; If (New centers == Initial centers) Flag = false; End if; End while; Select the optimal CH using fuzzy logic and rules (Table 5); Start the transmission phase; End.

Table 6 Simulation parameters.

28 29 30 31 32 33 34 35 36 37 38 39

46 47

Parameter value

Number of nodes Network size (monitoring area) Initial energy Relative position of the base station Simulation end condition Eelect εfs εmp Data aggregation energy d0

50, 100, 200, 300, 500 100 ∗ 100, 200 ∗ 200, 300 ∗ 300, 500 ∗ 500 Rand [0.2–0.9] Variable Number of nodes < 5 50 nJ/bit 10 pj/bit/m2 0.0013 pj/bit/m4 5 nj/bit/signal 87 m

52

8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25

30 31 32 33 34 35 36 37 38 39

41 42

Fig. 14 shows the energy consumption with the proposed method (HQCA-WSN). Figs. 15 and 16 show the energy consumption with the LEACH, LEACH-SWDN, LEACH-DCHS, ALEACH, HUCL, EAUCF, LEACH-C, CBL, PEGASIS, O-LEACH, GADA-LEACH, and MOFCA protocols. Clearly, the proposed method significantly improved the network lifetime as well as the FND and LND parameters. Fig. 17 shows that the HQCA-WSN method significantly improved the FND and LND metrics.

43 44 45 46 47 48

6.2. Number of CHs

49 50

50 51

7

40

6.1. Network lifetime

48 49

6

29

Parameter name

42

45

5

28

40

44

4

27

27

43

3

26

26

41

2

Selecting appropriate CHs is important for reducing the energy consumption in WSNs. In particular, selecting a node as a CH in a high-density area rather than a low-density area as well as the overall coverage of the cluster by CHs

51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.19 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

19

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

13 14

14 15

Fig. 14. Network lifetime with HQCA-WSN protocol.

15

16

16

17

17

18

18

19

19

20

20

21

21

22

22

23

23

24

24

25

25

26

26

27

27

28

28

29

29 30

30 31

Fig. 15. Network lifetime with LEACH, LEACH-SWDN, LEACH-DCHS, and ALEACH protocols.

31

32

32

33

33

34

34

35

35

36

36

37

37

38

38

39

39

40

40

41

41

42

42

43

43

44

44 45

45 46 47

Fig. 16. Network lifetime with HUCL, EAUCF, LEACH-C, CBL, PEGASIS, O-LEACH, GADA-LEACH, and MOFCA protocols. (For interpretation of the colors in the figure(s), the reader is referred to the web version of this article.)

50 51 52

47 48

48 49

46

will greatly increase the network lifetime and reduce energy the consumption because the presence of more CHs in the high-density area will reduce the total distances between the sensors and the CHs to enhance the network efficiency. Fig. 18 shows an example with a random distribution of nodes. The density of the nodes is higher in some areas of the network than other areas because of the random distribution. Thus, more CHs are selected in high-density areas

49 50 51 52

JID:FSS AID:7775 /FLA

20

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.20 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

13

14

14 15

15 16

Fig. 17. FND and LND metrics with HQCA-WSN and other protocols.

16

17

17

18

18

19

19

20

20

21

21

22

22

23

23

24

24

25

25

26

26

27

27

28

28

29

29

30

30

31

31

32

32

33

33

34

34 35

35 36

Fig. 18. Random distribution of nodes and dense regions.

38

38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

36 37

37

and the performance of the algorithm will be better. Figs. 19, 20, and 21 show examples of CH selection, where more CHs are clearly selected in high-density areas. Fig. 19 shows the simulation results obtained in a square measuring 200 ∗ 200 m2 with nine clusters and 100 nodes. In Fig. 19, the white areas are regions where the densities of the nodes are higher than those in other areas, and thus the number of clusters selected (i.e., 9 clusters) is higher in these areas. Fig. 20 shows the simulation results obtained with 16 clusters and 100 nodes. According to Fig. 20, the HQCA-WSN protocol has high scalability with respect to the number of nodes, the network size, number of clusters, and density. Fig. 21 shows the simulation results obtained with 200 nodes where 16 CHs are selected (red areas). Thus, the proposed algorithm exhibits better scalability in large-scale networks than other methods. Therefore, if the number of nodes is high in high-density areas, more clusters are selected in these areas. Selecting appropriate cluster centers in each round is important for the formation of optimal clusters. In particular, a more balanced selection of cluster centers leads to the formation of better final clusters in terms of the inter-cluster and intra-cluster distances. Fig. 22 shows the selection of cluster centers in each round, where it is clear that the selection of cluster centers with the HQCA-WSN method can obtain an appropriate balance.

39 40 41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.21 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

21

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

13

14

14 15

15 16

Fig. 19. CH selection in a network with nine clusters.

16

17

17

18

18

19

19

20

20

21

21

22

22

23

23

24

24

25

25

26

26

27

27

28

28

29

29

30

30

31

31

32

32

33

33

34

34 35

35 36

Fig. 20. CH selection with 100 nodes and 16 clusters.

38

38 39

6.3. Improved energy consumption in the overall network and each cluster

42 43 44 45 46 47

The network setup will be more cost-effective if the initial amount of energy is lower for each node. Table 7 shows the initial amounts of energy during the formation of various networks. According to Table 7, the proposed method requires less energy compared with previously reported methods. Fig. 23 shows the energy consumption in each cluster. According to Fig. 23, the energy consumption in each cluster is balanced in an appropriate manner, where the highest energy consumption is between 0.4 J and 0.6 J. As shown in Table 7, the total amount of energy during the initial setup of the network is very low when using the HQCA-WSN protocol compared with other methods, which also explains why the proposed method works well in high-density areas. 6.4. Determination of the number of clusters and node balance in each cluster

52

42 43 44 45 46 47

49 50

50 51

41

48

48 49

39 40

40 41

36 37

37

Selecting appropriate clusters is important for reducing the energy consumption. In the HQCA-WSN method, we introduce a criterion called the confidence interval (μ0 ), which is equal to the distance of the center of gravity for the

51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.22 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

22

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

13

14

14

15

15

16

16

17

17 18

18

Fig. 21. CH selection with 200 nodes and 16 clusters.

19

19

20

20

21

21

22

22

23

23

24

24

25

25

26

26

27

27

28

28

29

29

30

30

31

31

32

32

33

33 34

34 35

35

Fig. 22. Selection of cluster centers in each round.

36

36 37 38 39

Table 7 Initial energy in HQCA-WSN and similar protocols.

42 43 44 45 46 47 48 49 50 51 52

38

Protocol name

Number of nodes

Initial energy of nodes

Total energy

LEACH-SWDN CBL O-LEACH DCHS ALEACH HUCL PECRP ECPF PEGASIS CBCCP OCM-FCM FUZZY-TOPSIS ERA HQCA-WSN

100 100 100 100 100 100 100 100 100 100 100 100 200 100

5J 1J 0.5 J 2J 2J 2J 10 J 2J 1J 0.5 J 2J 0.5 J 0.5 J [0.2–0.9] J

500 J 100 J 50 J 200 J 200 J 200 J 1000 J 200 J 100 J 50 J 200 J 50 J 100 J 45 J

40 41

37

39 40 41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.23 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

23

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

13

14

14

15

Fig. 23. Energy consumption in each cluster.

15

16

16

17

17

18

18

19

19

20

20

21

21

22

22

23

23

24

24

25

25

26

26

27

27

28

28

29

29

30

30

31

31 32

32

Fig. 24. Clustering based on the centers of gravity for the sensors and grid.

33

35 36 37 38 39 40 41 42 43

sensors minus the center of gravity for the grid. The number of clusters can be determined based on this criterion and the number of sensors. Figs. 24 and 25 show the formation of clusters based on the center of gravity for the sensors in each grid and the center of gravity for the grid, respectively. Clearly, 200 sensors are organized in nine clusters. Table 8 shows the confidence intervals with both clustering methods, where it is clear that the clustering process in cluster 6 has an error, and this error is also clear in Figs. 24 and 25. Fig. 25 shows that a sensor in cluster 6 is incorrectly grouped in cluster 9 and a sensor in cluster 9 is incorrectly grouped in cluster 6. The errors are shown in Table 9. Based on the errors in Table 9 and the explanations given in the previous sections, we obtain the following.

44 45 46 47 48 49 50 51 52

33 34

34

α=

36 37 38 39 40 41 42 43 44

1+1 2 = = 0.01 29 + 23 + 19 + 22 + 20 + 18 + 25 + 21 + 23 200

45 46

If we assume that the number of nodes is greater than 30, then according to the central limit theorem, the distribution of the sensors follows the Z-distribution. The value of μ0 is equal to the hypotheses are defined as follows. H0 : μ ≤ 4.7140 H1 : μ > 4.7140

35

200 9

or 4.7140. Thus, according to Lemma 2,

47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.24 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

24

1

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

13

14

14

15

15

Fig. 25. Clustering based on the center of gravity for the nodes.

16

16 17

17

Table 8 Confidence interval in HQCA-WSN.

18 19 20

23 24 25 26 27 28

19

Cluster number

Clustering based on the center of gravity for the grid

Clustering based on the center of gravity for the nodes

Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9

μ0 = 3.8276 μ0 = 6.1155 μ0 = 0.3205 μ0 = 6.4854 μ0 = 0.9014 μ0 = 1.8889 μ0 = 6.2791 μ0 = 1.4134 μ0 = 8.5576

μ0 = 3.8276 μ0 = 9.2190 μ0 = 4.2226 μ0 = 7.6785 μ0 = 0.9630 μ0 = 0.5884 μ0 = 8.4677 μ0 = 1.4134 μ0 = 10.1856

21 22

18

20 21 22 23 24 25 26 27 28 29

29

30

30 31 32

Table 9 Errors with HQCA-WSN.

33 34 35 36 37 38 39 40 41

Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9

31 32

Cluster 1

Cluster 2

Cluster 3

Cluster 4

Cluster 5

Cluster 6

Cluster 7

Cluster 8

Cluster 9

33

29 0 0 0 0 0 0 0 0

0 23 0 0 0 0 0 0 0

0 0 19 0 0 0 0 0 0

0 0 0 22 0 0 0 0 0

0 0 0 0 20 0 0 0 0

0 0 0 0 0 17 0 0 1

0 0 0 0 0 0 25 0 0

0 0 0 0 0 0 0 21 0

0 0 0 0 0 1 0 0 22

34

44 45 46 47 48 49 50 51 52

36 37 38 39 40 41 42

42 43

35

The first mode in Fig. 9 is used to describe H0 and H1 . For example, we may calculate the values of T , X, Z, and SX for the first cluster with 29 nodes as follows. 200 = 6.8965 29 T = 99.8459 99.8459 T = 7.0602 SX = √ = √ n 200 X=

X − μ0 6.8965 − 4.7140 z= = = 0.3091 SX 7.0602

43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.25 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

25

1

1

2

2

3

3

4

4

5

5

6

6 7

7 8

Fig. 26. Decision diagram.

8

9

9

10

10

11

11

12

12

13

13

14

14

15

15

16

16

17

17

18

18

19

19

20

20

21

21

22

22

23

23

24

Fig. 27. Quality of HQCA-WSN protocol (μ).

26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42

According to Appendixes A and B, the statistical value for the error of 0.01 is 0.5040. Fig. 26 shows the graph obtained based on the findings described above. According to Fig. 26 and the values obtained, it is clear that the value obtained for α is above the value of Z. Thus, the value obtained for α is in the H1 area, so assumption H1 is verified. In addition, according to Lemma 2, if the distribution of the sensors is balanced, then μ must be greater than 4.7140. Therefore, we have the following. Nodes Number 200 μ > 4.7140 ⇒ > 4.7140 ⇒ > 4.7140 Clusters Number Clusters Number 200 200 ⇒ > 22.221796 ⇒ Clusters Number < Clusters Number 22.221796 ⇒ Clusters Number < 9.000172 Thus, the number of clusters should be less than 9.000172, which indicates that selecting nine clusters at the beginning is a good choice because the first integer smaller than this value is 9. Fig. 27 show the clustering quality with both methods, where it is clear that the first method obtains better quality clusters (less μ) than the second method, although a violation occurs in cluster 6. Therefore, the two indexes μ and are the fundamental factors used for evaluating and verifying the quality of clusters. 6.5. Improvement of C-means clustering and selecting better clustering centers

47 48 49 50 51 52

27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42

44 45

45 46

26

43

43 44

24 25

25

As discussed above, selecting better cluster centers enhances the inter-cluster and intra-cluster distances. An advantage of the HQCA-WSN method is the better selection of cluster centers compared with other methods, especially when compared with the C-means clustering method. Table 10 shows the clustering results obtained with the HQCAWSN and C-means protocols. Fig. 28 compares the cluster centers obtained with the HQCA-WSN and C-means protocols under different scenarios. Clearly, HQCA-WSN obtains better and more balanced cluster centers. An important feature of the HQCA-WSN protocol is that changing the position of the BS does not greatly affect the network lifetime. Fig. 29 shows the network lifetime with different BS positions. Another important feature of energy-efficient

46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.26 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

26

1

1

2

Table 10 Comparison of HQCA-WSN and C-means.

3

Network size

Number of nodes

Number of clusters

Quality criterion ( ) with HQCA-WSN

Quality criterion ( ) with C-means

3

300 ∗ 300 300 ∗ 300 300 ∗ 300 300 ∗ 300 300 ∗ 300 300 ∗ 300 200 ∗ 200 200 ∗ 200 200 ∗ 200 200 ∗ 200 200 ∗ 200 200 ∗ 200 100 ∗ 100 100 ∗ 100 100 ∗ 100 100 ∗ 100 100 ∗ 100 100 ∗ 100

500 500 300 300 100 100 500 500 300 300 100 100 500 500 300 300 100 100

9 4 9 4 9 4 9 4 9 4 9 4 9 4 9 4 9 4

4.3523 3.5873 7.6650 18.2551 5.5522 15.1578 3.6887 2.4718 4.8412 1.9009 3.9867 14.5233 2.1425 7.6347 2.8327 6.6699 1.8998 6.3542

6.60 38.16 9.43 35.01 7.80 35.48 6.39 25.17 5.87 23.00 5.67 25.25 3.02 12.06 3.04 12.36 2.39 14.64

5

4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19

2

24 25 26

algorithms is the high scalability of the algorithms with the sensor network size. Thus, an algorithm performs well with small-scale networks containing a low number of sensors or large-scale networks containing a high number of sensors. Fig. 30 shows the lifetime for a large scale network containing a large number of sensors with the HQCAWSN method. Clearly, the HQCA-WSN method performs well with large-scale networks containing a large number of sensors. 6.6. Computational complexity of HQCA-WSN protocol

31 32 33 34 35 36 37 38 39 40

An important characteristic of the HQCA-WSN algorithm is that its complexity is low. The most important advantage of using algorithms with low computational complexity is that they can be applied simply in high-density areas. Most hierarchical algorithms such as single-linkage, complete-linkage, and average linkage have a complexity of O(N 2 ) in both time and space. In addition, partitioning algorithms such as k-means have a complexity close to O(N) in both space and time. The proposed method has low computational complexity. To assess the computational complexity of HQCA-WSN, we assume that the number of nodes in the area is n. One CH is present in each cluster and the average number of common nodes per cluster is equal to n/k − 1, where k is the number of clusters. In the worst case, m comparisons are performed in k clusters and each cluster has n/k − 1 nodes, which equals m × k × (n/k − 1). Therefore, the time complexity of the HQCA-WSN algorithm is O(nm). Thus, the low computational complexity of HQCA-WSN facilitates its scalability to large-scale sensor networks. 7. Conclusion and future works

45 46 47 48 49 50 51 52

11 12 13 14 15 16 17 18 19

22 23 24 25 26

28

30 31 32 33 34 35 36 37 38 39 40

42 43

43 44

10

41

41 42

9

29

29 30

8

27

27 28

7

21

21

23

6

20

20

22

4

Reducing the energy consumption and increasing the network lifetime are major challenges in WSNs. In this study, we proposed a method called HQCA-WSN for clustering based on the cluster quality. The nodes are classified using two methods based on several criteria and the highest quality cluster is selected. The cluster quality is evaluated based on the inter-cluster and intra-cluster intervals as well as the distribution of nodes in the network. We also proposed a measure of the error generated by clustering. In HQCA-WSN, we can determine the number of clusters according to a criterion called the confidence interval. The proposed protocol can select more clusters in dense areas and in areas where the distribution of nodes is high. In HQCA-WSN, CHs are selected based on the remaining energy, the energy consumption of the nodes, the number of neighboring nodes, the mean distance between the neighboring nodes, and the distance to the BS. Simulation results showed that HQCA-WSN could improve the network lifetime,

44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.27 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

1 2 3 4 5 6 7

FND, and LND compared with similar methods. HQCA-WSN reduced the average total energy in the network during the initial setup, as well as improving the average energy consumption in each cluster. The important advantages of the proposed method are that it can improve the clustering process and select better primary centers for clustering. In methods such as C-means, the cluster quality depends on the initial selection of the cluster centers. The HQCA-WSN method exhibits high scalability in large-scale networks. In the future, the performance of the HQCA-WSN method may be enhanced by including more parameters such as a new energy model, optimal cluster estimation based on the density, and the peripheral density of each node.

10

1 2 3 4 5 6 7 8

8 9

27

Appendix A. Z-table

9 10

11

11

12

12

13

13

14

14

15

15

16

16

17

17

18

18

19

19

20

20

21

21

22

22

23

23

24

24

25

25

26

26

27

27

28

28

29

29

30

30

31

31

32

32

33

33

34

34

35

35

36

36

37

37

38

38

39

39

40

40

41

41

42

42

43

43

44

44

45

45

46

46

47

47

48

48

49

49

50

50

51

51

52

52

JID:FSS AID:7775 /FLA

28

1

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.28 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

Appendix B. Student’s t table

1

2

2

3

3

4

4

5

5

6

6

7

7

8

8

9

9

10

10

11

11

12

12

13

13

14

14

15

15

16

16

17

17

18

18

19

19

20

20

21

21

22

22

23

23

24

24

25

25

26

26

27

27

28

28

29

29

30

30

31

31

32

32

33

33

34

34

35

35

36

36

37

37

38

References

40 41 42 43 44 45 46 47 48 49 50 51 52

38 39

39

[1] D. Yi, H. Yang, HEER–a delay-aware and energy-efficient routing protocol for wireless sensor networks, Comput. Netw. 104 (2016) 155–173. [2] M. Sabet, H. Naji, An energy efficient multi-level route-aware clustering algorithm for wireless sensor networks: a self-organized approach, Comput. Electr. Eng. 56 (2016) 399–417. [3] G. Kannan, T.S.R. Raja, Energy efficient distributed cluster head scheduling scheme for two tiered wireless sensor network, Egypt. Inform. J. 16 (2015) 167–174. [4] Z. Zhang, M. Ma, Y. Yang, Energy-efficient multihop polling in clusters of two-layered heterogeneous sensor networks, IEEE Trans. Comput. 57 (2008) 231–245. [5] Z. Zong, A. Manzanares, X. Ruan, X. Qin, EAD and PEBD: two energy-aware duplication scheduling algorithms for parallel tasks on homogeneous clusters, IEEE Trans. Comput. 60 (2011) 360–374. [6] M. Aslam, E.U. Munir, M.M. Rafique, X. Hu, Adaptive energy-efficient clustering path planning routing protocols for heterogeneous wireless sensor networks, Sustain. Comput., Inform. Syst. 12 (2016) 57–71. [7] U. Hari, B. Ramachandran, C. Johnson, An unequally clustered multihop routing protocol for wireless sensor networks, in: 2013 International Conference on Advances in Computing, Communications and Informatics, ICACCI, 2013, pp. 1007–1011. [8] N. Sabor, M. Abo-Zahhad, S. Sasaki, S.M. Ahmed, An unequal multi-hop balanced immune clustering protocol for wireless sensor networks, Appl. Soft Comput. 43 (2016) 372–389.

40 41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.29 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

29

[9] S. Bhattacharjee, P. Roy, S. Ghosh, S. Misra, M.S. Obaidat, Wireless sensor network-based fire detection, alarming, monitoring and prevention system for Bord-and-Pillar coal mines, J. Syst. Softw. 85 (2012) 571–581. [10] S.-H. Moon, S. Park, S.-j. Han, Energy efficient data collection in sink-centric wireless sensor networks: a cluster-ring approach, Comput. Commun. 101 (2017) 12–25. [11] J.A. Rodríguez-Serrano, S. Singh, Trajectory clustering in CCTV traffic videos using probability product kernels with hidden Markov models, Pattern Anal. Appl. 15 (2012) 415–426. [12] J. Zhu, C.-H. Lung, V. Srivastava, A hybrid clustering technique using quantitative and qualitative data for wireless sensor networks, Ad Hoc Netw. 25 (2015) 38–53. [13] T. He, C. Huang, B.M. Blum, J.A. Stankovic, T. Abdelzaher, Range-free localization schemes for large scale sensor networks, in: Proceedings of the 9th Annual International Conference on Mobile Computing and Networking, 2003, pp. 81–95. [14] A. Liu, Z. Zheng, C. Zhang, Z. Chen, X. Shen, Secure and energy-efficient disjoint multipath routing for WSNs, IEEE Trans. Veh. Technol. 61 (2012) 3255–3265. [15] S.A. Sert, H. Bagci, A. Yazici, MOFCA: multi-objective fuzzy clustering algorithm for wireless sensor networks, Appl. Soft Comput. 30 (2015) 151–165. [16] Y. Chatei, K. Ghoumid, M. Hammouti, B. Hajji, Efficient coding techniques algorithm for cluster-heads communication in wireless sensor networks, AEÜ, Int. J. Electron. Commun. 82 (2017) 294–304. [17] O. Moh’d Alia, Dynamic relocation of mobile base station in wireless sensor networks using a cluster-based harmony search algorithm, Inf. Sci. 385 (2017) 76–95. [18] K. Muthukumaran, K. Chitra, C. Selvakumar, An energy efficient clustering scheme using multilevel routing for wireless sensor network, Comput. Electr. Eng. (2017). [19] N. Sirdeshpande, V. Udupi, Fractional lion optimization for cluster head-based routing protocol in wireless sensor network, J. Franklin Inst. (2017). [20] X. Xiao, B. Tang, L. Deng, High accuracy synchronous acquisition algorithm of multi-hop sensor networks for machine vibration monitoring, Measurement 102 (2017) 10–19. [21] K. Akkaya, M. Younis, A survey on routing protocols for wireless sensor networks, Ad Hoc Netw. 3 (2005) 325–349. [22] S. Arjunan, S. Pothula, A survey on unequal clustering protocols in wireless sensor networks, J. King Saud Univ, Comput. Inf. Sci. (2017). [23] A. Shahraki, M.K. Rafsanjani, A.B. Saeid, A new approach for energy and delay trade-off intra-clustering routing in WSNs, Comput. Math. Appl. 62 (2011) 1670–1676. [24] M. Moshtaghi, S. Rajasegarar, C. Leckie, S. Karunasekera, An efficient hyperellipsoidal clustering algorithm for resource-constrained environments, Pattern Recognit. 44 (2011) 2197–2209. [25] P. Roxy, D. Toshniwal, Clustering unstructured text documents using fading function, in: Proceedings of the World Academy of Science, Engineering and Technology, 2009, pp. 149–156. [26] A. Ribeiro, G.B. Giannakis, Bandwidth-constrained distributed estimation for wireless sensor networks-part I: Gaussian case, IEEE Trans. Signal Process. 54 (2006) 1131–1143. [27] A.A. Abbasi, M. Younis, A survey on clustering algorithms for wireless sensor networks, Comput. Commun. 30 (2007) 2826–2841. [28] P.A. Forero, A. Cano, G.B. Giannakis, Distributed clustering using wireless sensor networks, IEEE J. Sel. Top. Signal Process. 5 (2011) 707–724. [29] A.Y. Ng, M.I. Jordan, Y. Weiss, On spectral clustering: analysis and an algorithm, in: Advances in Neural Information Processing Systems, 2002, pp. 849–856. [30] X. Liu, Atypical hierarchical routing protocols for wireless sensor networks: a review, IEEE Sens. J. 15 (2015) 5372–5383. [31] S. Alam, G. Dobbie, Y.S. Koh, P. Riddle, S.U. Rehman, Research on particle swarm optimization based clustering: a systematic review of literature and techniques, Swarm Evol. Comput. 17 (2014) 1–13. [32] S. Shamshirband, A. Amini, N.B. Anuar, M.L.M. Kiah, Y.W. Teh, S. Furnell, D-FICCA: a density-based fuzzy imperialist competitive clustering algorithm for intrusion detection in wireless sensor networks, Measurement 55 (2014) 212–226. [33] J.N. Al-Karaki, A.E. Kamal, Routing techniques in wireless sensor networks: a survey, IEEE Wirel. Commun. 11 (2004) 6–28. [34] Z. Vincze, D. Vass, R. Vida, A. Vidács, A. Telcs, Adaptive sink mobility in event-driven multi-hop wireless sensor networks, in: Proceedings of the First International Conference on Integrated Internet Ad Hoc and Sensor Networks, 2006, p. 13. [35] D. Kumar, Performance analysis of energy efficient clustering protocols for maximising lifetime of wireless sensor networks, IET Wirel. Sens. Syst. 4 (2013) 9–16. [36] W.R. Heinzelman, A. Chandrakasan, H. Balakrishnan, Energy-efficient communication protocol for wireless microsensor networks, in: Proceedings of the 33rd Annual Hawaii International Conference on System Sciences, vol. 2, 2000, p. 10. [37] W.B. Heinzelman, A.P. Chandrakasan, H. Balakrishnan, An application-specific protocol architecture for wireless microsensor networks, IEEE Trans. Wirel. Commun. 1 (2002) 660–670. [38] S. Lindsey, C.S. Raghavendra, PEGASIS: power-efficient gathering in sensor information systems, in: Aerospace Conference Proceedings, 2002, IEEE, 2002, pp. 3–3. [39] A. Manjeshwar, D.P. Agrawal, TEEN: a routing protocol for enhanced efficiency in wireless sensor networks, in: Null, 2001, p. 30189a. [40] A. Chamam, S. Pierre, A distributed energy-efficient clustering protocol for wireless sensor networks, Comput. Electr. Eng. 36 (2010) 303–312. [41] J.A. Torkestani, M.R. Meybodi, LLACA: an adaptive localized clustering algorithm for wireless ad hoc networks, Comput. Electr. Eng. 37 (2011) 461–474. [42] M.E. Haque, N. Matsumoto, N. Yoshida, Context-aware cluster-based hierarchical protocol for wireless sensor networks, Int. J. Ad Hoc Ubiq. Comput. 4 (2009) 379–386.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

JID:FSS AID:7775 /FLA

30

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

[m3SC+; v1.304; Prn:5/12/2019; 13:31] P.30 (1-30)

A.A. Baradaran, K. Navi / Fuzzy Sets and Systems ••• (••••) •••–•••

[43] M.K. Watfa, O. Mirza, J. Kawtharani, BARC: a battery aware reliable clustering algorithm for sensor networks, J. Netw. Comput. Appl. 32 (2009) 1183–1193. [44] A. Wang, D. Yang, D. Sun, A clustering algorithm based on energy information and cluster heads expectation for wireless sensor networks, Comput. Electr. Eng. 38 (2012) 662–671. [45] S.E. Khediri, N. Nasri, A. Wei, A. Kachouri, A new approach for clustering in wireless sensors networks based on LEACH, Proc. Comput. Sci. 32 (2014) 1180–1185. [46] N.A. Latiff, C.C. Tsimenidis, B.S. Sharif, Energy-aware clustering for wireless sensor networks using particle swarm optimization, in: IEEE 18th International Symposium on Personal, Indoor and Mobile Radio Communications, 2007, PIMRC 2007, 2007, pp. 1–5. [47] M. Tabibzadeh, M. Sarram, F. Adibnia, Hybrid routing protocol for prolonged network lifetime in large scale wireless sensor network, in: International Conference on Information and Multimedia Technology, 2009, ICIMT’09, 2009, pp. 179–183. [48] M.S. Al-Fares, Z. Sun, H. Cruickshank, A hierarchical routing protocol for survivability in wireless sensor network (WSN), in: Proceedings of the International MultiConference of Engineers and Computer Scientists 2009 Vol. I, 2009, pp. 262–268. [49] T. Liu, F. Li, Power-efficient clustering routing protocol based on applications in wireless sensor network, in: 5th International Conference on Wireless Communications, Networking and Mobile Computing, 2009, WiCom’09, 2009, pp. 1–6. [50] L. Malathi, R. Gnanamurthy, K. Chandrasekaran, Energy efficient data collection through hybrid unequal clustering for wireless sensor networks, Comput. Electr. Eng. 48 (2015) 358–370. [51] H. Chen, C.-S. Wu, Y.-S. Chu, C.-C. Cheng, L.-K. Tsai, Energy residue aware (ERA) clustering algorithm for Leach-based wireless sensor networks, in: Second International Conference on Systems and Networks Communications, 2007, ICSNC 2007, 2007, pp. 40–40. [52] T. Bhatia, S. Kansal, S. Goel, A. Verma, A genetic algorithm based distance-aware routing protocol for wireless sensor networks, Comput. Electr. Eng. 56 (2016) 441–455. [53] S. Rani, J. Malhotra, R. Talwar, Energy efficient chain based cooperative routing protocol for WSN, Appl. Soft Comput. 35 (2015) 386–397. [54] B.M. Khan, R. Bilal, R. Young, Fuzzy-TOPSIS based cluster head selection in mobile wireless sensor networks, J. Electr. Syst. Inf. Technol. (2017). [55] H. Taheri, P. Neamatollahi, O.M. Younis, S. Naghibzadeh, M.H. Yaghmaee, An energy-aware distributed clustering protocol in wireless sensor networks using fuzzy logic, Ad Hoc Netw. 10 (2012) 1469–1481. [56] S. Su, S. Zhao, An optimal clustering mechanism based on fuzzy-C means for wireless sensor networks, in: Sustainable Computing: Informatics and Systems, 2017. [57] B. Baranidharan, B. Santhi, Ducf: distributed load balancing unequal clustering in wireless sensor networks using fuzzy approach, Appl. Soft Comput. 40 (2016) 495–506. [58] J.-M. Kim, S.-H. Park, Y.-J. Han, T.-M. Chung, CHEF: cluster head election mechanism using fuzzy logic in wireless sensor networks, in: 10th International Conference on Advanced Communication Technology, 2008, ICACT 2008, 2008, pp. 654–659. [59] H. Bagci, A. Yazici, An energy aware fuzzy approach to unequal clustering in wireless sensor networks, Appl. Soft Comput. 13 (2013) 1741–1749.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

29

29

30

30

31

31

32

32

33

33

34

34

35

35

36

36

37

37

38

38

39

39

40

40

41

41

42

42

43

43

44

44

45

45

46

46

47

47

48

48

49

49

50

50

51

51

52

52

HQCA-WSN: High-quality clustering algorithm and optimal cluster head selection using fuzzy logic in wireless sensor networks

HQCA-WSN: High-quality clustering algorithm and optimal cluster head selection using fuzzy logic in wireless sensor networks

Recommend Documents