Power quality event characterization using support vector machine and optimization using advanced immune algorithm

Power quality event characterization using support vector machine and optimization using advanced immune algorithm

Neurocomputing 103 (2013) 75–86 Contents lists available at SciVerse ScienceDirect Neurocomputing journal homepage: www.elsevier.com/locate/neucom ...

2MB Sizes 1 Downloads 46 Views

Neurocomputing 103 (2013) 75–86

Contents lists available at SciVerse ScienceDirect

Neurocomputing journal homepage: www.elsevier.com/locate/neucom

Power quality event characterization using support vector machine and optimization using advanced immune algorithm B. Biswal a,n, M.K. Biswal b, P.K. Dash c, S. Mishra d a

GMR Institute of Technology, Rajam, Srikakulam , AP 532127, India Silicon Institute of Technology, Bhubaneswar, India c S’O’A University, Bhubaneswar, India d Indian Institute of Technology, Delhi, India b

a r t i c l e i n f o

abstract

Article history: Received 24 October 2011 Received in revised form 21 March 2012 Accepted 6 August 2012 Communicated by A. Zobaa Available online 12 October 2012

This paper presents a time-time transform (TT-transform) variant for classification of non-stationary power signal disturbance patterns. The TT-transform variant is derived from the well known S-transform and employs a new window function whose width is inversely proportional to the frequency raised to a constant power with values within 0 and 1. Features are derived from the TT-transform result of the power signal patterns. These features are used for automatic recognition of types of disturbances with the help of kernel based support vector machine (SVM) based clustering. Further, the clustering performance of the TT-SVM based pattern recognizer is improved by a modified immune optimization algorithm. Several test cases are provided to demonstrate the improvement in classification accuracy while resulting in significant reduction of support vectors. & 2012 Elsevier B.V. All rights reserved.

Keywords: Power signals TT-transform Support vector machine (SVM) Radial basis function Mexican hat wavelet kernel Modified immune optimization algorithm (MIOA)

1. Introduction The voltage and current signals, in electrical power networks, often exhibit fluctuations in amplitude, phase, and frequency due to the operation of solid-state devices that are prolifically used for power control. This gives rise to steady state as well as transient voltage disturbances like voltage sag, voltage swell, interruption, oscillatory transients, impulsive transients, multiple voltage notches due to solid-state converter switching and harmonics. Power sinusoids being modulated by low frequency signals are also observed in the electric power networks. To distinguish between these disturbance signal patterns in the normal sinusoidal signals of frequency 50 Hz or 60 Hz, advanced signal processing techniques along with intelligent system approach play an important role. The detection and localization of these disturbances are critical in order to determine the causes and sources of disturbances. Harmonics, which are major phenomena in power networks are well detected using neural networks and adaptive filters [1,2]. In the current research trends wavelet transform (WT) [3–7] is widely used in analyzing non-stationary signals for power quality assessment. Although WT has been extensively used for detection of power quality disturbances, the effect of electrical noise is not

n

Corresponding author. E-mail address: [email protected] (B. Biswal).

0925-2312/$ - see front matter & 2012 Elsevier B.V. All rights reserved. http://dx.doi.org/10.1016/j.neucom.2012.08.031

adequately considered in many of the cases. In [5] it is shown that when noise is present in the signal wavelet based MLP neural network performs badly in comparison with other time-frequency techniques for power quality event classification. A de-noising scheme has been proposed in [6] for enhancing wavelet based power quality monitoring system. Apart from WT, the short time Fourier transform (STFT) and S-transform (ST) are often preferred for Power signal classification. The main objective of this paper is to apply TT-transform (TT) [8] with a modified Gaussian window and SVM clustering to the problem of power signal classification. The TT transform is derived from the ST by taking the inverse Fourier transform along the row vectors of ST result, corresponding to frequency axis. Thus the TT transform results in a time-time domain decomposition of the signal pattern. The first paper in the area of power quality analysis using S-transform [9] uses a Gaussian window with one scaling factor. In the present paper the shape of the Gaussian window is slightly modified to have a better time and frequency resolution. Correspondingly the TT transform with the modified Gaussian widow provides an improved time–time resolution. Further, the invertibility of the TT-transform leads to the possibility of filtering and signal to noise improvements in the time domain. After the features are extracted from the time-series data using modified TT-transform a SVM algorithm with the choice of different kernel functions, is used to group the data into clusters and thereby identifying belongingness of the data to a particular class. The SVM

76

B. Biswal et al. / Neurocomputing 103 (2013) 75–86

[10] performs structural risk minimization and creates a classifier with minimized Vapnik–Chervonenkis (VC) dimension. As the VC dimension is low, the expected probability of error is low to ensure a good generalization. The SVM keeps the training error fixed while minimizing the confidence interval. So, it has good generalization ability and can simultaneously minimize the empirical risk and the expected risk for pattern classification problems. More importantly, an SVM can work very well in a high dimensional feature space. The support vector learning mechanism provides architecture to extract support vectors from a given number of training data pairs for the purpose of classification of power signal disturbances using variety of kernel functions. SVMs rely heavily on the property of selected kernel functions, whose parameters are always fixed and are chosen solely on the basis of heuristics or trial and error. There have been some instances where the SVM is combined with FNN [11–14] for yielding robust classification performance. The SVM network developed in this paper results in higher classification accuracy, and when tuned with an evolutionary computing based approach, results reduction of the number of support vectors as compared with the regular SVM. In this paper, a modified immune optimization algorithm [15,16] is used to refine the center of the SVM clusters in order to improve compactness of the clusters and the classification accuracy. The modified immune algorithm is based on a process that drives motivation from biological evolution. A population is chosen consisting of the initial cluster centers and Over successive generations, the chosen initial population evolves toward an optimal solution. Compared with the earlier existing immune algorithm, in the modified immune algorithm, the concentration of antibodies is employed to scale the diversity of solutions, and the mutation rate of antibody’s clone is dynamically adjusted based on its concentration and fitness. Therefore, one major contribution of this paper is to propose a systematical procedure to classify power signal disturbances, by a SVM classifier with reduced number of support vectors and association of modified immune optimization algorithm to enhance accuracy of recognition.

The S-transform often suffers from poor concentration of energy at higher frequency and hence poor frequency localization. So the dependence of standard deviation of the Gaussian window on frequency can be altered to improve energy concentration. Defining the standard deviation of the window as k f 

sðf Þ ¼  c k4 0 and

Resulting in a modified S-transform as  c Z 1 f  ðttÞ2 jf j2c ffiffiffiffiffiffi p uðt Þ e 2k2  e2pf t dt Sðt,f Þ ¼ k 2p 1

ð4Þ

ð5Þ

The parameter ‘c’ is initially introduced as a switch between STFT (c ¼0) and S-transform (c ¼1) but later it was observed that better localization of the time-frequency distribution can be obtained by varying ‘c’ value. Here, co0 cannot be taken, since on doing so s instead of being proportional to the time period of the signal becomes inversely proportional to it, which takes it outside the basic definition of S-transform. We define a modified time–time distribution or TT-transform by taking the inverse Fourier transform of the modified S-transform along the frequency vectors. Z 1 TT ðt, tÞ ¼ Sðt,f Þ e2pif t  df ð6Þ 1

Also inverting the TT transform, the original signal u(t) is obtained as Z 1 TT ðt, tÞ dt ð7Þ uðt Þ ¼ 1

In the discrete domain the modified TT transform is expressed as: N

2 1 X

^ ðj,kÞ ¼ TT

S^ ðj,nÞ  e2pink

ð8Þ

m ¼ N2

and ^ XðkÞ ¼

2. The modified TT-transform

0 rc r 1

N 1 X

T^ T^ ðj,kÞ

ð9Þ

j¼0

The time-frequency distribution of a signal depicts the variation of its spectral characteristics with time. The recently proposed S-transform is a combination of short-time Fourier transform (STFT) and wavelet transforms. It uses a Fourier kernel with a wavelet like Gaussian window which translates in time and dilates in inverse proportion to the frequency. Thus, the S-transform preserves the phase information of the signal, and provides a variable resolution similar to wavelet transform. The standard S-transform of a signal u(t) is given by a convolution integral as Z 1 ðttÞ2 1 pffiffiffiffiffiffi e 2sf 2  e2pf t dt Sðt,f Þ ¼ uðt Þ ð1Þ sðf Þ 2p 1

3. Illustrations The contour plots of the amplitude spectrum of the modified TTtransform have been shown in Figs. 2–5, for different values of the scaling parameter ‘c’. The analyzed signal (Fig. 1) is a mixture of sinusoid of frequency order 2, having amplitude 1. The transients taken are of the order of 10 and 20, respectively, each having amplitude 1. The first transient occurs in between the sampling points 100 and 200 and the second lies in between the points 300

where, f is frequency, t and t are time variables. The standard deviation s(f) in Eq. (1) is a function of frequency f, and in normal S-transform is defined as 1 sðf Þ ¼   f

ð2Þ

Since the window depends on the frequency, it provides a good localization in the frequency-domain for low frequencies, a good localization in the time domain for higher frequencies. Further the integral of S-transform over time is the Fourier transform Z  U fÞ ¼

1

Sðt,f Þ dt 1

ð3Þ

Fig. 1. Original time domain signal consisting of sinusoidal content of frequency order 2 and two transients of frequency order10 and 20 centered at time points 150 and 350, respectively.

B. Biswal et al. / Neurocomputing 103 (2013) 75–86

30

20

Frequency

Frequency

30

77

10

20

10

0 0

100

200

300

400

500

0

Time

0

100

200

300

400

500

Time

Fig. 2. Frequency contour plot of S-transform for c¼ 1.

Fig. 5. Frequency contour plot of modified S-Transform for c ¼1.1.

Frequency

30

20

10

0 0

100

200

300

400

500

are well reflected. The TT contour in Fig. 6(c) shows higher energy conentation of the TT distribution during the occurance of transients. Fig. 7(b), Fig. 8(b) shoes the decrease/increase in amplitude during the occurrence of voltage sag/swell, respectively. Fig. 9(b) depicts the envelope of the 10 Hz amplitude modulation caused due to flicker. Similar informations can be detected from Figs. 10 and 11 for harmonis and sag with harmonics. Fig. 12(b) shows the sudden reduction in amplitude due to momentary interruption. Fig. 13(b) accurately localized the notches and their depths.

Time

Fig. 3. Frequency contour plot of modified S-transform for c ¼0.9.

5. Structure of the SVM classifier

Frequency

30

20

10

0 0

100

200

300

400

500

Time

Fig. 4. Frequency contour plot of modified S-transform for c ¼0.7.

and 400. Thus the center or peak points for the transients are positioned at 150 and 350 sampling interval, respectively. The contour plots of Fig. 3 and Fig. 4 has been obtained for c¼0.9 and c¼0.7 and it is observed that, when co1, better frequency localization is obtained. For c¼1.1, the window is squeezed around the center points, thus giving good time localization, while the contours are scattered over the frequency axis, which indicates poor spectral information as depicted in Fig. 5. Thus it is observed that the scaling parameter ‘c’ acts as a further fine tuning element for a particular resolution level.

4. Applications to PQ disturbance The proposed technique is used to analyze the power signal disturbance in a realistic power network simulated by power system block set supported by MATLAB software. The sampling rate for the collection of power quality data is taken to be equal to 3.2 kHz. The time–time contours obtained by modified TTtransform and the magnitude of change are shown in Figs. 6–13 for different power quality patterns. The magnitude of change is derived from the TT transform resultant matrix by picking the maximum of the absolute values along each time index. The TT contour plots are obtained by plotting the absolute value of the elements of TT-transform matrix. The intensity values along each contour are nearly equal. As it can be observed in Fig. 6(b), the magnitude of change during the occurrence of oscillatory transient

A generalized four-layered support vector neural network (SVNN) is shown in Fig. 14, which comprises of the input, membership function, kernel function, output layers. Layer-1 accepts input variables whose nodes represent input variables. In layer-2 the membership values whose nodes represent the terms of the respective variables are computed. Nodes at layer-3 represent the kernel function used in SVM. Layer-4 is the output layer. The signal propagation and the basic functions in each layer are described as follows: Layer 1(input layer): No computation is done in this layer. Each node in this layer, which corresponds to one input variable, only transmits input values to the next layer directly. No computation takes place in this layer and the output of this layer o1i is expressed as o1i ¼ xi .where xi, i¼1,2,y,M are the input features to the SVNN. Here we have considered only two input features x1 and x2. Layer 2 (membership layer): Each node in this layer is a Gaussian membership function that corresponds one label of one of the input variables in layer-1. In other words, the membership value, which specifies the degree to which an input value belongs to a fuzzy set, is calculated in layer-2. The Gaussian membership functionis defined as:

mi ð:Þ : R-f0:1g,  

 mi o1i ¼ e



:o1 cj :2 i s2 j

i ¼ 1,2,. . .,M:  ð10Þ

where, cj and sj are, respectively, the center and the spread of the Gaussian membership function of the jth term of the ith input variable o1i . Then centers of membership function corresponding to each individual non-stationary power signal disturbances will be updated by the evolutionary computing approach explained in Section 8. The spread is calculated by using the formula sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 2 :o1i cj : ð11Þ sj ¼ N N is the no of data points.

78

B. Biswal et al. / Neurocomputing 103 (2013) 75–86

Amplitude

1 0 -1 0

100

200

300

400

500

600

400

500

600

400

500

Magnitude of change

Time Index 0.1

0.05

0

0

100

200

300 Time Index

Time Index

600 400 200 0

0

100

200

300 Time Index

600

Amplitude

Fig. 6. (a) A voltage signal with oscillatory transient and an impulsive transient and (b) its time–time contour plot.

1 0 -1 0

100

200

300

400

500

600

400

500

600

400

500

600

Magnitude of change

Time Index

0.01

0.005

0 0

100

200

300 Time Index

Time Index

600 400 200 0 0

100

200

300 Time Index

Fig. 7. (a) Signal with voltage sag (b) magnitude of change (c)) its time–time contour plot.

The center cj and spread sj are fed as inputs to the kernel layer.   o2j ¼ cj , sj

Layer 3 (kernel layer): In this layer the low dimensional input space is transformed into a high dimensional feature space. The reason behind this transformation lies in Cover’s theorem. Cover’s theorem states that ‘‘A complex pattern classification

B. Biswal et al. / Neurocomputing 103 (2013) 75–86

79

Amplitude

1 0 -1 0

100

200

300

400

500

600

400

500

600

400

500

600

Magnitude of change

Time Index

0.015 0.01 0.005 0

0

100

200

300 Time Index

Time Index

600 400 200 0

0

100

200

300 Time Index

Fig. 8. (a) Signal with voltage swell (b) magnitude of change (c) its time–time contour plot.

Amplitude

1 0 -1 0

100

200

300

400

500

600

400

500

600

400

500

600

Magnitude of change

Time Index

0.01

0.005

0

0

100

200

300 Time Index

Time Index

600 400 200 0 0

100

200

300 Time Index

Fig. 9. (a) Signal with 10 Hz voltage flicker (b) magnitude of change (c) its Time-Time contour plot.

problem cast in a high dimensional space nonlinearly is more likely to be linearly separable than in a low dimensional space’’. In our SVM the fallowing types of kernel functions, which are listed below. a. Radial basis function  kðp,qÞ ¼ e



:pi qj :2 2s2

ð12Þ

b. Mexican hat wavelet kernel  2! :p q :2 :pi qj :  i 2j 1 s kðp,qÞ ¼ pffiffiffiffiffiffiffiffiffiffiffiffi 1 e s2 2ps3

ð13Þ

The average value of the calculated spread in layer 2 is passed as one of the parameters pi, of the kernel function. The other parameter of the kernel function is the center ‘qi’, which is a constant and can be assigned any value for achieving better classification.

80

B. Biswal et al. / Neurocomputing 103 (2013) 75–86

Amplitude

1 0 -1 0

100

200

300

400

500

600

400

500

600

400

500

600

Magnitude of change

Time Index

0.02

0.01

0 0

100

200

300 Time Index

Time Index

600 400 200 0 0

100

200

300 Time Index

Fig. 10. (a) Voltage Signal with 30% 3rd, 10% 5th, 5% 11th Harmonic (b) Magnitude of change (c) ) its Time-Time contour plot.

Amplitude

1 0 -1 0

100

200

300

400

500

600

400

500

600

400

500

600

Magnitude of change

Time Index

0.02

0.01

0 0

100

200

300 Time Index

Time Index

600 400 200 0 0

100

200

300 Time Index

Fig. 11. (a) Voltage signal with 30% 3rd, 10% 5th, 5% 11th harmonic and a sag (b) magnitude of change (c) its time–time contour plot.

Layer 4 (output layer): In this layer the support vectors are obtained by solving the constrained quadratic programming problem. In this layer the support vector machine algorithm for classification has been applied.

6. SVM framework Support vector machine is a very useful technique for data classification and regression problems. In most cases, the searching

B. Biswal et al. / Neurocomputing 103 (2013) 75–86

81

Amplitude

1 0 -1 0

100

200

300

400

500

600

400

500

600

400

500

600

Magnitude of change

Time Index

0.01

0.005

0

0

100

200

300 Time Index

Time Index

600 400 200 0

0

100

200

300 Time Index

Fig. 12. (a) Voltage signal with momentary interruption (b) magnitude of change (c) its time–time contour plot.

Amplitude

1 0 -1 0

100

200

300

400

500

600

400

500

600

400

500

600

Magnitude of change

Time Index

0.04 0.02 0 0

100

200

300 Time Index

Time Index

600 400 200 0 0

100

200

300 Time Index

Fig. 13. (a) Voltage signal with notch (b) magnitude of change (c) its time–time contour plot.

of the suitable hyperplane in an input space is too restrictive to be of practical use.A simple intuition about the SVM can be obtained in the case of a linear classifier. Let a set S of labelled training patterns (u1,v1),y,(ul,vl), each training point u1 belongs to either of two classes with label y{1,  1}, for i¼0,1,y,l. In this case the SVM separates the data with a hyper plane described by WTU þ b¼ 0 that leaves the maximum margin between the two classes. Here, W is a set of weight vector and U is a vector consisting of input data. The

task of maximizing the margin is equivalent to minimizing an upper bound on the generalization error of the classifier. The hyper plane

WTU þb ¼ 0

separates the data if

 f WTU þ b Z1

ð14Þ To find the optimal separating hyper plane, one needs to find the plane, which maximizes the distance between the hyper plane

82

B. Biswal et al. / Neurocomputing 103 (2013) 75–86 1

2

o1

x1

Any symmetric function K(u,v) in input space can represent a scalar product in feature space if ZZ   K ðu,u0 ÞgðuÞg ðu0 Þdudu0 40, 8g A L2 Rn , ð21Þ

o1

3

o1

x2

Input Layer

- o M3

-

- o1 M

XM

3

o2

2

o2

1 o2

2

oM

where g(.) is any function with a finite L2 norm in input space or a R function for which g2(x)dxo N. Corresponding features in a Z-space are the eigen-vectors of an integral operator associated with K, Z K ðu,u0 Þfi ðuÞdu ¼ li fi ðuÞ ð22Þ



Output Layer

li are the eigen values.

Layer-3

The kernels functions considered in this paper are Gaussian RBF kernel given by  2

Layer-2

Fig. 14. Four-layered support vector network (SVN).

kðp,qÞ ¼ e

and the closest positive and negative samples. Therefore the classification problem is equivalent to minimizing. !h 2 l X :W: þC xi ð15Þ min 2 W,b, x i¼1 subject to:  f W T U þ b Z 1xi

l X

ai fyi ½wi ui þ b1 þ xi g

i¼1

l X i¼1

ai 

and Mexican hat wavelet kernel  2! :p q :2 :pi qj :  i 2j 1 s kðp,qÞ ¼ pffiffiffiffiffiffiffiffiffiffiffiffi 1 e s2 2ps3

ð23Þ

ð24Þ

ð16Þ

l X

bi xi

ð17Þ

l 1 X vvaauu 2 i,j ¼ 1 i j i j i j

Energy, standard deviation, autocorrelation, mean, variance, and normalized values have been extracted as features from the TTtransform analysis of the non-stationary power signals. Among these features the standard deviation and normalized values were found to be the most distinguished features. This conclusion was made from the visual verification from the feature plot, with different combination of features. Feature vectors (hundreds of each disturbance) are given as the input to the SVM network. Figs. 15–17 depict classification of voltage sag, voltage flicker, voltage swell, impulsive transient, harmonics, voltage notch, sag with harmonics, momentary interruption, oscillatory transient and voltage flicker with harmonics using Mexican hat wavelet c function. When the classification was performed by the SVM algorithm with either RBF kernel or Mexican hat wavelet kernel function, the following serious drawbacks were observed.

i¼1

where ai and bi are the Lagrange multipliers. By solving Eq. (20) for optimal saddle point (w0, b0, x0, a0, b0) we can get the dual lagrangian Ld ðaÞ ¼

:pi qj : 2s2

7. Simulation results with Mexican hat wavelet kernel

xi Z 0 ði ¼ 1::lÞ

where, C is the penalty constraint, xi (i¼1,y,l) are the slack variables and h is a constant. Choosing the exponent h¼0 in the above equation results in nonappearance of the slack variables xi. Here, h is chosen as 1. This constrained optimization problem can be translated into a dual quadratic programming problem by Lagrangian multipliers. The saddle point of the primal Lagrangian gives the solution to a quadratic programming problem subject to inequality constraints. The Lagrangian function is: ! l X 1 Lp ðW,b, x, a, bÞ ¼ W T W þC xi 2 i¼1 



(1) In Figs.15 and 17, the support vector boundaries of different power signal data patterns are overlapping with each other.

ð18Þ

The roots of the above equation give the optimized Lagrange multipliers a. The values for which a is minimum and non-zero are the support vector points. The linear classifier presented in Eq. (21) is inadequate for the classification of the data points because of the overlapping nature of the data points. So for non-linear classifiers the input vectors UARn are mapped into vectors ZARf. U A Rn -zðuÞ ¼ ½f1 ðuÞ, f2 ðuÞ,. . ., fn ðuÞT A Rf :

ð19Þ

This mapping leads to a linear classifier in a high dimensional feature space creating a non-linear separating hyper plane. For this, the scalar product U T Uin Eq. (17) is replaced by the scalar product ZTZ¼[f1(u), f2(u) ,y,fn(u)][f1(u), f2(u) ,y,fn(u)]T in a feature space. This is further expressed by the kernel function,     K ui ,uj ¼ zi zj ¼ fðui Þf uj

ð20Þ

Fig. 15. Classification of voltage swell, sag, flicker, impulsive transient using Mexican hat wavelet kernel.

B. Biswal et al. / Neurocomputing 103 (2013) 75–86

83

Start Input features(Antigens) Randomly selected centers (Antibodies) Mutation of Antibodies Reselection of Antibodies

Clone

Death

Activated antibody

Not activated antibody

Fig. 18. Flow chart of modified artificial immune algorithm for center updation.

Fig. 16. Classification of voltage sag with harmonics, notch and harmonics using Mexican hat wavelet kernel.

repertoire of antibodies with sufficient diversity to handle various antigens. Compared with the earlier existing immune algorithms, in the modified immune algorithm, the concentration of antibodies is employed to scale the diversity of solutions, and the mutation rate of antibody’s clone is dynamically adjusted based on its concentration and fitness. Moreover, this algorithm defines the clusters of antibodies, in which the elitist ones are always retained as the memory set, while other antibodies are replaced by fresh individuals Fig. 18 describes the flow chart for the modified artificial immune algorithm for center updation. In our work each data point is considered as the antigen and the randomly selected center is taken as the antibody. The center (antibody) of the individual non-stationary power signal disturbances is updated by the following equation cj ¼ ci þ aNð0,1Þ

ð25Þ

where ci cj

Initial random center from Eq. (15). Mutated version of ci. a Mutation rate of the antibody. N (0, 1) Gaussian random variable with zero mean and standard deviation.

Fig. 17. Classification of voltage flicker with harmonics, momentary interruption, and oscillatory transient using Mexican hat wavelet kernel.

This results in an increased number of support vectors. Hence chances of misclassification will be quite prominent. In addition to this neither the support vectors boundaries are smooth nor the power signal data pattern are compactly clustered. (2) In Fig. 16 it is seen that the support vector boundary for harmonic data pattern could not be drawn due the fact that some feature vector of harmonic data pattern belong simultaneously to sag with harmonic and voltage notch. This occurs because the Euclidean distance between the center of the harmonic data pattern and its corresponding feature vectors could not be minimized to a great extent. To circumvent this problem, its center as well as the spread of the proposed kernel function needs to be further fine-tuned using modified immune optimization algorithm given in Eqs. (25) and (26), respectively, of the next section. 8. Modified immune optimization algorithm (MIOA) and simulation result The natural immune system can protect from a large variety of antigens, and the first problem it must face is the generation of a

The mutation rate of the antibody is expreossed as      g n þ 1g  f a ¼ exp b  concentrationðci Þ

ð26Þ

where Decay of mutation rate. Fitness of the individual normalized in [0, 1]. g and (1  g) Are the two weighted parameters for the concentration and fitness, respectively. Concentration A measure of diversity.

b

f*

Concentration of (ci) is defined as follows: concentrationðci Þ ¼

maxðaf f inityÞ meanðaf f inityðxi ,ci ÞÞ

ð27Þ

where, max(affinity) is the maximum of the affinities among all the antibodies, and mean(affinity(xi, ci))is the mean of all the affinities between ci and other antibodies. Affinity is the Euclidean distance between two antibodies. The Euclidian distance is calculated according the formula Distance ¼ :xi ci :

ð28Þ

It can be seen from the definition, the concentration is based on the affinities (similarity) among the antibodies, and it is a measure of their diversity. If an antibody is relatively close to others in the search

84

B. Biswal et al. / Neurocomputing 103 (2013) 75–86

space, its concentration is higher, and the mutation rate ‘a’of the clone is also higher. Hence, we expect this parent antibody to generate mutated clones with sparse locations and fitness to cover more domain space so that the solution diversity is maintained. The mutation rate of each antibody’s clone is a weighted value of the inverse of the concentration and fitness. If the fitness of the antibody is relatively high, its mutation rate is small. In other words, the mutation of the modified algorithm can achieve an appropriate trade-off between the exploration and exploitation in the search space. Figs. 19–21 show classification of voltage sag, voltage flicker, voltage swell, impulsive transient, harmonics, voltage notch, sag with harmonics, momentary interruption, oscillatory transient and voltage flicker with harmonics using Mexican hat wavelet kernel function with MIOA. The parameter used in the simulation of SVM classification for center updation is given below in Table 1. N is the number of input feature vectors for each of the power signal disturbances. Fig. 21. Classification of voltage flicker with harmonics, momentary interruption, and oscillatory transient using Mexican hat wavelet kernel with MIOA.

Table 1 Parameter selection in MIOA. Parameters

Values

Population size N Number of clones Nc Decay of mutation rate b Weighted parameter

100 10 10 0.9

0.8 0.7

Fig. 19. Classification of voltage swell, sag, flicker, impulsive transient using Mexican hat wavelet kernel with MIOA.

Fitness function

0.6

Old center

0.5 0.4 0.3

Updated center

0.2 0.1 0 0

2

4

6

8

10

12

14

16

No of Iterations Fig. 22. Center updation of normalized value using MIOA.

Fig. 20. Classification of voltage sag with harmonics, notch and harmonics using Mexican hat wavelet kernel with MIOA.

Nc is total no of clusters. The proposed SVM algorithm with Mexican hat wavelet kernel functions when blended with modified immune optimization algorithm (MIOA) has shown superior performance in terms of distinct and smooth SVM clusters, probability of misclassification is reduced to large extent and the problem of drawing the support vector boundary for harmonic power signal data pattern was successfully overcome when compared to Mexican hat wavelet kernel function. Fig. 22 describes the center updation of normalized value using Mexican hat wavelet kernel with modified immune optimization

B. Biswal et al. / Neurocomputing 103 (2013) 75–86

85

Table 2 Accuracy and percentage reduction in support vector using RBF and Mexican hat wavelet kernel. Sl. no.

1 2 3 4 5 6 7 8 9 10

Disturbance

Impulsive transient Swell Sag Flicker Notch Sag with Harmonics Harmonics Oscillatory transient Momentary Interruption Flicker with harmonic

RBF kernel

Mexican hat wavelet kernel

Misclassification

(%) age accuracy

No. of support vectors

Misclassifications

(%) age accuracy

No. of support vectors

0 0 0 0 0 0 0 0 0 1

100 100 100 100 100 100 100 100 100 99

12 8 9 11 2 6 19 8 3 5

0 0 0 0 0 0 0 0 1 0

100 100 100 100 100 100 100 100 99 100

10 4 7 11 2 5 16 5 3 2

Table 3 Accuracy and percentage reduction in support vector using MIOA with Mexican hat wavelet kernel. Mexican hat kernel with MIOA

Misclassification % age accuracy

1 2 3 4 5 6 7 8 9 10

Impulsive transient Swell Sag Flicker Notch Sag with Harmonic Harmonic Oscillatory transient Momentary Interruption Flicker with harmonic

% age Reduction in support vectors (%)

No. of support vectors

0

100

2

80

0 0 0 0 0

100 100 100 100 100

3 8 10 2 4

0 0

100 100

6 4

62.5 20

0

100

2

33.33

1

99

1

50

25  14.28 9.09 0 20

algorithm comparing the random center (old center) and the updated center (new center).The later one showing better convergence than the former one Tables 2 and 3.

9. Conclusion This paper has proposed a new approach for detection, localization and classification of non-stationary power signal waveforms using modified TT-transform and SVM classifier. The modified TTtransform with a variable window as a function of frequency is used to generate time–time distributions and significant features were derived for classification. Unlike the wavelet transform based techniques, the TT-transform provides a complete characterization of power signal disturbances. The support vector learning mechanism based on wavelet kernel provides a structural framework to extract support vectors from the input patterns. The choice of kernel function had a great influence in reducing the number of support vectors. The number of support vectors and the compactness of different clusters are further enhanced using modified immune optimization algorithm. As it can be inferred from Table 4, the proposed classifier outperforms earlier proposed classifiers in terms of accuracy.

16.66 25 11.11 28.57 0 16.66 15.78 37.5 0 60

Table 4 Comparison of accuracy with other proposed methods. Proposed by

Sl. Disturbance no.

% age reduction in support vectors (%)

Overall accuracy (%)

Ghosh and Lubkeman [17] 72 Perunicic et al. [18] 89 Damarla et al. [19] 95 Wijayakulasooriya et al. [20] 98 Kanitpanyacharoean and Premrudeepreechacharn [21] 98.9 Kaewarsa et al. [22] 90.14 Hu et al. [23] 98.4 Ekici [24] 95.6 Proposed 99.9

References [1] H.C. Lin, Intelligent neural network-based fast power system harmonic detection, IEEE Trans. Ind. Electron. 54 (1) (2007) 43–53. [2] H.C. Lin, Fast tracking of time varying power system frequency and harmonics using iterative loop approaching algorithm, IEEE Trans. Ind. Electron. 54 (2) (2007) 974–983. [3] S.,Santoso , E.J., Powers, W., Grady, and A. Parsons 1997. Power quality disturbance waveform recognition using wavelet-based neural classifier, Part 1: Theoretical foundation. The 1997 IEEE/PES Winter Meeting, New York, NY, U.S.A. [4] S.R. Shaw, C.R. Laughman, S.B. Leeb, R.F. Lepard, A power quality prediction system, IEEE Trans. Ind. Electron. 47 (3) (2000) 511–517, Jun 2000. [5] P.K. Dash, Nayak Maya, M.R. Senapati, I.W.C. Lee, Mining for similarities in time series data using wavelet based feature vectors and neural networks, Eng. Appl. Artif. Intell. 20 (2007) 185–201. [6] H.T. Yang, C.C. Liao, A de-noising scheme for enhancing wavelet-based power quality monitoring systems, IEEE Trans. Power Delivery 16 (2001) 353–360. [7] H. Zang, P. Liu, O.P. Malik, Detection and classification of power quality disturbances in noisy conditions, IEE Proc.: Gener. Transm. Distrib. 150 (5) (2003) 567–572. [8] C.R. Pinnegar, L. Mansinha, A method of time-time analysis: the TTtransform, Elsevier Sci. Digital Signal Processing 13 (2003) 588–603. [9] P.K. Dash, B.K Panigrahi, G. Panda, Power quality analysis using S-transform, IEEE Trans. Power Delivery 18 (2) (2003). [10] Vapnik, Statistical Learning Theory, Wiley, New York, 1998. [11] C.H. Sohn, C.H. Dagli, Advantages of using fuzzy class memberships in selforganizing map and support vector machines, Proc. Int. Joint Conf. Neural Networks (IJCNN’01) 3 (2001) 1886–1890. [12] C.F. Lin, S.D. Wang, Fuzzy support vector machines, IEEE Tran. Neural Networks 13 (2002) 464–471. [13] T. Inoue, S. Abe, Fuzzy support vector machines for pattern classification, Proc. Int. Joint. Conf. Neural Networks (IJCNN’01) 2 (2001) 15–19. [14] I.W.C., Lee, P.K. Dash, S-transform based intelligent system for classification of power quality disturbance signals, IEEE Trans. Ind. Electron., 50, 4, pp. 800–806. [15] Hong-Gang, Zhou, Yang Chun-De, 2006. Using immune algorithm to optimize anomaly detection based on SVM, in: Fifth International Conference on Machine Learning and Cybernetics, Dalian, Aug. 2006, pp. 3–16. [16] Zhuo-Yue, Song, X.Z., Gao, Xian-Lin, Huang, H.S.,.A, Lin Modified immune optimization algorithm, in: Fifth International Conference on Machine Learning and Cybernetics, Dalian, Aug.2006, pp. 3–16.

86

B. Biswal et al. / Neurocomputing 103 (2013) 75–86

[17] A.K. Ghosh, D.L. Lubkeman, The classification of power system disturbance waveforms using a neural network approach, IEEE Trans. Power Delivery 10 (1) (1995) 109–115. [18] B., Perunicic, M., Malini, Z., Wang, & Y., Liu (1998). Power quality disturbance detection and classification using wavelets and artificial neural networks, in: Proceedings of the International Conference on Harmonics and Quality of Power, Athens, Greece, pp. 77–82. [19] G.P., Damarla, A., Chandrasekaran, & A., Sundaram (1994). Classification of power system disturbances through fuzzy neural network, in: Electrical and Computer Engineering Conference, Canada, pp. 68–71. [20] J.V. Wijayakulasooriya, G.A. Putrus, P.D. Minns, Electric power quality disturbance classification using self-adapting artificial neural networks, IEE Proc.: Gener. Trans. Distrib. 149 (1) (2002) 98–101. [21] W., Kanitpanyacharoean, & S., Premrudeepreechacharn (2004). Power quality problem classification using wavelet transformation and artificial neural networks, in: IEEE Power Systems Conference and Exposition, Chiang Mai, Thailand, pp. 1496–1501. [22] S., Kaewarsa, & K., Attakitmongcol (2004). Wavelet-based neural classification of power quality disturbances, in: Proceeding of International Symposium on Intelligent Signal Processing and Communication Systems, Seoul, Korea, pp. 299–304. [23] Feng-Feng Guo-Sheng Hu, Zhu, Zhen Ren, Power quality disturbance identification using wavelet packet energy entropy and weighted support vector machines, Expert Syst. Appl. 35 (2008) 143–149. [24] Sami Ekici, Classification of power system disturbances using support vector machines, Expert Syst. Appl. 36 (2009) 9859–9868.

Birendra Biswal received the Master of Engg. Degree from University College of Engineering, Burla, India in 2001 and Ph.D. from Biju Patnaik University of Technology, India. Presently working as Professor and Chairman Post Graduate Studies in GMR Institute of Technology, India. His primary research interest focuses on Digital Signal Processing, Adaptive Signal Processing, Statistical Signal Processing, Image Processing, Data Mining, System Identification and Machine Intelligence.

M. K. Biswal is an Assistant Professor in Silicon Institute of Technology, Bhubaneswar, India. He obtained his B. Tech. in Electronics and Telecommunications engineering from Silicon Institute of Technology. His primary research interests are applications of Signal Processing, and Machine Intelligence techniques for solving engineering problems.

P. K. Dash Ph.D., D.Sc, SMIEEE, FNAE,. He is currently working as Director (Research and Consultancy) in Sikha ‘‘O’’ Anusandhana University, Bhubaneswar, India. He is a visiting professor to USA, Canada, Malaysia & Singapore. Under his Supervision more than 40 research scholars have completed their PhD successfully. His area of research includes Power System, Control Engineering, Data mining, Intelligent Signal Processing, Soft Computing, Adaptive Signal Processing & Advanced Signal Processing.

S. Mishra (M’97-SM’04) received the B.E. degree from the University College of Engineering, Burla, Orissa, India, in 1990, and the M.E. and Ph.D. degrees from Regional Engineering College, Rourkela, Orissa, in 1992 and 2000, respectively. In 1992, he joined the Department of Electrical Engineering, University College of Engineering, as a Lecturer and, subsequently, became a Reader in 2001. Presently, he is an Associate Professor with the Department of Electrical Engineering, Indian Institute of Technology Delhi, New Delhi, India. His current research interests include soft computing applications to power system control, power quality, and renewable energy. Dr. Mishra has been honoured with many prestigious awards, such as INSA Young Scientist Medal in 2002, INAE Young Engineer’s Award in 2002, and recognition as the DST Young Scientist in 2001–2002. He is a fellow of the Indian National Academy of Engineering, New Delhi, the Institution of Engineering and Technology, London, U.K., and the Institution of Electronics and Communication Engineering, India.