Quasi-periodic modeling for heart sound localization and suppression in lung sounds

Biomedical Signal Processing and Control 8 (2013) 586–595 Contents lists available at SciVerse ScienceDirect Biomedical Signal Processing and Contro...

Download PDF

900KB Sizes 0 Downloads 15 Views

Report

PDF Reader
Full Text

Biomedical Signal Processing and Control 8 (2013) 586–595

Contents lists available at SciVerse ScienceDirect

Biomedical Signal Processing and Control journal homepage: www.elsevier.com/locate/bspc

Quasi-periodic modeling for heart sound localization and suppression in lung sounds Miroslav Zivanovic ∗ , Miriam González-Izal Dept. Ingeniería Eléctrica y Electrónica, Universidad Pública de Navarra, Campus Arrosadía, 31006 Pamplona, Spain

a r t i c l e

i n f o

Article history: Received 10 December 2012 Received in revised form 6 May 2013 Accepted 5 June 2013 Keywords: Heart sound localization Lung sound denoising Quasi-periodic modeling

a b s t r a c t We present a novel approach to treating lung sounds contaminated by heart sounds by means of quasiperiodic signal modeling. Heart sounds are described through the long and short-term quasi-periodicity generated by cardiac cycle repetition and oscillations caused by valves openings and closures, respectively. In terms of signals, these quasi-periodicities drive time-variant HS envelope and phase which are modeled by single and piecewise time polynomials, respectively. Single polynomials account for slow and continuous envelope time variations, while piecewise polynomials capture fast and abrupt phase changes in short time intervals. Such a compact signal description provides an efﬁcient way to fundamental heart sound (FHS) components localization and posterior removal from lung sounds. The results show that the proposed method outperforms two reference methods for medium (15 ml/s/kg) and high (22.5 ml/s/kg) air ﬂow rates. © 2013 Elsevier Ltd. All rights reserved.

1. Introduction Lung sound (LS) analysis is a key issue for accurate medical diagnosis as it provides valuable information about lungs psychologies and pathologies. The major general problem, however, remains the presence of heart sounds (HS) which interferes with LS in 20–150 Hz approximately [1–3]. Incorrectly localized and removed fundamental heart sound components (FHS) can usually render LS unﬁt for diagnosis purposes. Denoising of LS is often viewed as a two-step process, namely, (1) HS localization and (2) suppression. In the literature we ﬁnd a number of approaches to HS localization based on time–frequency ﬁltering [4], higher-order statistics [5], wavelet decomposition [6,7], sound entropy [8] and singular spectrum analysis [9]. Additional efforts have been made to suppress FHS components from LS with the aim of preserving most of the LS signal content in the HS frequency range. Approaches like independent component analysis [10], adaptive ﬁltering [11–13], linear and non-linear prediction [14,15] and blind source separation [16] were dealt with. A thorough analysis would be needed to evaluate the performance of the aforementioned methods and bring out their virtues and shortcomings. However, a general remark regarding limitations

∗ Corresponding author. Tel.: +34 948169024; fax: +34 948169720. E-mail addresses: [email protected] (M. Zivanovic), [email protected] (M. González-Izal). 1746-8094/$ – see front matter © 2013 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.bspc.2013.06.003

usually stems from either an a priori assumption on HS stationarity or a lack of robustness in conditions of middle and high air ﬂow rates. Herein we present a novel approach to HS characterization by explicit modeling of the long and short-term quasi-periodicities, which arise from cardiac cycle repetition and vibrations caused by openings and closures of the valves, respectively. In the signal space the former account for the instantaneous envelope while the latter address the issue of the time-variant phase. Under the assumption of slow and continuous periodicity variation, the envelope is modeled by a harmonic structure whose coefﬁcients are low degree single time polynomials. Phase changes, on the contrary, are fast and abrupt and occur in short time intervals corresponding to FHS components; accordingly, the phase is modeled as a single sinusoid scaled by uniformly distributed piecewise polynomials. Such a modeling allows for a robust HS characterization in middle and high air ﬂow rates scenarios. Moreover, the method is computationally highly efﬁcient because the model parameters are estimated from an overdetermined linear system of equations. Note that single and piecewise polynomials have already been utilized in biomedical signal processing e.g. [17,18] but to the best of our knowledge never for the present application. The present paper is organized as follows: Section 2 is dedicated to HS modeling in terms of time-variant envelope and phase; in Section 3 we present experimental results in the context of a twofold comparative study; ﬁnally, the conclusions appear in Section 4.

M. Zivanovic, M. González-Izal / Biomedical Signal Processing and Control 8 (2013) 586–595

587

The denominator in (5) stands for the fact that f(t) is the derivative of the instantaneous phase. By inserting (4) and (5) into (3) and after some elementary trigonometric manipulations (omitted for the sake of simplicity) A(t) becomes a linear combination of sine and cosine product terms of the non-linear argument 2nfm tm+1 /(m + 1), m = 1, . . ., M. Assuming slow time variations in an (t) and f(t), the small argument approximation can be applied:

sin

cos

2nfm t m+1 m+1 2nfm t m+1 m+1

≈

2nfm t m+1 , m+1

(6)

≈ 1.

(7)

The combination of (3)–(7) yields the ﬁnal model for the HS signal envelope: A(t) =

N

sn (t) sin(2nf0 t) + cn (t) cos(2nf0 t),

(8)

n=1

Fig. 1. Illustration of the quasi-periodicities: (a) a HS signal with the long-term quasi-period 1/f(t) and (b) a FHS component with the short-term quasi-period 1/F(t).

2M+1

sn (t) =

(n) ai

i−2

− 2n

2M+1

cn (t) =

2.1. HS signal model

(n) bi

i−2

+ 2n

In general terms we can represent a HS as a narrow-band signal h(t) with a time-variant envelope and phase: h (t) = A(t) cos ϕ(t).

(1)

A(t) characterizes the instantaneous RMS amplitude of the h(t) and it exhibits prominent peaks localized at the FHS components. The repetitive nature of the cardiac cycle induces quasi-periodic patterns in A(t) with the quasi-period of the order of 1 s (longterm quasi-periodicity). The phase ϕ(t) describes the time-variant frequency content of the FHS components with the quasi-period typically in the range 5–50 ms (short-term quasi-periodicity). For the sake of clarity both quasi-periodic trends are illustrated in Fig. 1. 2.2. HS envelope modeling Due to its quasi-periodic nature A(t) can be modeled by a set of N time-variant harmonics: A(t) =

N

(2)

The last expression can be rewritten in the following way: N

− (an (t) sin n ) sin(2nf (t)t)

n=1

+ (an (t) cos n ) cos(2nf (t)t).

(3)

The time-variant parameters in (3) are modeled by polynomials of order M under the assumption of continuous time variation: −an (t) sin n =

M

(n) am t m ,

m=0

f (t) =

M fm m=0

i−1

ti,

(9)

ti.

(10)

sn (t) and cn (t) are polynomials of degree 2M + 1 whose internal structure reveals the way the harmonic amplitudes are related to the frequency. This structure is transparent to the envelope modeling process; nevertheless, it can provide valuable information about cardiac cycle time evolution, which is related to the issue of heart rate variability (out of the scope of the present paper). From (8) to (10) we observe that if A(t) was periodic, then sn (t) = a0 (n) , cn (t) = b0 (n) , f(t) = f0 and (8) would become a set of harmonics of the fundamental cardiac period 1/f0 describing a stationary HS. Otherwise, long-term quasi-periodicity is captured by means of the non-zero polynomial coefﬁcients. Accordingly, (8) correctly characterizes the envelope of a HS signal allowing variations in energy and time distance between the FHS components. Moreover, A(t) conveys necessary information for LS–HS localization, which can be readily obtained e.g. by adaptive thresholding [9]. 2.3. HS phase modeling

an (t) cos[2nf (t) + n ].

n=1

A(t) =

(n) f am i−m−1

m=0

i=0

i−1

m=0

i=0

2. Methods

(n) f bm i−m−1

m+1

an (t) cos n =

M

The term cos ϕ(t) in (1) characterizes the oscillatory behavior of a HS signal in short time intervals (typically 40–100 ms) centered at the FHS components. The character of these oscillations has been studied in the past [19–21], where it was found that they resemble chirp signals. Accordingly, we model ϕ(t) by means of a linear trend instantaneous frequency F(t): ϕ(t) = 2F(t)t + ˛,

F(t) = F0 +

(4)

m=0

(11)

By inserting (11) into (1) and assuming A(t) is known we obtain:

h(t) = A(t) cos 2 F0 + (n) bm t m ,

F1 t. 2

F1 t t+˛ . 2

(12)

After trigonometrically decomposing (12) and applying the approximations (6) and (7) we can write: h(t) = (− sin ˛ − F1 t 2 cos ˛)A(t) sin 2F0 t

tm.

(5)

+ (cos ˛ + F1 t 2 sin ˛)A(t) cos 2F0 t.

(13)

588

M. Zivanovic, M. González-Izal / Biomedical Signal Processing and Control 8 (2013) 586–595

The last expression shows that the short-term quasi-periodic behavior of the FHS components can be described by two parabolas in a short time interval. Observe that for F1 = 0 the oscillatory character of the FHS components becomes periodic with the period 1/F0 . It turns out, however, that (13) needs additional information about the FHS components localization for the sake of implementation (recall that (13) is limited to short time intervals between 40 ms and 100 ms approximately). Accordingly, we would ﬁrst have to localize and delimit the FHS components by examining A(t) and then estimate the model parameters for each FHS in the analysis window (compare this approach to (8)–(10) which model A(t) in the whole analysis window). Moreover, the computational time would strongly depend on the number of FHS components detected in the analysis window, rendering the approach unﬁt for the present application. An adequate way to overcome the aforementioned shortcomings is by using piecewise polynomial approximation in the whole analysis window. Piecewise polynomials, among which uniform splines are the most popular, have very convenient properties out of which we mention a few: (1) an accurate modeling in a short time interval with a low-degree polynomial, (2) continuity of derivatives of a certain order in the data points, (3) efﬁcient calculation due to uniform spacing [22–24]. For the present application we have chosen quadratic splines, which are represented as a linear combination of uniformly time-distributed third-order B-splines. Such a spline forms an approximation basis which perfectly ﬁts to the short-time quadratic polynomials in (13). Each third-order B-spline is constructed over three adjacent uniformly spaced time knots as follows:

Br,2 (t) =

⎧ ⎪ 0.5(t − tr )2 , ⎪ ⎨ ⎪ ⎪ ⎩

t ∈ [tr+1 , tr+2 ]

0.5(1 − (t − tr+2 ))2 ,

t ∈ [tr+2 , tr+3 ],

(14)

where Br,2 (t) is a third-order B-spline, r = 1, . . ., R is a knot index and R is a number of knots in the analysis window. Accordingly, (13) can be rewritten as: h(t) =

R

A = (S(1) S(2) . . . S(N) C(1) C(2) . . . C(N) ) × (s1 s2 . . . sN c1 c2 . . . cN )T , (16) (n)

lmod(2M+1)

sin(2nf0 tk ),

(17)

(n)

lmod(2M+1)

cos(2nf0 tk ),

(18)

Sk,l = tk

Ck,l = tk

for l = 0, . . ., 2N(M + 2), n = 1, . . ., N and k = 0, . . ., K − 1 where K is the number of data samples and T is the transpose operator. The vector A contains the output of the 10 Hz cut-off low-pass Butterworth ﬁlter while the subvectors s1 , . . ., cN contain the polynomial coefﬁcients. The splines coefﬁcients r and r are estimated from the following system written in the matrix form: T

HS = (1 2 . . . R 1 2 . . . R ) × (␰) ,

(19)

k,r = Br,2 (tk )A(tk ) sin 2F0 tk

(20)

k,r = Br,2 (tk )A(tk ) cos 2F0 tk

(21)

where r = 1, . . ., R and the vector HS contains the input data. Both overdetermined linear systems (16) and (19) can be represented as a linear least-squares problem which can easily be solved using, for instance, the QR factorization or singular value decomposition. 3. Results

t ∈ [tr , tr+1 ]

−(t − tr+1 )2 + (t − tr+1 ) + 0.5,

cut-off band-pass for F0 estimation. In both cases a third-order Butterworth ﬁlter was used. The rest of the parameters are estimated from two linear sets of Eqs. (8) and (15), respectively. The coefﬁcients in sn (t) and cn (t) are estimated from the following matrix system:

In order to best assess the validity of the proposed modeling method under different analysis circumstances, we have carried out a thorough comparative study in terms of HS localization as well as LS denoising by means of both artiﬁcial and real data mixtures. In the following sections we describe in detail all the issues related to the experiments and discuss the results. 3.1. Reference methods

r Br,2 (t)A(t) sin 2F0 t +

r Br,2 (t)A(t) cos

2F0 t,

(15)

r=1

where the parameters r and r are the B-spline coefﬁcients. Regarding the last expression we give a few remarks. First, observe that (15) provides not only the phase model but also the full model for HS signals. Accordingly, HS removal from LS is carried out by a simple h(t) subtraction from the original LS signal. Second, (15) can be useful in applications where HS recordings are corrupted by external noises (skin contact, muscular activity, surrounding speech) e.g. [25]. Third, the choice of the uniform time-distribution of Br,2 (t) has been motivated by its low computational complexity, although some studies show that non-uniform distribution can achieve more optimal signal approximations in terms of the residual norm [26–28]. 2.4. Parameter estimation In order to fully characterize a HS signal the following set of parameters must be estimated: the coefﬁcients of the polynomials sn (t) and cn (t) in (8), the scalars r and r in (15) and the frequencies f0 and F0 . The frequencies are straightforwardly estimated by the classical auto-correlation based approach [29] after the HS has been adequately ﬁltered: 10 Hz cut-off low-pass for f0 and 20–150 Hz

Regarding HS localization in LS signals we have compared the proposed method to [9]. This method relies on a singular spectrum analysis approach (SSA) which aims at decomposing HS–LS mixtures onto two signal subspaces by means of detecting different trends in the eigenspectra. As shown in [9], the SSA approach seems to be more suitable for HS localization than the multiresolution decomposition and entropy-based approach. As for LS denoising we have chosen [14] as a reference method. The method performs HS localization and LS denoising through multiscale product (MP) and linear prediction (LP), respectively (in the present study we will refer to this method as the MP–LP). First, it completely removes segments which contain HS, thus creating gaps in the wavelet decomposition of the LS signal; next, the gaps are “ﬁlled” through LP under the assumption of a non-stationary LS. The results show that the MP–LP yields a fairly good HS suppression in terms of power spectral density for low and medium ﬂow rates. 3.2. Signals For the present study we have used two sets of test signals. The ﬁrst set comprises real HS and LS recordings from [30,31] which we have artiﬁcially mixed in order to simulate different analysis scenarios. By adjusting the Signal-to-noise ratio (SNR) between the LS

M. Zivanovic, M. González-Izal / Biomedical Signal Processing and Control 8 (2013) 586–595

1 0.8

Normalized amplitude

and HS we could simulate different levels of ﬂow rates. In particular, medium ﬂow rates were simulated by the SNR in 5–7 dB while for high ﬂow rates the SNR was in the range 7–9 dB. Moreover, the knowledge of the underlying HS signal provided a quantitative study in terms of a set of parameters enumerated in the following subsection. An illustration of these test signals is given in Fig. 2 where the following time-waveforms are plotted: (a) heart signal, (b) lung signal and (c) their mixture for a medium ﬂow rate. The second set of test signals included LS recorded from four healthy subjects aged between 30 and 45. The signals were captured by a piezoelectric accelerometer Murata which was placed between the second and third intercostal spaces at the left sternal border. Five recordings (approximately 30 s each) were made for all subjects, which were instructed to breathe at medium and high ﬂow rates. The signals were digitized at 44.1 kHz with 16 bits precision and later downsampled to fs = 4410 Hz.

0.6 0.4 0.2 0 -0.2 -0.4 -0.6 -0.8 -1 0

1

2

3

3.3. Performance evaluation parameters

0.8 0.6

t =

t h2 (t ) k=q k FHS k , p h2 (t ) k=q FHS k

P

t hˆ 2 (t ) k=Q k FHS k

P

hˆ 2 (t ) k=Q FHS k

6

7

8

9

,

(23)

6

7

8

9

6

7

8

9

1

Normalized amplitude

(22)

p

tˆ =

5

(a)

• Duration error[samples] = fs−1 |(tp − tP ) − (tq − tQ ). • Position error[samples] =

4

Time [s]

3.3.1. HS localization Let us denote an original FHS signal component as hFHS (t), tq ≤ t ≤ tp and the corresponding component detected from a mixˆ FHS (t), tQ ≤ t ≤ tP . The quality of localization for a given FHS ture as h is measured by means of the following parameters:

fs−1 |t − tˆ|,

589

0.4 0.2 0 -0.2 -0.4 -0.6

where the operator “<·>” stands for the mean time or center of gravity of the detected component [32]. The center of gravity is more accurate than a simple geometric center because it points at the maximum energy time-concentration of the underlying FHS component.

-0.8 -1 0

1

2

3

4

5

Time [s]

⎛

q

h2 (t ) k=p FHS k

SDRdB = 10 log10 ⎝

2 q ˆ (hFHS (tk ) − hFHS (tk )) k=p

(b)

⎞ ⎠

(24) 1

which measures time-waveform similarity between the original and estimated FHS component. • False positive and false negative errors [%] which evaluate the methods’ performance as a FHS component detector. 3.3.2. HS suppression Denoting the PSD of a clean LS as Y(fu ) and the PSD of a denoised ˆ (fu ), we measure the quality of HS removal from a LS signal by LS as Y the Power spectrum density difference (PSDD) in dB evaluated in a set of logarithmically spaced frequency bands. For a band delimited by the frequency indices [u1 , u2 ] the PSDD is calculated as follows:

u

u=u1

Y (fu )

u=u1

Yˆ (fu )

2

PSDDdB = 10 log10

u2

0.8

Normalized amplitude

• Signal-to-distortion ratio (SDR) [dB]:

0.6 0.4 0.2 0 -0.2 -0.4 -0.6 -0.8 -1

(25)

3.4. HS localization – comparative study The ﬁrst part of the study was performed on the artiﬁcially mixed signals by means of the duration error, position error and

0

1

2

3

4

5

Time [s]

(c) Fig. 2. Signals used for the HS localization comparative study: (a) heart signal, (b) lung signal and (c) their mixture for a medium ﬂow rate.

590

M. Zivanovic, M. González-Izal / Biomedical Signal Processing and Control 8 (2013) 586–595

MEDIUM FLOW

HIGH FLOW SSA modeling

0.8

0.6

0.4

0.2

0

SSA modeling

1

Norm. histogram

Norm. histogram

1

0.8

0.6

0.4

0.2

0

5

10

15

20

25

30

35

0

40

0

10

20

Position error [samples]

SSA modeling

SSA modeling

1

0.8

Norm. histogram

Norm. histogram

60

HIGH FLOW

1

0.6

0.4

0.2

0.8

0.6

0.4

0.2

0

20

40

60

80

0

100

0

50

Duration error [samples]

100

150

200

Duration error [samples]

(b)

(e)

MEDIUM FLOW

HIGH FLOW SSA modeling

1

SSA modeling

1

0.8

Norm. histogram

Norm. histogram

50

(d)

MEDIUM FLOW

0.6

0.4

0.2

0

40

Position error [samples]

(a)

0

30

0.8

0.6

0.4

0.2

2

4

6

8

10

SDR [dB]

(c)

12

14

16

18

0

0

2

4

6

8

10

12

14

16

18

SDR [dB]

(f)

Fig. 3. Position error, duration error and SDR histograms obtained for the SSA and proposed modeling method: (a–c) correspond to medium ﬂow rates; (d–f) correspond to high ﬂow rates.

M. Zivanovic, M. González-Izal / Biomedical Signal Processing and Control 8 (2013) 586–595

SIGNAL

591

SIGNAL SEGMENT

2.5

2

2

1.5

1.5 1

Amplitude

Amplitude

1 0.5 0 -0.5

0.5 0 -0.5

-1 -1

-1.5 -1.5

-2 -2.5

0

1

2

3

4

5

6

7

8

-2

9

5

5.5

Time [s]

(a)

SSA 1

2

0.8

1.5

0.6

1

0.4

Amplitude

Amplitude

LS ESTIMATED

0.5 0 -0.5

0.2 0 -0.2

-1

-0.4

-1.5

-0.6

-2

-0.8

0

1

2

3

4

5

6

7

8

-1

9

5

5.5

Time [s]

MODELING 1

0.8

0.8

0.6

0.6

0.4

0.4

Amplitude

Amplitude

HS ESTIMATED

0.2 0 -0.2

0.2 0 -0.2

-0.4

-0.4

-0.6

-0.6

-0.8

-0.8

1

2

3

4

5

Time [s]

(c)

6.5

(e)

1

0

6

Time [s]

(b)

-1

6.5

(d)

2.5

-2.5

6

Time [s]

6

7

8

9

-1

5

5.5

6

6.5

Time [s]

(f)

Fig. 4. High ﬂow HS–LS separation illustration: (a) mixture signal, (b and c) estimated LS and HS by the modeling method, (d) original signal segment, (e) the segment processed by the SSA, and (f) the segment processed by the modeling method.

592

M. Zivanovic, M. González-Izal / Biomedical Signal Processing and Control 8 (2013) 586–595

Table 1 Position error, duration error and SDR for the artiﬁcial mixtures. Position error (samples) SSA (low ﬂow) Modeling (low ﬂow) SSA (medium ﬂow) Modeling (medium ﬂow) SSA (high ﬂow) Modeling (high ﬂow)

± ± ± ± ± ±

4.88 2.49 15.07 4.20 23.33 6.98

Duration error (samples)

3.06 1.91 8.72 3.84 13.33 5.57

6.21 5.98 21.11 6.33 68.55 6.66

± ± ± ± ± ±

2.66 2.80 34.44 7.58 32.62 8.73

Table 4 Mean execution time (s) as a function of the number of samples on an Intel 3.19 GHz processor.

SDR (dB) 7.93 13.22 5.49 11.14 5.90 8.34

± ± ± ± ± ±

1.80 2.71 2.13 2.23 1.72 2.22

Table 2 False positives and negatives (%) for the artiﬁcial mixtures. False positives (%) SSA (low ﬂow) Modeling (low ﬂow) SSA (medium ﬂow) Modeling (medium ﬂow) SSA (high ﬂow) Modeling (high ﬂow)

0.86 0.91 2.84 1.58 52.53 3.21

± ± ± ± ± ±

0.30 0.53 1.72 1.27 22.28 2.78

False negatives (%) 0.70 0.62 1.84 1.40 15.84 2.67

± ± ± ± ± ±

0.23 0.41 1.05 1.52 8.36 1.90

Number of samples

10,000

20,000

30,000

40,000

SSA Modeling

2.05 0.27

3.54 0.53

4.79 0.65

6.23 0.82

percentage errors. For each subject and ﬂow rate these parameters were calculated and the results are shown in Tables 2 and 3. We can see that in general for medium ﬂow rates both methods exhibit statistically large scores (small false errors percentages). However, for high ﬂow rates the SSA undergoes an important degradation due to a stronger impact of the LS onto HS localization. The reason behind this effect is the principal component (PC) pairs selection criterion which interprets the smallest eigenvalues as noise and largest as signal. It turns out that this criterion does not hold for high ﬂow rates, as the noise starts generating larger eigenvalues

MEDIUM FLOW

Table 3 False positives and negatives (%) for the real mixtures from four subjects.

False positives (%) SSA (low ﬂow) Modeling (low ﬂow) SSA (medium ﬂow) Modeling (medium ﬂow) SSA (high ﬂow) Modeling (high ﬂow) False negatives (%) SSA (low ﬂow) Modeling (low ﬂow) SSA (medium ﬂow) Modeling (medium ﬂow) SSA (high ﬂow) Modeling (high ﬂow)

Subject 1

Subject 2

Subject 3

Subject 4

0.87 0.75 2.16 1.27 62.76 3.82

0.95 0.82 1.86 1.76 58.93 2.20

1.13 1.04 3.06 0.91 46.80 1.68

0.79 0.80 1.88 1.31 55.48 2.09

0.68 0.54 1.98 1.46 19.74 3.05

0.61 0.48 1.86 1.36 14.17 1.28

0.93 0.73 2.88 0.72 13.29 2.81

0.55 0.68 1.41 1.50 22.10 4.72

4

Modeling MP-LP

3.5

Average PSDD [dB]

3 2.5 2 1.5 1 0.5 0

1

2

3

4

Frequency band

(a) HIGH FLOW 6

Modeling MP-LP 5

Average PSDD [dB]

SDR. A total of 100 10-s mixtures were generated representing different medium and high-ﬂow rate scenarios. These signals were processed by the SSA and proposed modeling method and the outcomes are represented as normalized histograms. For the modeling method we used M = 1, R = 20 (expressions (4) and (15)) while the number of harmonics in A(t) was chosen according to the following criterion. The lower limit on FHS duration is assumed 40 ms: hence, the envelope bandwidth spans up to 1/40 ms = 25 Hz approximately. By taking the mean cardiac period as 1 s, we conclude that A(t) can be properly modeled with N = 25 harmonics. In addition, we used the same adaptive thresholding criterion as in [9] for delimiting components in the estimated HS signals. Regarding the SSA method, the same parameters as those in [9] were used. The histograms are shown in Fig. 3 where the left column (a–c) corresponds to the medium and right column (d–f) to high ﬂow rates. Judging by the histograms’ shape and localization on the horizontal axes we can say that the modeling method outperforms the SSA regarding the three performance evaluation parameters. For both medium and high ﬂow rates the modeling method yields position and duration error histograms with a smaller mean and deviation than those of the corresponding SSA histograms. Regarding the SDR both methods give rise to a similar deviation but the modeling method achieves larger mean values. The concrete mean and deviation values for the histograms are given in Table 1. The second part of the present study concerned the artiﬁcial and real mixtures processed in terms of false positive and negative

4

3

2

1

0

1

2

3

4

Frequency band

(b) Fig. 5. Average PSDD for the MP–LP and modeling method under (a) medium and (b) high ﬂow rates. The frequency bands 1–4 correspond to: 20–40, 40–70, 70–150 and 150–300 Hz.

M. Zivanovic, M. González-Izal / Biomedical Signal Processing and Control 8 (2013) 586–595

which strongly overlap with the signal’s eigenvalues. Consequently, the selected eigenvalue pairs contain a signiﬁcant amount of noise which hampers the S1–S2 component localization by means of adaptive thresholding [9]. According to Table 3 the false positives detection increases signiﬁcantly over the false negatives detection. This effect is probably due to the envelope distortion through a number of spikes clustered around the local energy maxima of LS. The modeling method, nevertheless, proves to be more robust against the LS, as its statistical measures clearly indicate. Fig. 4 shows a high ﬂow rate time-waveform example in order to illustrate the methods’ performance. In subﬁgures (a–c) the original respiratory signal, the estimated LS and the estimated HS for the modeling method, respectively are shown. We can see that the estimated HS still contains certain LS residual which cannot be completely removed. Nevertheless, the FHS components can be clearly identiﬁed and used for further processing. In subﬁgures (d–f) we show, for the sake of comparison, a short segment (around 2 cardiac cycles) of the respiratory signal and the output of the SSA and modeling algorithm, respectively. We see that the modeling method is more robust against LS than the SSA regarding both localization and characterization of FHS components. Furthermore, we have compared the SSA and modeling method in terms of mean execution time as a function of the number of

593

samples to be processed. The results are obtained on an Intel 3.19 GHz processor and given in Table 4. Observe that the modeling method outperforms the SSA by almost an order of magnitude independently on the record length. Although this outcome might come as a surprise, it is in fact due to the singular value decomposition (SVD) computation of a large trajectory matrix in the SSA method [9]. 3.5. HS suppression – comparative study This study was performed on the same signals from the previous subsection, but herein the modeling method was compared to the MP–LP in terms of the PSDD parameter. The PSDs were calculated by the Welch method using a 200 ms Hamming window with 50% overlap. The PSDD was evaluated in the frequency subbands 20–40, 40–70, 70–150 and 150–300 Hz and averaged over all trials. Regarding the artiﬁcially mixed signals the results are shown in Fig. 5 for (a) medium and (b) high ﬂow rates. Both methods seem to obey a similar trend which exhibits the minimum HS suppression capacity in 40–70 Hz (the maximum PSDD). This is due to the fact that this band contains most of the HS energy; therefore, it will enclose the major part of the HS residual after denoising. The modeling method is superior in the whole analysis range by

SIGNAL

MP-LP

1.5

1 0.8

1

0.6 0.4

Amplitude

Amplitude

0.5

0

-0.5

0.2 0 -0.2 -0.4 -0.6

-1

-0.8 -1.5

0

1

2

3

4

5

6

7

8

-1 0

9

1

2

3

Time [s]

(a)

0.8

0.8

0.6

0.6

0.4

0.4

0.2 0 -0.2

-0.6

-0.8

-0.8

5

Time [s]

(b)

9

6

7

8

9

-0.2 -0.4

4

8

0

-0.6

3

7

0.2

-0.4

2

6

MODELING 1

Amplitude

Amplitude

LS

1

5

(c)

1

-1 0

4

Time [s]

6

7

8

9

-1 0

1

2

3

4

5

Time [s]

(d)

Fig. 6. Illustration of LS estimation: (a) mixture signal, (b) LS, (c) estimated LS by the MP–LP method, (d) estimated LS by the modeling method.

594

M. Zivanovic, M. González-Izal / Biomedical Signal Processing and Control 8 (2013) 586–595

MEDIUM FLOW -30

Clean LS MP-LP Modeling

Average PSD [dB]

-35 -40 -45 -50 -55 -60 -65

signal estimation, as most of the energy in the bursts is preserved (compare (c) and (d) to (b)). In order to support the above discussion we have compared average PSDs of the real mixtures, clean LS and estimated LS across the methods. The PSDs of the clean LS were obtained by processing the segments of the LS signals which did not contain HS. The PSDs for medium and high ﬂow rates are shown in Fig. 7(a and b), respectively. We can see that below 200 Hz approximately the MP–LP introduces important differences between the clean and estimated LS for medium ﬂow rates. For high ﬂow rates this effect spreads up to 300 Hz. The LS estimated by the modeling method resembles the clean LS in almost whole analysis range, except for the small differences at low frequencies (below 100 Hz). Therefore, we state that the modeling method is more robust against LS than the MP–LP in terms of HS suppression for medium and high ﬂow rates.

2

10

4. Conclusions

Frequency [Hz]

(a) HIGH FLOW -25 Clean LS MP-LP Modeling

Average PSD [dB]

-30 -35 -40 -45 -50 -55 -60

2

10

Frequency [Hz]

(b) Fig. 7. Average PSD from the subjects for the MP–LP and modeling method under (a) medium and (b) high ﬂow rates.

We have presented a novel approach to HS localization and suppression in LS by means of quasi-periodic modeling. We have shown that inherent quasi-periodicities in HS (envelope and phase) can be compactly modeled by a joint action of single and piecewise polynomials. Such a modeling scheme allows for a correct HS characterization even in conditions of large LS. The model parameters are estimated in an efﬁcient fashion from an overdetermined linear system of equations. The HS model has proven to be capable of ﬁtting well to data. Considering LS localization we have shown that the proposed modeling method outperforms a singular spectrum analysis method in terms of fundamental HS component position, duration and signalto-distortion ratio. The differences become even more prominent for high ﬂow rates, indicating a very good robustness against LS for the proposed method. As for HS suppression in LS, the proposed method has proven to be better than an approach based on multiscale product and linear prediction. It was shown that in terms of average PSD difference between clean and denoised LS for both medium and high ﬂow rates, the proposed method is situated 2–5 dB below the other method. In view of these results, we think that the proposed method is a good candidate for clinical applications which include LS sound analysis. Competing interests None declared.

approximately 3 dB for low and 4 dB for high ﬂow rates. Such a difference in performance is a result of the errors generated by both HS localization and suppression. The MP–LP FHS localization is based on the multiscale product of the wavelet coefﬁcients of the original signal. Although this approach works well for low ﬂow rates, its performance degrades for medium and especially high ﬂow rates. In addition, the HS suppression “gap-ﬁlling” based approach is also error prone, because connecting two signal segments to new data introduces errors in the whole frequency range, not only around the HS frequencies. The modeling method, on the contrary, provides a very good localization and characterization of the FHS components (Section 2) under both medium and high ﬂow rate conditions (Section 3). Accordingly, HS suppression will only have a small impact on the LS signal in the HS frequencies. As an illustration, we have shown in Fig. 6 the following waveforms: (a) original mixture, (b) original LS, (c) estimated LS by the MP–LP method, and (d) estimated LS by the modeling method. We see that the synthesized LS signal by the MP–LP method turns to be seriously degraded at the position of the bursts. This effect is due to HS detection mechanism which is very sensible to low signal-tonoise ratios. The modeling method, however, generates a better LS

Funding None declared. References [1] S. Reichert, R. Gass, C. Brandt, E. Andres, Analysis of respiratory sounds: state of the art, clinical medicine insight, Circulatory, Respiratory and Pulmonary Medicine 2 (2008) 45–58. [2] P.J. Arnott, G.W. Pfeiffer, M.E. Tavel, Spectral analysis of heart sounds: relationships between some physical characteristics and frequency spectra of ﬁrst and second sounds in normal and hypertensives, Journal of Biomedical Engineering 6 (2) (1984) 121–128. [3] A. Sovijarvi, L. Malmberg, G. Charbonneau, F. Vanderschoot, J. Dalmasso, C. Sacco, M. Rossi, J. Earis, Characteristic of breath sounds and adventitious respiratory sounds, Journal of European Respiratory Review 10 (2000) 591–596. [4] M.T. Pourazad, Z. Moussavi, G. Thomas, Heart sound cancellation from lung sound recordings using time–frequency ﬁltering, Medical & Biological Engineering & Computing 44 (2006) 216–225. [5] L.J. Hadjileontiadis, S.M. Panas, Separation of discontinuous adventitious sounds from vesicular sounds using a wavelet-based ﬁlter, IEEE Transactions on Biomedical Engineering 44 (12) (1997) 1269–1281. [6] Z. Moussavi, D. Flores, G. Thomas, Heart sound cancellation based on multiscale products and linear prediction, in: Proceedings of IEEE Conference on Engineering in Medicine & Biological Society, 2004, pp. 3840–3843.

M. Zivanovic, M. González-Izal / Biomedical Signal Processing and Control 8 (2013) 586–595 [7] C. Ahlstrom, T. Lanne, P. Ask, A. Johansson, A method for accurate localization of the ﬁrst heart sound and possible applications, Physiological Measurement 29 (3) (2008) 417–428. [8] A. Yadollahi, Z.M.K. Moussavi, A robust method for heart sounds localization using lung sound entropy, IEEE Transactions on Biomedical Engineering 53 (3) (2006) 497–502. [9] F. Ghaderi, H.R. Mohseni, S. Sanei, Localizing heart sounds in respiratory signals using singular spectrum analysis, IEEE Transactions on Biomedical Engineering 58 (12) (2011) 3360–3367. [10] M.T. Pourazadi, Z. Moussavi, F. Farahmand, R.K. Ward, Heart sounds separation from lung sounds using independent component analysis, in: Proceedings of IEEE Conference on Engineering in Medicine & Biological Society, 2005, pp. 2736–2739. [11] J. Gnitecki, I. Hossain, H. Pasterkamp, Z. Moussavi, Qualitative and quantitative evaluation of heart sound reduction from lung sound recordings, IEEE Transactions on Biomedical Engineering 52 (10) (2005) 1788–1792. [12] J. Gnitecki, Z. Moussavi, Separating heart sounds from lung sounds: accurate diagnosis of respiratory disease depends on understanding noises, IEEE Engineering in Medicine and Biology Magazine 26 (1) (2007) 20–29. [13] K. Basak, S. Mandal, M. Manjunatha, J. Chatterjee, A.K. Ray, Phonocardiogram signal analysis using adaptive line enhancer methods on mixed signal processor, in: Proceedings of International Conference on Signal Processing and Communications, 2010, pp. 1–5. [14] D. Flores-Tapias, Z.M.K. Moussavi, G. Thomas, Heart sound cancellation based on multiscale products and linear prediction, IEEE Transactions on Biomedical Engineering 54 (2) (2007) 234–243. [15] C. Ahlstrom, O. Liljefeldt, P. Hult, P. Ask, Heart sound cancellation from lung sound recordings using recurrence time statistics and nonlinear prediction, IEEE Transactions on Biomedical Engineering 12 (12) (2005) 812–815. [16] T. Tsalaile, S.M. Naqvi, K.S. Nazarpour, J.A. Chambers, Blind source extraction of heart sound signals from lung sound recordings exploiting periodicity of the heart sound, in: Proceedings of International Conference on Acoustics Speech and Signal Processing, 2008, pp. 461–464.

595

[17] M. Zivanovic, M. Gonzalez-Izal, Non-stationary harmonic modelling for ECG removal in surface EMG signals, IEEE Transactions on Biomedical Engineering 59 (6) (2012) 1633–1640. [18] A. Ahmadian, S. Karimifard, H. Sadoughi, M. Abdoli, An efﬁcient piecewise modelling of ECG signals based on hermitian basis functions, in: Proceedings of IEEE Conference on Engineering in Medicine & Biological Society, 2007, pp. 3180–3183. [19] J. Xu, L.G. Durand, P. Pibarot, Nonlinear transient chirp signal modeling of the aortic and pulmonary components of the second heart sound, IEEE Transactions on Biomedical Engineering 47 (10) (2000) 1328–1335. [20] J. Xu, L.G. Durand, P. Pibarot, Extraction of the aortic and pulmonary components of the second heart sound using a nonlinear transient chirp signal model, IEEE Transactions on Biomedical Engineering 48 (3) (2001) 277–283. [21] A. Djebbari, F. Bereksi-Reguig, A new chirp-based wavelet for heart sound time–frequency analysis, Journal on Communications Antenna and Propagation 1 (1) (2011) 92–102. [22] M. Unser, A. Aldroubi, M. Eden, B-spline signal processing: part I. Theory, IEEE Transactions on Signal Processing 41 (2) (1993) 821–833. [23] M. Unser, A. Aldroubi, M. Eden, B-spline signal processing: part II. Efﬁcient design and applications, IEEE Transactions on Signal Processing 41 (2) (1993) 834–848. [24] C. de Boor, A Practical Guide to Splines, Springer-Verlag, New York, 1978. [25] T. Li, H. Tang, T. Qiu, Y. Park, Best subsequence selection of heart sound recording based on degree of sound periodicity, Electronic Letters 47 (15) (2010) 841–843. [26] L. Chua, A. Deng, Canonical piece-wise representation, IEEE Transactions on Circuits and Systems 35 (1) (1988) 4425–4431. [27] G. Bucci, M. Faccio, C. Landi, New ADC with piecewise characteristic: case study. Implementation of smart humidity sensor, IEEE Transactions on Instrumentation and Measurement 49 (6) (2000) 1154–1166. [28] T. Blu, P. Thevenaz, M. Unser, Linear interpolation revitalized, IEEE Transactions on Image Processing 13 (5) (2004) 471–475. [29] W.J. Hess, Pitch Determination of Speech Signals, Springer, New York, 1993. [30] http://www.med.umich.edu/lrc/psb/heartsounds/index.htm [31] http://darwin.unmc.edu/hnp/m2/breathsounds.htm [32] L. Cohen, Time–Frequency Analysis, Prentice-Hall, Englewood Cliffs, NJ, 1995.

Quasi-periodic modeling for heart sound localization and suppression in lung sounds

Quasi-periodic modeling for heart sound localization and suppression in lung sounds

Recommend Documents