Spectral estimation method for noisy data using a noise reference

Applied Acoustics 72 (2011) 11–21 Contents lists available at ScienceDirect Applied Acoustics journal homepage: www.elsevier.com/locate/apacoust Sp...

Download PDF

2MB Sizes 0 Downloads 124 Views

Report

PDF Reader
Full Text

Applied Acoustics 72 (2011) 11–21

Contents lists available at ScienceDirect

Applied Acoustics journal homepage: www.elsevier.com/locate/apacoust

Spectral estimation method for noisy data using a noise reference Daniel Blacodon ⇑ ONERA, BP 72, Châtillon 92322, France

a r t i c l e

i n f o

Article history: Received 7 December 2009 Received in revised form 15 June 2010 Accepted 14 September 2010

Keywords: Source localization Array processing Background noise

a b s t r a c t The development of array processing methods to extract the useful characteristics of acoustic sources such as their locations and absolute levels, starting from the measured sound ﬁeld is one of the main issues in aero-acoustics. The conventional beamforming method is a very popular technique investigated to solve the power level estimation problem. It has the advantage of being robust, easy to implement and cheap in computation time. However, this technique is also known for having poor spatial resolution capabilities which prevents the correct source levels being obtained for numerous practical applications. Deconvolution techniques of the result computed with CBF, with the point spread function of the array manifold, may restore the power levels of the acoustic sources that would be observed in the absence of the array resolution effects. However, the accuracy of the results provided by deconvolution methods is very sensitive to background noise, always present in acoustic measurements. This process should be carried out after the additive noise has been suitably attenuated and, ideally, the deconvolution operator should amplify the noise as little as possible. Another approach is described in the article. It consists in using a noise reference and a new technique called spectral estimation method with additive noise to remove both the smearing effect produced by the array response and the background noise. The technique has been applied to computer and experimental simulations conducted both in an anechoic chamber and in the test section of an open wind tunnel involving acoustic sources radiating in a noisy environment. The levels of the sources were found with a good level of accuracy and the background noise greatly reduced, conﬁrming the validity of the approach and the satisfactory performance of the method proposed. Ó 2010 Elsevier Ltd. All rights reserved.

1. Introduction Array processing is a powerful tool to extract noise source characteristics, both in terms of locations and levels in the frequency band of interest, from the acoustic signals measured during wind tunnel tests. For many years this means of investigation has been commonly used to study airframe noise [1–3], which is a signiﬁcant contributor to the overall acoustic emissions of a civil aircraft when landing. The reduction of aerodynamic noise is a priority for the civilian aeronautics industry, which in the next decade will be confronted with more restrictive noise regulation in urban ﬂy-over areas. In the classic multiple acoustic sources localization problem the sensor signals produced by the wave ﬁelds are processed in order to isolate the incident sources and estimate their associated positions. A wide variety of techniques have been developed to provide solutions to this problem. Conventional beamforming (CBF) is one of the most popular methods used as a diagnostic method because it supplies a source map in a single step at each computational ⇑ Tel.: +33 (1) 46 73 48 05. E-mail address: [email protected] 0003-682X/$ - see front matter Ó 2010 Elsevier Ltd. All rights reserved. doi:10.1016/j.apacoust.2010.09.004

frequency [4]. However, it should be emphasized that processing based on CBF is the optimum processing for a single monopole. The classic result is that the overall array response to a point source exhibits several sidelobes and a main lobe of which the location and amplitude correspond respectively to the location and amplitude of the point source. For multiple sources, the CBF method is able to perform the localization with a good degree of accuracy in many practical situations [5]. The power levels of the sources are accurately obtained when the sources are separated from each other by at least one 3 dB beam width, and when their amplitudes are larger than those of the side lobes. Unfortunately, the two previous criteria are not satisﬁed in situations of interest like aircraft noise, where the sources under study are continuously distributed, and the beam pattern associated with different sources overlap. In these cases, it is obvious that, because of its poor spatial resolution, the CBF method fails to provide correct levels. The estimation of the source power levels in a multiple source environment is essential in order to characterize spread sources like airframe noise. To solve the problem, we must resort to methods, which improve the overall performance of the CBF. An interesting solution has been suggested [6] for computing the total

12

D. Blacodon / Applied Acoustics 72 (2011) 11–21

Nomenclature c R E[] f J L M Nq P0 Q t djk Ds

r2N s Xq(f)

340 m/s is speed of sound in the propagation medium distance between the source region and the array expectation operator frequency in Hertz number monopole sources to model D length of the array number of microphones of the array number of monopole sources in the source area q 20 lPa number of source areas in D time in seconds Kronecker delta minimum distance between two monopole sources variance of the white background noise time delay actual power spectral density of acoustic source area q

output of distributed sources. It consists in recovering the actual source power levels of speciﬁed areas from the CBF output, knowing the characteristics of the array pattern. The undesirable effects of smearing by the array pattern are removed from the output of the CBF by using a deconvolution technique that considers the beamform. However, the measured data not only contains the noise radiation from the source of interest, but also random noise, which always contaminates measurements. This situation arises, for example, in wind tunnels with a closed test section, where acoustic sources radiate in very a noisy environment, making it difﬁcult to characterize them. Generally, the background noise has a spectrum, which is more extended than the spectra of the source areas, and it can often become dominant as the frequency increases. Consequently, the particular solutions associated with background noise may be superposed onto the solution of our problem, in an unfavourable way, by the normalization operation used to approximate the source power levels, leading to inaccurate results. The problem becomes critical when corrupting noise is changing the actual levels and the spectral distribution in the measured cross-spectral matrix. Besides, undesired dominant noise sources may be present in the corrupting noise, making it more difﬁcult to decide in the localization maps whether the sources with high levels correspond to those of interest or not. In order to get over this drawback, a lot of research on the modeling and removal of the effects of noise and distortion has been done. The main conclusions of these studies are that the success of noise processing methods depends on their ability to characterize and model the noise process, and to use the noise characteristics advantageously to differentiate the signal from the noise [7]. Various techniques such as beamforming (non-adaptive and adaptive) and spatial–temporal ﬁltering can be used to achieve noise reduction [8–10]. In certain applications, it is not possible to access the instantaneous frame of the contaminating noise, and only the noisy signal is available. Thus, the noise cannot be cancelled out, but it may be reduced, in an average sense, using the statistics of the signal and the noise process. In other situations, it is not possible to get a noise reference: the reference signal can either be generated artiﬁcially or extracted from the primary signal or from the imaging results [11]. Sometimes we have access to a noise reference and under some assumptions on the noise characteristics; several strategies exist that can be used to remove its effects. There is an interesting approach [12] that shows how the generalized singular value decomposition can be used to achieve noise reduction. Another popular solution uses the Spectral Subtraction method to reduce the noise

XCBF q ðf Þ

power spectral density of acoustic source area q estimated with CBF XSEM ðf Þ power spectral density of acoustic source area q estiq mated with SEM XSEMWAN ðf Þ power spectral density of acoustic source area q estiq mated with SEWMAN Abbreviations CBF conventional beamforming CSM cross-spectral matrix PSD power spectral density PSS power spectrum subtraction SEM spectral estimation method SEMWAN spectral estimation method with additive noise SPL spectral power level

contamination [13]. It is based on the subtraction of the short-term spectral magnitude of noise from that of the noisy spectrum. A generalized form of the basic spectral subtraction is given in [14] where power spectral subtraction uses short-time power spectrum estimates instead of a magnitude spectrum. Another interesting solution consists in removing the main diagonal of the array cross-spectral matrix before performing the imaging processing [15]. However, this solution is tractable when the background noise has both the same variance on all the microphones of the array and is also uncorrelated between the microphones. The present study is focused in the presentation on an extension of the spectral estimation method (SEM) which is based on a prior knowledge of a noise reference. This new method called spectral estimation method with additive noise (SEMWAN) has the advantage of being able to reduce the smearing effect due to the array response and at the same time the inaccuracy of the results caused by noise sources which can be coherent (i.e. the phase difference between the wave ﬁeld emitted by the source is constant) as well as incoherent (i.e. the phase difference between the wave ﬁeld radiated by the source is random), with high or low levels. This method is well suited to applications in a wind tunnel since we can, for example, get a noise reference or record the environmental noise before installing the models in the test section. This paper is organised as follows: Section 2 begins with a presentation of the problem we want to solve. Section 3 is devoted to a description of the array and model cross-spectral matrices used in the development of the methods considered in this study. A theoretical background for the estimation of the level of the acoustic sources with SEM and CBF, starting from noiseless data, is given in Section 4. The performances of CBF and SEM are compared in Section 5 using noiseless data. Section 6 presents the limitations of SEM when this method is used with noisy data. A short presentation of the Power Spectrum Subtraction is made in Section 7 and the principle of SEMWAN described in Section 8. SEM and SEMWAN are applied on numerical simulations, experimental simulations and in tests conducted in an open wind tunnel in Sections 9–11 respectively.

2. Problem of source levels estimation with noisy data Two extended sources S1 and S2 of power spectral densities (PSDs) X1(f) and X2(f) radiate in a very noisy environment. It is characterized by a coherent noise S3 of PSD X3(f), which can be between the acoustic sources or at the same location as one of these,

13

D. Blacodon / Applied Acoustics 72 (2011) 11–21

Fig. 1. Geometry of the problem of acoustic SPLs estimation using noisy data.

and an incoherent broadband noise SN of variance r2N , as depicted in Fig. 1. The objective here is to ﬁnd the location of S1 and S2 and estimate their absolute levels X1(f) and X2(f), starting from the noisy signals and the measurement of a noise reference. 3. Array and model cross-spectral matrices The methods studied in this paper are based on an array matrix and a model cross-spectral matrix and these are described below. 3.1. Array CSM

that an extended noise source may only be viewed as an equivalence class between source functions radiating the same pressure ﬁeld on a phased array. A valuable equivalent source model could be a distribution of correlated monopoles, allowing the characterization of any directivity. However, the problem of ﬁnding both the amplitude of the sources and their correlation function in amplitude and phase involves a very large number of parameters, and it is not easy to solve with a limited number of microphones. In many applications, a majority of noise sources have smooth directivity patterns. This means that if the aperture angle of an array of microphones seen from the overall source region is not too large, the directivity pattern of each region may be considered as isotropic within this aperture. Therefore, a model based on a set of uncorrelated monopoles, like that used in the paper, is quite appropriate to study airframe noise. Let us consider that each monopole is characterized by a source function ai(t). The correlation function between two monopoles is:

C i;j ðsÞ ¼ E½ai ðtÞaj ðt þ sÞ ¼ C i ðsÞdi;j

The power spectral density of ai(t) is Ai(f), the Fourier Transform of Ci(s) (Eq. (3)). The problem of the source level estimation is then to ﬁnd the J parameters Ai(f) of the model for each frequency. In order to proceed, it is necessary to deﬁne the general structure of the array CSM for uncorrelated monopole sources, and then choose the parameters Ai(f) such that the model CSM ﬁts the SPLs measured as well as possible. To compute the model CSM, we begin by expressing the pressure radiated on the array:

pmod ðtÞ ¼ n

We consider an overall source domain D located in the (x, y) plane split into Q effective noise radiating areas Dq. An array of M microphones located in front of D records the radiated pressure. We assume that the acoustic sources are random and stationary. Let pmes n ðtÞ be the pressure recorded by the microphone n. The cross-correlation function C mes m;n ðsÞ between the microphone signals n and m is deﬁned as: mes mes C mes m;n ðsÞ ¼ E½pm ðtÞpn ðt þ sÞ

ð1Þ

Cmes m;n ðf Þ ¼

Z

1

1

2pjf s C mes ds m;n ðsÞe

½C

mes

6 ðf Þ ¼ 4

C

mes 1;1 ðf Þ

C

mes 1;M ðf Þ

Cmes Cmes M;1 ðf Þ M;M ðf Þ

ð4Þ

Cmod m;n ðf Þ ¼

J X

Gm;i Ai ðf ÞGn;i

ð5Þ

i¼1 j2pfRmi c

with Gm;i ðf Þ ¼ e conjugation.

Rmi

, the superscript stands for the complex

ð2Þ

The measured array CSM of all signals pmes m ðtÞ (m = 1, 2, . . . , M) is deﬁned by

2

J X ai ðt Rin =cÞ Rin i¼1

where Rin is the distance from the source i to the microphone n and J is the number of monopole sources considered for the description of all acoustic sub-regions. The model cross-power spectrum Cmod m;n ðf Þ, between the output signals of the mth and nth microphones, can be expressed as follows:

mes m;n ðf Þ

mes The cross-power spectral C between pmes m ðtÞ and pn ðtÞ is the Fourier Transform of C mes m;n ðsÞ (Wiener–Khintchine theorem):

ð3Þ

3 7 5

4. Source level estimation with noiseless data using SEM Consider ½Cmes ðf Þ, the array cross-spectral matrix, and ½Cmod ðf Þ, the model cross-spectral matrix due to the acoustic sources of component Cmod m;n ðf Þ (Eq. (5)). In SEM [5], the problem of ﬁnding the PSDs Ai(f) is based on the following cost function:

EðAðf ÞÞ ¼ 3.2. Model CSM In practical situations, it is not the details of noise sources such as the slats, the ﬂaps and the landing gears for an aircraft at landing that are needed, but their directivity pattern in the far ﬁeld to assess their contribution to the overall noise radiated. However, the problem is complex if we want to extract the directivity pattern of different extended source regions radiating simultaneously from the phased array data. At this point, the concept of extended noise source measured through its radiation must be clariﬁed to get a better understanding of the true nature of the problem that we want to solve. It has been shown [16] that the measurement of the acoustic ﬁeld outside the source region is not sufﬁcient to determine the characteristics of the source uniquely. This implies

2 J M X X mes Gm;i ðf ÞAi ðf ÞGn;i ðf Þ Cm;n ðf Þ m;n¼1 i¼1

ð6Þ

The minimization of the mean squares error E(A(f)) between the array and model matrices with respect to Ai(f) gives the square linear system: J X

V i;j ðf ÞAi ðf Þ ¼ U j ðf Þ

ð7Þ

j¼1

P mes with U j ðf Þ ¼ M m;n¼1 Gm;j ðf ÞCm;n ðf ÞGm;j ðf Þ; 2 P V i;j ðf Þ ¼ Nm¼1 Gm;i ðf ÞGm;j ðf Þ . b i ðf Þ of the deThe solutions of Eq. (7) provide the estimates A sired PSDs Ai(f). However, the Ai(f) obtained after the minimization

14

D. Blacodon / Applied Acoustics 72 (2011) 11–21

process are not guaranteed to be positive. This can lead to physically unrealistic solutions for the recovered spectra (i.e. spectra with negative amplitudes). Adding a positivity constraint on parameters Ai(f) can easily circumvent this drawback. b i ðf Þ depends on the variance of However, the accuracy of the A the background noise. So we consider that the acoustic sources radiate in an ideal medium without background noise. In this case, the array cross-spectral matrix [Cmes] may be assumed equal to [CS], the source cross-spectral matrix that would be obtained with noiseless data. Thus, Eq. (7) may be written as: J X

V i;j ðf ÞAi ðf Þ ¼ U Sj ðf Þ

ð8Þ

j¼1

with U Sj ðf Þ ¼

PM

m;n¼1 Gm;j ðf Þ

Cmod m;n ðf ÞGm;j ðf Þ.

is composed of 15 microphones equally spaced every Dx = 0. 1 cm: the length of the array is L = 1.4 m (Fig. 1).

5.1. Two sources spaced more than 3 dB beam width apart In the ﬁrst test, S1 and S2 radiate at 4 kHz, so that their spatial separation DS = 0.2 m (i.e. DS = 2.35k, k = c/f = 0.085 m) is more than a standard beam width Dx3 dB ¼ 1:34k. The result found with CBF exhibits two main peaks (Fig. 2 upper plot). Their locations and amplitudes coincide with those of the actual sources S1 and S2. The spatial resolution of SEM is much better than that of the CBF (Fig. 2 lower plot) due to the spurious effect of the array response that has been removed from the result by solving the deconvolution

b i ðf Þ obtained by solving Eq. (8) will be used to The estimates A characterize the actual levels of the acoustic sources as it has been shown with numerical simulations [5]. 4.1. Source power estimation with SEM and CBF In this paper, we compare the results provided by SEM and CBF. It is thus necessary to deﬁne the total power level output from the source areas for both methods. 4.1.1. Source power estimation with SEM The total power level output XSEM ðf Þ of the source area Dq is obq b i ðf Þ of all monopole sources tained by summing the power levels A that are used in the modeling of sub region Dq:

XSEM q ðf Þ ¼

Nq X

b i ðf Þ A

ð9Þ

i¼1

The spectral estimation method (SEM) is the overall technique aiming to ﬁnd XSEM ðf Þ. q 4.1.2. Source power levels estimation with CBF In the case of CBF, the estimation of the amplitude As(f) of a single compact source at location S is based on the following relation (more details are given in [4]): mes þ c ðf ; iÞ ¼ Gi ðf Þ½C ðf ÞGi ðf Þ W kGi ðf Þk4

Fig. 2. Scenario of two uncorrelated monopoles S1 ð70 dBÞ and S2 ð70 dBÞ spaced at more than one 3 dB bandwidth – results provided by the CFB method (upper plot) and SEM (lower plot).

ð10Þ

the vector Gþ i ðf Þ ¼ ½G1;i ðf Þ; G2;i ðf Þ; . . . ; GM;i ðf Þ and the superscript + stands for the transposed complex conjugate. It is pointed out here that the source map obtained by computing Eq. (10) over the source region Dq (for i = 1, . . . , Nq) will show a main lobe centered at the location of S and sidelobes due to the arc ðf ; iÞ is maximum at the location of S (i.e. for i = S) ray pattern. W c ðf ; iÞ ¼ As ðf Þ. and W The total power level output XCBF q ðf Þ of the source area Dq is obc ðf ; iÞ of all monopole tained by summing the power levels W sources that are used in the modeling of Dq:

XCBF q ðf Þ ¼

Nq X

c ðf ; iÞ W

ð11Þ

i¼1

5. Numerical simulations with noiseless data This section compares SEM and CBF using numerical simulations to show the capacity of both methods to estimate the actual levels of two compact sources S1 and S2 radiating in a noiseless environment. The sources are placed on a line of a length of 2 m, located at a distance R = 2 m. S1 is located at x = 0.1 m and S2 at x = +0.1. Both sources have the same amplitude of 70 dB. The array

Fig. 3. Scenario of two uncorrelated monopoles S1 ð70 dBÞ and S2 ð70 dBÞ spaced at less than one 3 dB bandwidth – results provided by the CFB method (upper plot) and SEM (lower plot).

15

D. Blacodon / Applied Acoustics 72 (2011) 11–21

problem of Eq. (7). However, contrary to the CBF, the amplitudes of the two sources do not result from the maxima of the peaks. An integration over the spatial extended of the two peaks is needed to ﬁnd the real levels of the sources. Indeed, the amplitudes of S1 and S2 are respectively provided with a good accuracy by computing XSEM ¼ 70:13 dB and XSEM ¼ 70:13 dB. 1 2 5.2. Two sources spaced less than 3 dB beam width apart The parameters of the simulation are the same as those used previously, except that here S1 and S2 radiate at 1.5 kHz, so are separated from each other by DS = 0.88k which is less than a standard beam width Dx3 dB ¼ 1:29k. CBF fails to resolve the two sources (Fig. 3 upper plot). However, the result exhibits a power level of 71.56 dB at the location of S1 and S2 (instead of 70 dB as expected). The source locations are correctly found with SEM (Fig. 3 lower plot). The integrated power levels XSEM ¼ 70:0 dB and XSEM ¼ 1 2 70:0 dB over the spatial extended of the two peaks correspond respectively at the actual power levels of S1 and S2. 6. Least squares solution for source level estimation with noisy data Now assuming that the wave ﬁelds radiated by the acoustic sources are contaminated by an additive noise, and that the source and the noise signals are uncorrelated. It follows that the true array cross-power spectral [Cmes,T(f)] obtained in practical cases can be expressed as the sum of [Cmes(f)] and [Cmes,N(f)] which are the cross-spectral matrices of the acoustic sources and the background noise respectively:

½Cmes;T ðf Þ ¼ ½Cmes ðf Þ þ ½Cmes;N ðf Þ

ð12Þ mes

It is again assumed that the cross-spectral matrix [C (f)] of the acoustic sources is equal to ½CS ðf Þ. Therefore the substitution of [Cmes(f)] with [Cmes,T(f)] in Eq. (7) yields: J X

V i;j ðf ÞAi ðf Þ ¼ U Sj ðf Þ þ U Nj ðf Þ

ð13Þ

i¼1

with U Nj ðf Þ ¼

PM

m;n¼1 Gm;j ðf Þ

Cmes;N m;n ðf ÞGm;j ðf Þ.

Eq. (13) may be written with the following compact form as: J X i¼1

" V i;j ðf ÞAi ðf Þ ¼ U Nj ðf Þ 1 þ

U Sj ðf Þ

# ð14Þ

U Nj ðf Þ

We may analyze Eq. (14) as follows: U S ðf Þ

(a) when the power ratio veriﬁes U Nj ðf Þ 1, the estimated j parameters will allow an estimation of the acoustic source levels with a high degree of accuracy, U S ðf Þ

(b) when the power ratio veriﬁes U Nj ðf Þ 1, the estimated j b i ðf Þ will characterize the undesired background parameters A noise, (c) In the other situations, actual PSDs of the acoustic source will be more or less corrupted by background noise.

because the computational complexity of the spectral subtraction is not expensive, it has a major drawback. In fact, the effectiveness of the noise removal process is dependent on obtaining an accurate spectral estimate of the noise signal. The better the noise estimate, the lower the level of residual noise appearing in the modiﬁed spectrum. However, since the noise spectrum cannot be directly obtained, it is necessary to use an averaged estimate of the noise. Hence, there are some signiﬁcant variations between the estimated noise spectrum and the actual noise content present in the instantaneous spectra to denoise. The subtraction of these quantities results in the presence of isolated residual noise levels of large variance. The increase in the variance constitutes an irrevocable distortion in the reconstructed spectra.

8. Spectrum estimation method with additive noise The conclusion drawn in Section 6 is that SEM will give results that are strongly erroneous if it is applied to very noisy data. This drawback may be removed with the spectral estimation method with additive noise which is an improvement on SEM. This implies taking into account the contribution of the background noise in the acoustic measurements, and modifying the optimization problem using Eqs. (6) and (12), so that we obtain.

HðAðf ÞÞ ¼

mes;N jCmes;T m;n ðf Þ Cm;n ðf Þ

m;n¼1

J X

Gm;i ðf ÞAi ðf ÞGn;i ðf Þj2

ð15Þ

i¼1

b mes;N ðf Þ is generally not available The cross-spectral matrix ½ C because the measurements of acoustic sources and noise sources cannot be obtained separately. Nevertheless, in many practical situations we can measure the environmental noise with the array of microphones and thus access a reference noise. This data can be used to compute [Cmes,ref(N)(f)], called a reference noise cross-spectral matrix. At this point in the paper, we should consider several assumptions about the noise reference and the acoustic sources: (a) the noise reference is locally stationary, so that its magnitude measured before the tests without the acoustic sources is equal to its level produced during the tests with the acoustic sources, (b) the source signals and additive noise components are statistically independent. It follows from the above assumptions that [Cmes,ref(N)(f)] can be considered to be very good characterization of [Cmes,N(f)], so we can replace [Cmes,N(f)] with [Cmes,ref(N)(f)] in Eq. (15):

2 J M X X mes;T mes;refðNÞ HðAðf ÞÞ ¼ ðf Þ Gm;i ðf ÞAi ðf ÞGn;i ðf Þ Cm;n ðf Þ Cm;n m;n¼1 i¼1 ð16Þ Let us develop Eq. (16) in the following form:

HðAðf ÞÞ ¼ Bmes ðf Þ 2

J X

U mes;T ðf ÞAi ðf Þ þ 2 i

i¼1

7. Power spectrum subtraction The conclusions of the previous section show that the lower the magnitude of the background noise then the more accurate are the estimates of the sources PSDs. In other words, when noisy data is processed, a reduction of the noise will generally improve the accuracy of array processing methods based on the resolution of Eq. (13). A very simple way to achieve this, when a noise reference is available, consists in subtracting the noise reference from the noisy spectra. However, while this solution is very attractive

M X

þ BN ðf Þ 2Bmes;N ðf Þ þ

J X

Ai ðf ÞV i;j ðf ÞAj ðf Þ

With M X M X

2 mes;T jCmes;T ðf Þ m;n ðf Þj ; U i

m¼1 n¼1

¼

M X M X m¼1 n¼1

U Ni ðf ÞAi ðf Þ

i¼1

i;j

Bmes ðf Þ ¼

J X

Gm;i ðf ÞCmes;T m;n ðf ÞGn;i ðf Þ

ð17Þ

16

D. Blacodon / Applied Acoustics 72 (2011) 11–21

U Ni ðf Þ ¼

M X M X

and from a numerical point of view [19]. An efﬁcient welldebugged software version of the algorithm is available [20]. The minimization of the cost function H(a(f)) with respect to ai(f) provides positive noise-free PSDs as it will be shown in the next section. After this step, the total power level output XSEMWAN ðf Þ of q the source area Dq is obtained by summing the power levels a2i ðf Þ of all monopole sources that are used in the modeling of Dq:

mes;ref ðNÞ Gm;i ðf ÞCm;n ðf ÞGn;i ðf Þ; BN ðf Þ

m¼1 n¼1

¼

M X M X

ðNÞ jCmes;ref ðf Þj2 m;n

m¼1 n¼1

Bmes;N ðf Þ ¼

N X N X

ðNÞ Cmes;T ðf ÞCmes;ref ðf Þ and V i;j ðf Þ m;n m;n

m¼1 n¼1

XSEMWAN ðf Þ ¼ q

2 X M Gm;i ðf ÞGm;j ðf Þ ¼ m¼1

Nq X

a^ 2i ðf Þ

ð19Þ

i¼1

In this form, it appears that the cost function H(A(f)) (Eq. (17)) takes into account: (a) the bias due to the variance of the background noise with BN(f), (b) the background noise that was propagated in the source domain with U Ni ðf Þ, (c) the background that may be correlated with the acoustic sources with Bmes,N(f). The noise free power spectral densities Ai(f) are deﬁned as the solution of the following problem:

b Þ ¼ arg min HðAðf ÞÞ Aðf

We will call this overall technique, to ﬁnd XSEMWAN ðf Þ, the spectral q estimation method (SEMWAN). The result is valid only around the direction ~ u and in the aperture D~ u of the measuring array, seen from the source region. Consequently, in the acoustic far ﬁeld, the SPL at a large distance R from the sources and around the direction ~ u is modeled as:

Pðf ; R~ uÞ ¼

Q X XSEMWAN ðf Þ q q¼1

R2

ð20Þ

In conclusion, XSEMWAN ðf Þ may be considered as the mean direcq tivity pattern of each radiating area q in the direction ~ u averaged on u. Then, rotating the measuring array around the global radiating D~ region enables us to estimate the full directivity pattern Pðf ; ~ uÞ.

A

As for SEM, the power spectral densities Ai obtained after the minimization process (Eq. (16)) are not guaranteed to be positive. This can lead to physically unrealistic solutions for the recovered spectra (i.e. spectra with negative amplitudes). In order to get over this drawback, a new set of parameters is introduced such as a2i ðf Þ ¼ Ai ðf Þ. Thus, the least-squares problem can be formulated as follows:

Hðaðf ÞÞ ¼ Bmes ðf Þ 2

J X

U mes;T ðf Þa2i ðf Þ þ 2 i

i¼1

þ BN ðf Þ 2Bmes;N ðf Þ þ

J X

U Ni ðf Þa2i ðf Þ

i¼1 J X

a2i ðf ÞV i;j ðf Þa2j ðf Þ

ð18Þ

i;j

9. Numerical simulations with noisy data The purpose of this section is to evaluate the capacity of SEMWAN to accurately estimate the levels and locations of two extended acoustic sources S1 and S2 radiating at a frequency of 4 kHz in a noisy environment (Fig. 4). S1 and S2 are composed of 40 uncorrelated monopoles evenly spaced with Ds = 0.005 m. The levels of each monopole source for S1 and S2 are 20 logðA1 Þ ¼ 55 dB and 20 logðA2 Þ ¼ 70 dB respectively, in decibels. The integrated power levels for S1 is given by the following relation:

X1 ¼ 10 log

40 X

A2i ¼ 71:02 dB

ð21Þ

i¼1

H(a(f)) is a polynomial of fourth degree, and its derivative a polynomial of third degree. Now, ﬁnally, two questions arise: on the existence of a unique global maximum (uniqueness of the solution) and on the choice of the initial guess to ﬁnd a global minimum (and not a local one). It is generally difﬁcult to answer these questions theoretically when the estimation problem has nonlinearity characteristics such as the one that is the subject of this paper. Today, we may only say that we have obtained different solutions and minima starting from very different values of the initial guess. However, it has been found in all the simulations performed that the initial guess a2i ðf Þ ¼ U mes;T ðf Þ (Eq. (18)) leads to the global minimum, although i we lack general proof for this observation. Furthermore, the minimization of the cost function H(a(f)) with respect to ai provides positive noise-free PSDs.

If we substitute A1 by A2 in Eq. (21), we ﬁnd that the integrated power level for S2 is equal to X2 ¼ 86:02 dB. The wave ﬁelds generated by the two sources are assumed to be measured by the same array of microphones considered in Section 5. In order to simulate the noisy environment the virtual microphone signals are contaminated by an additive coherent noise source S3 composed of 24 uncorrelated monopole sources, each with a level of 20 logðA3 Þ ¼ 50 dB such that the integrated power levels X3 ¼ 63:8 dB, and also with an incoherent noise SN with a variance in decibels equal to ðr2B ÞdB ¼ 50 dB.

8.1. Source power levels computation The computation procedure consists ﬁrst in sampling the source areas, then in determining the power levels a2i ðf Þ of virtual uncorrelated monopoles located at each of the sampling points, by minimizing the error in Eq. (18). From all the methods available to perform nonlinear optimization [17], we chose an efﬁcient and robust algorithm called the Restarted Conjugate Gradient Method to compute the a2i ðf Þ. This method is well documented from a statistical point of view [18]

Fig. 4. Simulation of two extended sources S1 and S2 contaminated by a coherent noise source S3 and a broadband noise SN with a variance of 50 dB.

D. Blacodon / Applied Acoustics 72 (2011) 11–21

17

9.1. Results with CBF–SEM and SEMWAN for the ﬁrst scenario Fig. 5 shows the result given by conventional beamforming (CBF) and SEM at the emission frequency of 4 kHz for the conﬁguration presented in Fig. 4. Predictably, CBF neither resolves the two extended sources S1 and S2 nor gives their actual levels. In contrast, SEM separates S1 and S2 and correctly characterizes the two sources in terms of both location and levels. It also appears that the coherent spurious source S3 is found with SEM. However, it will be difﬁcult to decide in practical situations if S3 is a source of interest or not. We also observe several unwanted sources due to the incoherent background noise. Fig. 6a shows the noise reference (i.e. S3 and SN) used to compute [Cmes,ref(N)(f)] and in Fig. 6b the result provided by SEMWAN. Now, the spurious sources are removed and S1 and S2 are found at their actual locations with their levels within 1 dB.

Fig. 7. Simulation of two extended sources S1 and S2 contaminated by a coherent noise source S3 and a broadband noise SN with a variance of 100 dB.

9.2. Results with CBF–SEM and SEMWAN for the second scenario We consider in Fig. 7 a conﬁguration with a high level of 100 dB for SN. This situation may be representative of the tests carried out in a wind tunnel with closed test section. The characteristics of the acoustic sources S1 and S2 remain unchanged compared to those presented in Fig. 4. In contrast, the spurious source S3 is now situated at the same location of S2 and is composed of 24 uncorrelated

Fig. 5. Result obtained with CBF and SEM for the conﬁguration presented in Fig. 2.

Fig. 8. (a) Result obtained with CBF and SEM for the conﬁguration presented in Fig. 7. (b) Result obtained with SEMWAN using the noise references S3 and SN.

monopole sources, each with a level of 20 logðA3 Þ ¼ 90 dB such that the integrated power levels X3 ¼ 103:8 dB. Again, CBF does not resolve S1 and S2 but it shows the location of the undesired source S3 (Fig. 8a). The degradation of the result found by SEM (Fig. 8a) is clear compared to that shown in Fig. 5. It is not possible to distinguish the sources S1 and S2 because the result is completely erratic over the whole computational domain. This is in agreement with the study described in Section 6. Fig. 8b shows the result obtained with SEMWAN. Now, S1 and S2 are resolved, their levels are correctly estimated and the smearing effect of the background noise removed. 10. Experimental simulations 10.1. Tests conducted in an anechoic chamber

Fig. 6. (a) Noise references used to compute [Cmes,ref(N)(f)] and apply SEMWAN for the conﬁguration presented in Fig. 4. (b) Result obtained with SEMWAN.

A simple experiment was conducted in an anechoïc chamber to validate SEMWAN with experimental data. A photograph and a schematic view of the experimental set-up are presented in Fig. 9a and b respectively. We consider here that the acoustic sources are three driver units S1, S2 and S3 mounted in the middle of a wooden plate and spatially separated at 0.29 m. The background noise is generated by the driver units S4 and S5 placed near the array of microphones and between the two wooden plates 3 m

18

D. Blacodon / Applied Acoustics 72 (2011) 11–21

from each other. The phased array is composed of 30 microphones, 1/2 in. 3211 Brüel&Kjaer, evenly spaced at 3 cm.

10.2. Methodology of the tests The driver units S1, S2 and S3 are the acoustic sources. They were connected independently at three generators delivering sinusoidal signals at the frequency of 2.5 kHz, 4.83 kHz and 8 kHz, respectively. The driver units S4 and S5 are the noise sources. They were also connected to independent generators delivering a broadband signal ﬁltered at 10 kHz for S4 and a sinusoidal signal at 3 kHz for S5. The acoustic waves radiated by each of the ﬁve driver units were individually measured by the array of microphones. The spectra of S1, S2, S3, S4 and S5 are presented in the Fig. 10a–e respectively. The levels of the measured spectra by the microphones 29, 15 and 1 opposite the sources S1, S2, S3 respectively will be compared to the results provided by SEM and SEMWAN. Thus, it will possible to assess the capacity of both methods to accurately reproduce the levels of acoustic sources radiating in a noisy environment. The noise reference matrix [Cmes,ref(N)(f)] required by SEMWAN is computed with the data measured when only S4 and S5 impinge on the array of microphones. An example of the spectra of [Cmes,ref(N)(f)] is presented in Fig. 10f. It shows the broadband noise and tone noise emerging, of about 10 dB at 3 kHz.

10.3. Data processing

Fig. 9. (a) Photograph of the experimental set-up in an anechoic chamber used to validate SEMWAN. (b) Schematic view of the experiment set-up.

The array cross-matrices [Cmes,T(f)] and [Cmes,ref(N)(f)] were estimated using 200 data blocks, of 1024 samples each, sampled at 31,250 Hz (i.e. Df = 31,250/1024 = 30.51 Hz). The levels of

Fig. 10. Spectra of the acoustic sources: (a) Tone noise at 2.5 kHz. (b) Tone noise at 4.83 kHz. (c) Tone noise at 8 kHz. Spectra of the noise sources. (d) White noise ﬁltered with a low pass ﬁlter at 10 kHz. (e) Tone noise at 3 kHz. (f) Spectrum of S4 + S5.

D. Blacodon / Applied Acoustics 72 (2011) 11–21

19

Nq = 200 virtual monopole sources were computed at spatial sampling points equally spaced every Dy = 0.005 m, on a line passing through the middle of the driver units S1, S2 and S3, from y = 0.4 m to y = 1.4 m, at the discrete frequencies fg = gDf (g = 1, . . . , 328), and in the frequency range [0, 10 kHz]. The measured spectra are compared to the estimated spectra with SEM and SEMWAN after their propagation at the location of microphone n using the following relation:

PN q Pn ðf Þ ¼ 10 log

a^ 2i ðf Þ i¼1 R2 ni

P20

ð22Þ

10.4. Results This section presents a result obtained with Power Spectrum Subtraction (PSS), and with SEM and SEMWAN, ﬁrstly in the characterization of the sources S1 and S2, then S2 and S3 when the array measurements are contaminated by S4 and S5.

Fig. 12. (a) Result found with SEM for the conﬁguration shown in Fig. 11a. (b) Result found with SEMWAN.

10.4.1. Result with PSS The Fig. 11a presents the spectra of S1 (solid line) and S2 (dashed line), and the spectrum of the signal Microphone 16 (black curve) when S1, S2, S4 and S5 impinge on the array. In Fig. 11b the spectrum of Microphone 16 (thick line) and the result found with PSS (grey curve) are superimposed. Reduction of the background is obtained with PSS, but there is a lot of frequency components with negative levels, certainly due to the weak signal-to-noise ratio considered in this test. We can also see that the sources S1 and S2 do not clearly appear at their emission frequencies. This result is a good illustration of the drawback of PSS. 10.4.2. Results with SEM and SEMWAN The same conﬁguration is used as that considered in the previous section. Fig. 12a shows the result given by SEM. The level of S1 is correct, while that of S2 is underestimated by about 3 dB. There are also numerous spurious sources with high levels generated by the noise sources S4 and S5. The result given by SEMWAN is presented in Fig. 12b. The noise sources have been highly reduced and the levels of S1 and S2 found within 1 dB. This demonstrates the efﬁciency of SEMWAN for reducing the undesired noise sources when a reference of these noise sources is available. The second scenario is shown in Fig. 13a. The objective here is the characterization of S2 and S3 starting from noisy measurements of S4 and S5. The graph in Fig. 13b shows the result with SEM. The levels of source S2 is underestimated by about 3 dB while that of S3 is within 1.5 dB. Again, there are numerous spurious sources with

Fig. 13. (a) Spectra of acoustic sources S2 (solid line) and S3 (dashed line) to be estimated and the spectrum measured by microphone 16 (thick line) due to S2, S3, S4 and S5. (b) Result found with SEM. (c) Result found with SEMWAN.

high levels due to S4 and S5. The result obtained with SEMWAN (Fig. 13c) shows that S2 and S3 appear more clearly due to the reduction of noise. The level of S1 is accurately estimated and that of S2 obtained within 2.54 dB. It also appears that S5 was not completely removed by SEMWAN. 11. Tests conducted in an open wind tunnel

Fig. 11. (a) Comparison between the noisy spectrum on microphone 16 due to S1, S2, S4 and S5. (b) Result obtained with Power Spectrum Subtraction using the noise reference given in Fig. 10f.

Tests similar to those carried out in the anechoic chamber were conducted in the test section of the Onera open wind tunnel F2 (Fig. 14). The acoustic sources are three driver units S1, S2 and S3 installed in the ceiling of the test section. S1, S2 and S3 were

20

D. Blacodon / Applied Acoustics 72 (2011) 11–21

Fig. 14. (a) Photograph of the experimental set-up in the Onera open wind tunnel F2. (b) Schematic view of the experiment set-up.

connecting to three independent generators delivering sinus signals of frequencies 2.5 kHz, 4.83 kHz and 8 kHz respectively. In this study, the noise sources are characterized by the noise radiated by the fan of the wind tunnel and by the turbulent boundary layer ﬂowing over the cross-shaped array installed in the ﬂoor of the test section. During the tests, the noise radiated by the acoustic sources are measured using a cross-shaped array, equipped with 50 half-inch microphones AKSUD 3211 evenly spaced at 3 cm on each arm. Thirty microphones were distributed along the arm of the array, which is in the y-axis, parallel to the ﬂow direction. In contrast, twenty microphones were distributed along the arm in the x-axis,

perpendicular to the ﬂow direction. Because the antenna was located in the ﬂow, a convection effect occurs between the acoustic sources and the locations of the microphones. This is taken into account by applying ﬂow corrections on the propagation time [21,22] which are used in the computation of the model array CSM. The noisy cross-spectral matrix [Cmes,T(f)] and the noise reference matrix [Cmes,ref(N)(f)] needed to apply SEMWAN were estimated using 1800 data blocks, of 1024 samples each, sampled at a frequency of 62,500 Hz (i.e. Df = 62,500/1024 = 61.035 Hz). It should be pointed out here that [Cmes,ref(N)(f)] is computed with signals measured by the array of microphones when the three sinus generators connected to S1, S2 and S3 were off.

Fig. 15. Comparison of the spectra measured with microphone 15 (see Fig. 14) and those computed with SEM and SEMWAN when the ﬂow is at speeds V = 10 m/s and 50 m/s. The spectra noise reference measured with microphone 15 are also shown for the two ﬂow speeds.

D. Blacodon / Applied Acoustics 72 (2011) 11–21

The levels of 2700 virtual monopole sources were computed at the nodes of a grid (from x = 2.06 to 2.95 m and from y = 0.15 m to 0.15 m) with an elementary mesh of size Ds = Dx Dy (with Dx = Dy = 0.01 m) at the discrete frequencies fg = gDf(g = 1, . . . , 250) in the frequency range [0, 15 kHz]. 11.1. Results The aim now is to assess the capability of SEM and SEMWAN to restore the frequency components of S1, S2 and S3 for two conﬁgurations when the data measured by the array is corrupted by low and high background noise. The overall measured spectrum with microphone 15 near the centre of the array (see Fig. 14) up to the frequency 15 kHz is compared to the integrated power levels computed with the SEM and SEMWAN. 11.1.1. Test at V = 10 m/s At this low speed of ﬂow the background generated in the wind tunnel weakly corrupts the data measured by the array of microphones. It follows that the spectral components of S1, S2 and S3 (Fig. 15a) are correctly found with SEM. We also observe that the source levels are quite similar to those shown by the spectrum of microphone 15 (see Fig. 14). Fig. 15b exhibits the result obtained with SEMWAN using a cross-spectral matrix of the noise reference computed with data measured when S1, S2 and S3 did not impinge on the array. Clearly, the three acoustic sources are correctly characterized in frequency and the background noise level has been reduced for the whole of the frequency band [0, 15 kHz]. It is lower than the background noise measured with microphone 15 across almost all the frequency band considered in this test. However, the result provided by SEMWAN does not show more information than that provided by SEM. The results provided by SEMWAN does not show more information than the spectra provided by SEM. It is obvious that the radiation of the acoustic sources is not done in free ﬁeld but in reverberant conditions. Therefore, it is not possible to conclude that levels of the sources found with SEMWAN are equal to the actual ones simply by comparing the estimated and measured spectra. 11.1.2. Test at V = 50 m/s At a speed of ﬂow of 50 m/s the background noise measured with microphone 15 (Fig. 15a) has increased by about 25 dB compared to that measured at 10 m/s (Fig. 15a). SEM correctly restores the frequency components of S1 and S2 (Fig. 15c) which are at higher levels than the background noise. In contrast S3 does not appear at all due to its weak level. The reduction of 15 dB of the background obtained with SEMWAN means it is now possible to clearly show the three acoustic sources. However, as for the previous test, it is not possible to conclude that levels of the sources found with SEMWAN are equal to the actual ones simply by comparing the estimated and measured spectra. 12. Summary This study has presented a new method for estimating the real levels of acoustic sources radiated in a noisy environment. This method called spectral estimation method with additive noise (SEMWAN) is based on the prior knowledge of a noise reference. The minimization of a cost function including the noise reference cross-spectral matrix, the array and model matrices, gives

21

solutions with highly reduced background noise. SEMWAN has been applied to computer, experimental simulations and tests conducted in an open wind tunnel to estimate the levels of acoustic sources radiated in a noisy environment. The results have conﬁrmed the validity of the approach and the performance of the proposed method. In particular, the levels of the acoustic sources simulated numerically or tested in the anechoic chamber were restituted with a good degree of accuracy. Concerning the test conducted in the open wind tunnel, further investigations remain to be done in order to assess the inﬂuence of the reverberation effects on the source levels provided by SEMWAN. This will require examining techniques of de-reverberation that have been proposed [7]. Acknowledgements The author is grateful to MM. Williams Denis and fabrice Desmerger for many stimulating discussions. This work was supported by the Department of Numerical Simulation and Aeroacoustics at Onera as part of the MASA home project. References [1] Piet JF, Élias G. Airframe noise source localization using a microphone array. AIAA/CEAS aeroacoustics conference, 3rd, Atlanta, GA, 12–14 May, 1997. AIAA Paper No. 1997-1643. [2] Davy D, Remy H. Airframe noise characteristics of a 1/11 model airbus. AIAA/ CEAS aeroacoustics conference, 4th, Toulouse (France), 2–4 June, 1998. AIAA Paper No. 98-2335. [3] Blacodon D. Analysis of the airframe noise of an A320/A321 with a parametric method. J Aircraft 2007;44(1):26–34. [4] Élias G. Source localization with a two-dimensional focused array-optimal processing for a cross-shaped array. In: Inter-Noise 95, Newport Beach (USA), 10–12 July 1995. [5] Blacodon D, Elias G. Level estimation of extended acoustic sources using a parametric method. J Aircraft 2004;41(6):1360–9. [6] Brooks T. A deconvolution approach for the mapping of acoustic sources (DAMAS) determined from phased microphone arrays. In: 10th AIAA/CEAS aeroacoustics conference, Manchester, Great Britain, 10–12 May, 2004. AIAA Paper No. 2004-2954. [7] Benesty J, Chen J, Huang Y. Microphone array signal processing. Springer; 2008. [8] Cox H, Zeskind RM, Owen MM. Robust adaptive beamforming. In: IEEE trans. acoust., speech, signal process, vol. 35 (10).; 1987. p. 1365–75. [9] Capon J. High resolution frequency–wavenumber spectrum analysis. Proc IEEE 1969;57:1408–18. [10] Frost OL. An algorithm for linearly constrained adaptive array processing. Proc IEEE 1972;60:926–35. [11] Koop L. Microphone-array processing for wind-tunnel measurements with strong background noise. In: 14th AIAA/CEAS aeroacoustics conference, Vancouver, British Columbia, 5–7 May, 2008. AIAA Paper No. 2008-2907 [12] Bulté J. Acoustic array measurements in aerodynamic wind tunnels: a subspace approach for noise suppression. In: 13th AIAA/CEAS aeroacoustics conference, Rome, Italy, 21–23 May, 2007. AIAA Paper No. 2007-3446. [13] Boll SF. Suppression of acoustic noire in speech using spectral subtraction. IEEE Trans Acoustic Speech Signal Process 1979;27:113–20. [14] Berouti M, Schwartz- R, Makhoul J. Enhancement of speech corrupted by acoustic noise. Proc IEEE Int Conf Acoust 1979;4:208–11. [15] Sijtsma P. CLEAN based on spatial source coherence. In: 13th AIAA/CEAS aeroacoustics conference, Rome, Italy, 21–23 May, 2007. AIAA Paper No. 20073436. [16] Dowling AP, Ffowcs Williams JE. Sound and sources of sound. Chichester: Ellis Horwood Publishers; 1983. [17] Gill PE, Muttey W, Wright MH. Practical optimization. London: Academic Press, Inc.; 1981. p. 59–66. [18] Bard Y. Nonlinear parameter estimation. New York: Academic Press; 1974. [19] Bandler JW. Computed-aided circuit optimization. In: Temes GC, Mitra SK, editors. Modern ﬁlter theory and design. New York: Wiley; 1973. [20] Shanno DF, Phua KH. Minimization of unconstrained multivariate functions. ACM transactions on mathematical software 6; 1980. p. 618–22. [21] Candel S, Guédel A, Julienne A. Radiation, refraction and scattering on waves in a shear ﬂow. AIAA Paper-76-544. American Institute of Aeronautics and Astronautics. Aero-acoustics conference, 3rd, Palo Alto, Calif., Jul. 20–22, 1976. [22] Amiet RK. Refraction of sound by a shear layer. J Sound Vib 1978;58(3):467–82.

Spectral estimation method for noisy data using a noise reference

Spectral estimation method for noisy data using a noise reference

Recommend Documents