Infrared Physics & Technology 71 (2015) 481–491
Contents lists available at ScienceDirect
Infrared Physics & Technology journal homepage: www.elsevier.com/locate/infrared
Spectral blind deconvolution with differential entropy regularization for infrared spectrum Hai Liu a,b, Zhaoli Zhang a,⇑, Sanya Liu a,⇑, Jiangbo Shu a, Tingting Liu a, Luxin Yan b, Tianxu Zhang b a b
National Engineering Research Center for E-Learning, Central China Normal University, Wuhan, Hubei 430079, China School of Automatic, Huazhong University of Science and Technology, Wuhan 430074, China
h i g h l i g h t s A blind spectral deconvolution method is proposed for infrared spectrum. Differential entropy regularization is proposed to preserve spectral details. Alternation minimization algorithm is described to solve the proposed model. The robustness of proposed method is verified by some simulation and real experiments.
a r t i c l e
i n f o
Article history: Received 8 April 2015 Available online 17 June 2015 Keywords: Infrared spectroscopy Spectral analysis Blind deconvolution Regularization Differential entropy
a b s t r a c t Estimating an accuracy spectrum from a linearly degraded and noisy data record is often desirable in infrared spectroscopy application. In this paper, we proposed a new framework which can estimate the desired spectrum and the instrument function simultaneously from the degraded data. Firstly, the instrument function is parametrically represented as a Lorentzian function for the first time. Then, we construct the cost functional with the DE regularization, and minimize the functional to obtain the desired spectrum and instrument function. It has been found that the program is very useful to determine accurate line widths and peak positions from degraded spectral spectrum. Comparative results including quantitative and qualitative analyses manifest that the proposed method outperforms the conventional methods on peak narrowing and noise suppression. Ó 2015 Elsevier B.V. All rights reserved.
1. Introduction
1.1. Problem formulation
Infrared spectroscopy has gained widespread acceptance in recent years as a powerful diagnostic tool, such as unknown material recognition [1], biomedicine [2], feature extraction [3]. In these applications, measurements need to be performed rapidly to analyze accurate compositions or quality parameters, and data may need to be recorded at multiple wavelengths in real time. However, the performance of instrument is easily degraded in industrial environment. These disadvantages limit the development of infrared spectroscopy in industrial and commercial applications.
In general, spectral restoration can be interpreted as estimating desired spectrum s(v) (v denotes the wavenumber of the spectral spectrum) from a linearly degraded and noisy spectrum
⇑ Corresponding authors at: National Engineering Research Center for E-Learning, Central China Normal University, Wuhan, Hubei 430079, China (Z. Zhang, S. Liu). E-mail addresses:
[email protected] (Z. Zhang),
[email protected]. cn (S. Liu). http://dx.doi.org/10.1016/j.infrared.2015.06.008 1350-4495/Ó 2015 Elsevier B.V. All rights reserved.
yðv Þ ¼ sðv Þ kðv Þ þ eðv Þ;
ð1Þ
where the observed sample y(v) consists of unknown desired spectrum s(v), which is first degraded by a instrument function k(v) (also called kernel) and then corrupted by zeros-mean additive white Gaussian noise (AWGN) e(v). Fig. 1 shows the experimental infrared spectrum of ethanol, which was provided by Griffiths and Shao [4]. The spectrum degradation can be considered as peaks overlap and strong noise (positive noise and negative noise). In fact, the degradation error is caused by the instrument in measure process. We do not know what material the spectrum represents because of strong noise degradation. How can we accurately recognize the unknown
482
H. Liu et al. / Infrared Physics & Technology 71 (2015) 481–491
stimulatingly. Secondly, the instrument function is parametrically expressed as Lorentzian function. To the best of our knowledge, the parametric spectral blind deconvolution with differential entropy regularization is proposed for the first time. Blind deconvolution of the desired spectrum from a degraded spectrum has recently been presented in [19]. In their work, high-order statistical priori was proposed to constrain the desired spectrum. However, this method needs to designate the kernel width in advance. And the restoration results seem rather noisy.
1 overlappeaks
0.8
Intensity (a.u.)
positivenoise
0.6 0.4 0.2 0 −0.2
2. Spectral Priori analysis
negativenoise −0.4
800
850
900
950
1000 1050 1100 1150 1200
Wavenumber (cm-1) Fig. 1. Infrared spectrum of ethanol analysis. It often suffers from peaks overlap, positive noise, and negative noise.
In this section, the Shannon-entropy and cross-entropy are reviewed, and then a differential entropy expression is proposed. Following, we illustrate that why the instrument function can be parametric represented as a Lorentzian function. 2.1. Differential-entropy
spectrum from the known spectral database? To address this problem, we proposed a new method to restorate the desired spectrum in this paper. 1.2. Previous algorithm Spectroscopic restoration problem was first tackled by Kauppinen and Moffatt, who enhanced the resolution of IR spectra in the frequency domain [5]. After their work, many spatial domain restoration methods have been developed in recent decades, the solution model of the spectrum restoration problem can be represented by the following regularization-based least-squares problem [1,6]:
( ) N X 2 ^s ¼ arg min ksi k yi k2 þ aUðsÞ ;
Entropy reflects the uncertainty measurement of the information. The entropy theory was first used in spectral estimation by Ables [20], which attracted major research efforts [6], for its advantages in band narrowing. The most famous entropy is the Shannon entropy N X SEðsÞ ¼ si ln si ;
where s denotes spectrum intensity, N is the length of the spectrum. In the following, Ref. [21] exploits the cross-entropy (also called as the ‘‘Kullback-Leibler distance’’) to measure the information dissimilarity between any two probability density functions s and c, defined by
ð2Þ
i¼1
P where ksi k yi k2 is the fidelity item, which stands for the fidelity between the observed spectrum and the original spectrum. U(s) is the regularization item, which gives a prior model of the desired spectrum s, and a is the regularization parameter, which controls the tradeoff between the fidelity and prior items. The regularization item U(s) in (2) plays a very important role in the spectrum restoration process. It controls the perturbation of the solution and guarantees stable desired spectrum estimation. In the past decades, many regularization models have been proposed. In sum, all the priori methods can be mainly classified into two groups, vector-priori method and distribution-priori method. For the vector-priori, those methods consider the spectrum as a vector in mathematics, and use the derivation of the spectrum as the smoothness constraint, such as positivity constraint [7], P Tikhonov regularization (TR) jðsiþ1 si Þ=2j2 [8–11], L1 norm constraint [12,13]. Huber-Markov [14,15]. For the distributionpriori, those methods consider the spectrum as a probability distribution, including Shannon-entropy [6], Cross-entropy [16], two point entropy [17], and Burg-entropy [18]. Among the entropy-based methods, the model [6] is a popular and effective regularization model because of its ability to address nonlinearity problem. However, the entropy-based regularization model also has some shortcomings. One of them is that ME model only allows the positive value, although the well-known expression works very well for positive-restricted solutions. 1.3. Proposed algorithm There are two main contributions in this work. Firstly, the proposed method can recover the desired spectrum and kernel
ð3Þ
i¼1
CEðs; cÞ ¼
N X si si ln ðsi ci Þ: c i i
ð4Þ
In particular, [16] sets the function c as a uniform distribution to reflect the priori information of spectral shape. The cross-entropy regularization has the advantages of preserving spectral detail in the spectral denoising and restoration. However, Eq. (4) did not allow the existence of negative value. Thus, this article tries to propose a new expression without sign restriction. In spectra measurement, the object spectrum s can be achieved from the subtraction between observed spectrum o and background spectrum b. To measure the distance between object spectrum s and priori spectrum c, the distance function can be rewrote as,
0qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi N s2i þ 4c2i þ si X @ A þ 2ci s2 þ 4c2 : DEðs; cÞ ¼ si ln i i 2ci i
ð5Þ
This distance can be called as differential-entropy (DE). The detailed derivation process is presented in the Appendix. 2.2. Parametric instrument function Conventional spectrometers require careful handling and frequent calibration to eliminate the effects of mechanical vibration and movement. Therefore, the accurate instrument function is difficult to be measured. In this paper, we propose a new method to overcome the problem. For our real measured spectrum, the instrument function must be a convolution of many factors, such as pressure broadening, natural lifetime broadening, transit time broadening, and dominating effect. And the instrument function approximates a
H. Liu et al. / Infrared Physics & Technology 71 (2015) 481–491
Lorentzian-like function. We assume that the instrument function shape is Lorentzian function (also called Cauchy distribution)
k c ðv Þ ¼
1 c ; p ðv 2 þ c2 Þ
ð6Þ
where c represents kernel width, symbol v denotes the wavenumber. In Fig. 2, solid-lines represent the instrument functions with the differential kernel width c = 5, 10 and 15 cm1. It is worth noting that the kernel can also be parametrically expressed as the Triangle [22], Gaussian [23], Voigt [24] functions. In those cases, only one unknown parameter (kernel width) needs to be estimated. If not, all kernel points need to be computed. Thus, the advantage of the parametric expression can reduce the computation complexity. 3. Proposed model In this section, our parametric blind deconvolution method is presented. The negative entropy is used as the smoothing function to be minimized (hence maximum entropy). The model can be wrote as
h^si ¼ arg min
N X
ksi k yi k22 þ aðSEðsÞÞ;
ð7Þ
i¼1
which is called as Shannon-entropy deconvolution method (SED). To estimate the kernel width automatically, kernel priori item is added to the model (7). The energy functional can be rewrote as
^ ¼ arg min h^s; ki
N X
ksi k yi k22 þ aðSEðsÞÞ þ bðDEðk; c2 ÞÞ;
ð8Þ
i¼1
where c2 denotes the kernel priori distribution, and b is the regularization parameter. To incorporate the prior information c1 about desired spectra, Shannon-entropy is replaced by the cross-entropy item, i.e.,
^ ¼ arg min h^s; ki
^Þ ¼ Lð^s; c
N X ksi k yi jj22 þ aðCEðs; c1 ÞÞ þ bðDEðk; c2 ÞÞ:
ð9Þ
483
N 1X ksi k yi k22 2 i¼1 0 1 0qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi s2 þ 4c21 þ s 2 2 A þ 2c1 s þ 4c A þ a@s ln @ 1 2c1 q ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 0 1 0 1 2 qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi kc þ 4c22 þ kc 2 2A @ @ A þ 2c2 kc þ 4c2 ; þ b kc ln 2c2
ð11Þ where c1 and c2 are set as priori distribution values. It is worth noting that c1 is uniform distribution, i.e., a line of constant value in geometrical. The DE measures the distance between the desired spectrum and uniform distribution. And the recovered spectrum needs to be as smooth as possible, which approximates the uniform distribution. This is the reason why the CE and DE can suppress noise. Essentially, the entropy-based constraint is the constraint of spectral spectrum smoothness. Compared with the model in Ref. [18], the Model (11) can work without a known instrument function. We call the proposed framework as parametric blind deconvolution (PBD) method, in which the instrument function is parametrically expressed. Shannon-entropy is added to the PBD framework, Model (8) is called as SE-PBD. Similarly, the Model (9) with cross-entropy is called as CE-PBD, and Model (10) as DE-PBD. 4. Optimization The above cost functional can be optimized by the alternating minimization (AM) method. This iterative method starts with an initial estimation of k with c = 1. Then, for a fixed k, the cost functional is minimized with respect to s by conjugate gradient (CG) algorithm. Thus, the desired spectrum is estimated. Then, for the fixed s, a new estimation for k is obtained. This procedure continues until convergence is reached, i.e. s and k updated in a cyclic fashion. In this framework, optimal deconvolution mainly boils down to solving a combination optimization sub-problem.
i¼1
Considering that CE does not allow the existence of negative value, we introduce the proposed DE to our model. The DE can be used without sign restriction. The model can be expressed as
^ ¼ argmin h^s; ki
N X
ksi k yi k22 þ aðDEðs;c1 ÞÞ þ bðDEðk;c2 ÞÞ:
ð10Þ
4.1. Variation with respect of s(v) To optimize the cost functional L(s, c), the third term in (11) can be dropped since it can be considered as a constant item. Therefore, the cost functional can be rewrote as
i¼1
Submitting the DE expression (5) to Model (10), we get the model,
0.09
γ =5 cm-1 γ=10 cm-1
0.08
N 1X ksi k yi k22 2 i¼1 2 3 0qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi N s2 þ 4c21 þ s X A þ 2c1 s2 þ 4c2 5: þ a 4s ln @ 1 2c1 i¼1
ð12Þ
The variation of L with respect to s(v) can be expressed as
γ =15 cm -1
0.07
dLðsÞ dLI ðsÞ dLII ðsÞ ¼ þa ; ds ds ds
0.06 0.05
ð13Þ
where LI ðsÞ and LII ðsÞ respect the first and second item in (12), respectively.
0.04 kernel width γ =5
0.03
dLI ðsÞ ¼ ðk s yÞ kðv Þ; ds
0.02 γ γ
0.01 0 -60
LðsÞ ¼
-40
-20
0
20
40
60
Index/cm-1 Fig. 2. Instrument functions with different kernel width. The symbol c represents half width at half height of instrument function. The larger the value c, the wider the kernel width.
0qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 s2 þ 4c21 þ s dLII ðs; cÞ @ A: ¼ ln 2c1 ds
ð14Þ
ð15Þ
Computing (14) and (15) using the fundamental lemma of calculus of variations, we can obtain that
484
H. Liu et al. / Infrared Physics & Technology 71 (2015) 481–491
0qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 s2 þ 4c21 þ s dLðsÞ @ A: ¼ ðk s yÞ kðv Þ þ a ln 2c1 ds
4.3. Parameters setting and convergence condition
ð16Þ
The detailed optimization process in relation to s with fixed k is described as follows. Algorithm 1: CG algorithm for infrared spectrum optimization. Step 1: Initialize the conjugate vector; Step 2: Calculate the step size to update the desired spectrum s; Step 3: Update desired spectrum s; Step 4: Update the gradient of the cost function with respect to s; Step 5: Calculate the step size for updating the conjugate vector Step 6: Update the conjugate vector; Step 7: n = n + 1. The iteration is terminated when ||sn+1 sn||/||sn|| 6 d1, where d1 is a predetermined value. Otherwise goto step 1.
Initializing the regularization parameters, we set a = 107, b = 0.47. In the experiments, spectrum intensities are normalized to the range [0, 1]. We set a = a/t1, b = b/t2, where t1 = 1.01 and t2 = 1.02. In each iteration, the parameter a and b are adaptively updated, with the purpose of reducing the influence of the spectrum priori and increasing that of the spectrum likelihood (fidelity item in Model (10)). Afterward, we point out that the distribution priori parameter c1 and c2 respectively depends directly on the noise level and the severity of the overlap degree in the observed spectrum. We set, =10t , c2 2 [1/200, 1/100]. The symbol y and t represents the c1 ¼ y average value of the spectral intensity and non-linear enhancement factor, respectively. The noise variance d of the additive noise in (1) is typically unknown in practice and must be estimated from the observation y. d can be reliably estimated using a median estimator on the finest scale wavelet coefficients of y. We estimated Gaussian noise based on various differencing schemes. The median absolute difference to estimate d defined in the following equation [25]
d¼ 4.2. Differential with respect to c In this step, we fix s(v) and compute the optimal c. Eq. (11) is simplified to
LðcÞ ¼
1:4826 pffiffiffi medianfjyi yi1 j; 2
i ¼ 1; 2; . . . ; Ng:
ð22Þ
In this paper, we set t = 2 + 100d. We declared convergence when for more than two consecutive iterations both the solutions and the cost functional changed less than a threshold value: ksnþ1 sn k=ksn k 6 d1 , kcnþ1 cn k=kcn k 6 d2 and kLnþ1 Ln k=kLn k 6 d3 . The typically values for di used in this paper were between 105 and 107. This PBD algorithm was implemented as a visual program in Matlab 2010a. The optimization of Models (8) and (9) are similar with that of the Model (11). In summary, the optimization procedure is concluded as follows.
N 1X ksi k yi k22 2 i¼1 2 3 0qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 2 qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi N X kc þ 4c22 þ kc 2 2 25 4 @ A þ 2c2 kc þ 4c2 : þb kc ln 2c22 i¼1
ð17Þ Minimization with respect to the scalar parameter c is determined by differentiation of the cost functional (17)
@kc @DEðkc ; c2 Þ @LðcÞ ¼ ðkc s yÞ s þb ¼ 0; @c @c @c
ð18Þ
the partial derivational respect to the width c
@kc v 2 c2 ¼ : @c pðv 2 þ c2 Þ2
ð19Þ
and
0qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 2 2 2 kc þ 4c22 þ kc @DEðkc ; c2 Þ @ A v c ¼ ln : 2 2c2 @c pðv þ c2 Þ2
ð20Þ
0qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 2 kc þ 4c22 þ kc @kc @LðcÞ A þ b ln @ ¼ ðs kc yÞ s @c @c 2c2
v 2 c2 : pðv 2 þ c2 Þ2
Require: Initial spectrum s, parameters t, a, and b Ensure: Resolution of (10) while Convergence Condition do not hold do Compute s using CG algorithm, Compute c using bisection method, Update a, b end while Output: the recovered spectrum s and the width c.
5. Experiments and discussion 5.1. Evaluation indices
Thus, for the c, we get the optimal expression,
Algorithm 2: AM Algorithm for DE-PBD Optimization
ð21Þ
The bisection root-finding method was used to determine the correct value of c to within the tolerance mentioned above for the DE solution. The discrete support of the Lorentzian kernel is limited to 6c + 1, which is much smaller than the spectral length in our experiments. The numerical integral of kc is normalized to 1, and the width c0 is initialized to 1.
The simulated data and real data are provided by the net address [26]. The spectra tested in our experiment are at resolution 1 cm1. In this paper, in the simulated experiments, we use the spectra quality assessment index RMSE, SCM to evaluate the reconstruction results. (a) Root of mean square error (RMSE)
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi PN 2 ^ i¼1 ðsi si Þ RMSEð^si ; si Þ ¼ ; N (b) Spectral correlation measure [27] (SCM)
ð23Þ
485
H. Liu et al. / Infrared Physics & Technology 71 (2015) 481–491
0.9
1726
(a)
0.8
Intensity (a.u.)
0.7
1164
0.020
0.6 0.5
1200 1302 1326
0.4 0.3
0.015
1440
0.2
(b)
0.025
816
942
0.010
1639
1018
0.1
0.005
0 600
800
1000
1200
1400
1600
0
1800
−50
−40
−30
−20
−10
-1
Wavenumber (cm )
0.9
20
30
40
50
(d)
0.8 0.7
Intensity (a.u.)
0.7
Intensity (a.u.)
10
0.9
(c)
0.8
0
Index (cm-1)
0.6 1725
0.5
1166
0.4 0.3
0.6 0.5 0.4 0.3
1314
0.2 820
0.1
0.2
1445
941 1015
1637
0.1 0
0 600
800
1000
1200
1400
1600
600
1800
800
1000
1200
1400
1600
1800
Wavenumber (cm-1)
Wavenumber (cm-1)
Fig. 3. (a) Original infrared spectrum of methyl methacrylate from 500 to 2000 cm1. (b) Lorentzian kernel with c = 18 cm1. (c) Degraded spectrum (noise-free). (d) Noisy spectrum, adding the AWGN (with zero mean, bottom) to (c).
0.9 0.8
0.9
(a)
0.8
1726
1726
0.7
0.7 1164
0.6
Inteslty (a.u.)
intensity (a.u.)
(b)
0.5 1200
0.4 1302 1328
0.3
1443
940
0.2
818
0.1
1164
0.6 0.5
1200 1302 1327 1442
0.4 0.3 0.2
1636 1017
818
0.1
1017
0
0 600
800
1000
1200
1400
1600
600
1800
800
1000
Wavenumber (cm-1)
0.9
0.9
(c)
Intensity (a.u.)
1164
0.5 1200 1301 1328 1443
0.3 0.2 819
941
1726
1400
1600
1326 1442 817
941
1639 1018
0 1200
1301
0.3
0.1
Wavenumber (cm-1)
1200
0.4
0.2
1636
1000
1163
0.5
0 800
1800
0.6
0.1
600
1600
(d)
0.7
0.4
1400
0.8
1726
0.7 0.6
1200
Wavenumber (cm-1)
0.8
intensity (a.u.)
1637
940
1800
600
800
1000
1200
1400
1600
1800
Wavenumber (cm-1)
Fig. 4. Deconvolution results by four compared methods, where (a–c) deconvoluted with a known kernel. (a) FSD method [23]. (b) TRD method [8]. (c) SED method [6]. (d) DE-PBD.
486
H. Liu et al. / Infrared Physics & Technology 71 (2015) 481–491
(a)
14
(b) 0.12
γ valeve
12
0.10
10
0.08 8 0.06 6
0.04
4 2
0.02 0 0
5
10
15
20
25
50 5
15 Iteration nu mber
Iteration number
0
10
-50
20
index
Fig. 5. Instrument function estimation versus the iteration number by DE-PBD method. (a) Convergence curve of kernel width c. (b) Estimated kernels versus the iteration number.
Table 1 Comparison of band distortions in deconvoluted spectra by FSD, TRD, SED and DE-PBD methods (in Fig. 4).
a
Band position
816
942
1018
1164
1200
1302
1326
1440
1639
1726
RMSE
Position FSD TRD SED DE-PBD
+2a +2 +3 +1
2 2 1 1
1 1 0 0
0 0 0 1
0 0 0 0
0 0 1 1
+2 +1 +2 +0
+3 2 +2 +2
+3 +2 +3 1
0 0 0 0
1.761 1.342 1.673 0.948
Height FSD TRD SED DE-PBD
+0.005 0.027 +0.009 0.010
+0.001 +0.003 0.010 +0.017
0.006 0.006 0.009 +0.004
0.044 0.038 0.076 +0.021
+0.001 0.016 0.032 +0.027
0.090 0.061 0.079 0
+0.059 0.039 0.062 0.028
0.022 0.019 0.026 0.010
0.030 0.025 0.031 0.016
0.083 0.070 0.142 +0.096
0.0467 0.0369 0.0624 0.0295
‘‘+’’ or ‘‘’’ indicates larger or smaller than the original values respectively.
(a)
(c)
(b)
noise level=1%
noise level=3%
noise level=5%
noise level=7%
600 800 1000 1200 1400 1600 1800
Wavenumber (cm-1)
600 800 1000 1200 1400 1600 1800
Wavenumber (cm-1)
600 800 1000 1200 1400 1600 1800
Wavenumber (cm-1)
Fig. 6. Deconvolution results with different noise levels. (a) Simulated noisy absorption spectra at 1 cm1 resolution (from top to bottom noise levels 1%, 3%, 5%, 7%). (b) TRD method [8] (the width of Lorentzian kernel c = 18 cm1). (c) Proposed method.
H. Liu et al. / Infrared Physics & Technology 71 (2015) 481–491
P P P N ^si si ^si si SCMð^si ; si Þ ¼ rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi P ffi ; P 2 P 2 P N ^s2i ð ^si Þ N si ð si Þ2
ð24Þ
where N is the length of the spectrum, and si, ^si denote the i-th data point of the true and recovered data, respectively. RMSE and SCM represent the average difference and average similarity between the two spectra. A small RMSE and a large SCM correspond to a good match. To assess the relative merits of the DE-PBD, we compare it with Fourier self-deconvolution (FSD) [23], Tikhonov regularization deconvolution (TRD) [8] and Shannon entropy deconvolution (SED) [6] in the simulated data experiments. Because all the FSD, TRD and SED methods need a known instrument function in advance, we applied the predefined kernel. 5.2. Simulated experiments According to (1), we simulated the degraded spectrum on the basis of the real infrared spectrum. Fig. 3(a) shows the IR spectrum of methyl methacrylate from 500 to 2000 cm1. The molecular
Table 2 Figures of merit for the performance of FSD, TRD and DE-PBD methods in different noise levels. For SCM, higher values imply superior performance, while lower RMSE values imply improved performance. Merits
RMSE
SCM
Noise levels
Methods FSD
TRD
DE-PBD
Noise-free 1% 3% 5% 7%
0.0379 0.0382 0.0425 0.0526 0.0810
0.0278 0.0329 0.0390 0.0460 0.0501
0.0145 0.0156 0.0160 0.0175 0.0239
Noise-free 1% 3% 5% 7%
0.9889 0.9886 0.9883 0.9877 0.9870
0.9929 0.9920 0.9917 0.9914 0.9910
0.9953 0.9950 0.9948 0.9940 0.9926
(a)
600 800 1000 1200 1400 1600 1800
487
formula is C5H8O2. We convolved it with a Lorentzian kernel with c = 14 cm1, as illustrated in Fig. 3(b), to generate the degraded spectrum (Fig. 3(c)). The degraded spectrum becomes much smoother and less resolved, in which the bands become wider and lower. First, we applied the four methods on the noise-free degraded spectrum. We applied the predefined kernel (Fig. 3(b)) to FSD, TRD and SED methods. For DE-PBD, we started from a Lorentzian kernel with an initial width c = 1 cm1. The width c converged at about 13.8 cm1 after 20 iterations (shown in Fig. 5(a)). Fig. 4 shows the deconvolution results for noise-free degraded spectrum (Fig. 3(c)). In Fig. 4(a) and (b), we observe that many redundant peaks exist, such as the small peaks indicated by bold lines below the baseline. We call this phenomenon as ‘peak-ringing’. However, Fig. 4(c) and (d) seem rather smooth at the flat region (no peaks place), and do not suffer from those problems. It means that the entropy-based methods outperform the FSD and TRD methods at peak-ringing. Furthermore, for the deconvoluted spectra by FSD, TRD, SED and DE-PBD methods, the bands distortions were investigated. In the ground-truth spectrum (Fig. 3(a)), the ten peaks at 816, 942, 1018, 1164, 1200, 1302, 1326, 1440, 1639, and 1726 are taken as the references. Table 1 lists these peaks distortions in their position and height between the deconvoluted results (Fig. 4) and the ground-truth spectrum (Fig. 3(a)). Roots of mean square error (RMSE) of these distortions are calculated. In terms of RMSE, the distortions in peak position and height by DE-PBD method are smaller than those by the FSD, TRD and SED methods. It means that Fig. 4(d) is similar with ground-truth spectrum most. We may conclude that the proposed method can suppress the peak ringing well when recover more details.
5.3. Advantages of the PBD The width c converged at about 13.8 cm1 after 20 iterations, which was plotted in Fig. 5(a). The kernel width estimated by DE-PBD appears to be close to the preset value (Fig. 3(b)). Fig. 5(b) dynamically represents the kernels estimation processing versus the iteration number. However, the FSD, TRD and SED
(b)
1200 1400
(c)
1200 1400
(d)
600 800 1000 1200 1400 1600 1800
Wavenumber (cm-1) Fig. 7. Deconvolution results at noise level = 5% with the iteration number 1, 5, 10 and convergence. (a) TRD [8]. (b) SE-PBD method. (c) CE-PBD method. (d) DE-PBD method.
488
H. Liu et al. / Infrared Physics & Technology 71 (2015) 481–491
methods need to assume the accurate kernel. In general, the accurate kernel width is measured by the instrument or equal to the width of the narrowest peak. In fact, it is difficult to assume the accurate kernel shape and width. Most of the bands will be either infra-deconvoluted when c > ctruth, or over-deconvoluted when c < ctruth. However, the proposed PBD framework does not suffer from this problem.
means a better match between recovered spectrum and original spectrum. We may conclude that the proposed method performs much better than TRD in noise suppression. 5.5. Effect of the DE In Fig. 7, we display the recovered spectrum after 1, 5, 10 and last iterations. With the iteration processing, the restorated spectrum approach the ground-truth spectrum more and more. Hence, the AM algorithm is an efficient method for minimizing (11). To demonstrate the effectiveness of the spectral narrowing ability, we compared the performance of the TRD, SE-PBD (Model (8)), CE-PBD (Model (9)) and DE-PBD (Model (10)) methods. Three PBD models are set with the same parameters setting (a = 40, b = 0.47). Because the Shannon-entropy and cross-entropy expressions do not allow the negative probability value, we only intercept the positive spectrum from 1100 cm1 to 1500 cm1, which are shown in the dash-line blanks. The recovered results are deconvoluted from the overlapped spectrum in Fig. 6(a) (noise level = 5%), as shown in Fig. 7(a) (TRD), Fig. 7(b) (SE-PBD), Fig. 7(c) (CE-PBD) and Fig. 7(d) (DE-PBD) respectively. On the whole, all the deconvoluted spectra are clearly resolved. For TRD, the deconvoluted spectra appear rather noisy, especially from the 500 to 800 cm1. Three entropy regularization methods suppress the noise better than TRD does. For Fig. 7(b)–(d), the overlap peak 1166 cm1 can be split to two peaks clearly. Nevertheless, for the overlap peak 1314 cm1, Fig. 7(d) is the most resolved, and approaches the original spectrum best. It means that Fig. 7(d) performs much better than Fig. 7(b) and (c) in the recovery of local spectral details. Moreover, Fig. 7(d) can process the negative noise, while Fig. 7(b) and (c) cannot. Thus, we may conclude that the proposed DE-PBD method recovers more spectral details than the SE-PBD and CE-PBD.
5.4. Effect of the noise Afterward, to investigate the robustness of deconvolution methods to noise, additive zero-mean random noise is added to the degraded spectrum with four different levels 1%, 3%, 5% and 7%, as shown in Fig. 6(a). Fig. 6(b) and (c) shows the deconvoluted results from the noisy spectra by TRD and DE-PBD, respectively. For noise level 1%, we compute the d = 0.0021, and set c1 = 102.2, for noise level 3%, the d = 0.0041, and set c1 = 102.5; and when 5%, d = 0.0071 and set c1 = 102.9. In all noise levels, the overlapped peaks (1166 and 1314 cm1) are all split. For the TRD method, it can be seen that the recovered spectrum seems to be less resolved and with more residual noise (existing peak-ringing). At the same noise level, the residual noise in Fig. 6(c) is less than that in Fig. 6(b), and also, the peaks are separated more distinctly than those in Fig. 6(b). Moreover, even with a strong noise level (7%), the proposed algorithm can still recover a very sharp spectrum. This experiment demonstrates the robustness of the proposed algorithm. TRD method produced significant ringing peaks in the reconstructed spectrum. The reason is that Tikhonov regularization only measures the spectral local smoothness, but not considers the structure of all peaks. Table 2 shows the resulting RMSE, SCM for each method on the simulated data with different noise levels. It is clear that the proposed method give the lowest RMSE and highest SCM (shown in bold) in all noise levels. As mentioned above, the higher SCM 0.9
0.9
(a)
0.8 0.7
0.7
Intensity (a.u.)
Intensity (a.u.)
(b)
0.8
0.6 0.5 0.4 0.3
0.6 0.5 0.4 0.3
0.2
0.2
0.1
0.1
0
0 600
800
1000
1200
1400
1600
600
1800
800
1200
1400
1600
1800
0.028
0.023 0.022
(c)
0.021
(d)
0.026
noise5%
0.024
0.020 0.019
RMSE value
RMSE value
1000
Wavenumber (cm-1)
Wavenumber (cm-1)
noise3%
0.018 0.017
noise-free
0.022 0.020 noise5%
0.018
0.016
noise3%
0.016
0.015 0.014 10
-6
-5
10
-4
10
c1value
10
-3
10
-2
10
-1
0.014
noise-free
0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
4.0 ×10-3
c2value
Fig. 8. Parameters determination. (a) Recovered spectrum with c1 = 2. (b) Recovered spectrum with c1 = 3. (c) RMSE with the priori distribution c1. (d) RMSE with the priori distribution c2. The smaller the RMSE value, the more similar between the deconvolution spectrum with ground-truth spectrum.
489
H. Liu et al. / Infrared Physics & Technology 71 (2015) 481–491
2892
(a)
2946
0.8
0.6
5.7. Application to experimental spectra 0.4
0.2
0
2600 2700 2800 2900 3000 3100 3200 3300 3400 3500
Wavenumber (cm-1)
1.8 1.6
(b) 2892
Three more challenging real examples are shown in Figs. 9–11, which contain overlap peaks and random noise. There is no actual kernel width to the experimental spectra. The kernel width used in TRD method is estimated from DE-PBD. Since there is no ground-truth regarding the experimental spectra, we compare the noise suppressing and overlap peaks splitting ability between TRD method and the proposed method. Fig. 9(a) shows the Infrared spectrum of D(+)-Glucopyranose (C6H12O6) from 2500 to 3600 cm1 at 1 cm1. In Fig. 9(b), the spectrum is deconvoluted by TR method using a Lorentzian instrument
1.4 2946
1.2
1
(a)
2877
0.8
2961
0.6 0.4 0.2 0
2600 2700 2800 2900 3000 3100 3200 3300 3400 3500
RamanIntensity (a.u.)
1
2915
RamanIntensity (a.u.)
that RMSE reaches global minimum when c2 is between 1 103 and 2 103 under the corresponding optimum c1 values. Therefore, it is advisable for practical spectrum to use the parameter c2 between 1 103 and 2 103. In this paper, the parameters c2 = 1.5 103 were used for the DE-PBD deconvolution.
2915
RamanIntensity (a.u.)
1
Wavenumber (cm-1)
0.8
0.6
511
0.4
0.2
0 2892
1.8
100
(c)
200
300
400
500
600
700
800
Wavenumber (cm-1)
2946
1.4 1.2
1.6
1
1.4
0.6 0.4 0.2 0
RamanIntensity (a.u.)
2961
2877
0.8 2915
RamanIntensity (a.u.)
1.6
1.2 1 512 0.8 0.6
2600 2700 2800 2900 3000 3100 3200 3300 3400 3500
0.4
Wavenumber (cm-1)
0.2
501
485
515
780
0
Fig. 9. Real infrared spectrum experiment. (a) Original spectrum. D(+)Glucopyranose (C6H12O6) (b) TRD method (kernel width estimated by DE-PBD is 10.5 cm1). (c) DE-PBD method.
100
200
300
400
500
600
700
800
600
700
800
Wavenumber (cm-1)
5.6. Parameters control
1.6
(c)
1.4
RamanIntensity (a.u.)
For the noisy spectrum (noise level 3%), we compute the parameters d = 0.005 (Eq. (22)), t = 2.5, thus, c1 = 102.5. To validate the effect of the parameter c1 for the restoration processing, we select smaller and larger parameters. The restoration results are shown in Fig. 8(a) (with c1 = 102, c2 = 1.5 103) and Fig. 8(b) (with c1 = 103, c2 = 1.5 103). The dark-line and solid-line represent the ground-truth spectrum and the recovered spectrum, respectively. It is observed that the solid-line is lower than dark-line in Fig. 8(a) and inversely in Fig. 8(b). That means both smaller and larger c1 will lead to a larger RMSE. For the noise level 3% case, we find that RMSE get the minimum value when c1 = 102.5, shown in Fig. 8(c). We also validate optimization parameter c1 values under other noise levels (noise-free and 5%) utilizing the curves of the average RMSE. Fig. 8(d) shows
(b)
1.2
511
1 0.8
501
0.6 0.4 0.2 0 100
200
300
400
500
Wavenumber (cm-1) Fig. 10. Real spectrum experiment. (a) L(+)-Arabinofuranose (C5H10O5) (b) TRD. (c) DE method. The kernel width estimated by this method is equal to 7.5.
490
H. Liu et al. / Infrared Physics & Technology 71 (2015) 481–491
function with width 10.5 cm1 (estimated by DE-PBD). It can be seen that the resolution of peak near 2892 and 2946 cm1 have been considerably improved. The peaks at 2892 and 2946 cm1 are split into two peaks at 2877, 2892 cm1 and 2946, 2961 cm1, respectively. Here, it computed d = 0.0015 and set c1 = 102.1. The kernel width estimated by DE-PBD method equals 10.5 cm1 (not shown). Although the recovered peak positions in Fig. 9(c) are the same as that in Fig. 9(b), it seems that they are more resolved than that in Fig. 9(b).
1
(a)
0.8
Intensity (a.u.)
0.6 0.4 0.2 0 -0.2 -0.4 750
800
850
900
950
1000 1050 1100 1150 1200 1250
Fig. 10(a) is the Infrared spectrum of L(+)-Arabinofuranose (C5H10O5) from 20 to 820 cm1 at resolution 1 cm1. Observed Infrared spectrum suffers heavy noise from 20 to 180 cm1, and overlap at 511 cm1. The kernel width estimated by DE-PBD is equal to 7.5 cm1. The result spectrum (Fig. 10(b)) was deconvoluted by TRD with 7.5 cm1 kernel. As well, a considerable reduction in noise and increase in peak height is observed. The overlap peak at 511 cm1 can be resolved to 501 and 512 cm1. But this method also produces some redundant peaks such as 485, 515, 780 and 800 cm1. Compared with Fig. 10(b), Fig. 10(c) appears more flat from 20 to 180 cm1, and the peaks between 501 and 512 cm1 are more resolved than that in Fig. 10(b). The DE-PBD method is also applied to IR spectrum. Fig. 11(a) demonstrates the ground-truth IR spectrum of methyl. Fig. 11(b) is the experimental IR spectrum provided by Prof. Griffiths of University of Idaho [4]. This spectrum contains negative noise, which cannot be processed by the SED method. The main task is to compute the spectral similarity between Fig. 11(a) and (b). Here, it computed d = 0.0105 and set c1 = 103.1. After processing, the SNR of overlap peaks between 1000 and 1100 cm1 has been significantly improved by the proposed method. The methyl peaks at 1032 and 1041 cm1 appear to be resolved. Additionally, the proposed method requires no a priori information about the kernel width. And the kernel width estimated by the proposed method is 5.5 cm1. It can conclude that the experimental spectra will be recognized more easily than its original.
-1
Wavenumber (cm )
6. Conclusion
1
(b)
0.8
Intensity (a.u.)
0.6 0.4 0.2 0 -0.2 -0.4 750
800
850
900
950
1000 1050 1100 1150 1200 1250
Wavenumber (cm-1)
1
1053
(c)
0.8 1041 Intensity (a.u.)
0.6
In this paper, we propose a powerful technique to estimate both of the desired signal and instrument function from a degraded spectrum simultaneously. The entropy expression is utilized to regularize the smoothness of desired spectrum and instrument function. The major novelty of the proposed algorithm is that it can estimate the instrument function and desired signal simultaneously. Comparative results with other deconvolution methods suggest that the proposed method can recover spectral details as well as suppress noise effectively. In particular, owing to no need for a known instrument function, the new method has considerable value in practice. It is noteworthy that although the application considered here is infrared spectroscopy, the method is generally applicable to any data, which can be modeled as a smooth underlying signal distorted by an instrument function and corrupted by random noise, such as NMR, fluorescence data, gamma ray and ultrasonic spectra. The recovered spectra are easily for extracting the spectral features and interpreting the unknown chemical mixture. Nevertheless, there may still be room for the improvement of our proposed method. In fact, the instrumental broadening usually varies with wavenumber in dispersive infrared spectrometers. Thus, the instrument function k(v) is not necessary unique. We will examine that in our future work.
1032 0.4
Conflict of interest
0.2
There is no conflict of interest. 0
Acknowledgments
-0.2 -0.4 750
800
850
900
950
1000 1050 1100 1150 1200 1250 -1
Wavenumber (cm )
Fig. 11. Real experiments. (a) Ground-truth spectrum of methyl. (b) Experimental spectrum. (c) Deconvolution results by the proposed method.
The authors would also like to thank the editor and anonymous reviewers for their valuable suggestions. This research was partially funded by the National Social Science Fund of China (14BGL131), the Project of the Program for National Key Technology Research and Development Program (2013BAH72B01, 2013BAH18F02, 2015BAH33F02), the Self-determined Research Funds of CCNU from
H. Liu et al. / Infrared Physics & Technology 71 (2015) 481–491
the Colleges’ basic Research and Operation of MOE (CCNU15A05009, CCNU15A05010), Scientific R & D Project of State Education Ministry and China Mobile (MCM20121061), New PhD Researcher Award from Chinese Ministry of Education, New Century Excellent Talents in University (NCET-11-0654) and the National Natural Science Foundation of China under Grant No. 60902060. Appendix A In this Appendix, we derive the differential entropy expression from the cross-entropy. The symbols o and b denote the observed spectrum and background spectrum. Therefore, the experimental spectrum can be achieved as
s ¼ o b: We define a new distance, Uðo; bÞ and assume it satisfies the following equation
Uðo; bÞ ¼ CEðo; cÞ þ CEðb; cÞ: Submitting Eq. (4) to it, thus,
Uðo; bÞ ¼
Xh
oi ln
o i X bi i þ c bi : þ c oi þ bi ln c c
The expression for Uðo; bÞ can be obtained from the expression of dU ¼ dU . do db The variation of U with respect to o and b can be expressed as
WE and the differentiability condition
dU X dU X ¼ ln oi ln c and ¼ ln bi ln c: do db i i Namely,
ob ¼ c2 ; Submitting, s ¼ o b get the new equation
o2 os c2 ¼ 0: Generally, o > 0, we obtain the solution,
o¼
sþ
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi . pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi s2 þ 4c2 and b ¼ 2c2 s þ s2 þ 4c2 : 2
Therefore, equation Uðo; bÞ can be written
Uðo; bÞ ¼
X
s ln
i
! pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi s2 þ 4c2 þ s þ 2c s2 þ 4c2 : 2c
For the equation only with respect to s and c, the new distance can be denoted as
Uðs; cÞ ¼
X s ln i
! pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi s2 þ 4c2 þ s þ 2c s2 þ 4c2 : 2c
The new distance Uðs; cÞ is called as differential-entropy.
491
References [1] H. Liu, M. Zhou, Z. Zhang, J. Shu, T. Liu, T. Zhang, Multi-order blind deconvolution algorithm with adaptive Tikhonov regularization for infrared spectroscopic data, Infrared Phys. Technol. 71 (2015) 63–69. [2] S. Boyd, M.F. Bertino, S.J. Seashols, Raman spectroscopy of blood samples for forensic applications, Foren. Sci. Int. 208 (2011) 124–128. [3] H. Liu, L. Yan, Y. Chang, H. Fang, T. Zhang, Spectral deconvolution and feature extraction with robust adaptive Tikhonov regularization, IEEE Trans. Instrum. Meas. 62 (2013) 315–327. [4] P.R. Griffiths, L. Shao, Self-weighted correlation coefficients and their application to measure spectral similarity, Appl. Spectrosc. 63 (2009) 916–919. [5] J.K. Kauppinen, D.J. Moffatt, H.H. Mantsch, D.G. Cameron, Fourier selfdeconvolution: a method for resolving intrinsically overlapped bands, Appl. Spectrosc. 35 (1981) 271–276. [6] E. Chouzenoux, S. Moussaoui, J. Idier, F. Mariette, Efficient maximum entropy reconstruction of nuclear magnetic resonance T1–T2 spectra, IEEE Trans. Sig. Process. 58 (2010) 6040–6051. [7] M.K. Emresoy, A. El-Jaroudi, Evolutionary spectrum estimation by positivity constrained deconvolution, IEEE Trans. Sig. Process. 47 (1999) 889–893. [8] L.J. Meng, D. Ramsden, An inter-comparison of three spectral-deconvolution algorithms for gamma-ray spectroscopy, IEEE Trans. Nucl. Sci. 47 (2000) 1329–1336. [9] H. Liu, Z. Zhang, J. Sun, S. Liu, Blind spectral deconvolution algorithm for Raman spectrum with Poisson noise, Photon. Res. 2 (2014) 168–171. [10] M. Jiayi, Z. Ji, T. Jinwen, A.L. Yuille, T. Zhuowen, Robust point matching via vector field consensus, IEEE Trans. Image Process. 23 (2014) 1706–1721. [11] D.K. Buslov, N.A. Nikonenko, Regularized method of spectral curve deconvolution, Appl. Spectrosc. 51 (1997) 666–672. [12] H. Liu, S. Liu, Z. Zhang, J. Sun, J. Shu, Adaptive total variation-based spectral deconvolution with the split Bregman method, Appl. Opt. 53 (2014) 8240–8248. [13] M. Jiayi, Q. Weichao, Z. Ji, M. Yong, A.L. Yuille, T. Zhuowen, Robust L2E estimation of transformation for non-rigid registration, IEEE Trans. Sig. Process. 63 (2015) 1115–1129. [14] H. Liu, T. Zhang, L. Yan, H. Fang, Y. Chang, A MAP-based algorithm for spectroscopic semi-blind deconvolution, Analyst 137 (2012) 3862–3873. [15] J. Ma, J. Zhao, Y. Ma, J. Tian, Non-rigid visible and infrared face registration via regularized Gaussian fields criterion, Pattern Recogn. 48 (2015) 772–784. [16] J. Shore, Minimum cross-entropy spectral analysis, IEEE Trans. Acoust. Speech Sig. Process. 29 (1981) 230–237. [17] L.S. Greek, H.G. Schulze, M.W. Blades, A.V. Bree, B.B. Gorzalka, R.F.B. Turner, SNR enhancement and deconvolution of raman spectra using a two-point entropy regularization method, Appl. Spectrosc. 49 (1995) 425–431. [18] V.A. Lórenz-Fonfría, E. Padrós, Maximum entropy deconvolution of infrared spectra: use of a novel entropy expression without sign restriction, Appl. Spectrosc. 59 (2005) 474–486. [19] J. Yuan, Z. Hu, High-order statistical blind deconvolution of spectroscopic data with a gauss-newton algorithm, Appl. Spectrosc. 60 (2006) 692–697. [20] J.G. Ables, Maximum entropy spectral analysis, Astron. Astrophys. Suppl. 15 (1974) 383–393. [21] S.K.A.R.A. Leibler, On information and sufficiency, Ann. Math. Stat. 22 (1951) 79–86. [22] H. Liu, Z. Zhang, S. Liu, T. Liu, L. Yan, T. Zhang, Richardson-Lucy blind deconvolution of spectroscopic data with wavelet regularization, Appl. Opt. 54 (2015) 1770–1775. [23] V.A. Lórenz-Fonfría, E. Padrós, The role and selection of the filter function in fourier self-deconvolution revisited, Appl. Spectrosc. 63 (2009) 791–799. [24] G. Vogman, Deconvolution of spectral Voigt profiles using inverse methods and Fourier transforms, in: Department of Mathematics, University of Washington; 2010. [25] P.L. Davies, A. Kovac, Local extremes, runs, strings and multiresolution, Ann. Stat. 20 (2001) 1–65. [26] S.B. Engelson, Raman Spectral of (D+)-Glucopyranose
. [27] F. van der Meer, The effectiveness of spectral similarity measures for the analysis of hyperspectral imagery, Int. J. Appl. Earth Obs. Geoinf. 8 (2006) 3–17.