Accepted Manuscript Qualitative and quantitative analysis of peanut adulteration in almond powder samples using multi-elemental fingerprinting combined with multivariate data analysis methods
Mahnaz Esteki, Yvan Vander Heyden, Bahman Farajmand, Yadollah Kolahderazi PII:
S0956-7135(17)30312-2
DOI:
10.1016/j.foodcont.2017.06.014
Reference:
JFCO 5667
To appear in:
Food Control
Received Date:
07 January 2017
Revised Date:
08 June 2017
Accepted Date:
11 June 2017
Please cite this article as: Mahnaz Esteki, Yvan Vander Heyden, Bahman Farajmand, Yadollah Kolahderazi, Qualitative and quantitative analysis of peanut adulteration in almond powder samples using multi-elemental fingerprinting combined with multivariate data analysis methods, Food Control (2017), doi: 10.1016/j.foodcont.2017.06.014
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT
1
Qualitative and quantitative analysis of peanut adulteration in almond powder samples
2
using multi-elemental fingerprinting combined with multivariate data analysis methods
3 4
Mahnaz Esteki1,*, Yvan Vander Heyden2, Bahman Farajmand1, Yadollah Kolahderazi1
5
1Department
6
2Department
7
Pharmaceutical Research (CePhaR), Vrije Universiteit Brussel (VUB), Laarbeeklaan 103, B-
8
1090 Brussels, Belgium
of Chemistry, University of Zanjan, Zanjan 45195-313, Iran of Analytical Chemistry and Pharmaceutical Technology (FABI), Center for
9 10
Mahnaz Esteki (Corresponding author)
11
Department of Chemistry, University of Zanjan, Zanjan 45195-313, Iran
12
Phone: +982415152586
13
Fax: +982415152477
14
Email:
[email protected]
15 16 17
Yvan Vander Heyden
18
Phone: +32 2 477 47 34
19
Fax: +32 2 477 47 35
20
Email:
[email protected]
21 22 23
Bahman Farajmand
24
Department of Chemistry, University of Zanjan, Zanjan 45195-313, Iran 1
ACCEPTED MANUSCRIPT
25
Phone: +982415152586
26
Fax: +982415152477
27
Email:
[email protected]
28 29 30
Yadollah Kolahderazi
31
Department of Chemistry, University of Zanjan, Zanjan 45195-313, Iran
32
Phone: +982433052586
33
Fax: +982433052477
34
Email:
[email protected]
35 36 37 38 39 40 41 42 43 44 45
2
ACCEPTED MANUSCRIPT
46
Qualitative and quantitative analysis of peanut adulteration in almond powder samples
47
using multi-elemental fingerprinting combined with multivariate data analysis methods
48
Abstract
49
In this study, adulteration of almond powder samples with peanut was analysed using multi-
50
elemental fingerprinting based on inductively coupled plasma optical emission measurements
51
(ICP-OES) combined with chemometric methods. The ability of multivariate data analysis
52
approaches, such as principal component analysis (PCA) and principal component analysis-
53
linear discriminant analysis (PCA-LDA), to achieve differentiation of samples and as partial
54
least squares (PLS) and least squares support vector machine (LS-SVM), to quantify the
55
adulteration based on the elemental contents has been investigated. Ten variables i.e. the contents
56
of B, Na, Mg, K, Ca, Fe, Cu, Cu, Zn and Sr at µg g-1 level, determined by ICP-OES were used.
57
Different almond and peanut samples were then mixed at various ratios to obtain mixtures
58
ranging from 95/5 to 5/95 w/w and PCA-LDA was applied to classify the almonds, peanuts and
59
adulterated samples. This method was able to differentiate peanut and almond samples from the
60
adulterated samples. PLS and LS-SVM models were developed to quantify the adulteration
61
ratios of almond using a training set and the constructed models were evaluated using a
62
validation set. The root mean squared error of prediction (RMSEP) and the coefficient of
63
determination (R2) of the validation set for PLS and LS-SVM were 3.81, 0.986 and 1.66, 0.997,
64
respectively, which demonstrates the superiority of the LS-SVM model. The results show that
65
the combination of multi-elemental fingerprinting with multivariate data analysis methods can be
66
applied as an effective and feasible method for testing almond adulteration.
67 68
Keywords: adulteration, almond, peanut, multi-elemental fingerprinting, PCA-LDA, LS-SVM
3
ACCEPTED MANUSCRIPT
69 70
Introduction
71
The authentication of food products has been recognized as a worldwide topic of interest
72
covering many different aspects, from adulteration to mislabeling and misleading origin.
73
Economically motivated adulteration for economic gain of the seller is a process by which the
74
quality of a substance is reduced through the addition, substitution or removal of food
75
ingredients without the consumer's knowledge (Moore, Spink, & Lipp, 2012). Most food
76
products susceptible for fraud are high commercial cost products, often produced worldwide on a
77
large scale (Cordella, Moussa, Martel, Sbirrazzouli, & Lizzani-Cuvelier, 2002). Therefore, the
78
ability of the industry, governments, and standards-setting organizations to authenticate, to
79
control food constituents and to check for food fraud is increasingly important ( Ellis, Brewster,
80
Dunn, Allwood, Golovanov & Goodacre, 2012; Gupta & Panchal, 2009; Zhang, Zhang, Dediu,
81
& Victor, 2011).
82
Several methods have been proposed for the detection of the adulteration in food products, such
83
as PCR assay for meat adulteration ( Ali, Rahman, Hamid, Mustafa, Bhassu & Hashim, 2014;
84
Karabasanavar, Singh, Kumar, & Shebannavar, 2014), visible and near infrared hyperspectral
85
imaging for meat adulteration (Kamruzzaman, Makino, Oshita, & Liu, 2015), high performance
86
liquid chromatography with mass spectrometric detection for lemon juice adulteration (Wang &
87
Jablonski, 2016), solid-phase micro extraction, gas chromatography with mass spectrometry
88
detection (SPME-GC-MS) for cognac and brandy adulteration (Mozhayeva, Zhakupbekova,
89
Kenessov & Akmoldayeva, 2014), electrospray ionization mass spectrometry for meat
90
adulteration (Ruiz Orduna, Husby, Yang, Ghosh, & Beaudry, 2015), two-dimensional gas
91
chromatography for medicinal herb adulteration ( Welke, Damasceno, Nicolli, Mentz, Caramao,
92
Pulgatid, & Zini, 2015), an electronic nose for saffron adulteration (Heidarbeigi, Mohtasebi, 4
ACCEPTED MANUSCRIPT
93
Foroughirad, Ghasemi-Varnamkhasti, Rafiee & Rezaei, 2015),
94
attenuated total reflectance (Jiménez-Sotelo, Hernández-Martínez, Osorio-Revilla, Meza-
95
Márquez, García-Ochoa & Gallardo-Velázquez, 2016) and Raman spectroscopy for milk powder
96
characterization (Karunathilaka, Farris, Mossoba, Moore, & Yakes, 2016). Most methods are
97
time consuming, use large volumes of solvents and are not readily adaptable for rapid monitoring
98
through portable instrumentation. Meanwhile, spectroscopic techniques combined with
99
multivariate data analysis methods form a promising strategies to overcome the drawbacks and
100
can effectively be used to detect adulteration of different food products (López, Trullols, Callao,
101
& Ruisánchez, 2014).
102
Almond (Prunus amygdalus) with several unique features is one of the most popular nuts
103
worldwide. It is highly nutritious and classified as a drupe in which the edible seed is a
104
commercial product (Alonso, Kodad, & Gradziel, 2012). Almond powder is generally used in a
105
variety of processed foods, particularly in bakery and confectionery products (Dourado, Barros,
106
Mota, Coimbra, & Gama, 2004). Because of its high price, almond powder is a target of illegal
107
practices, such as mixing with cheaper nuts. One of the most common adulterations consists of
108
using peanut as an adulterant with very similar chemical composition and much lower prices. To
109
the best of our knowledge, adulteration studies for this kind of nut are not extended in the
110
literature. Thus, it is important to develop analytical procedures to verify the quality of almond
111
powder, identifying peanut fraud, motivated primarily by economic gain.
112
Food products are also usually tested for metal contents for a variety of reasons. Determination
113
of the mineral composition is an important approach reflecting the nutritional value and its
114
relationship with food quality. In fact, some elements are essential for numerous bodily functions
115
and have a metabolic role (Co, Cu, Fe, Se, Zn), while others have potentially toxic characteristics
5
Fourier transform infrared
ACCEPTED MANUSCRIPT
116
(As, Cd, Pb). Therefore, trace metal profiling can be used to authenticate food Chen, Fan, Chang,
117
Pang, Hu, Lu & Wang, 2014; D’Archivio, Giannitto, Incani, & Nisi, 2014; Drivelos, Higgins,
118
Kalivas, Haroutounian, & Georgiou, 2014; Laursen, Schjoerring, Kelly, & Husted, 2014) and it
119
is also expected that food adulteration could change the elemental profile of a particular sample,
120
such as almond.
121
Fingerprinting based on chemical composition and multivariate data analysis have become one
122
of the most powerful systematic approaches to determine authenticity (Coetzee, Van Jaarsveld,
123
& Vanhaecke, 2014). A fingerprint is a characteristic profile of a sample, which can be
124
established through common techniques such as chromatography and spectroscopy to
125
characterize food products based on their quality or origin (Gan, Yang, Li, Wen, Zhu, Jiang &
126
Ni, 2015). The most common procedures used for pattern recognition purposes include: (i)
127
principal component analysis (PCA) as an unsupervised technique, which provides uncorrelated
128
objective latent variables (principal components) capable of extracting valuable information from
129
the experimental data, exploring the relationship between objects in addition to the relationship
130
between variables, and between objects and variables, (ii) linear discriminant analysis (LDA) as
131
a supervised pattern recognition technique which is based on the determination of linear
132
discriminant functions through maximizing the variance between classes and minimizing the
133
variance within the classes.
134
In this paper, multi-element fingerprints of almond and peanut samples were determined by
135
inductively coupled plasma optical emission spectrometry (ICP-OES) and subsequently two
136
methodologies were proposed for multivariate classification. First, PCA was used to investigate
137
the ability of elemental fingerprinting to discriminate the samples and then LDA was applied to
138
detection of pure and adulterated almond samples. The second main objective of this study was
6
ACCEPTED MANUSCRIPT
139
to build a reliable model using elemental fingerprinting coupled with multivariate calibration
140
methods for the quantification of peanut adulteration in almond powder samples. In this context,
141
we have focused on the development and validation of calibration models using PLS and LS-
142
SVM for the quantitative determination of adulteration in binary mixtures of peanut/almond
143
powder samples in the concentration range of 5–95% (w/w).
144 145
Materials and method
146
Reagents and samples
147
Analytical reagent-grade materials were used for all experiments. All solutions were prepared
148
using high-purity deionized water (>18MΩ) (TKASmart2pure Water Purification Systems,
149
Niederelbert, Germany). Nitric acid 65% w.v−1 and reference solutions of Al, B, Ba, Be, Bi, Ca,
150
Cd, Co, Cr, Cu, Fe, Ga, K, Li, Mg, Mn, Na, Ni, Pb, Se, Te and Zn at 1000 mg L−1 were obtained
151
from Merck (Darmstadt, Germany).
152
Due to the large variation in available almonds and peanuts as a result of flavoring and
153
processing, only raw, unflavored samples were used to minimize the sample variability. Almond
154
samples were collected from Shahr-e-kord, which is the main producer of almonds with the
155
highest quality in Iran. Peanut sampling was performed from Astaneh-ye-ashrafiyeh orchards.
156
This region covers more than 80% of the peanut production in Iran. All samples were collected
157
during the harvesting periods of 2013–2014. The adulterated samples were prepared as different
158
ratios of almond-peanut mixtures. Almonds and peanuts were oven dried at 30°C for at least 5
159
days. Afterwards, the samples were stored in a refrigerator at 4 °C till the preparation process.
160 161
Sample preparation and digestion 7
ACCEPTED MANUSCRIPT
162
The raw unshelled samples were powdered using a mortar and pestle and were used for the
163
preparation of the pure and the adulterated samples (ranging from 95/5 to 5/95 w/w). A total of
164
150 samples was measured, including 25 pure almond samples, 25 pure peanut samples and 100
165
adulterated almond samples. Almond powder was adulterated with peanut powder at a variety of
166
levels from 5 to 95%, w/w. The finely ground kernels (1.0 g) (pure or adulterated with specific
167
weight fraction of peanut) were introduced into a 50-mL PTFE (polytetrafluoroethylene) closed
168
vessel. A volume of 20.0 mL concentrated nitric acid and 5 mL of concentrated hydrogen
169
peroxide were added to the vessel. The bomb was sealed and put in a furnace set at 180 ± 10 ◦C
170
and remained at this temperature during 2h. The addition of H2O2 to the nitric acid was necessary
171
to increase the oxidation efficiency. After cooling down to room temperature, solutions were
172
quantitatively transferred to glass volumetric flasks and volumes were made up to 50.0 mL with
173
deionized water.
174 175
Analysis by ICP-OES
176
An ICP-OES (Perkin Elmer, Optima 7300 DV, Shelton, USA) was used for the Al, B, Ba, Be,
177
Bi, Ca, Cd, Co, Cr, Cu, Fe, Ga, K, Li, Mg, Mn, Na, Ni, Pb, Se, Te and Zn determination. The
178
sample introduction system was composed of a Scott spray chamber and a Gem-cone nebulizer.
179
The operational parameters are tabulated in Table 1. A multi-element calibration curve was
180
prepared by diluting standard solutions at 100 mg L−1 of each metal (CertiPur, Merck). The
181
instrument is operated with computer software WinLab32. All measurements were performed in
182
triplicate.
183
Multivariate data analysis
8
ACCEPTED MANUSCRIPT
184
Different multivariate analysis techniques were applied on the ICP-OES data. Pure almond
185
samples (25 samples) and pure peanut samples (25 samples) were used for PCA and PCA-LDA
186
analysis in order to investigate the ability of the elemental composition to differentiate both. Half
187
of the adulterated samples (50 samples) were also used for classification purposes using PCA-
188
LDA, while the remaining adulterated samples (50 samples) were used to evaluate the
189
classification accuracy of PCA-LDA model.
190
In order to perform the quantification, the adulterated almond samples data (100 samples) were
191
randomly divided into calibration (50 samples) and prediction (evaluation) set (50 samples). The
192
calibration samples were used for model construction (applying PLS and LS-SVM), while the
193
prediction samples were utilized to evaluate the model performance.
194
The LDA and PLS models were in-house programmed (Mousavi, Esteki, Mostafazadeh-Fard,
195
Dehghani, & Khorvash, 2012) using MATLAB (the MathWorks, Version 6.0, Natick, MA) and
196
the SVM software package ChemSVM, including SVR, was programmed by Suykens ( 2001).
197
This software has been validated in previous works (Esteki, Rezayat, Ghaziaskar, & Khayamian,
198
2010).
199 200
Linear Discriminant analysis (LDA)
201
Linear Discriminant analysis (LDA) is a supervised pattern recognition technique which is based
202
on maximizing the variance between the groups and minimizing the variance within them, by
203
means of a data projection from a high dimensional space to a low dimensional space. The
204
transformation tends to pivot the space in the way which maximizes the differences between the
205
groups when the observations are projected on to the new space (Varmuza & Filzmoser, 2010).
9
ACCEPTED MANUSCRIPT
206
order In to achieve a binary classification (two classes), suppose two classes with n-dimensional
207
samples x1, …, xn , where n1 and n2 represent the number of samples of the first and the second
208
class, respectively. The Fisher discriminant can be defined as the linear function 𝑽𝒕𝒙 that
209
maximizes the principle function: (𝝁𝟏 ‒ 𝝁𝟐)𝟐
210
𝑱(𝑽) =
(4) where 𝜇1 and 𝜇2
211
indicate the means of the projected points of classes 1 and 2, and 𝑠1 and 𝑠2 the scatter in classes 1
212
and 2, respectively. Now, the within-class scatter matrix is defined as:
213
𝑺𝒘 = 𝑺𝟏 + 𝑺𝟐
214
In this case, the separate class scatter matrices, 𝑺𝟏 and 𝑺𝟐, for classes 1 and 2 can be described
215
as:
𝒔𝟏𝟐 + 𝒔𝟐𝟐
∑
𝑺𝟏 =
(5)
(𝒙𝒊 ‒ 𝝁𝟏)(𝒙𝒊 ‒ 𝝁𝟏)𝒕
(6)
(𝒙𝒊 ‒ 𝝁𝟐)(𝒙𝒊 ‒ 𝝁𝟐)𝒕
(7)
𝒙𝒊𝝐𝑪𝒍𝒂𝒔𝒔 𝟏
216
∑
𝑺𝟐 =
𝒙𝒊𝝐𝑪𝒍𝒂𝒔𝒔 𝟐
217
The scatter of the projection y can then be defined as a function of the scatter matrix in feature
218
space x:
219
𝒔𝟏𝟐 = 𝑽𝒕𝑺𝟏𝑽
220
Considering the fact that 𝒔𝟐𝟐 = 𝑽𝒕𝑺𝟐𝑽, and using eq. 5, the sum can be written as
221
𝒔𝟐𝟏 + 𝒔𝟐𝟐 = 𝑽𝒕𝑺𝑾𝑽
222
The between class scatter matrix can be defined as:
223
𝑺𝑩 = (𝝁𝟏 ‒ 𝝁𝟐)(𝝁𝟏 ‒ 𝝁𝟐)𝒕
(8)
(9)
(10)
10
ACCEPTED MANUSCRIPT
224
Similarly, the difference between the projected means (in y-space) can be represented in terms of
225
the means in the original feature space (x-space):
226
(𝝁𝟏 ‒
𝝁𝟐)𝟐 = 𝑽𝒕𝑺𝑩𝑽
(11)
227 228
Finally the Fisher criterion (eq. 4) can be expressed in terms of 𝑆𝑊 and 𝑆𝐵 as:
229
𝑱(𝑽) =
230
the
𝑽𝒕𝑺𝑩𝑽
(12) In order to find
𝑽𝒕𝑺𝑾𝑽
maximum
of 𝑱(𝑽),
the
derivative
of
𝑱(𝑽)
should
be
equal
to
zero:
231 232
𝒅 𝑱(𝑽) = 𝟎 𝒅𝑽
233
generalized eigen value problem:
234
𝑺 ‒𝑾𝟏𝑺𝑩𝑽 = 𝝀𝑽
235
Where 𝝀 = 𝑱(𝑽)
236
It can be shown that for c classes, the most favorable projection matrix 𝑽 is the one whose
237
columns are the eigenvectors corresponding to the largest eigen values of the generalized eigen
238
value problem (eq. 13).
(13) To solve the
(14)
239 240
Partial least squares (PLS)
241
Partial least squares (PLS) is a multivariate statistical linear regression technique, based on
242
estimated latent variables which extracts the relationship between dependent variables (output
243
array) and independent variables (input array). The aim of the PLS regression is to build a linear
244
model enabling the prediction of a desired characteristic (Y) from a measured data matrix (X). 11
ACCEPTED MANUSCRIPT
245
In this method, dimension reduction of the raw data is based on the independent variable matrix
246
(X) as well as on the dependent variable matrix (Y). The key point of the method is the
247
simultaneous decomposition of X and Y which can be performed as follows (Geladi, 1988):
248
𝑿 = 𝑻𝑷' + 𝑫
(15)
249
𝒀 = 𝑼𝑸' + 𝑭
(16)
250
where T and U are the X- and Y-block score matrices; P and Q are the X and Y loadings; and D
251
and F are the residuals. The construction of a PLS model involves the creation of a relationship
252
between projections of the input and output variables, U and T, respectively, as below:
253
𝑼 = 𝑩𝑻
254
In other words, through the PLS modeling process the regression model is build between scores
255
of the input variables and scores of the output variables.
(17)
256 257
Support Vector Machine (SVM)
258
Support vector machine (SVM) was introduced by Vapnik ( 1998) and has been the focus of
259
many studies in recent years in a wide range of applications. To apply SVM in a regression
260
problem one should find the optimal hyperplane, in such a way that the distances between the
261
data points and the hyperplane are minimal (Smola & Schölkopf, 2004). The next paragraphs
262
describe the basic fundamental aspects of the LS-SVM modeling.
263
G ( x i , y i ) iN1 represents a data set including N data points where each independent variable,
264
xi , should be linked with the corresponding dependent variable, y i . Suppose that the data set
265
is generated by an unknown function g (x) , then a function f that approximates g (x) should
266
be build based on the knowledge of data set G . Therefore, xi is first mapped into a higher 12
ACCEPTED MANUSCRIPT
267
dimensional space, F , by using a nonlinear mapping and then making a linear regression in
268
this space. The SVM approximates the function as: n f ( x) i i ( x) b with : F , F
269
(18)
i 1
270
where i are coefficients and b is the value of threshold. This approximation can be
271
assumed as a hyperplane in the feature space F (with D-dimensions) which can be identified
272
by the functions i (x) . Since is fixed, the coefficients ( i ) can be obtained from the data
273
by minimizing the sum of the empirical risk and a complexity term, which have been defined
274
in the following risk function:
R
275
1 N
N
y i 1
i
f ( xi )
1 2
2
(19)
276
where is a parameter that has to be determined based on earlier information, so error
277
values less than may be ignored, as it is shown in the following error function.
278
0 if yi f ( xi ) yi f ( xi ) otherwise. yi f ( xi )
279
(20)
280 281
The first term in Eq. (19) describes the loss function and the second is related to the function
282
flatness. The SVMs construct a linear regression in a high-dimensional feature space by
283
using the loss function and reduce model complexity by minimizing . The regularization
284
parameter ( ) is a constant which determines the trade-off between the model flatness and
285
the calibration error. Then the SVM regression model is build by minimizing the following
286
term:
2
13
ACCEPTED MANUSCRIPT
287 288
N 1 2 ( i i* ) 2 i 1
(21)
289
290
f ( xi ) b y i i * y i f ( xi ) b i Subject to i* 0 i ,
(22)
291 292
The above optimization problem could be solved by constructing a Lagrangian function and
293
transforming it into the dual problem (Vapnik, 1998):
294 N
f ( x, , ) ( i* i ) K ( x,xi ) b *
295
(23)
i 1
296
where i and i* are the Lagrange multipliers, related to each input data point xi , and which
297
are subject to the following constraints:
298
0 i* , i and
N
( i 1
* i
i ) 0
299
Support vectors are actually the training points with non-zero Lagrange multipliers and K (.)
300
is the kernel function describing an inner product in the D-dimensional space as identified
301
below:
302 D
303
K ( x, xi ) i ( x) i ( xi )
(24)
i 1
304 14
ACCEPTED MANUSCRIPT
305
The Lagrange coefficients , * are obtained by maximizing the following term subject to
306
the constraints stated earlier:
307 N
N
i 1
i 1
R( , * ) yi ( i* i ) ( i* i )
308
1 N ( i* i )( *j j ) K ( xi , x j ) 2 i , j 1
(25)
309
Whenever the coefficients are determined, the regression estimate is given by Eq. (23). The
310
value of threshold b can be calculated from the constraints in Eq. (21) considering the fact
311
that the first constraint becomes an equality with i 0
312
constraint becomes an equality with i* if 0 i* . The generalization performance is
313
influenced by the parameters , and the kernel type. Linear, radial basis function (RBF),
314
and polynomial kernels are the most common kernel types. The radial basis function,
315
exp xi x j
316
Gaussian function. In order to obtain the support vectors, 2 and should be optimized.
2
/ 2 2
if 0 i , and the second
is the most frequently used kernel function with
2
the width of a
317 318
Results and discussion
319
To develop a robust model for detection and quantification of adulteration, the concentrations of
320
different elements were selected as variables. Analytical characteristics of the employed ICP-
321
OES procedure after the sample digestion via a Teflon lined stainless steel autoclave are
322
tabulated in Table 2. The limit of detection (LOD) was determined as the lowest concentration of
323
analyte that produces a signal equal to three times the standard deviation of ten replicates of the
324
blank solution. Additionally, LODs were calculated in the original samples (µg g-1) taking into
325
account the amount of sample and the volume of solvents employed during the procedure. The
326
LOD values of the ICP-OES are satisfactory for determination of the elements in almond and 15
ACCEPTED MANUSCRIPT
327
peanut samples, ranging from 0.3 µg L-1 (0.015 µg g-1) to 103 µg L-1 (5.15 µg g-1) for Be and Sr,
328
respectively. Relative standard deviation (%RSD) values were also calculated for five
329
independent analyses of multi-elemental standard solution to characterize the repeatability of the
330
method; obtaining RSD values ranging from 0.03% for Ni to 3.29% for Te.
331
Table 3 summarizes the mean concentration values (µg g-1) of the element compositions found in
332
the almond and peanut samples. From the initial 22 elements investigated only ten, including B,
333
Na, Mg, K, Ca, Fe, Cu, Cu, Zn and Sr, are presented in detectable concentrations in peanut and
334
almond samples.
335 336 337
Differentiation of pure and adulterated almond samples with class-modeling methods
338
Principal component analysis (PCA) is the best known variable-reduction method which is
339
widely used in chemometrics. PCA decomposes the data matrix with m rows (samples) and p
340
columns (variables) into the product of a score and a loading matrix. The scores are the positions
341
of the samples in the space of the principal components (PCs), which gives information about the
342
similarity of samples. All PCs are orthogonal and each successive PC contains a smaller fraction
343
of the total variability of the data set. Principal component analysis was applied to provide an
344
overview of the ability of the elemental composition data to characterize almond and peanut
345
samples. The data set consisted of 50 samples, including 25 pure almond samples and 25 pure
346
peanut samples. Auto-scaling was applied as pretreatment technique. By using the first two
347
principal components, the total explained variation is equal to 97.96% (PC1 = 92.49%, PC2 =
348
5.47%) indicating that the amount of information which would be lost taking into account the
349
first two PCs, is not considerable. The scores for the first two PCs are plotted as a scatter
16
ACCEPTED MANUSCRIPT
350
diagram in Fig. 1a. It is clear that two distinct clusters are formed, corresponding to the peanut
351
and almond samples which indicates that the multi-elemental fingerprinting is able to
352
discriminate almond and peanut samples.
353
Since the data structure analysis using PCA gave a good sample characterization, PCA-LDA
354
analysis was applied in order to find a predictive classification model, which would be able to
355
separate the three described classes i.e. almond, peanut and adulterated samples. A PCA-LDA
356
model was constructed using 100 samples (25 pure almond, 25 pure peanut, 50 adulterated
357
samples) as calibration set for training. Classification was evaluated using the cross-validation
358
(leave-one-out) method. The ability of distinguishing adulterated samples by the LDA
359
classification model was considered based on the validation set consisting of 50 adulterated
360
samples. The scatter plot of projected samples of the second discriminate function vs. the first
361
shows a clear class separation (Fig. 1b). The pure almond group showed negative values in the
362
first dimension and positive in the second; the pure peanut had high positive values in the first
363
dimension and negative in the second, while the adulterated points are distributed around zero in
364
both dimensions. The almond samples formed a distinct group on the left side of the plot. The
365
peanut and adulterated group showed a significant difference in the first dimension but did not
366
show a big difference in the second. The relative contributions of the first and second
367
discriminant functions to the total variance are 81.2% and 16.2%, respectively. The eigenvalues
368
also showed that the first discriminant function is more effective for classifying the samples.
369
The likelihood of the model was evaluated using cross-validation of the training set and also by
370
diagnosing the test set that has not used to build the model. Finally percentages of correct
371
classification were calculated. In Table 4, the results of LDA classification are reported. The
372
LDA model, correctly classified 100% of the calibration, 100% of the cross-validated cases and
17
ACCEPTED MANUSCRIPT
373
the recognition percentage was 98% for the validation set with only one wrong assignment. This
374
means that PCA-LDA is an efficient technique for the qualitative analysis of almond samples
375
and distinguishing adulterated almond samples from the pure ones.
376 377
Quantitative determination of adulteration with PLS and LS-SVM
378
Besides the detection of almond samples adulterated with peanut, the other important problem
379
focuses on quantitative determination of peanut in adulterated almond samples.
380
The investigated range of relative adulterant concentrations in binary mixtures was 5–95 %
381
weight ratios of adulteration. In this study, auto-scaling was applied as the pretreatment
382
technique. The modeling and prediction of mixture composition have been done in a first
383
approach by partial least squares (PLS) regression, which is a linear calibration method widely
384
used in the chemometric literature for multivariate calibration. In this work, the elemental
385
compositions measured by ICP-OES were used for constructing the calibration matrix. Fifty of
386
the 100 adulterated almond samples were randomly selected as calibration set and the remaining
387
50 were used for validation. To select the optimal number of factors for the PLS models, the
388
leave-one-out cross-validation method was employed. The prediction error (RMSE) on the
389
amount of peanut in the samples of calibration and prediction sets was estimated as follows:
390
1 𝑅𝑀𝑆𝐸 = 𝑛
𝑛
∑ (𝐶 ‒ 𝐶 ) 𝑖
𝑖
2
𝑖=1
391
where 𝐶𝑖 is the percentage of the peanut in the ith sample, 𝐶𝑖 is the predicted percentage and n is
392
the number of samples considered. RMSE values were calculated for calibration, cross-
393
validation and prediction sets. The number of latent variables was subsequently optimized by
394
considering a compromise between the root mean squared error of calibration (RMSEC), which 18
ACCEPTED MANUSCRIPT
395
represents the fit of the model to the calibration points, and the root mean squared error of cross-
396
validation (RMSECV), to prevent overfitting (Fig. 2a). The PLS model selected used two latent
397
variables, resulting in an RMSEC of 1.30% (w/w), and RMSECV of 2.15%, which demonstrates
398
the adequacy of the model. The scatter plot of the actual peanut concentration as adulterant
399
versus the predicted peanut content for both the calibration and the test sets are illustrated in Fig.
400
2 (b,c). The determination coefficient (R2) of 0.995 obtained for the calibration set, as shown in
401
Fig. 2b, reflects the generally adequate fit of the PLS regression model. The data from the
402
external validation set consisting of fifty samples were analyzed using the model that was
403
obtained with the training set and the accuracy of the model was evaluated based on the root
404
mean square error of prediction (RMSEP) and determination coefficient of the validation set.
405
The predicted values of adulteration with peanut vs. the experimental ones are shown in Fig. 2c.
406
The numerical values of the statistical parameters of the PLS model are listed in Table 5. An
407
RMSEP of 3.81 (w/w) and R2 value of 0.986 were obtained for the validation set. The overall
408
results show that the proposed method has a suitable reliability for estimating the concentration
409
of peanut adulterant in almond samples.
410
In comparison to the linear model, a nonlinear approach was performed. LS-SVM was used in
411
this work as a nonlinear modeling method. The basic principle of LS-SVM regression is
412
mapping the nonlinearity of the input training data onto a higher dimensional linear feature space
413
via the mapping function. In this study, the data of the validation and the calibration sets are the
414
same as for PLS model building. Optimization of the parameters is an important issue in building
415
an LS-SVM model. Kernel selection and setting appropriate kernel parameters can greatly
416
improve the SVM regression performance. In this study, the radial basis function (RBF) was
417
used as the kernel function. The RBF kernel is often used for regression analysis because of its
19
ACCEPTED MANUSCRIPT
418
good generalization performance, effectiveness and speed in the training process (Janik & Lobos,
419
2006). This function is also able to model nonlinear relationships between the independent
420
parameter and the response variables (Qu, Shih, Jing, & Wang, 2003).
421
Furthermore, variations of the hyper-parameters, including the regularization parameter, γ, and
422
the kernel parameter, σ2, have influence on model performance. Grid search, random search
423
(Bergstra & Bengio, 2012) and genetic algorithms (Wu, Tzeng, Goo, & Fang, 2007) are the most
424
widely used strategies for hyper-parameter optimization. Grid search is a reliable and efficient
425
global optimization method which was used in this study. A robust model was attained by
426
selecting parameters that give the lowest RMSEC error. This grid search was performed on the
427
original training data set. To find the optimal values of the parameters, a grid search was
428
performed for different combinations of γ and σ2 parameters and then the levels with smallest
429
values of RMSECV were selected as the optimum.
430
Fig. 3(a) illustrates the optimization process. The first searching step is a 10×10 grid (.) and is
431
considered as large steps, while in the second search, the grids (×) is again 10×10 and the
432
searching step is smaller. Error contour lines determine the most favorable search area. The
433
results indicated that an LS-SVM with 𝛾 ≃ 860.71, and 𝜎2 ≃ 165.04 resulted in the best LS-
434
SVM implementation. The nonlinear regression method was trained using the objects in the
435
training set and it was evaluated by the test set data. Fig. 3 (b,c) shows the plots of observed
436
versus predicted values for the training and the test sets. The data are distributed around the
437
bisecting line with a determination coefficient of 0.999 for the calibration set and 0.997 for the
438
test set. The statistical parameters, such as R2 between the predicted and experimental values
439
obtained using the LS-SVM for training and test set also are listed in Table 5. The RMSEC and
440
RMSECV values reveal the good performance of the LS-SVM model for prediction of the
20
ACCEPTED MANUSCRIPT
441
adulteration ratio. The high value of R2 cross-validation, 0.996, and the low value of RMSECV
442
of 0.92, as compared with the RMSEC of 0.54 and RMSEP of 1.66, suggest a good internal
443
consistency and the high-quality predictive capability of the LS-SVM model. The estimated
444
adulteration ratios are in excellent quantitative agreement with the experimental values (Fig. 3c).
445
The values and statistical parameters shown in Table 5 allow a comparison between both
446
methods. According to the calibration and cross-validation statistics and the prediction
447
performance, the LS-SVM method is superior to the PLS model.
448 449
Effect of the variables
450
In order to investigate the effect of the different elements in differentiating the samples, the LS-
451
SVM and PLS models were evaluated. The models were constructed using ten variables
452
including B, Na, Mg, K, Ca, Fe, Cu, Cu, Zn and Sr, and then, the effect of each variable was
453
evaluated by omitting it from the model. The RMSECV was calculated for the constructed
454
models. Fig. 4 shows the results of this process. For both LS-SVM and PLS models, the
455
maximum increment of RMSECV was due to Ca, Mg and K followed by Na and B. This means
456
that Ca, Mg and K are the most effective parameters in differentiating almond and peanut
457
samples, while Cu, Zn, Mn and Sr have relatively little effect in this process.
458 459
Conclusion
460
This study has shown the ability of multi-elemental fingerprinting coupled with chemometric
461
methods to qualitatively and quantitatively analyze peanut adulteration in almond powder
462
samples. PCA has been performed using the contents of B, Na, Mg, K, Ca, Fe, Cu, Cu, Zn and
463
Sr (in µg g-1) as variables, in order to investigate the discrimination ability of multi-elemental
21
ACCEPTED MANUSCRIPT
464
fingerprinting for differentiation of the peanut and almond samples. On the other hand, PCA-
465
LDA discrimination analysis allows separating the almond, peanut and adulterated almond
466
samples. The first two weight vectors were used to show the ability of LDA to group three
467
classes. The adulteration of almond was detected using the PCA-LDA method showing 100%
468
correct classification for both calibration and cross-validation and 98% accuracy for validation
469
set samples. Whenever adulteration has been recognized, quantitative analysis of peanut
470
adulterant was carried out by PLS and LS-SVM models. A determination coefficient of 0.995 for
471
training and 0.986 for validation set, RMSEC, 1.30, and RMSEP, 3.81, were achieved by the
472
PLS model. The LS-SVM was also applied as a non-linear regression method to determine the
473
level of adulteration in mixture samples. The adulteration ratios were determined successfully in
474
the range of 5-95% (w/w) with higher determination coefficients (R2 calibration, 0.999, and
475
validation, 0.997) and lower RMSE values (RMSEC, 0.54, and RMSEP, 1.66) when using the
476
LS-SVM model.
477
The proposed multi-elemental fingerprinting method combined with multivariate analysis
478
methods is a straightforward and effective tool for the determination and identification of
479
adulteration of almond powder with peanut.
480
In order to be able to verify the authenticity of almond powders originating from different areas
481
or the food industry by using the above-discussed sensitive, easy and rapid method, we will
482
develop the models for a wider variety of almond and peanut samples, covering the relevant
483
geographic origins and harvesting times.
484 485
Acknowledgement
486
The authors gratefully acknowledge the University of Zanjan for financial support of this work.
22
ACCEPTED MANUSCRIPT
487 488 489 490 491
Ali, M. E., Rahman, M. M., Hamid, S. B. A., Mustafa, S., Bhassu, S., & Hashim, U. (2014).
492
Canine-Specific PCR Assay Targeting Cytochrome b Gene for the Detection of Dog Meat
493
Adulteration in Commercial Frankfurters. Food Analytical Methods, 7(1), 234–241.
494
http://doi.org/10.1007/s12161-013-9672-y
495 496 497
Bergstra, J., & Bengio, Y. (2012). Random Search for Hyper-Parameter Optimization. Journal of Machine Learning Research, 13, 281–305. Chen, H., Fan, C., Chang, Q., Pang, G., Hu, X., Lu, M., & Wang, W. (2014). Chemometric
498
determination of the botanical origin for Chinese honeys on the basis of mineral elements
499
determined by ICP-MS. Journal of Agricultural and Food Chemistry, 62(11), 2443–2448.
500
http://doi.org/10.1021/jf405045q
501
Coetzee, P. P., Van Jaarsveld, F. P., & Vanhaecke, F. (2014). Intraregional classification of wine
502
via ICP-MS elemental fingerprinting. Food Chemistry, 164, 485–492.
503
http://doi.org/10.1016/j.foodchem.2014.05.027
504
Cordella, C., Moussa, I., Martel, A. C., Sbirrazzouli, N., & Lizzani-Cuvelier, L. (2002). Recent
505
developments in food characterization and adulteration detection: Technique-oriented
506
perspectives. Journal of Agricultural and Food Chemistry. http://doi.org/10.1021/jf011096z
507
D’Archivio, A. A., Giannitto, A., Incani, A., & Nisi, S. (2014). Analysis of the mineral
508
composition of Italian saffron by ICP-MS and classification of geographical origin. Food
509
Chemistry, 157, 485–489. http://doi.org/10.1016/j.foodchem.2014.02.068
23
ACCEPTED MANUSCRIPT
510
Dourado, F., Barros, A., Mota, M., Coimbra, M. a, & Gama, F. M. (2004). Anatomy and cell
511
wall polysaccharides of almond (Prunus dulcis D. A. Webb) seeds. Journal of Agricultural
512
and Food Chemistry, 52(5), 1364–1370. http://doi.org/10.1021/jf030061r
513
Drivelos, S. A., Higgins, K., Kalivas, J. H., Haroutounian, S. A., & Georgiou, C. A. (2014). Data
514
fusion for food authentication. Combining rare earth elements and trace metals to
515
discriminate “fava Santorinis” from other yellow split peas using chemometric tools. Food
516
Chemistry, 165, 316–322. http://doi.org/10.1016/j.foodchem.2014.03.083
517
Ellis, D. I., Brewster, V. L., Dunn, W. B., Allwood, J. W., Golovanov, A. P., & Goodacre, R.
518
(2012). Fingerprinting food: current technologies for the detection of food adulteration and
519
contamination. Chemical Society Reviews, 41(17), 5706. http://doi.org/10.1039/c2cs35138b
520
Esteki, M., Rezayat, M., Ghaziaskar, H. S., & Khayamian, T. (2010). Application of QSPR for
521
prediction of percent conversion of esterification reactions in supercritical carbon dioxide
522
using least squares support vector regression. Journal of Supercritical Fluids, 54(2), 222–
523
230. http://doi.org/10.1016/j.supflu.2010.04.007
524
Gan, Z., Yang, Y., Li, J., Wen, X., Zhu, M., Jiang, Y., & Ni, Y. (2015). Using sensor and
525
spectral analysis to classify botanical origin and determine adulteration of raw honey.
526
Journal of Food Engineering. http://doi.org/10.1016/j.jfoodeng.2016.01.016
527
Geladi, P. (1988). Notes on the history and nature of partial least squares (PLS) modelling.
528
Journal of Chemometrics, 2(January), 231–246.
529
http://doi.org/http://doi.wiley.com/10.1002/cem.1180020403
530
Gupta, N., & Panchal, P. (2009). Extent of awareness and food adulteration detection in selected
531
food items purchased by home makers. Pakistan Journal of Nutrition.
532
http://doi.org/10.3923/pjn.2009.660.667
24
ACCEPTED MANUSCRIPT
533
Heidarbeigi, K., Mohtasebi, S. S., Foroughirad, A., Ghasemi-Varnamkhasti, M., Rafiee, S., &
534
Rezaei, K. (2015). Detection of adulteration in saffron samples using electronic nose.
535
International Journal of Food Properties, 18(7), 1391–1401.
536
http://doi.org/10.1080/10942912.2014.915850
537
Janik, P., & Lobos, T. (2006). Automated classification of power-quality disturbances using
538
SVM and RBF networks. IEEE Transactions on Power Delivery, 21(3), 1663–1669.
539
http://doi.org/10.1109/TPWRD.2006.874114
540
Jiménez-Sotelo, P., Hernández-Martínez, M., Osorio-Revilla, G., Meza-Márquez, O. G., García-
541
Ochoa, F., & Gallardo-Velázquez, T. (2016). Use of ATR-FTIR spectroscopy coupled with
542
chemometrics for the authentication of avocado oil in ternary mixtures with sunflower and
543
soybean oils. Food Additives & Contaminants: Part A, 33(7), 1105–1115.
544
http://doi.org/10.1080/19440049.2016.1203073
545
Kamruzzaman, M., Makino, Y., Oshita, S., & Liu, S. (2015). Assessment of Visible Near-
546
Infrared Hyperspectral Imaging as a Tool for Detection of Horsemeat Adulteration in
547
Minced Beef. Food and Bioprocess Technology, 8(5), 1054–1062.
548
http://doi.org/10.1007/s11947-015-1470-7
549
Karabasanavar, N. S., Singh, S. P., Kumar, D., & Shebannavar, S. N. (2014). Detection of pork
550
adulteration by highly-specific PCR assay of mitochondrial D-loop. Food Chemistry (Vol.
551
145). Elsevier Ltd. http://doi.org/10.1016/j.foodchem.2013.08.084
552
Karunathilaka, S. R., Farris, S., Mossoba, M. M., Moore, J. C., & Yakes, B. J. (2016).
553
Characterising variances of milk powder and instrumentation for the development of a non-
554
targeted, Raman spectroscopy and chemometrics detection method for the evaluation of
555
authenticity. Food Additives & Contaminants: Part A, 33(6), 921–932.
25
ACCEPTED MANUSCRIPT
556 557
http://doi.org/10.1080/19440049.2016.1188437 Laursen, K. H., Schjoerring, J. K., Kelly, S. D., & Husted, S. (2014). Authentication of
558
organically grown plants - advantages and limitations of atomic spectroscopy for multi-
559
element and stable isotope analysis. TrAC - Trends in Analytical Chemistry.
560
http://doi.org/10.1016/j.trac.2014.04.008
561
López, M. I., Trullols, E., Callao, M. P., & Ruisánchez, I. (2014). Multivariate screening in food
562
adulteration: Untargeted versus targeted modelling. Food Chemistry, 147, 177–181.
563
http://doi.org/10.1016/j.foodchem.2013.09.139
564
Moore, J. C., Spink, J., & Lipp, M. (2012). Development and Application of a Database of Food
565
Ingredient Fraud and Economically Motivated Adulteration from 1980 to 2010. Journal of
566
Food Science. http://doi.org/10.1111/j.1750-3841.2012.02657.x
567
Mousavi, S.-F., Esteki, M., Mostafazadeh-Fard, B., Dehghani, S., & Khorvash, M. (2012).
568
Linear and Nonlinear Modeling for Predicting Nickel Removal from Aqueous Solutions.
569
Environmental Engineering Science, 29(8), 765–775. http://doi.org/10.1089/ees.2011.0233
570
Mozhayeva D, Zhakupbekova A, Kenessov B, Akmoldayeva S, C. L. (2014). New method for
571
the discovery of adulterated cognacs and brandies based on solid-phase, 76, 53–66.
572
Qu, M., Shih, F. Y., Jing, J., & Wang, H. (2003). Automatic solar flare detection using MLP,
573
RBF, and SVM. Solar Physics, 217(1), 157–172. http://doi.org/10.1023/A:1027388729489
574
Ruiz Orduna, A., Husby, E., Yang, C. T., Ghosh, D., & Beaudry, F. (2015). Assessment of meat
575
authenticity using bioinformatics, targeted peptide biomarkers and high-resolution mass
576
spectrometry. Food Additives & Contaminants: Part A, 32(10), 1709–1717.
577
http://doi.org/10.1080/19440049.2015.1064173
578
Smola, A. J., & Schölkopf, B. (2004). A Tutorial on Support Vector Regression. Statistics and
26
ACCEPTED MANUSCRIPT
579 580
Computing, 14(3), 199–222. http://doi.org/10.1023/B:STCO.0000035301.49549.88 Alonso, J. M., Kodad, A., & Gradziel, T. M. (2012). Almond. In M. L. Badenes & D. H. Byrne
581
(Eds) Fruit Breeding (p. 875) Springer, New York. http://doi.org/10.1007/978-1-4419-
582
0763-9
583
Suykens, J. A. K. (2001). Nonlinear modelling and support vector machines. IMTC 2001
584
Proceedings of the 18th IEEE Instrumentation and Measurement Technology Conference
585
Rediscovering Measurement in the Age of Informatics Cat No01CH 37188, 1, 287–294.
586
http://doi.org/10.1109/IMTC.2001.928828
587
Vapnik, V. N. (1998). Statistical Learning Theory. Adaptive and Learning Systems for Signal
588
Processing, Communications and Control, 2, 1–740. http://doi.org/10.2307/1271368
589
Varmuza, K., & Filzmoser, P. (2010). Introduction to multivariate statistical analysis in
590
chemometrics. Applied Spectroscopy (Vol. 64).
591
http://doi.org/10.1366/000370210791114185
592
Wang, Z., & Jablonski, J. E. (2016). Targeted and non-targeted detection of lemon juice
593
adulteration by LC-MS and chemometrics. Food Additives & Contaminants: Part A, 33(3),
594
560–573. http://doi.org/10.1080/19440049.2016.1138547
595
Welke, J. E., Damasceno, F. C., Nicolli, K. P., Mentz, L., Caramao, E. B., Pulgatid, F. H., &
596
Zini, C. A. (2015). Preliminary investigation of medicinal herb adulteration using
597
comprehensive two-dimensional gas chromatography and chemometric analysis. Journal of
598
the Brazilian Chemical Society, 26(6), 1232–1246. http://doi.org/10.5935/0103-
599
5053.20150089
600
Wu, C. H., Tzeng, G. H., Goo, Y. J., & Fang, W. C. (2007). A real-valued genetic algorithm to
601
optimize the parameters of support vector machine for predicting bankruptcy. Expert
27
ACCEPTED MANUSCRIPT
602 603
Systems with Applications, 32(2), 397–408. http://doi.org/10.1016/j.eswa.2005.12.008 Zhang, J., Zhang, X., Dediu, L., & Victor, C. (2011). Review of the current application of
604
fingerprinting allowing detection of food adulteration and fraud in China. Food Control.
605
http://doi.org/10.1016/j.foodcont.2011.01.019
606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623
28
ACCEPTED MANUSCRIPT
624 625 626 627
Figure captions
628
Fig. 1 a) PC1-PC2 score plot, b) Discriminant scores on the two-dimension space defined by the
629
first and second discriminant functions of pure samples and mixtures of almond and peanut (in
630
the range 30/70 – 70/30, m/m).
631
Fig. 2 PLS modeling based on the multi-elemental fingerprinting to predict adulteration of
632
almond with peanut a) RMSEC and RMSECV versus the number of latent variables in the PLS
633
model; Prediction results of best PLS model for calibration (b) and prediction (c) sets.
634
Fig. 3 LS-SVM modeling based on the multi-elemental fingerprinting to predict adulteration of
635
almond with peanut. (a) Grid search on γ and σ2; Prediction results of the best LS-SVM model
636
for calibration (b) and prediction (c) sets.
637
Fig. 4 RMSECV values after omitting different variables from the PLS and SVM models.
638 639 640 641 642 643
29
ACCEPTED MANUSCRIPT Highlights
Detection of peanut adulteration in almond powder by multi-elemental fingerprinting. PCA and LDA were able to differentiate the adulterated samples. Detection of adulteration up to the level of 5% was possible using LS-SVM model. The effective elements in differentiating almond and peanut samples were determined.
ACCEPTED MANUSCRIPT
Table 1. The ICP-OES operating parameters.
Instrumental Parameter
Level
Generator Frequency (MHz)
40
RF Power (kW)
1.3
Plasma gas flow rate (L min-1)
15
Auxiliary gas flow rate (L min-1)
0.2
Nebulizer gas flow rate (mL min-1)
800
Integration time (s)
3
Stabilization time (s)
15
ACCEPTED MANUSCRIPT
Table 2. Analytical figures of merit from the ICP-OES determination of the elements in almond samples, digested in a teflon lined stainless steel autoclave. Spectral
RSD(%)a
LOD Spectral RSD(%)a LOD In solution line In solution In sample In sample Element line (nm) Element (µg g-1)c (µg L-1)b (nm) (µg L-1)b (µg g-1)c Li 413.256 0.14 50 2.5 Ni 221.648 0.03 10 0.5 Be 234.86 0.08 0.3 0.015 Cu 213.597 1.24 12 0.6 B 182.528 1.40 50 2.5 Zn 202.548 0.70 4.0 0.2 Na 588.995 2.41 29 1.45 Ga 294.364 1.68 46 2.3 Mg 279.077 0.88 30 1.5 Se 196.026 1.47 75 3.75 Al 308.215 1.14 45 2.25 Sr 232.235 0.52 103 5.15 K 766.490 0.27 40 2 Cd 214.44 0.32 2.5 0.125 Ca 315.88 3.06 10 0.5 Te 214.281 3.29 41 2.05 Cr 205.560 0.98 6.1 0.30 Ba 230.42 0.56 4.1 2.05 Mn 257.610 1.54 1.4 0.07 Tl 190.801 3.12 40 2.0 Fe 234.349 1.09 13 0.65 Pb 217.000 0.98 40 2.0 Co 228.616 0.88 7 0.35 Bi 190.171 1.17 34 1.7 a RSD data were calculated as percentage corresponding to (s/x ) * 100 mean b LODs were calculated as the concentration corresponding to signals equal to three times the standard deviation of a blank solution. c LODs data in µg g-1 of almond sample were calculated taken into consideration the amount of almond digested and the appropriate dilution.
ACCEPTED MANUSCRIPT
Table 3. Mean concentration and the range of the elements (µg g-1) in almond and peanut samples. Almond Element Mean Range SD Al
Peanut Mean
Range 16.4 - 22.3
673.0 - 787.1
17.3 - 22.7 20.7 - 29.2 4931.1 - 5432.0 1874.7 - 2048.8 11.6 - 13.3
11.1 - 14.7 27.8 - 32.4
SD
ACCEPTED MANUSCRIPT
Table 4. The calibration, cross-validation and validation results of the LDA classification model. Classes Almond Peanut Adulterated
Predicted/Assigned Almond Peanut 25 0 0 25 0 0
Adulterated 0 0 50
Total 25 25 50
Cross-validation
Almond Peanut Adulterated
25 0 0
0 25 0
0 0 50
25 25 50
Validation set
Adulterated
0
1
49
50
Calibration
ACCEPTED MANUSCRIPT
Table 5. Calibration and validation results of PLS and LS-SVM models from using multielemental fingerprints. Method
RMSEC
RMSECV
RMSEP
R2calibration
R2 cross-validation
R2 validation
PLS LS-SVM
1.30 0.54
2.15 0.92
3.81 1.66
0.995 0.999
0.989 0.996
0.986 0.997