Discrimination of extra-virgin-olive oils from different cultivars and geographical origins by untargeted metabolomics

Discrimination of extra-virgin-olive oils from different cultivars and geographical origins by untargeted metabolomics

Accepted Manuscript Discrimination of extra-virgin-olive oils from different cultivars and geographical origins by untargeted metabolomics Silvia Ghi...

11MB Sizes 0 Downloads 79 Views

Accepted Manuscript Discrimination of extra-virgin-olive oils from different cultivars and geographical origins by untargeted metabolomics

Silvia Ghisoni, Luigi Lucini, Federica Angilletta, Gabriele Rocchetti, Daniela Farinelli, Sergio Tombesi, Marco Trevisan PII: DOI: Reference:

S0963-9969(18)30999-2 https://doi.org/10.1016/j.foodres.2018.12.052 FRIN 8177

To appear in:

Food Research International

Received date: Revised date: Accepted date:

17 September 2018 4 December 2018 23 December 2018

Please cite this article as: Silvia Ghisoni, Luigi Lucini, Federica Angilletta, Gabriele Rocchetti, Daniela Farinelli, Sergio Tombesi, Marco Trevisan , Discrimination of extravirgin-olive oils from different cultivars and geographical origins by untargeted metabolomics. Frin (2018), https://doi.org/10.1016/j.foodres.2018.12.052

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT Discrimination of extra-virgin-olive oils from different cultivars and geographical origins by untargeted metabolomics

Silvia Ghisonia, Luigi Lucinia* , Federica Angillettaa, Gabriele Rocchettia,b, Daniela Farinellic,

Department for Sustainable Food Process, Università Cattolica del Sacro Cuore, Via Emilia

RI

a

PT

Sergio Tombesid, Marco Trevisana

b

SC

Parmense 84, 29122 Piacenza, Italia

Department of Animal Science, Food and Nutrition, Università Cattolica del Sacro Cuore, Via

c

Department of Agricultural, Food and Environmental Sciences, Università degli Studi di Perugia,

MA

Via Borgo 20 giugno 74, 06154, Perugia, Italy d

NU

Emilia Parmense 84, 29122 Piacenza, Italia

Department for Sustainable Crop Production, Università Cattolica del Sacro Cuore, Via Emilia

AC C

EP T

ED

Parmense 84, 29122 Piacenza, Italia

*

Corresponding author: via Emilia Parmense 84, 29122 Piacenza, Italy. Telephone: +39 0523 599156. E-mail: [email protected] 1

ACCEPTED MANUSCRIPT Abstract In

this preliminary study,

quadrupole-time-of-flight

ultra-high-pressure liquid

chromatography (UHPLC)

mass spectrometry (QTOF) metabolomics followed

coupled

to

by multivariate

statistics was applied to discriminate nine extra-virgin olive oil (EVOO) cultivars according to their phenolic and sterolic fingerprints. The same approach was then used to discriminate EVOO samples

PT

from different geographical origins, namely six blends representative of the main growing regions

RI

in Italy (i.e., Sicily, Puglia, Umbria, Liguria, Lombardy and Tuscany).

SC

This approach allowed to putatively identify more than 1000 compounds, considering both polyphenols and sterols. The unsupervised hierarchical cluster analysis (HCA) and the orthogonal

NU

projections to latent structures discriminant analysis (OPLS-DA) discriminated EVOO samples according to both cultivar and geographical origin. In particular, flavonoids (i.e., anthocyanins and

MA

flavonols), hydroxycinnamic acids and cholesterol derivatives were found to be the most representative classes of compounds discriminating EVOO samples according to the two

ED

parameters (cultivar or geographical origin) selected. However, the following Venn analysis allowed to point out the discriminant markers being exclusive for cultivar or origin discrimination.

EP T

In this regard, only the 9.6% of phenolics and 13.6% of sterols were in common, thus indicating that several of these discriminant compounds were exclusive of a single condition. Indeed,

AC C

considering that most of the commercial EVOOs are blended, the contribution of both the cultivars used and the geographical origin must be taken into account.

Keywords: Extra-Virgin Olive Oil; Food metabolomics; Polyphenols; Sterols; Traceability.

2

ACCEPTED MANUSCRIPT 1. Introduction Extra-Virgin Olive Oil (EVOO) is a main component of the Mediterranean diet, displaying health-promoting effects thanks to both its unsaturated fatty acids profile and the presence of bioactive compounds such as polyphenols and tocopherols (Blasi et al., 2018; Suarez, Caimani, del Bas, & Arola, 2017). EVOO is produced in all the countries in the Mediterranean basin, with Spain,

PT

Italy, Tunisia and Greece being the most important producers worldwide (Kalogiouri, Aalizadeh, &

RI

Thomaidis, 2018).

SC

The quality and organoleptic properties of EVOOs depend on many factors such as cultivar, geographical origin, climatic conditions, agronomic techniques and processing technologies, with

NU

consequences also in the final price of the product in the market (Cubero-Leon, Penalver, & Maquet, 2014). Therefore, the protection of olive oils according to their origin, such as protected

MA

designation of origin (PDO), protected geographical indication (PGI) and traditional specialties guaranteed (TSG), has become an essential information in the label, thus entailing the adoption of

ED

methods for authentication (Merchak, et al., 2017).

EVOO is considered a common target for economically motivated adulteration, because of its

EP T

high demand and profitability potential (Esteki et al., 2018). As recently reviewed, chromatographic methods coupled to chemometrics, including untargeted metabolite profiling have been applied for

AC C

authentication purposes in foods, including olive oils (Esteki et al., 2018). Efforts have been done in the area of PDOs and PGIs to avoid fraudulent labelling, but a definitive and robust solution to this problem is still not present. Both PDO and PGI designate products that are linked to the area of which they bear the name and include niche products from small-scale processes in limited areas where growth conditions confer a particular quality to the product (Cubero-Leon et al., 2014). In the last years, several research attempts tried to use the chemical fingerprint of EVOO samples for authentication purposes. For example, according to Aparicio, Morales, Aparicio-Ruiz, Tena and Garcìa-Gonzàlez (2013), sterols represent a very useful tool for the authentication of olive oils. These authors used sterols as index of fraudulent oil mixtures, underlying the importance of 3

ACCEPTED MANUSCRIPT chromatographic techniques to quantify the free and esterified forms in short time. Different studies have been carried out also to authenticate olive oils from Italy (Alonso-Salces et al., 2010), Greece (Longobardi et al., 2012; Alonso-Salces et al., 2010) and Spain (Becerra-Herrera et al., 2018). Furthermore, Ben Mohamed and co-authors (2018) have recently used both sterol and phenolic fingerprints to discriminate Tunisian and Italian EVOOs, outlining the potential of untargeted

PT

metabolomics for authenticity evaluations. Indeed, metabolomics, defined as “the unbiased, global

RI

screening approach to classify samples based on metabolite patterns or fingerprints that change in response to disease, environmental or genetic perturbations with the ultimate goal to identify

SC

discriminating metabolites” (Sales, et al., 2017) might represents a powerful approach to ensure the

NU

authenticity of olive oils in relation with the geographical origin (Esteki et al., 2018) or sensorial properties (Sales, Portolés, Johnsen, Danielsen & Beltran, 2019).

MA

However, to the best of our knowledge, the application of untargeted metabolomics for authenticity markers discovering in different EVOO cultivars from different origins is still scarce.

ED

Therefore, the aim of this work was to apply a metabolomic approach followed by multivariate statistics to identify sterol and phenolic biomarkers for authenticity purposes. Most of the

EP T

commercial oils are sold as blends, where a contribution of both the cultivars used and the geographical origin is expected. On this basis, the possibility to discriminate different EVOOs as a

AC C

function of the cultivar and according to the actual geographical origin has been investigated.

2. Materials and methods 2.1. Samples

Fifteen EVOO samples were included in this study. Six of them are blends representative of the main growing regions in Italy (i.e., Sicily, Puglia, Umbria, Liguria, Lombardy, Tuscany) were kindly provided by COOP Italia SpA (Casalecchio di Reno, Bologna, Italy) a commercial supplier who owns the supply chain and could ensure geographical origins. The oil from each origin was

4

ACCEPTED MANUSCRIPT provided in triplicate; two additional samples (from Sicily and Tuscany, respectively) have been provided to be used as quality control for validation purposes. The remaining nine samples Mediterranean

basin

(i.e.,

represented

‘Leccino’,

some widely diffused

‘Picual’,

‘Picholine marocaine’,

olive cultivars of the ‘Frantoio’,

‘Sourani’,

‘Kalamon’, ‘Manzanilla de Sevilla’, ‘Arbequina’ and ‘Koroneiki’), and were obtained by olives

PT

harvested in the germplasm repository of the University of Perugia (Italy). EVOO was obtained

RI

from 2 kg of fruit samples harvested on November 15th , 2016 from three trees per each cultivar. Oil

SC

was extracted within 24 hours by a custom-built plant, allowing to process small amount of fruits. Oil samples were immediately placed in cold storage (10 ± 2 °C) until further analyses. Three

NU

replicate samples were obtained per accession.

MA

2.2. Untargeted profiling of phenolic and sterol compounds

Each oil sample was analysed in duplicate, with the exception of quality controls that were

ED

analysed in triplicate. For the extraction step, the method reported by Ben Mohamed and co-authors (2018) was used. Briefly, three individual sample replicates from each sample (3 g each) were

EP T

extracted in 3 mL of 80% methanol solution (v/v) (LCMS grade, VWR, Milan, Italy) using a vortex mixer. The suspension was kept at room temperature for 30 minutes and then centrifuged at 6,000 x

analysis.

AC C

g for 10 min at 4 °C. The extracts were collected in 2 mL amber glass vials and stored at 4 °C until

Phenolic compounds and

sterols were then screened

through ultra-high-pressure liquid

chromatography (UHPLC) coupled to an electrospray quadrupole-time-of-flight hybrid mass spectrometer (UHPLC-ESI/QTOF-MS), as previously described (Rocchetti et al., 2017). With this purpose, a 1290 series liquid chromatograph and a G6550 iFunnel mass spectrometer detector equipped with a JetStream dual electrospray (all from Agilent Technologies, Santa Clara, CA, USA), were used. Chromatographic separation was achieved on an Agilent Zorbax Extend-C18 column (75 × 2.1 mm i.d., 1.8 μm) in reverse phase mode (gradient elution from 5% to 90% LC-MS 5

ACCEPTED MANUSCRIPT grade methanol). The mass spectrometer data acquisition was MS-only mode in positive polarity (range 100–1000 m/z) and injection volume was 3 μL. Source conditions were as follows: nitrogen was used as both sheath gas (10 L min-1 and 350 °C) and drying gas (8 L min-1 and 330 °C); nebulizer pressure was 60 psi, nozzle voltage was 300 V, and capillary voltage was 3.5 kV. Raw data were processed using Profinder B.07 (from Agilent Technologies), for mass and

PT

retention time alignment and for the following annotations based on the ‘find-by-formula’ algorithm

RI

(Pretali, Bernardo, Butterfield, Trevisan & Lucini, 2016). According this approach, the entire

SC

isotopic pattern (monoisotopic accurate mass, isotope spacing, and isotope ratio - mass accuracy < 5 ppm) was considered for compounds identification by linking the isotopic profile to a custom

NU

database obtained by combining Phenol-Explorer 3.6 (http://phenol-explorer.eu/) and LIPID MAPS

2.3. Multivariate statistical analysis

MA

(Lipidomics Gateway, http://www.lipidmaps.org/).

ED

Metabolomics data on phenolic and sterol profiling were interpreted in Agilent Mass Profiler Professional B.12.06 as previously reported (Lucini et al., 2016). Compounds were filtered by

EP T

abundance (only those compounds with an area > 10000 counts were considered), normalized at the 75th percentile and baselined to their median in the dataset. Post-acquisition processing included

AC C

also filtering by frequency: only those compounds identified within 100% of replications in at least one treatment were retained. Statistics were finally carried out on this latter dataset; unsupervised hierarchical cluster analysis (HCA) was carried out setting the similarity measure as ‘Euclidean’ and ‘Wards’ as the linkage rule. Thereafter, the raw dataset was exported into SIMCA 13 (Umetrics, Malmo, Sweden), Pareto scaled, and elaborated for orthogonal partial least squares discriminant analysis (OPLS-DA) supervised modelling. Herein, the variation between the groups was separated into predictive and orthogonal (i.e., ascribable to technical and biological variation) components. The presence of outliers was investigated according to Hotelling’s T2 using 95% and 99% confidence limits for 6

ACCEPTED MANUSCRIPT suspect and strong outliers respectively. Cross-validation of the OPLS-DA model was carried out using CV-ANOVA (p < 0.01) and permutation testing carried out to exclude overfitting (N=300), after inspecting model parameters (goodness-of-fit R2 Y and

goodness-of-prediction Q 2 Y).

Regarding Q 2 Y prediction ability, a value > 0.5 was adopted as a threshold to identify acceptable models, according to software recommendation and as set out in the literature (Rombouts et al.,

PT

2017). Finally, the variables importance in projection (VIP analysis) selection was then used to

RI

evaluate the importance of metabolites and to select those having the highest discrimination

SC

potential (VIP score > 1.2).

Finally, to discern among the markers being specifically responsible for origin discrimination

NU

from those related to cultivars, a Venn analysis has been carried out using an online tool (Venny 2.1

MA

- http://bioinfogp.cnb.csic.es/tools/venny/index.html).

3. Results and discussion

ED

3.1. Discrimination of different EVOO cultivars according to phenolic and sterols profiles

EP T

In the first part of this work, the untargeted profiling of EVOO samples was carried out in order to shed light onto the differences in secondary metabolites as a sole function of the cultivar, since

conditions.

AC C

the olive trees were grown in the same field and were therefore affected by the same pedo-climatic

Overall, the metabolomics-based approach allowed us to putatively annotate 1354 compounds (444 polyphenols and 910 sterols; supplementary material). The metabolomic profiling allowed to discriminate the nine different EVOO cultivars according to their sterols and phenolic fingerprints. In particular, both unsupervised and supervised multivariate statistics (i.e., HCA and OPLS-DA) let us to discriminate the different cultivars analyzed. In this regard, the HCA provided good clusters, considering that biological replicates belonging to the same variety grouped together and no mixed clusters could be observed (supplementary material). In more detail, two main clusters could be identified, suggesting a very similar metabolites profile between ‘Manzanilla’ and ‘Kalamon’ 7

ACCEPTED MANUSCRIPT cultivars when comparing to the others. These results were confirmed through the supervised OPLS discriminant analysis, using the cultivar as class membership criterion. As shown in the OPLS score plot (Figure 1), a clear discrimination of the cultivars was obtained, thus suggesting a strong relationship with the secondary metabolites’ profile. Looking in detail the score plot outlined by OPLS-DA, three main groups were obtained: ‘Manzanilla’ and ‘Kalamon’ cultivars clustered

PT

together, while a second major cluster comprising ‘Frantoio’, ‘Arbequina’, ‘Picual’ and ‘Koroneiki’

RI

oils was noticed. However, ‘Leccino’, ‘Sourani’ and ‘Picholine marocaine’ showed a more distinct metabolites profile when compared to the others. The model cross-validation parameters were

SC

found to be very robust, being the goodness-of-fit (R2 Y) and the goodness-of-prediction (Q2 Y) of

NU

0.97 and 0.91, respectively. Besides, no outliers could be observed by Hotelling’s T2, whereas both CV-ANOVA cross-validation (p < 0.01 for regression) and permutation test (N = 100) for

MA

overfitting were more than adequate (supplementary material). The most representative compounds for discrimination purposes were selected by using the VIP

ED

variable selection method, following OPLS-DA supervised modeling, and then grouped into chemical classes and subclasses (Table 1). The most represented class of polyphenols was

EP T

flavonoids (28 compounds), with anthocyanins such as delphinidin 3-O-sambubioside, peonidin 3O-(6''-acetyl-glucoside) and petunidin 3-O-galactoside, possessing a VIP score > 1.3. When

AC C

considering the other flavonoid-subclasses namely dihydrochalcones, flavonols and flavones, phloretin, quercetin 3-O-glucosyl-xyloside and tetramethylscutellarein possessed the highest VIP score (1.44, 1.33 and 1.28, respectively). Interestingly, among the phenolic acids outlined by VIP analysis, hydroxycinnamic acids were more discriminating than hydroxybenzoic acids, with the isomeric forms of feruloylquinic acid possessing the highest VIP score (1.35). The most discriminating sterols outlined by VIP were exported and classified in subclasses, by exploiting the information provided by the LIPID MAPS database (Lipidomics Gateway, http://www.lipidmaps.org/)

(Table

1).

Several sterol-derivatives characterized

the differences

among the cultivars considered, with an abundance of cholesterols, cardanolides, furostanols, 8

ACCEPTED MANUSCRIPT spirostanols, stigmasterols and other compounds. The highest discrimination potential was found for a stigmasterol (ajugalactone; VIP score: 1.40) followed by a cholesterol derivative (3α,7α,12αTrihydroxy-5β-cholestanoyl-CoA; VIP score: 1.37) and two furostanols (i.e., isomeric forms of pseudoprotodioscin; VIP scores: 1.36). The cultivars analyzed in this work are generally suitable for the production of table olive,

PT

further than for olive oil extraction. Interestingly, ‘Manzanilla’ and ‘Kalamon’, two of the most

RI

important cultivars for oil production, clustered together into the OPLS score plot, whereas

SC

‘Frantoio’, ‘Arbequina’ and ‘Koroneiki’, i.e., those cultivars characterized by a smaller fruit size, where found to be very close into the OPLS score plot according to their sterols and phenolic

NU

fingerprints. A recent work used QTOF-based metabolomics followed by multivariate statistics to discriminate EVOO cultivars from Greece (Kalogiouri et al., 2018). Notably, despite these authors

MA

used a wider database for annotations (i.e., the food database FooDB), phenolic compounds and flavonoids in particular were selected as the compounds displaying the highest discrimination

ED

potential. Conversely, Piccinonna et al. (2016) reported oleic acid and fatty chains as discriminants of Apulian olive oils. However, the metabolomic approach they applied was based on NMR

EP T

spectroscopy, and likely had a compounds coverage complementary to our QTOF screening.

AC C

3.2. UHPLC/QTOF-MS discrimination of EVOOs from different geographical origin The previous approach was used also to discriminate EVOO samples from different regions of Italy. The hierarchical cluster analysis (supplementary material) showed a clear separation between each oil sample, with all the biological replicates clustering together. In particular, it is possible to notice that “Regionale del Garda” was clearly separated from all the other samples, thus suggesting a different secondary metabolites profile. These findings are quite interesting considering that the main cultivars used for producing this EVOO are ‘Casaliva’, ‘Leccino’ and ‘Frantoio’ and that ‘Casaliva’ is considered as a possible synonym of ‘Taggiasca’ and ‘Frantoio’ (Muzzalupo, Vendramin, & Chiappetta, 2014). However, considering that ‘Taggiasca’ is the main cultivar 9

ACCEPTED MANUSCRIPT characterizing ‘Regionale ligure” EVOO, it was surprising to observe a so clear segregation of these two samples. However, it must be considered that ‘Regionale ligure” is produced from olives grown on the seaside, whereas “Regionale del Garda” is produced in the Northern area among those considered, hundred kilometers far from the coast, in a very particular climatic niche. Given the HCA outcomes from unsupervised analysis, the following OPLS-DA supervised

PT

modeling was carried out to identify markers for geographical origin. The score scatter plot (Figure

RI

2) confirmed the results obtained from the HCA, thus proving that the metabolomic profiling proposed was able to discriminate EVOO samples according to the geographical origin. In this

SC

regard, the model parameters were found to be excellent, being R2 Y = 0.97 and Q 2 Y = 0.93. No

NU

outlier samples could be observed by Hotelling’s T2, whereas both CV-ANOVA cross-validations (p < 0.01 for regression) and test for overfitting (permutation test, N = 100) were adequate

MA

(supplementary material). In order to perform an external validation of the chemometric approach applied, two quality controls of known origin (namely Tuscany and Sicily) were also analyzed

ED

using the same method, and their metabolomic profiles included into the OPLS-DA model. The resulting misclassification table (a data elaboration displaying class prediction accuracy), allowed

EP T

including both quality controls within the correct class, with a Fisher’s P of 3.6 E-26 (supplementary material). Therefore, this further investigation confirmed the prediction ability of

AC C

the multivariate approach we developed. The variable selection methods applied to OPLS-DA model (VIP analysis) allowed identifying the most discriminant phenolic and sterols among the compounds annotated. These compounds are reported in Table 2. Overall, 47 phenolic compounds possessed a VIP score > 1.2. In particular, flavonoids were found to be the most common class (21 compounds), with anthocyanins (7 compounds) and flavonols (10 compounds) being the most frequent compounds. Notably, glycosidic forms of kaempferol and cyanidin possessed very represented among these two latter phenolic subclasses. Phenolic acids were the second most frequent class (17 compounds), with a clear abundance of hydroxycinnamic acids, such as isomeric forms of caffeoylquinic and 10

ACCEPTED MANUSCRIPT sinapoylquinic acids. In addition, other phenolics such as lignans and stilbenes were reported. However, the compounds possessing the highest VIP score were found to be 6’’-O-Acetylglycitin (belonging to isoflavonoids) and p-Coumaroyl glucose (hydroxycinnamic acid) with VIP scores of 1.43 and 1.40, respectively. Moving to the sterols identified by VIP analysis, cholesterol (5 compounds), spirostanols (4 compounds), ergosterol (4 compounds), steryl esters (3 compounds)

PT

together with stigmasterol, furostanol and cycloartanol (2 compounds) derivatives were the most

RI

represented compounds. In this case, iokundjoside (i.e., cardanolides) and solaspigenin (i.e.,

SC

spirostanols) were characterized by the highest VIP scores (i.e., 1.41 and 1.35, respectively). It is possible that sterols profile in EVOO is affected also by the oil microbiota, typically present in solid

NU

particles and micro drops of vegetation water, where yeasts are predominant (Ciafardini & Zullo, 2018).

MA

The importance of a metabolomics-based approach to discriminate and authenticate EVOOs from different geographical origin was recently reported. For example, Gil-Solsona and co-authors

ED

(2016) used an UHPLC-QTOF MS approach to identify possible biomarkers able to discriminate different EVOO samples, while Ben Mohamed et al. (2018) using the same analytical conditions

EP T

were able to discriminate Tunisian and Italian EVOOs according to their sterols and phenolic fingerprints. The need to authenticate and differentiate EVOOs olive oils from different origins is

AC C

an essential procedure for both the industry and the final consumer. In our work, a metabolomic profiling approach has been proposed, focusing on two classes of secondary metabolites that are affected by the environment and are widely represented in EVOO. A previous work successfully using metabolomics to authenticate geographical origin of EVOOs from Spain (Gil-Solsona et al., 2016), selected twelve markers, seven of which could be identified; all these markers were lipids, even though no sterols were selected. These differences can be likely ascribed to the differences in acquisition and annotation between the present and the previous work. Conversely, Peršurić, Saftić, Mašek & Kraljević Pavelić (2018), used a combination of MALDI/TOF, GC/MS and NIRS to

11

ACCEPTED MANUSCRIPT authenticate geographical origin of Croatian EVOOs. According to the approach proposed, triacylglycerols were reported to be the most discriminating compounds. Therefore, several markers have been proposed for geographical origin of EVOOs, each of them having its peculiarities. Considering that the very most of commercial EVOOs are blend of different cultivars, it is important to discern whether the proposed markers are specifically related to the

PT

origin rather than being the result of the cultivar(s) considered. With this purpose, we carried out a

RI

Venn analysis to point out to which extent the markers for cultivars discrimination are shared with

SC

those for origin discrimination (Figure 3). As it can be noted, 9.6% of phenolics and 13.6% of sterols were in common, thus indicating that the very most of discriminant compounds are exclusive

NU

of a single condition. Likely, this is the result of using OPLS-DA supervised modeling approaches, where the predictive covariance related to a given class membership (either cultivar or origin) is

MA

separated from orthogonal non-predictive covariance.

Despite the novelty and encouraging findings of the approach proposed, based on untargeted

ED

phenolic and sterolic profiling associated to supervised multivariate OPLS-DA, further research is still needed. In fact, it is known that several factors might affect secondary metabolite profile in

EP T

olive fruit and consequently also in oil; therefore, broader investigations based on a higher number

AC C

of samples and geographical origins might be advisable.

4. Conclusions

Adulteration and misleading label information are a serious problem in the food sector; with this regard, particular attention is being paid to olive oils, thus demanding robust and effective techniques for authentication purposes. In this work, an untargeted metabolomics-based approach was used to identify phenolic and sterol compounds able to discriminate nine samples of EVOOs from different cultivars (grown in the same conditions) and blend samples of EVOOs having different geographical origin. The results showed that the profile of these secondary metabolites can be a powerful tool for discrimination purposes, with regards to cultivars and geographical origin, 12

ACCEPTED MANUSCRIPT confirming that UHPLC-MS metabolomic profiling represents an effective approach in food traceability. In particular, although a clear effect of the cultivar can be identified (thus indicating a strong genetic basis), the interaction with the environment results in a distinctive profile in phenolics and sterols. Holistic approaches such as untargeted profiling, associated to supervised multivariate analyses, allowed identifying the markers specifically related to geographical origin.

PT

Notwithstanding the preliminary nature of our research, the two classes of secondary metabolites

RI

considered in this work are entitled to represent a complementary and promising solution for plant

SC

foods traceability, thus deserving further investigations.

From a practical point of view, it is advisable that the markers proposed will be included in a

NU

validated targeted quantification method, e.g. based on less sophisticated and cheaper mass spectrometers such as triple quadrupole MS/MS. This would result in offering a simple, robust and

MA

cost-effective way to improve authentication of EVOOs on routine basis. Furthermore, given the promising outcomes of metabolomics, research aimed to investigate whether metabolomics could

ED

help to identify fraudulent activities (e.g., mixing lower quality oils in EVOOs) is worthwhile in the

EP T

next future.

AC C

Funding

This research did not receive any specific grant from funding agencies in the public, commercial and not-for-profit sectors.

Acknowledgements SG and GR were recipient of a Ph.D. fellowship from the Doctoral School AgriSystem, Piacenza, Italy. The authors thank the “Romeo ed Enrica Invernizzi” foundation for kindly supporting the metabolomic platform.

13

ACCEPTED MANUSCRIPT

References Alonso-Salces, R. M., Moreno-Rojas, J. M., Holland, M. V., Reniero, F., Guillou, C., & Héberger, K. (2010). Virgin olive oil authentication by multivariate analyses of 1 H NMR fingerprints and

PT

 13 C and  2 H data. Journal of Agricultural and Food Chemistry, 58(9), 5586-5596.

RI

Aparicio, R., Morales, M., Aparicio-Ruiz, R., Tena, N., & Garcìa-Gonzàlez, D. (2013). Authenticity of olive oil: Mapping and comparing official methods and promising alternative. Food Research

SC

International, (54), 2025-2038.

NU

Becerra-Herrera, M., Vélez-Martín, A., Ramos-Merchante, A., Richter, P., Beltrán, R., & Sayago, A. (2018). Characterization and evaluation of phenolic profiles and color as potential

ED

origin. Food Chemistry, 241, 328-337.

MA

discriminating features among Spanish extra virgin olive oils with protected designation of

Ben Mohamed, M., Rocchetti, G., Montesano, D., Ben Ali, S., Guasmi, F., Grati-Kamoun, N., &

EP T

Lucini, L. (2018). Discrimination of Tunisian and Italian extra-virgin olive oils according to their phenolic and sterolic fingerprints. Food Research International, 106, 920-927.

AC C

Blasi, F., Rocchetti, G., Montesano, D., Lucini, L., Chiodelli, G., Ghisoni, S., Baccolo, G., Simonetti, M. S., & Cossignani, L. (2018). Changes in extra-virgin olive oil added with Lycium barbarum L. carotenoids during frying: Chemical analyses and metabolomic approach. Food Research International, 105, 507-516. Ciafardini, G., & Zullo, B. A. (2018). Virgin olive oil yeasts: A review. Food Microbiology, 70, 245-253. Cubero-Leon, E., Penalver, R., & Maquet, A. (2014). Review on metabolomics for food authentication. Food Research International, 60, 95-107.

14

ACCEPTED MANUSCRIPT Esteki, M., Simal-Gandara, J., Shahsavari, Z., Zandbaaf, S., Dashtaki, E., & Vander Heyden, Y. (2018). A review on the application of chromatographic methods, coupled to chemometrics, for food authentication. Food Control, 93, 165-182. Gil-Solsona, R., Raro, M., Sales, C., Lacalle, L., Díaz, R., Ibánez, M., Beltran, J., Sancho, J. V., & Hernández, F. J. (2016). Metabolomic approach for Extra virgin olive oil origin discrimination

PT

making use of ultra-high performance liquid chromatography - Quadrupole time-of-flight mass

RI

spectrometry. Food Control, (70), 350-359.

SC

Kalogiouri, N. O., Aalizadeh, R., & Thomaidis, N. S. (2018). Application of an advanced and wide scope non-target screening workflow with LC-ESI-QTOF-MS and chemometrics for the

NU

classification of the Greek olive oil varieties. Food Chemistry, 256, 53-61.

MA

Longobardi, F., Ventrella, A., Casiello, G., Sacco, D., Tasioula-Margari, M., Kiritsakis, A., & Kontominas, M. (2012). Characterisation of the geographicasl origin of Western Greek virgin

ED

olive oils based on instrumental and multivariate statistical analysis. Food Chemistry, 133, 169-

EP T

175.

Lucini, L., Borgognone, D., Rouphael, Y., Cardarelli, M., Bernardi, J. & Colla, G. (2016). Mild Potassium Chloride Stress Alters the Mineral Composition, Hormone Network, and Phenolic

AC C

Profile in Artichoke Leaves. Frontiers in Plant Scence, 7, 948. Merchak, N., El Bacha, E., Khouzam, R. B., Rizk, T., Akoka, S., & Bejjan, J. (2017). Geoclimatic, morphological, and temporal effects on Lebanese olive oils composition and classification: A 1 H NMR metabolomic study. Food Chemistry, 217, 379-388. Muzzalupo, I., Vendramin, G. G., & Chiappetta, A. (2014). Gemetic biodiversity of Italian olives (Olea europaea) germplasm analyzed by SSR markers. The Scientific World Journal, vol. 2014, Article ID 296590, 12 pages. https://doi.org/10.1155/2014/296590.

15

ACCEPTED MANUSCRIPT Peršueić, Z., Saftić, L., Mašek, T., & Pavelić, S. K. (2018). Comparison of triacylglycerol analysis by MALDI-TOF/MS, fatty acid analysis by GC-MS and non-selective analyis by NIRS in combination with chemometrics for determination of extra virgin olive oil geographical origin. A case study. LWT, 95, 326-332. Piccinonna, S., Ragone, R., Stocchero, M., Del Coco, L., De Pascali, S. A., Schena, F. P., &

PT

Fanizzi, F. P. (2016). Robustness of NMR-based metabolomics to generate comparable data sets

RI

for olive oil cultivar classification. An inter-laboratory study on Apulian olive oils. Food

SC

Chemistry, 199, 675-683.

Pretali, L., Bernardo, L., Butterfield, T. S., Trevisan, M., & Lucini, L. (2016). Botanical and elicit

a

similar

Induced

Systemic Response in tomato

NU

biological pesticides

(Solanum

MA

lycopersicum) secondary metabolism. Phytochemistry, 130, 56-63. Rocchetti, G., Chiodelli, G., Giuberti, G., Masoero, F., Trevisan, M., & Lucini, L. (2017).

ED

Evaluation of phenolic profile and antioxidant capacity in gluten-free flours. Food Chemistry, 228, 367-373.

EP T

Rombouts, C., Hemeryck, L. Y., Van Hecke, T., De Smet, S., De Vos, W. H., & Vanhaecke, L. (2017). Untargeted metabolomics of colonic digests reveals kynurenine pathway metabolites,

AC C

dityrosine and 3-dehydroxycarnitine as red versus white meat discriminating metabolites. Scientific Reports, 7, 42514. Sales, C., Cervera, M., Gil, R., Portolés, T., Pitarch, E., & Beltran, J. (2017). Quality classification of Spanish olive oils by untargeted gas chromatography coupled to hybrid quadrupole-time of flight mass spectrometry with atmospheric pressure chemical ionization and metabolomics-based statistical approach. Food Chemistry, (216), 365-373. Sales, C., Portolés, T., Johnsen, L. G., Danielsen, M., & Beltran, J. (2019). Olive oil quality classification and measurement of its organoleptic attributes by untargeted GC-MS and multivariate statistical-based approach. Food Chemistry, 271, 488-496. 16

ACCEPTED MANUSCRIPT Suarez, M., Caimani, A., del Bas, J., & Arola, L. (2017). Metabolomics: An emerging tool to evaluate the impact of nutritional and physiological challenges. Tends in Analiytical Chemistry, (96), 79-88.

PT

FIGURE CAPTIONS Figure 1. Orthogonal Projections to Latent Structures Discriminant Analysis (OPLS-DA) on

RI

EVOO samples phenolic and sterolic profile, using the ‘cultivar-type’ as class membership

SC

criterion. Individual replications are given in the class prediction model score plot.

NU

Figure 2. Orthogonal Projections to Latent Structures Discriminant Analysis (OPLS-DA) on EVOO samples phenolic and sterolic profile according to their geographical origin. Individual

MA

replications are given in the class prediction model score plot. Figure 3. Venn diagrams considering phenolic and sterolic markers outlined by VIP selection

ED

method according to both geographical origin and cultivar-type. The upper diagram [A] represents

AC C

TABLE CAPTIONS

EP T

the output for phenolic compounds, while the lower one [B] is related to sterols.

Table 1. Discriminant phenolic and sterolic compounds identified by VIP (Variable Importance in Projection) selection method, following supervised OPLS-DA of different EVOO samples and using the ‘cultivar-type’ as class membership criterion. Compounds are provided together with VIP scores (measure of variables importance in the OPLS-DA model). Table 2. Discriminant phenolic and sterolic compounds identified by VIP (Variable Importance in Projection) selection method, following supervised OPLS-DA of EVOO samples from different

17

ACCEPTED MANUSCRIPT geographical origin. Compounds are provided together with VIP scores (measure of variables importance in the OPLS-DA model).

SUPPLEMENTARY MATERIAL

PT

Supplementary Table 1. Comprehensive dataset of identified phenolic and sterolic compounds,

RI

together with abundances, annotations (identification scores and raw formulas) and composite

SC

spectrum (masses and their abundances).

Supplementary Table 2. External validation of the OPLS-DA model for prediction of EVOO

NU

origin. Quality Controls QC oils from Tuscany and Sicily were included in the analysis and a misclassification table was then produced. Such table is presented together with the corresponding

MA

OPLS-DA score plot.

Supplementary Figure 1. Non-averaged unsupervised cluster analysis on the phenolic and sterolic

ED

profile of different EVOO samples representative of nine olive cultivars (i.e., Leccino, Picual,

EP T

Picholine marocaine, Frantoio, Sourani, Kalamon, Manzanilla de Sevilla, Arbequina and Koroneiki) (similarity: Euclidean; linkage rule: Ward). Compounds intensity was used to build up heat map, on

AC C

the basis of which the clusters were generated. Supplementary Figure 2. Validation of OPLS-DA discriminant model on phenolic and sterolic profile as a function of ‘cultivar-type’; Hotelling's T2 using 95% and 99% confidence limits is given in upper pane [A], whereas outcome of permutation test (N = 100) is given in the lower one [B]. Supplementary Figure 3. Non-averaged unsupervised cluster analysis on the phenolic and sterolic profile of different EVOO samples from Sicily, Puglia, Umbria, Liguria, Lombardy and Tuscany (similarity: Euclidean; linkage rule: Ward). Compounds intensity was used to build up heat map, on the basis of which the clusters were generated. 18

ACCEPTED MANUSCRIPT Supplementary Figure 4. Validation of OPLS-DA discriminant model on phenolic and sterolic profile as a function of geographical origin; Hotelling's T2 using 95% and 99% confidence limits is given in upper pane [A], whereas outcome of permutation test (N = 100) is given in the lower one

Class

Subclass

Marker

Flavonoids

Anthocyanins

Delphinidin 3-O-sambubioside

PT

[B].

SC

Petunidin 3-O-galactoside

RI

Peonidin 3-O-(6''-acetyl-glucoside)

Delphinidin 3-O-(6''-acetyl-glucoside)

NU

Delphinidin 3-O-(6''-acetyl-galactoside) Cyanidin 3-O-xyloside/arabinoside

MA

Pelargonidin 3-O-arabinoside Peonidin 3-O-(6''-acetyl-galactoside)

ED

Delphinidin 3-O-arabinoside/xyloside

EP T

Petunidin 3-O-glucoside

Dihydrochalcones

Phloridzin Phloretin

AC C

Flavanols

Flavones

(+)-Catechin 3-O-gallate (-)-Epicatechin 3-O-gallate Tetramethylscutellarein Chrysoeriol 7-O-glucoside Rhoifolin/Isorhoifolin

Flavonols

Quercetin 3-O-glucosyl-xyloside Quercetin 3-O-arabinoside/xyloside Isorhamnetin 3-O-rutinoside Kaempferol 3-O-(6''-acetyl-galactoside) 7-O-rhamnoside

VIP score 1.34  0.09 1.33  0.16 1.30  0.09 1.27  0.41 1.27  0.41 1.27  0.10 1.27  0.08 1.27  0.13 1.26  0.13 1.25  0.09 1.25  0.11 1.44  0.20 1.25  0.10 1.25  0.10 1.28  0.40 1.24  0.17 1.23  0.13 1.33  0.07 1.27  0.13 1.26  0.18 1.24  19

ACCEPTED MANUSCRIPT

Spinacetin 3-O-glucosyl-(1-6)-[apiosyl(1-2)]-glucoside Spinacetin 3-O-glucosyl-(1-6)-glucoside Phenolic acids

Hydroxybenzoics

Galloyl glucose

Hydroxycinnamics

3/4/5-Feruloylquinic acid

PT

Cinnamic acid

Stigmastanol ferulate

SC

Sinapic acid

RI

Avenanthramide 2f

1,2-Disinapoylgentiobiose

-

Isolariciresinol

MA

Lignans

NU

1,2'-Disinapoyl-2-feruloylgentiobiose

Pinoresinol

Matairesinol

ED

Secoisolariciresinol

Sterols

Phenolic terpenes

Rosmadial

Curcuminoids

Curcumin

Calysterols and cyclopropyl sidechain derivatives Cardanolides and derivatives

(24S)-isocalysterol

AC C

Other polyphenols

EP T

Anhydro-secoisolariciresinol

3beta-(6-deoxy-4-O-beta-D-glucopyranosyl-3-O-methylbeta-D-galactopyranosyloxy)-14,16beta-dihydroxy-5betacard-20(22)-enolide Adonitoxin Convallatoxin

Cholesterol and derivatives

3alpha,7alpha,12alpha-Trihydroxy-5beta-cholestanoylCoA Ecdysone 25-O-D-glucopyranoside Cholesterol-5-hydroperoxide

0.24 1.30  0.26 1.28  0.29 1.29  0.19 1.34  0.19 1.34  0.17 1.26  0.08 1.26  0.13 1.26  0.18 1.24  0.09 1.22  0.20 1.34  0.19 1.29  0.10 1.29  0.10 1.26  0.24 1.23  0.24 1.23  0.24 1.38  0.22 1.30  0.14 1.30  0.14 1.26  0.26 1.26  0.26 1.37  0.14 1.33  0.13 1.30  0.20 20

ACCEPTED MANUSCRIPT 24S,25-dihydroxycholesterol Ianostane 20-hydroxyecdysone 22-phosphate

RI

PT

3beta-hydroxy-4beta-methyl-5alpha-cholest-7-ene4alpha-carboxylic acid 4alpha-carboxy-4beta-methyl-5alpha-cholesta-8-en-3betaol 3beta-hydroxy-4alpha-methyl-5alpha-cholest-7-ene4beta-carboxylic acid Pinnasterol 7alpha-hydroxy-4-cholesten-3-one-d7

Nebrosteroid L

NU

Ergosterols and C24-methyl derivatives

SC

7-oxocholesterol (d7)

Pseudoprotodioscin

EP T

Furostanols and derivatives

ED

MA

campest-22E-en3beta,4beta,5alpha,6alpha,8beta,14alpha,15alpha,25R,26nonol campest-22E-en3beta,4beta,5alpha,6alpha,8beta,14alpha,15alpha,25,28nonol Conicasterol D

AC C

3-O-(Rhaa1-4(Rhaa1-2)Glcb)-26-O-(Glcb)-(25R)-furost-5en-3beta,22,26-triol Pseudoprotogracillin

Solanidines and alkaloid derivatives

Spirostanols and derivatives

26-O-[beta-D-glucopyranosyl]-25R-furostan3beta,22alpha,26-triol Solasodine Etioline Diospolysaponin A Trillin 3-O-(Galb)-(25R)-12-oxo-5alpha-spirostan-3beta-ol Alliogenin 6-O-(Glcb)-(25R)-5alpha-spirostan-3beta,6alpha-diol Karatavegenin C

1.30  0.20 1.25  0.51 1.24  0.14 1.24  0.31 1.24  0.31 1.24  0.31 1.23  0.17 1.22  0.79 1.22  0.79 1.26  0.18 1.25  0.26 1.25  0.26 1.24  0.31 1.36  0.15 1.36  0.14 1.30  0.23 1.29  0.21 1.23  0.35 1.23  0.35 1.29  0.44 1.29  0.12 1.27  0.15 1.25  0.29 1.24  0.16 1.23  21

ACCEPTED MANUSCRIPT

Protometagenin Steroid lactones

4beta-Hydroxywithanolide E

Steryl esters

15:0 Cholesteryl ester 17:1 Cholesteryl ester

PT

16:1 Campesteryl ester 14:1 Cholesteryl ester Ajugalactone

RI

Stigmasterols and C24-ethyl derivatives

SC

24Z-ethylidene-cholesta-5,7-dien-3beta-ol

Withanolide

1.30  0.14 1.22  0.18 1.22 0.09

AC C

EP T

ED

MA

Withanolides and derivatives

NU

Stigmastanyl glucoside

0.17 1.23  0.17 1.22  0.12 1.31  0.28 1.29  0.13 1.29  0.13 1.24  0.62 1.40  0.14

22

ACCEPTED MANUSCRIPT Subclass Anthocyanins

Marker Cyanidin 3-O-xylosylrutinoside Cyanidin 3-Osambubioside 5-Oglucoside Delphidin 3-Oarabinoside/xyloside Cyanidin 3-O-(6’’malonyl-glucoside) Peonidin 3-O-(6’’-acetylglucoside/galactoside) Phloretin 2’-O-xylosylglucoside Apigenin 7-O-glucoside Kaempferol 3-O-glucoside Kampferol 3-O-(2’’rhamnosyl-galactoside) 7O-rhamnoside Kaempferol 3-Orhamnosyl-rhamnosylglucoside Kaempferol 3-O-xylosylrutinoside Kaempferol 3-O-(2’’rhamnosyl-6’’acetylgalactoside) 7-Orhamnoside Quercetin 3-O-glucoside Patuletin 3-O-glucosyl-(16)-[apiosyl(1-2)]glucoside Spinacetin 3-O-glucosyl(1-6)-glucoside Quercetin 3-O-xylosylrutinoside Quercetin 3-O-galactoside 6’’-O-Acetylglycitin 3,5-Dihydroxybenzoic acids Protocatechuic acid p-Coumaroyl glucose p-Coumaric acid 4-Oglucoside 24-Methylcholesterol ferulate 24-Methyllathosterol ferulate 24-Methylenecholestanol ferulate p-Coumaroyl glycolic acid 3/4/5-Caffeoylquinic acid 3/4/5-Sinapoylquinic acid 1,2’-Disinapoyl-2feruloylgentiobiose

PT

Class Flavonoids

Dihydrochalcones

EP T

ED

MA

NU

SC

RI

Flavones Flavonols

Isoflavonoids Hydroxybenzoics

AC C

Phenolic acids

Hyddroxycinnamics

VIP score 1.36  0.38 1.35  0.11 1.33  0.13 1.31  0.31 1.28  0.08 1.35  0.13 1.33  0.17 1.35  0.21 1.35  0.25 1.34  0.24 1.34  0.24 1.32  0.15

1.32  0.17 1.35  0.19 1.34  0.16 1.30  0.14 1.27  0.16 1.43  0.15 1.31  0.38 1.31  0.38 1.40  0.32 1.39  0.32 1.36  0.19 1.36  0.19 1.36  0.19 1.36  0.29 1.32  0.17 1.28  0.20 1.28  0.12

23

ACCEPTED MANUSCRIPT

Ergosterols and C24-methyl derivatives

Furanostanols

Solanidines and alkaloid derivatives Spirostanols

PT

RI

1.31  0.13 1.34  0.14 1.28  0.13 1.33  0.19 1.29  0.18 1.31  0.50

Iokundjoside 3alpha,7alpha,12alphaTrihydroxy-5betacholestanoyl-CoA 3beta-hydroxy-4betamethyl-5alpha-cholest-7ene-4alpha-carboxylic acid 4alpha-carboxy-4betamethyl-5alpha-cholesta-8en-3beta-ol 3beta-hydroxy-4alphamethyl-5alpha-cholest-7ene-4beta-carboxylic acid Cholestanyl glucoside Trihydroxycoprostanoic acid Ganosporeric acid A Cimicifoetiside A Cycloartenol Campesteryl glucoside

1.41  0.27 1.31  0.48

Conicasterol D Mutasterol Castasterone 26-O-[beta-Dglucopyranosyl]-25Rfurostan-3beta,22alpha,26triol Pseudoprotogracillin Tomatine

1.29  0.37 1.29  0.18 1.28  0.13 1.29  0.21

Epimetagenin Solaspigenin Trillenogenin

1.35  0.22 1.35  0.22 1.31  0.20

NU

Cycloartanols and derivatives

1.36  0.17 1.36  0.17 1.36  0.17 1.34  0.17 1.31  0.12 1.28  0.13 1.29  0.27 1.31  0.12 1.28  0.25 1.34  0.17

Aragusteroketal

MA ED EP T

Stilbenes Sterols

Brassinolides and derivatives Bufanolides and derivatives C24-propyl sterols and derivatives Calysterols and cyclopropyl sidechain derivatives Cardanolides and derivatives Cholesterol and derivatives

AC C

Lignans

Homovanillic acid Dihydrocaffeic acid Syringaldehyde Rosmadial 3-Methoxyacetophenone Esculetin Estragole 4-Vinylguaiacol 4-Ethylphenol Anhydrosecoisolariciresinol Syringaresinol Piceatannol 3-O-glucoside 7-oxotyphasterol Scillaren A 24-Isopropenylcholesterol

SC

Other polyphenols

Hydroxyphenylacetics Hydroxyphenylpropanoics Hydroxybenzaldehydes Phenolic terpenes Hydroxybenzoketones Hydroxycoumarins Hydroxyphenylpropenes Alkylmehtoxyphenols Alkylphenols -

1.29  0.37

1.29  0.37 1.29  0.37 1.29  0.13 1.28  0.13 1.27  0.20 1.30  0.18 1.29  0.18 1.30  0.17

1.28  0.43 1.35  0.20

24

ACCEPTED MANUSCRIPT

Stigmasterols and C24-ethyl derivatives

18:0 Cholesteryl ester 16:0 Sitosteryl ester 15:0 Cholesteryl ester Cyasterone

1.31  0.08 1.30  0.26 1.31  0.08 1.31  0.23

(20S,24S)-24ethylthornasterol Minabeolide-7

1.30  0.24

1.34  0.26

1.28  0.22

SC

RI

Withanolides and derivatives

1.28  0.13 1.27  0.11

PT

Steroids and steroid derivatives Steryl esters

Hongguanggenin 3-O-(Galb)-(25R)-12-oxo5alpha-spirostan-3beta-ol Withaperuvin B

NU

Discrimination of extra-virgin-olive oils from different cultivars and geographical origins by untargeted

MA

metabolomics

ED

Silvia Ghisoni, Luigi Lucini, Federica Angilletta, Gabriele Rocchetti, Daniela Farinelli, Sergio Tombesi, Marco Trevisan

EP T

HIGHLIGHTS

Sterolic and phenolic profiles in EVOO accounted for more than 1000 compounds.



The chemical fingerprints of each EVOO cultivar analyzed showed differences.



Multivariate statistics discriminated EVOO according to geographical origin.



Flavonoids, hydroxycinnamics and cholesterol derivatives were the best markers.

AC C



25

Figure 1

Figure 2

Figure 3

Figure 4

Figure 5

Figure 6

Figure 7