Accepted Manuscript Investigation of the global protein content from healthy human tears Marianne Dor, Simone Eperon, Patrice H. Lalive, Yan Guex-Crosier, Mehrad Hamedani, Cindy Salvisberg, Natacha Turck PII:
S0014-4835(18)30356-7
DOI:
10.1016/j.exer.2018.10.006
Reference:
YEXER 7504
To appear in:
Experimental Eye Research
Received Date: 11 May 2018 Revised Date:
1 September 2018
Accepted Date: 11 October 2018
Please cite this article as: Dor, M., Eperon, S., Lalive, P.H., Guex-Crosier, Y., Hamedani, M., Salvisberg, C., Turck, N., Investigation of the global protein content from healthy human tears, Experimental Eye Research (2018), doi: https://doi.org/10.1016/j.exer.2018.10.006. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
AC C
EP
TE D
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
ACCEPTED MANUSCRIPT
Investigation of the global protein content from healthy human
2
tears
3
Authors: Marianne Dor1,2, Simone Eperon3, Patrice H. Lalive4, Yan Guex-Crosier3,
4
Mehrad Hamedani3, Cindy Salvisberg2, Natacha Turck1,2*.
RI PT
1
5 6
Affiliations:
7
1
8
University of Geneva, Geneva, Switzerland
9
2
SC
OPTICS Group, Department of Internal Medicine Specialties, Faculty of Medicine,
M AN U
OPTICS Group, Department of Human Protein Science, Faculty of Medicine,
10
University of Geneva, Geneva, Switzerland
11
3
12
Fondation Asile des Aveugles, Lausanne, Switzerland
13
4
14
Switzerland
Department of Ophthalmology, University of Lausanne; Jules-Gonin Eye Hospital,
15
TE D
Division of Neurology, Geneva University Hospitals, Faculty of Medicine, Geneva,
*Corresponding author:
17
Email:
[email protected].
19 20 21
AC C
18
EP
16
22 23 24 25 1
ACCEPTED MANUSCRIPT 26
Abstract
28
Considering absence of invasiveness and side effects, tears emerge as a particularly
29
attractive fluid for biomarker discovery and therefore for daily clinical use. However,
30
to date this fluid remains poorly studied in healthy condition. Here, we present an
31
updated in-depth characterisation of the human healthy tear protein composition
32
using proteomics approach. Both eyes of 8 healthy controls (4 men and 4 women,
33
average age: 38 ± 18) were collected using the Schirmer’s strip method. After liquid
34
digestion and off-gel electrophoresis fractionation, three independent proteomics
35
analyses were performed on a LTQ-Orbitrap Velos Pro. Globally, 1351 proteins were
36
identified with 2 unique peptides and 1% FDR. Gene ontology analyses showed up
37
that 39% of the tear proteins were enzymes, with high numbers of dehydrogenases,
38
phosphatases, kinases and ligases. Immunoglobulins, serpins and 14-3-3 domains
39
proteins also emerged as the main tear protein families. The glycolysis and the
40
coagulation and complement cascades, which were already shown in tears as
41
involved in ocular and systemic diseases, were highlighted performing pathway
42
analyses. Our study therefore complements the existing data on healthy tears
43
proteome. Nevertheless, extensive studies for deeply and definitively characterise
44
this promising fluid are required in the near future in order to be able to routinely use
45
this fluid in clinics. A better understanding of its protein content will probably open
46
new avenues in the biomarker discovery and clinical practice in the near future.
AC C
EP
TE D
M AN U
SC
RI PT
27
47
48
Keywords: tears; proteome; Off-Gel Electrophoresis; Gas-Phase Fractionation;
49
integrative pathways. 2
ACCEPTED MANUSCRIPT 50
1. Introduction
52
It is now well known that tears play a key role in the correct function and health status
53
of the eye by providing oxygen and nutriments to the ocular surface cells and
54
improving the optical properties of the eye (lubricating eye surface). Furthermore,
55
tear fluid protects the eye’s epithelium thanks to antibacterial properties and by
56
flushing contaminants from the ocular surface. Surprisingly, until now tears and their
57
clinical relevance have been relatively poorly studied. This is particularly illustrated by
58
searching the keywords “Eye tears and biomarkers” in Pubmed. Only 268 items were
59
found during the two last decades (between 1998/01 and 2018/01). In the same
60
period, 264’224 publications containing the words “blood” and “biomarkers” were
61
published. However, with non-invasive, easy and rapidly collected samples, tear-
62
based approaches open up new routes for diagnostic methods and for deeper
63
understanding both ocular and systemic diseases. Differences in the tear protein
64
pattern of patients suffering from diabetic retinopathy (1-3), Graves’ Orbitopathy (4,
65
5), dry eye disease (6-11), multiple sclerosis (12, 13) and even breast cancer (14) or
66
renal failure (15) have already been highlighted. Consequently, the ability to measure
67
modifications in human tear content offers promising opportunities for screening not
68
only ocular but also systemic diseases and for discovering potentially new
69
biomarkers for these clinical situations.
70
At this stage, in order to be as performing as possible, the use of tears for biomarker
71
discovery and clinics requires well knowing and understanding their global protein
72
content, mainly in healthy subjects. To the best of our knowledge, to date, only three
73
major studies (16-18) deeply investigated the proteome of human healthy tears. Even
AC C
EP
TE D
M AN U
SC
RI PT
51
3
ACCEPTED MANUSCRIPT if these studies are technically and biologically relevant, this is probably not enough
75
to definitively close the question of the complex composition of this fluid in healthy
76
condition. However, in the past three years, there was an increasing number of
77
published papers about tear proteomics, in both ocular and systemic diseases (10,
78
11, 19-22). It is not easy to understand or explain why tears, unlike other biological
79
fluids, just start to arouse real clinical interest. It is likely that the relatively low
80
volumes available for collection contributed significantly to the low numbers of ‘omics’
81
studies reported so far. Indeed, an average between 5 and 15 µl of tears can be
82
collected per eye, which is not sufficient for some ‘omics’ technics (for example,
83
Nuclear Magnetic Resonance usually required around 100µl). Due to their
84
localisation close to the eye, for a long time tears were probably not considered as a
85
relevant biofluid to study systemic diseases. Moreover, such as albumin within blood,
86
tears contain some abundant proteins that makes more difficult the detection of other
87
proteins. In previous studies investigating tear proteome, either the pool of samples
88
were rather small (one sample for De Souza et al.(17) and four samples for Zhou et
89
al.(18)) or the protein list was obtained with several extraction methods from the
90
Schirmer strip (Aass et al.(16)). In this context, we designed our study with eight
91
samples divided in three pools for our experiments and we used only one extraction
92
method from the Schirmer. Moreover, we fractionated our peptides thanks to the off-
93
gel electrophoresis method, which was not well described for tear investigation yet.
94
To conclude, compared to the previous articles, we made improvements that allowed
95
to enhance tear understanding. In this context, using mass spectrometry-based
96
approaches, our study proposes an updated and extended list of tear proteins which
97
could be used as additional reference list for clinical applications in the field of the
98
biomarker research.
AC C
EP
TE D
M AN U
SC
RI PT
74
4
ACCEPTED MANUSCRIPT 99
100
2. Material and methods
101
2.1.Subjects.
103
Healthy subjects (N=8; experiment 1: 2 women aged 59 and 61 years; experiment 2:
104
1 woman, 2 men, aged 21, 24 and 60 years respectively; experiment 3: 1 woman, 2
105
men aged 26, 24 and 26 years respectively) were included in this study. Inclusion
106
criteria were: subjects aged ≥ 18 and ≤ 70 years, any gender and ethnicity, mm of
107
Schirmer >10mm in 5 minutes. Exclusion criteria were: any eye allergy, conjunctivitis,
108
any eye pathology, acute infectious disease, make-up or contact lenses, cancer
109
and/or chronic autoimmune disease. In accordance with the Declaration of Helsinki,
110
written informed consents were obtained from these subjects. The cantonal ethics
111
committee for research on human beings has approved the patient’s informed
112
consent form and the use of biological material (approbation N°516/12, Dec 2012).
SC
M AN U
TE D
EP
113
RI PT
102
2.2. Sample collection.
115
Tears were collected using Schirmer-Plus® paper strip (Biotech Vision Care PVT
116
LTD, Gujarat, India). To avoid any discomfort to subjects, collection was restricted to
117
a maximum of 5 minutes. No external stimulation was done to collect the basal fluid.
118
Topical anaesthesia should be avoided because it reduces tear production. For each
119
patient, tear samples were taken from both eyes. Persons collecting tears wore
120
gloves in order to avoid any contamination. Care was taken to avoid damage to the
121
conjunctive surface and local eye irritation. External factors such as harsh lighting,
AC C
114
5
ACCEPTED MANUSCRIPT background noise and extreme room temperature, all known to affect the content of
123
samples, were strictly supervised in order to ensure satisfactory reproducibility. The
124
strip was then inserted in a tube on ice and centrifuged at 7840 g for 7 min at 4°C
125
without any additional buffer, as described elsewhere (21, 23, 24). After
126
centrifugation, tear samples were immediately stored at -80°C until analysis. In order
127
to verify the absence of cellular contamination in our samples, we performed a
128
haematoxylin and eosin (H&E) stain in order to color the potential cells. 5µl of tears
129
were deposited on a slide. The slide was heated 5 minutes at 60°C then fixed with 50
130
µl of 10% formalin (10 minutes). 3 washes were done with H2O and another stain
131
was done with 20µl of Hemalun (5 min). The slide was washed 3 times with H2O,
132
stained with 20µl of Eosin (5 min), washed 3 times with H2O then 3 times with 10%
133
alcohol. The slide was finally washed 3 times with the Neo-Clear solution (Merck,
134
Darmstadt, Deutschland). The same protocol was applied to the positive control, i.e.
135
human astrocytes (around 5’000 cells, suspended in PBS 1X, added on a Schirmer
136
strip and centrifuged under the same conditions as for control tear strip) and negative
137
control, i.e. PBS. The conclusion was that the Schirmer’s method did not induce cell
138
contamination in our samples.
EP
TE D
M AN U
SC
RI PT
122
139
Table 1: Moisture length (mm) of Schirmer strip for the cohort.
AC C
140
Subject
Age
Sex*
mm Schirmer (right eye)
mm Schirmer (left eye)
C1
59
W
12
13
C2
61
W
14
19
C3
21
W
15
18
C4
24
M
25
25
C5
60
M
25
15
C6
26
W
16
14
C7
24
M
32
23
6
ACCEPTED MANUSCRIPT C8
26
M
13
12
*: M: man; W: woman
142
2.3. Total protein assay.
143
The protein concentration of the pooled tear sample (from both eyes of each subject)
144
was determined by performing a Bradford Assay according to manufacturer’s
145
recommendations (Protein assay Dye reagent concentrate, Bio-Rad, Hercules, US-
146
CA). The absorbance was measured at 595 nm with a spectrophotometer (Ultrospec
147
2100 Pro, Amersham Biosciences) and the protein concentrations of the three pools
148
were determined using a bovine serum albumin (BSA) calibration curve. They were
149
estimated at 12.9 µg/µl, 11.06 µg/µl and 9.4 µg/µl for the experiments 1, 2 and 3
150
respectively.
M AN U
SC
RI PT
141
151
2.4. Reduction, alkylation and digestion.
153
Three independent proteomics experiments were done. Experiment 1 was performed
154
with a pool of two healthy subjects, experiments 2 and 3 were done using two
155
different pools of three healthy subjects (both eyes of the subjects in all three
156
experiments). 60 µg of proteins were used for all experiments. They were dried under
157
speed-vacuum; then urea (33 µl of 6M; Merck, Darmstadt, Deutschland) diluted in
158
Triethylammonium bicarbonate buffer (0.1M; TEAB, Sigma-Aldrich, Saint Louis, US-
159
MO) and tris-(2-carboxyethyl)phosphine (2 µl of 50 mM; TCEP, Sigma-Aldrich, Saint
160
Louis, US-MO) were added in each tube. After incubation at 37°C during 1 hour,
161
iodoacetamid (1 µl of 400 mM; Sigma-Aldrich, Saint Louis, US-MO) was added and
162
tubes were incubated 30 minutes of in the dark. TEAB (67 µl of 0.1M) was added,
163
then a liquid trypsin digestion (1:20 ratio, 1 microgram of enzyme to 20 micrograms of
164
protein, porcine origin, Promega Corporation, Madison, US-WI) was done overnight
165
at 37°C.
AC C
EP
TE D
152
7
ACCEPTED MANUSCRIPT 166
2.5. Off-gel electrophoresis (OGE).
168
Before OGE, samples were dried under speed-vacuum then purified by using
169
Macrospin columns (Harvard Apparatus, Holliston, US-MA) according to
170
manufacturer’s recommendations. Tubes were dried under speed-vacuum and A
171
3100 OFFGEL Fractionator (Agilent technologies, Santa Clara, US-CA) was
172
performed over night to separate the sample. Guidelines available in Agilent
173
datasheet were followed, using a 13cm IPG strip (Immobiline DryStrip pH 3-10, 13cm
174
GE Healthcare, Little Chalfont, UK) and 12 OGE wells (25, 26). After fractionation,
175
microspin columns (Harvard Apparatus, Holliston, US-MA) were used according to
176
the manufacturer’s recommendations and the 12 fractions of each experiment were
177
dried under speed-vacuum. Peptide concentration of the fractions was theoretically
178
approximated, considering that 1/12 of the pooled sample was found in each fraction
179
after OGE.
SC
M AN U
TE D
180
RI PT
167
2.6. Mass spectrometry (MS) analyses.
182
The fractions resulting from OGE were dissolved in 94.9% H2O / 5% Acetonitrile
183
(ACN) / 0.1% Formic acid (FA). For the three experiments, 1µg of fraction was
184
injected in GPF4 mode (27), meaning that each fraction of each experiment was
185
injected four times (GPF1, GPF2, GPF3 and GPF4). They were analysed by tandem
186
MS (Liquid Chromatography-MS/MS) using a Linear Trap Quadrupole (LTQ) Orbitrap
187
Velos Pro (ThermoFisher instruments, San Jose, US-CA) coupled to a nanoflow high
188
pressure liquid chromatography (HPLC, NanoAcquity system from Waters, Milford,
189
US-MA). Peptides were trapped on a home-made 5 µm 200 Å Magic C18 AQ
190
(Michrom) 0.1 × 20 mm pre-column and separated on a commercial 0.075 x 150 mm
AC C
EP
181
8
ACCEPTED MANUSCRIPT Nikkyo (Nikkyo Technology, Tokyo, JPN) analytical nanocolumn (C18, 5 µm, 100 Å).
192
More precisely, trapping was done during 15 minutes with a flow rate of 3 µl/min
193
using a gradient of H2O/FA 99.9%/0.1% (solvent A) and CH3CN/FA 99.9%/0.1%
194
(solvent B), where 95% of solvent A were mixed with 5% of solvent B. Then the
195
analytical separation was run for 85 min with a flow rate of 220 nl/min as follows: 0–1
196
min 95% A and 5% B, 1-55 min 65% A and 35% B, 55-65 min 20% A and 80% B, 65-
197
67 min 20% A and 80% B, 67-69 min 95% A and 5% B and 69-85 min 95% A and
198
5% B. min. For MS survey scans, the OT resolution was set to 60000 and the ion
199
population was set to 5 × 105 with an m/z window from 400 to 2000. Five precursor
200
ions were selected for collision-induced dissociation (CID) in the LTQ. The
201
normalised collision energies were set to 35% for CID. The different m/z windows for
202
the gas-phase fractions were set as following: 400-520 for GPF1, 515-690 for GPF2,
203
685-979 for GPF3 and 974-2000 for GPF4. Exclusion duration was set at 45
204
seconds, with a repeat count of 1. Exclusion was done by mass, with a low m/z = 0.1
205
and a high m/z = 1.1. Peak lists and resulting files, combined from the different
206
experiments, were searched against the uniprot_sprot (2017_05_10) database using
207
Thermo Proteome Discoverer (version 2.2.0388; Thermo Fisher Scientific, San Jose,
208
US-CA). Oxidised methionine was set as variable amino acid modifications and
209
carbamidomethylation of cysteines were set as fixed modification. Trypsin was
210
selected as the enzyme, with one potential missed cleavage. The precursor mass
211
tolerance and the fragment mass tolerance were 10 ppm and 0.6 Da respectively. A
212
false discovery rate (FDR) of 1% was selected at peptide and protein levels and only
213
the Master proteins were kept. The list of identified protein was generated containing
214
proteins matching with two different and unique peptide sequences. The datasets
215
supporting the conclusions of this article are available in the ProteomeXchange
AC C
EP
TE D
M AN U
SC
RI PT
191
9
ACCEPTED MANUSCRIPT 216
Consortium repository via the PRIDE database (submission identifier: PXD008702).
217
The reviewer account details are the following: username:
[email protected];
218
password: ycE67P81.
RI PT
219
2.7. Process and pathway analyses.
221
Database for Annotation, Visualization and Integrated Discovery (DAVID), a web-
222
accessible program, was used to perform the Gene Ontology (GO) Analysis
223
classification of the identified proteins (28). Options were the following: high
224
classification stringency and p-Value for the GO term > 0.05. Some subcategories
225
were grouped in order to reduce the numbers of GO categories and simplify the pie
226
charts.
227
Search Tool for the Retrieval of Interacting Genes/Proteins /STRING) database was
228
used to classify the proteins into protein families (29). We considered a count in gene
229
set > 5 and a p-Value > 0.05 for relevant protein families.
230
Kyoto Encyclopedia of Genes and Genomes (KEGG) was also used to study and
231
visualise the pathways in which the proteins were involved (30). We set a threshold
232
count of 10, with a p-Value and adjusted p-Value (Benjamini-Hochberg correction)
233
>0.05.
M AN U
TE D
EP
AC C
234
SC
220
A manual merge of the protein lists obtained with the three experiments was done
235
using Venny (http://bioinfogp.cnb.csic.es/tools/venny/index.html).
236
Top 10, 20, 50, 100 and 200 of the most abundant proteins for the three experiments
237
were obtained by classifying the proteins according to their Peptide Spectrum
238
Matches (PSM). PSM of a protein corresponds to the number of identified peptide
239
spectra matched for this protein. The same classification was done to obtain the top-
240
100 and top-200 of the less abundant proteins of the different experiments. 10
ACCEPTED MANUSCRIPT 241
3. Results
243
In order to obtain a robust in-depth characterisation, three independent proteomics
244
experiments were performed on three different pools of healthy tears. Individual and
245
general lists of proteins were then generated, analysed and compared to the
246
literature. The study design is summarised in the graphical abstract. In addition,
247
comparisons between the tear protein content with other ocular fluids will be
248
proposed.
M AN U
SC
RI PT
242
249
3.1. Characterisation of the human tear proteome
251
By combining all the experiments, 1351 proteins with 2 unique peptides were
252
identified. By excluding the keratins, which are suspected to come from
253
contamination during the tear collect, the original list was finally reduced to 1337
254
proteins with 2 unique peptides (874, 837 and 1143 proteins for experiment 1, 2, and
255
3 respectively; global and detailed lists in S1 dataset).
256
Interestingly, 45.5% (608 proteins) were in common between the three different
257
experiments, which is a good overlapping considering the variabilities induced by
258
intervariability. Other variabilities could also have been induced during trypsin
259
digestion (31), off-gel fractionation and mass spectrometry injections (32). For further
260
analyses, only list of proteins without keratins was considered.
AC C
EP
TE D
250
261 262
Top-10 of the most abundant proteins for each experiment was highly stable with 7
263
proteins always present whatever the experiment observed (Table 2). Interestingly, 11
ACCEPTED MANUSCRIPT the PSM corresponding to these 7 proteins represented between 27.7% (experiment
265
3) and 36.7% (experiment 1) of the total PSM detected. As expected, the rank one
266
protein was the lactotransferrin, one of the major proteins of the tears, involved in the
267
anti-inflammatory and antimicrobial processes (33). This point highlighted the
268
importance of tears to maintain a very clean environment for the eyes. The other six
269
common proteins between our three experiments were lipocalin-1, also known for its
270
antimicrobial activity (34), serum albumin, lysozyme C, immunoglobulin heavy
271
constant alpha 1, immunoglobulin kappa constant and polymeric immunoglobulin
272
receptor. Moreover, even if some differences in term of abundance (PSM number)
273
have been observed, the presence of the 5 first proteins (lactotransferrin, lipocalin-1,
274
lysozyme C, serum albumin, immunoglobulin heavy constant alpha 1) were largely
275
conserved in all experiments. The principal tear proteins are directly secreted by
276
lacrimal glands (lactotransferrin, epidermal growth factor (EGF), tear lipocalin,
277
secretory immunoglobulin A (sIgA)) or by lysosomes (lysozyme, which was found in
278
the three experiments in the top 10 more abundant proteins). However, some serum
279
proteins such as albumin, transferrin, IgG and IgM were also found in the tear fluid
280
probably as a result of passive transport from the blood and are suggestive of serum
281
leakage. It is also known that cell infiltrating conjunctiva (T cells, B cells among
282
others) secrete Igs and cytokines in various conditions (35). Finally, local contribution
283
of the neighbouring cells releasing proteins in tears cannot be excluded.
284
By expanding the comparison to the top-20, top-50 or top-100, the percentages of
285
proteins in common were still relatively high (64%, 45.1% and 52.42%, respectively).
286
At the opposite, top-100 and top-200 of the less abundant proteins presented only
287
0.1% and 1.6% of proteins in common between the three experiments. The relatively
AC C
EP
TE D
M AN U
SC
RI PT
264
12
ACCEPTED MANUSCRIPT 288
low numbers of unique peptides of these less abundant proteins may explain why
289
they were found in some experiments but not in all three.
290
Table 2: Top 10 proteins of the three experiments. Protein names Lactotransferrin Lipocalin-1 Lysozyme C Serum albumin Immunoglobulin heavy constant alpha 1 Immunoglobulin alpha-2 heavy chain Polymeric immunoglobulin receptor Immunoglobulin heavy constant alpha 2 Prolactin-inducible protein Immunoglobulin kappa constant
PSMs
AC P02788 P02768 P61626 P01876 P31025 P01834 P01833 P0DOX2 P25311 P01024
Protein names Lactotransferrin Serum albumin Lysozyme C Immunoglobulin heavy constant alpha 1 Lipocalin-1 Immunoglobulin kappa constant Polymeric immunoglobulin receptor Immunoglobulin alpha-2 heavy chain Zinc-alpha-2-glycoprotein Complement C3
PSMs
EP
TE D
Protein names Lactotransferrin Lysozyme C Serum albumin Lipocalin-1 Immunoglobulin heavy constant alpha 1 Prolactin-inducible protein Polymeric immunoglobulin receptor Immunoglobulin kappa constant Zinc-alpha-2-glycoprotein Immunoglobulin kappa light chain
AC C
Pep
3074 1252 1109 1021 857 546 541 521 505 389
AC P02788 P61626 P02768 P31025 P01876 P12273 P01833 P01834 P25311 P0DOX7
293
294
M AN U
292
PSMs
RI PT
AC P02788 P31025 P61626 P02768 P01876 P0DOX2 P01833 P01877 P12273 P01834
SC
291
60 14 12 54 13 8 28 2 9 2
Pep 1570 906 751 649 558 487 385 344 268 244
60 13 57 12 14 12 33 2 21 3 Pep
1913 1358 1179 677 586 432 353 336 305 288
54 52 10 11 10 2 27 8 20 75
295
From top to bottom: experiment 1,2 and 3. Bold: same position in the three lists.
296
Italic: present in the three lists. AC: accession number on UniProt; PSMs: Peptide
297
Spectrum Matches for each protein; Pep: number of unique peptides for each protein.
298 13
ACCEPTED MANUSCRIPT 3.2. Biological analyses of the tear proteome
300
Using the functional annotation-clustering tool from DAVID, the 1337 proteins were
301
classified according to their molecular functions, cellular components and biological
302
processes (Figs 1a, b and c respectively). The cadherin-binding involved in cell-cell
303
adhesion (62%, with for instance some eukaryotic translation elongation factors and
304
several capping actin proteins) and the enzyme activity (16%, containing a lot of
305
alcohol dehydrogenases and aldo-keto reductase) were the two main subcategories
306
emerging from GO molecular functions. For cellular components, half of the proteins
307
were linked to the cell-cell adherens junction (56%), an observation that was
308
confirmed by the biological process classification (cell-cell adhesion 46%).
M AN U
SC
RI PT
299
309
Around 39% (518 proteins) of the 1337 proteins appeared to be part of the enzyme
311
protein classes (detailed lists are available as S2 dataset). By classifying these
312
enzymes using UniProt, the three major subclasses were hydrolases (41%, 212
313
proteins), transferases (27%, 138 proteins) and oxidoreductases (15%, 80 proteins)
314
(Fig 2). In the hydrolases category, we highlighted for example the phospholipase A2
315
(26, 15 and 15 PSMs in the three experiments) but also the lactotransferrin and the
316
lysozyme C, which were present in the three top 10 proteins. Several mitogen-
317
activated protein kinases (mitogen-activated protein kinase 1, 3, 13, 14) were found
318
in the transferases category and the oxidoreductases contained a lot of
319
dehydrogenases such as 4 alcohol dehydrogenases (alcohol dehydrogenase, alcohol
320
dehydrogenase 1B, alcohol dehydrogenase 1C, alcohol dehydrogenase class 4,
321
alcohol dehydrogenase class 3) and retinal dehydrogenase 1.
AC C
EP
TE D
310
322 14
ACCEPTED MANUSCRIPT After noticing the large amount of enzymes in our proteins, we were interested by
324
studying the other major protein families. Using STRING, which is a biological
325
database for protein-protein interactions, it appeared that serpins (10 proteins) and
326
14-3-3 domain (7 proteins) were among the top five protein families. Both families
327
have been extensively studied for various cellular functions. 14-3-3 family has been
328
shown as a potential marker for ocular hypertension induced by glaucoma (36)
329
andseveral serpins such as serpinA1 (serpin peptidase inhibitor clade A member 1)
330
or serpinB1 (plasminogen activator inhibitor-1) were proposed as targets for
331
diagnosis but also therapy in cancer (37-39). Tracing these proteins in tears, in
332
response to a treatment or to predict patient outcome for examples, could be
333
clinically relevant and open new way to manage cancer patients. Until now, no
334
information related to the eye can be found concerning these protein families, but we
335
may suggest that it is related to the weak overall investigation of tears.
M AN U
SC
RI PT
323
TE D
336
Furthermore, KEGG analyses highlighted two notable biological pathways, the
338
glycolysis/gluconeogenesis (31 proteins, p-Value = 4.6E-14, with phosphoglycerate
339
kinase (EC 2.7.2.3) and the ATP-dependent 6-phosphofructokinase (EC 2.7.1.11))
340
(Fig 3) and the coagulation and complement cascades (38 proteins, p-Value = 1.7E-
341
20
342
glycoprotein) (Fig 4). Alterations of these pathways have been already shown to be
343
involved in some eye diseases such as Age-Related Macular Degeneration (AMD) or
344
diabetic retinopathy (40, 41). Yokosako et al.(40) described the urinary levels of
345
lactate and pyruvate as deregulated in AMD patients. In the present study, we were
346
able to identify the lactate dehydrogenase and the pyruvate kinase, which may play a
347
role in the deregulation observed by Yokosako et al.(40). As the AMD is an ocular
AC C
EP
337
, among other the complement factor H, the complement factor 3 and the CD59
15
ACCEPTED MANUSCRIPT disease, it could be relevant to investigate if the levels of these two proteins are also
349
deregulated in tears. Karamichos et al.(42) highlighted the
350
glycolysis/gluconeogenesis as part of the most significantly affected pathways in
351
case of keratoconus (KC), which is a non-inflammatory corneal disease. They noticed
352
the 1,3 diphosphoglycerate and the 3-phosphoglycerate as significantly up-regulated
353
in patients suffering from KC compared to healthy controls. Interestingly, we
354
identified in tears the glyceraldehyde-3- phosphate dehydrogenase and the
355
phosphoglycerate kinase, which may have an influence on the metabolites studied by
356
Karamichos et al.(42). De Souza et al.(17) also reported in their tear study a high
357
number of hydrolases involved in the glycolysis. Since the glycolysis takes place
358
within the cells, the source of these proteins might be damaged cells. These cells
359
could be epithelial cells covering the eyes but we cannot exclude that lacrimal cells
360
released them. The fact that these proteins were deregulated in a disease context
361
emphasises their role or importance. Regarding the complement and coagulation
362
pathways, proteins that are involved in play a major role in the protection of the
363
ocular surface (43, 44) but also in several diseases (41). As one of the tear main
364
functions is protection and lubrication of the eye, it is completely relevant to find some
365
proteins related to these processes. Detecting in patient tears involved at different
366
levels in these pathways is clinically relevant and offer new therapeutic strategies.
SC
M AN U
TE D
EP
AC C
367
RI PT
348
368
3.3. Consistency with the existing knowledge
369
To date, only three major publications focused on healthy tear proteome and
370
reported 491, 1543 and 1526 identified proteins respectively (16-18). Summary of
371
these studies compared to our data were described in Table 3. In order to compare
372
them with our study, the same stringent identification criteria that we used were 16
ACCEPTED MANUSCRIPT applied: identification of proteins with at least 2 unique peptides, only reviewed
374
proteins and exclusion of keratins. Subsequent new lists of identified proteins were
375
generated for each study, greatly reducing their lists at 478, 1026 and 662 proteins.
376
By merging all the information (the 3 published studies and our study), 1620 proteins
377
were identified in tears (Fig 5). Only 197 proteins (11.2%; S3 dataset) were in
378
common across all 4 studies. Among them, a mean of 55% were found in the top-200
379
of the most abundant proteins of our experiments (53.6%, 53.3% and 55.3% in the
380
experiment 1, 2 and 3 respectively). At the opposite, between 1% and 7.1% of these
381
197 proteins were found in the 200 less abundant proteins of the experiments (4.6%,
382
7.1% and 1% in the experiment 1, 2 and 3 respectively). Therefore, the less
383
abundant proteins are probably the main cause of variability between these studies.
384
This result is in correlation with the low number of shared proteins between the three
385
top-100 and top-200 less abundant proteins of our own experiments. It is then
386
important to enhance the identification of these low abundant proteins, as they are
387
more likely to contain potential biomarkers for diseases than abundant proteins.
SC
M AN U
TE D
388
RI PT
373
Furthermore, 425 proteins (24.1%) were described for the first time in our study (S4
390
dataset), including some translation initiation factors (eukaryotic translation initiation
391
factors 2A and subunit 2, factor 3 subunits A, B, C, D, E, F, M, factors 4 gamma 1
392
and 2, factor 5), ribosomal proteins (60S ribosomal protein L14, L17, L18a, L22, L23,
393
L24, L3, L30, L32, L9, 40S ribosomal protein S2, S21, S4, S5, S7 and ribosomal
394
protein S6 kinase alpha-1, alpha-3) and mitogen-activated protein kinases 1, 13 and
395
14. An important point is that around 40% of these 425 proteins were identified at
396
least in two out of our three experiments (S4 dataset), reinforcing our results. If we
397
exclude these 425 specific proteins from our three lists of top-100 and three lists of
AC C
EP
389
17
ACCEPTED MANUSCRIPT top-200 less abundant proteins, it does not greatly change the low percentages of
399
shared proteins which were previously found (2% instead of 0.1% between the three
400
top-100 and 5.7% instead of 1.6% between the three top-200). We specifically
401
identified some ribosomal proteins, which are part of small guanosine
402
trisphosphatases (GTPases) (45). Depending on the binding to guanosine
403
trisphosphate (GTP) or guanosine diphosphate (GDP), Rab proteins (Ras-related in
404
brain) are active or inactive, respectively (46). They are involved in the membrane
405
traffic and vesicles formation and transport. Several mitogen-activated protein
406
kinases which are in charge of the signal transduction by the phosphorylation of
407
serine and threonine protein residues were also found (47). They could be released
408
after apoptosis of the cells in contact with tears. Why did we find these proteins
409
specifically in our study and how could we explain the low number of proteins in
410
common (192) between the studies? The use of different technical workflows (Table
411
3) may bring some answers to these questions.
SC
M AN U
TE D
412
RI PT
398
Firstly, the method of sample collection has an impact on tear composition (23). De
414
Souza et al.(17) used microcapillary glass tubes whereas the other studies sampled
415
tears with Schirmer’s strip. Schirmer method is reported to be more comfortable and
416
pleasant for the collected persons, easier to handle and can not wound neither
417
cornea, nor conjunctiva. On the other side, capillary tube was described as requiring
418
more technical expertise from the person who collected tears. The good angle and
419
position of the capillary on the ocular surface are not trivial, which frequently need
420
capillary tube repositioning. Thereby, more risks to seriously damage the conjunctiva
421
and/or the cornea are associated to this technique. Finally, collection with a capillary
422
tube takes more time since it must be stopped when the patient moves or blinks.
AC C
EP
413
18
ACCEPTED MANUSCRIPT Even if this technique is suspected to limit cell contamination from conjunctival or
424
epithelial cells compared to Schirmer collection, this cannot be completely excluded.
425
Posa et al. (23) concluded that both methods were suitable for protein analyses. Yet,
426
they reported that all subjects experienced the Schirmer strip much more pleasant
427
compared to the capillary tube. Our personal data also support the conclusion of
428
Posa et al. (23), so regarding the balances between various advantages and few
429
limitations, we chose the Schirmer test. However, the use of capillary tubes in the
430
study of De Souza et al.(17) could be part of the reasons why only 197 proteins were
431
commonly found in the 4 studies. Indeed, in the literature higher protein contents and
432
concentrations were obtained using Schirmer strip method (48-50). Another
433
parameter that was different between the studies is the Schirmer extraction method.
434
Whereas we only centrifuged the Schirmer’s strip to collect tears, Zhou et al.(18)
435
used extraction buffer comprising 100 mM of ammonium bicarbonate and protease
436
inhibitor. Aass et al. (16) worked with different buffers containing NH4HCO3, NaCl, a
437
surfactant, or a combination of the three, the consequences on recovered proteins
438
following these extraction methods were not investigated yet.
439
Another different technical point could be the off-gel electrophoresis fractionation
440
(based on the isoelectric point of the peptides) while Aass et al.(16) and Zhou et
441
al.(18) used strong cation exchange (SCX) chromatography. It has been proven that
442
both methods were suitable to separate a complex sample and enhance the number
443
of identified proteins (51, 52). A few studies compared these two methods (53, 54).
444
The off-gel electrophoresis provides the isoelectric point of the peptides, a useful
445
information, and shows a high resolution, a high sample loading and a flexibility
446
concerning the choice of the pH gradient (25, 51, 55). This allows to focus on a
447
specific pH range, to better separate the samples in this pH gradient. The SCX
AC C
EP
TE D
M AN U
SC
RI PT
423
19
ACCEPTED MANUSCRIPT method is described to be a faster method (25), but evidences about enhancing the
449
number of identified proteins are contradictory (53, 55, 56). As expected, the
450
methods lead to different protein profiles (55, 57). We also would like to mention that
451
we did not measure peptide concentration of the fractions after OG. Regarding the
452
difficulty in quantifying peptides (58, 59) and the low quantity of material per fraction,
453
we estimated the concentration by considering a homogeneous quantity of peptides
454
in each fraction.
455
Peptide length and tryptic missed cleavage sites could also explain the differences
456
between the 4 studies that we observed. While we used LTQ-Orbitrap like Aass et
457
al.(16), Zhou et al.(18) performed their analyses with a TripleTOF 5600. On another
458
hand, Ass et al.(16) supplemented the trypsin with Lys-C enzyme. This combination
459
of two proteases was described to enhance the efficiency of the digestion, and
460
influences the identified proteins (60).
461
Like Zhou et al.(18) and Aass et al.(16), we pooled different collected samples, while
462
De Souza et al.(17) analysed one single subject. Pooling different samples permitted
463
to reach a sufficient quantity and work with an average sample in term of peptide
464
content (61) in the analysed sample. However, pooling loses the sample identity and
465
the inter-variability information. In our case, we were interested in the global tear
466
proteome so we chose to pool samples.
SC
M AN U
TE D
EP
AC C
467
RI PT
448
468
In our pie charts, we did not obtain the same category names as Zhou and al. (18),
469
probably because of a different software version. Nevertheless, the main categories
470
seemed similar for each GO classification, which was completely relevant with the
471
fact that almost 50% (776) of proteins were found in common in their study and ours. 20
ACCEPTED MANUSCRIPT This observation strengthened our results but also emphasised the need to analyse
473
more samples, to consider the potential effects of physiological parameters (age,
474
gender, circadian rhythm and potential drugs) and to standardise the proteomics
475
workflow.
RI PT
472
476
Our method presents the advantage not to use extraction buffer, compared to Zhou
478
et al.(18) and Aass et al.(16). Therefore, the samples will be available for any kind of
479
experiments and analysis methods, without any interferences. Moreover, no study
480
checked whether the extraction buffer well preserved the samples. We recommend
481
including an equal number of men and women of various ages, to properly cover tear
482
proteome. Indeed, some recent studies started to investigate how both age and sex
483
could influence both tear composition and characteristics (62-64). The use of OGE
484
was also an added value, as this method was only described for tears in another
485
article from our team in a disease context (13). This alternative fractionation method
486
probably contributed to the specifically identified proteins of our study. However, the
487
current methodology has limitations. First, Schirmer strip extraction without buffer will
488
not allow studying ocular diseases leading to dry eye. Indeed, Schirmer moisturing
489
could be too short, resulting in 1 or 2-µl sample, not suitable for further analyses. In
490
these cases, using extraction buffers or other collecting methods such as a polyester
491
fiber rod or microcapillary tube would be more efficient (10, 11). Moreover, due to its
492
multiple steps, this methodology is not easy to handle in a clinical study with many
493
samples. However, our goal was to describe a methodology which can be used in the
494
first step of research projects, usually done with limited number of samples.
AC C
EP
TE D
M AN U
SC
477
495 496
Table 3: Comparison of the four tears proteome studies. 21
ACCEPTED MANUSCRIPT De Souza et al.
Zhou et al.
Aass et al.
Dor et al.
1 healthy patient
4 healthy subjects
3 healthy subjects
8 healthy subjects
Microcapillary tube
Schirmer's strip
Schirmer's strip
Schirmer's strip
Protein quantity (µg)
NM (13 µl)
400
NM
180
Pool
/
Yes
Yes
Yes
In-gel digestion
In-solution
In-solution
In-solution
Trypsin
Trypsin
Lys-C+ trypsin
Trypsin
SDS-PAGE
Offline SCX
Offline SCX
Digestion
RI PT
Collection
Off-Gel
Fractionation
NM LTQ-Orbitrap +
Mass spectrometer
2% ACN
Triple TOF 5600
LTQ-FT FDR <0.1%, 2
NM
5% ACN
LTQ-Orbitrap
LTQ-Orbitrap
M AN U
Injection MS buffer
SC
Electrophoresis
FDR <1%,
Identification
unique peptides
conditions
(Orbitrap), 1 peptide
FDR <1%, 1 peptide unspecified number FDR <1%, 1 peptide of peptide
identification
†
Stringent
497
*
‡
1543
1526
1351
‡
478
1026
662
1337
AC C
identification
491
EP
Maximum
TE D
(FT)
498
NM: not mentioned; Lys-C: proteinase lys-C; FT: Fourier transform; †: with 1 peptide
499
and non-reviewed proteins; ‡: with 2 peptides; *: with 2 unique peptides, without
500
keratins and reviewed proteins.
501 502
3.4. The tear proteome and the other ocular fluids
22
ACCEPTED MANUSCRIPT The tear proteome generated in our study was then compared to the proteome of
504
vitreous and aqueous humors (VH and AH respectively), investigated among others
505
by Murphy et al. (65, 66). So far, VH and AH were not very much studied in term of
506
proteomics studies. VH is a transparent and gelatinous substance situated between
507
the lens and the retina (67). Its global proteome, meaning in healthy subjects, was
508
described in only two main studies (65, 68). This fluid should contain some
509
information about the physiological condition of the retina. AH is situated at the
510
anterior and posterior chambers of the eye and plays a crucial role in cleaning the
511
lens and the cornea, but also permitting the distribution of nutriments and drugs to
512
several ocular structures (69). Interestingly, 291 (21.8%) and 197 (14.7%) tear
513
proteins were also found in VH and AH respectively (% of proteins we found in tears;
514
with two peptides, only reviewed and without keratins; S5 and S6 datasets). Proteins
515
found in common were represented by clusterin, recently demonstrated in VH as a
516
potential biomarker of the AMD (70) and suspected to be a marker of Alzheimer
517
disease in plasma (71), but also proteins involved in the glycolysis (among others the
518
lactate dehydrogenases A and B chains and the enolase 1) and complement and
519
coagulation pathways (6 proteins of the serpin family and 16 complement factors).
520
Several cathepsins (B, D, L1 and Z) were also commonly found in both tears, VH and
521
AH. Cathepsin D, identified in the three different ocular fluids, was also recently
522
proposed as biomarker candidate in AH for neovascular AMD (72). Finding in tears
523
some proteins also present in VH and AH and related to certain diseases is very
524
promising. Indeed collecting VH and AH requires surgery, which is an invasive
525
process. Using tears to find the same information could be a real advantage for both
526
clinicians and patients. Moreover, it could allow easier and better identification of
527
proteins of interest in certain pathologies.
AC C
EP
TE D
M AN U
SC
RI PT
503
23
ACCEPTED MANUSCRIPT The comparisons between tears, AH and VH have some limitations. Indeed, two
529
studies of Murphy (65, 66) were done with AH and VH of patients with cataract,
530
because the need to a surgery does not permit to collect complete healthy persons.
531
We do not know how the cataract status could affect the results obtained in these
532
studies, and so the comparisons that we made. However, we are aware that these
533
remarks are somewhat speculative and that more experiments will be needed to
534
support our hypothesis.
RI PT
528
SC
535
5. Conclusion
537
To conclude, the merge of all tear protein lists has enabled the identification of 1620
538
proteins (with stringent conditions), suggesting that there is probably room for
539
improvement in tear protein identification. Indeed, the fact that the combination of
540
different studies greatly enhances our knowledge about this fluid clearly
541
demonstrates that additional experiments will be required for the establishment of a
542
reliable proteome for the tears. Through this study, we first complemented the
543
knowledge on tear proteome but we also proposed a global overview of their great
544
potential for clinical research and biomarker discovery.
TE D
EP
AC C
545
M AN U
536
546
Acknowledgments
547
Authors thank Patrizia Arboit, Carla Pasquarello, Alexandre Hainard (Proteomics
548
Core Facility of the University of Geneva), Catherine Fouda, Christophe Tati and
549
Hortense Delacretaz for their technical assistance.
550
24
ACCEPTED MANUSCRIPT 551
Figure captions
552
Graphical Abstract. Summary of the experimental workflow followed in this
554
study. GPF: gas-phase fractionation mode; LTQ: linear trap quadrupole; IPG:
555
immobilized pH gradient. DAVID: Database for Annotation, Visualization and
556
Integrated Discovery; STRING: Search Tool for the Retrieval of Interacting
557
Genes/Proteins; KEGG: Kyoto Encyclopedia of Genes and Genomes.
SC
RI PT
553
M AN U
558
Fig 1. Pie charts of molecular function (a), cellular component (b) and
560
biological process (c) obtained with the 1337 proteins using DAVID software.
561
Some subcategories were grouped to reduce the number of GO categories
562
TE D
559
Fig 2. Classification of the 518 enzymes identified in our study. Data were
564
obtained using the “enzymes classes” available on UniProt. Details of the content of
565
each category could be found in S2 dataset.
AC C
566
EP
563
567
Fig 3. Glycolysis/Gluconeogenesis pathway obtained using KEGG database
568
(30). Number of genes that are involved in is higher than the number of bold boxes
569
on the figures due to the isoforms of the proteins. It means that one bold box can
570
encompass several proteins. 5.4.2.2: phosphoglucomutase 1, 2; 5.3.1.9: glucose-6-
571
phosphate isomerase; 5.1.3.3: galactose mutarotase; 3.1.3.11: fructose-
25
ACCEPTED MANUSCRIPT bisphosphatase 1; 2.7.1.11: phosphofructokinase muscle, platelet, liver types;
573
4.1.2.13: aldolase fructose-bisphosphate C, A; 5.3.1.1: triosephosphate isomerase 1;
574
1.2.1.12: glyceraldehyde-3-phosphate dehydrogenase; 2.7.2.3: phosphoglycerate
575
kinase 1; 5.4.2.11: phosphoglycerate mutase 1; enolase 1; 2.7.1.40: pyruvate kinase
576
(muscle); 1.1.1.27: lactate dehydrogenase A, B; 6.2.1.1: acyl-CoA synthetase short-
577
chain family member 2; 1.2.1.3: aldehyde dehydrogenase 9 family member A1, 7
578
family member A1; 1.2.1.5: aldehyde dehydrogenase 1 family member A3, 3 family
579
member A1; 1.1.1.1: Alcohol dehydrogenase 1B, 1C, 5, 7; 1.1.1.2: aldo-keto
580
reductase family 1 member A1.
M AN U
SC
RI PT
572
581
Fig 4. Coagulation and complement cascades obtained using KEGG database
583
(30). Number of genes that are involved in is higher than the number of bold boxes
584
on the figures due to the isoforms of the proteins. It means that one bold box can
585
encompass several proteins. AT3: serpin family C member 1; A1AT: serpin family A
586
member 1; A2M: alpha-2-macroglobulin; CD59: CD59 molecule; CPB2:
587
carboxypeptidase B2; CR3: fibrinogen gamma chain; CR4: fibrinogen gamma chain;
588
C1INH: serpin family G member; C1qrs: complement C1q B chain, C1q C chain, C1r,
589
C1s; C2: complement factor 2; C3: complement C3; C4: complement C4A; C5:
590
complement C5; C6, 7, 8, 9: complement C6, C7, C8 alpha chain, C8 beta chain, C9;
591
DAF: CD55 molecule; FB: complement factor B; FD: complement factor D; FH:
592
complement factor H; FI: complement factor I; Fibrinogen: fibrinogen gamma chain,
593
beta chain, alpha chain; F2: coagulation factor 2 (thrombin); F5: coagulation factor 5;
594
F12: coagulation factor 12; F13: coagulation factor 13 B chain; HCII: serpin family D
595
member 1; kallikrein: kallikrein B1; kininogen: kininogen 1; PCI: serpin family A
596
member 5; PLG: plasminogen; α2AP: serpin family F member 2.
AC C
EP
TE D
582
26
ACCEPTED MANUSCRIPT 597
Fig 5. Comparison between the studies of De Souza et al.(17), Zhou et al.(18),
599
Aass et al.(16) and us. The following criteria were applied to the four protein lists:
600
only reviewed proteins, 2 unique peptides and exclusion of keratins. For each
601
category, percentage represents the ratio between the proteins contain in this
602
category and the total number of proteins. Major information is the proteins in
603
common to the four studies (197; 11.2%) and our specific proteins (425; 24.1%).
SC
RI PT
598
604
References
606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632
1. Csosz E, Boross P, Csutak A, Berta A, Toth F, Poliska S, et al. Quantitative analysis of proteins in the tear fluid of patients with diabetic retinopathy. J Proteomics. 2012;75(7):2196-204. 2. Kim HJ, Kim PK, Yoo HS, Kim CW. Comparison of tear proteins between healthy and early diabetic retinopathy patients. Clin Biochem. 2012;45(1-2):60-7. 3. Park KS, Kim SS, Kim JC, Kim HC, Im YS, Ahn CW, et al. Serum and tear levels of nerve growth factor in diabetic retinopathy patients. Am J Ophthalmol. 2008;145(3):432-7. 4. Pieragostino D, D'Alessandro M, di Ioia M, Di Ilio C, Sacchetta P, Del Boccio P. Unraveling the molecular repertoire of tears as a source of biomarkers: beyond ocular diseases. Proteomics Clin Appl. 2015;9(1-2):169-86. 5. Ujhelyi B, Gogolak P, Erdei A, Nagy V, Balazs E, Rajnavolgyi E, et al. Graves' orbitopathy results in profound changes in tear composition: a study of plasminogen activator inhibitor-1 and seven cytokines. Thyroid. 2012;22(4):407-14. 6. Perumal N, Funke S, Pfeiffer N, Grus FH. Proteomics analysis of human tears from aqueousdeficient and evaporative dry eye patients. Sci Rep. 2016;6:29629. 7. Zhou L, Beuerman RW, Chan CM, Zhao SZ, Li XR, Yang H, et al. Identification of tear fluid biomarkers in dry eye syndrome using iTRAQ quantitative proteomics. J Proteome Res. 2009;8(11):4889-905. 8. Aluru SV, Agarwal S, Srinivasan B, Iyer GK, Rajappa SM, Tatu U, et al. Lacrimal proline rich 4 (LPRR4) protein in the tear fluid is a potential biomarker of dry eye syndrome. PLoS One. 2012;7(12):e51979. 9. Postnikoff CK, Huisingh C, McGwin G, Nichols KK. Leukocyte Distribution in the Open Eye Tears of Normal and Dry Eye Subjects. Curr Eye Res. 2018. 10. Aluru SV, Shweta A, Bhaskar S, Geetha K, Sivakumar RM, Utpal T, et al. Tear Fluid Protein Changes in Dry Eye Syndrome Associated with Rheumatoid Arthritis: A Proteomic Approach. Ocul Surf. 2017;15(1):112-29. 11. Jung JH, Ji YW, Hwang HS, Oh JW, Kim HC, Lee HK, et al. Proteomic analysis of human lacrimal and tear fluid in dry eye disease. Sci Rep. 2017;7(1):13363.
AC C
EP
TE D
M AN U
605
27
ACCEPTED MANUSCRIPT
EP
TE D
M AN U
SC
RI PT
12. Del Boccio P, Rossi C, di Ioia M, Cicalini I, Sacchetta P, Pieragostino D. Integration of metabolomics and proteomics in multiple sclerosis: From biomarkers discovery to personalized medicine. Proteomics Clin Appl. 2016;10(4):470-84. 13. Salvisberg C, Tajouri N, Hainard A, Burkhard PR, Lalive PH, Turck N. Exploring the human tear fluid: discovery of new biomarkers in multiple sclerosis. Proteomics Clin Appl. 2014;8(3-4):185-94. 14. Lebrecht A, Boehm D, Schmidt M, Koelbl H, Schwirz RL, Grus FH. Diagnosis of breast cancer by tear proteomic pattern. Cancer Genomics Proteomics. 2009;6(3):177-82. 15. Terekhina NA, Petrovich Iu A. [Diagnostic value of tear analysis in uremia, uricemia and cholesterolemia]. Klin Lab Diagn. 1994(6):17-8. 16. Aass C, Norheim I, Eriksen EF, Thorsby PM, Pepaj M. Single unit filter-aided method for fast proteomic analysis of tear fluid. Anal Biochem. 2015;480:1-5. 17. de Souza GA, Godoy LM, Mann M. Identification of 491 proteins in the tear fluid proteome reveals a large number of proteases and protease inhibitors. Genome Biol. 2006;7(8):R72. 18. Zhou L, Zhao SZ, Koh SK, Chen L, Vaz C, Tanavde V, et al. In-depth analysis of the human tear proteome. J Proteomics. 2012;75(13):3877-85. 19. Gerber-Hollbach N, Plattner K, O'Leary OE, Jenoe P, Moes S, Drexler B, et al. Tear Film Proteomics Reveal Important Differences Between Patients With and Without Ocular GvHD After Allogeneic Hematopoietic Cell Transplantation. Invest Ophthalmol Vis Sci. 2018;59(8):3521-30. 20. Kallo G, Emri M, Varga Z, Ujhelyi B, Tozser J, Csutak A, et al. Changes in the Chemical Barrier Composition of Tears in Alzheimer's Disease Reveal Potential Tear Diagnostic Biomarkers. PLoS One. 2016;11(6):e0158000. 21. Kishazi E, Dor M, Eperon S, Oberic A, Hamedani M, Turck N. Thyroid-associated orbitopathy and tears: A proteomics study. J Proteomics. 2018;170:110-6. 22. Tang Q, Zhang C, Wu X, Duan W, Weng W, Feng J, et al. Comprehensive Proteomic Profiling of Patients' Tears Identifies Potential Biomarkers for the Traumatic Vegetative State. Neurosci Bull. 2018;34(4):626-38. 23. Posa A, Brauer L, Schicht M, Garreis F, Beileke S, Paulsen F. Schirmer strip vs. capillary tube method: non-invasive methods of obtaining proteins from tear fluid. Ann Anat. 2013;195(2):137-42. 24. Remington SG, Crow JM, Nelson JD. Antibodies to endogenous tear protein in normal human tears. Curr Eye Res. 2009;34(10):819-23. 25. Moreda-Pineiro A, Garcia-Otero N, Bermejo-Barrera P. A review on preparative and semipreparative offgel electrophoresis for multidimensional protein/peptide assessment. Anal Chim Acta. 2014;836:1-17. 26. Dayon L, Turck N, Kienle S, Schulz-Knappe P, Hochstrasser DF, Scherl A, et al. Isobaric taggingbased selection and quantitation of cerebrospinal fluid tryptic peptides with reporter calibration curves. Anal Chem. 2010;82(3):848-58. 27. Scherl A, Shaffer SA, Taylor GK, Kulasekara HD, Miller SI, Goodlett DR. Genome-specific gasphase fractionation strategy for improved shotgun proteomic profiling of proteotypic peptides. Anal Chem. 2008;80(4):1182-91. 28. Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44-57. 29. Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43(Database issue):D447-52. 30. Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 2017;45(D1):D353-D61. 31. Walmsley SJ, Rudnick PA, Liang Y, Dong Q, Stein SE, Nesvizhskii AI. Comprehensive analysis of protein digestion using six trypsins reveals the origin of trypsin as a significant source of variability in proteomics. J Proteome Res. 2013;12(12):5666-80. 32. Tabb DL, Vega-Montoto L, Rudnick PA, Variyath AM, Ham AJ, Bunk DM, et al. Repeatability and reproducibility in proteomic identifications by liquid chromatography-tandem mass spectrometry. J Proteome Res. 2010;9(2):761-76.
AC C
633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684
28
ACCEPTED MANUSCRIPT
EP
TE D
M AN U
SC
RI PT
33. Flanagan JL, Willcox MD. Role of lactoferrin in the tear film. Biochimie. 2009;91(1):35-43. 34. Fluckinger M, Haas H, Merschak P, Glasgow BJ, Redl B. Human tear lipocalin exhibits antimicrobial activity by scavenging microbial siderophores. Antimicrob Agents Chemother. 2004;48(9):3367-72. 35. Offiah I, Calder VL. Immune mechanisms in allergic eye diseases: what is new? Curr Opin Allergy Clin Immunol. 2009;9(5):477-81. 36. Tezel G. A proteomics view of the molecular mechanisms and biomarkers of glaucomatous neurodegeneration. Prog Retin Eye Res. 2013;35:18-43. 37. Farshchian M, Kivisaari A, Ala-Aho R, Riihila P, Kallajoki M, Grenman R, et al. Serpin peptidase inhibitor clade A member 1 (SerpinA1) is a novel biomarker for progression of cutaneous squamous cell carcinoma. Am J Pathol. 2011;179(3):1110-9. 38. Kwon CH, Park HJ, Lee JR, Kim HK, Jeon TY, Jo HJ, et al. Serpin peptidase inhibitor clade A member 1 is a biomarker of poor prognosis in gastric cancer. Br J Cancer. 2014;111(10):1993-2002. 39. Zheng D, Chen H, Davids J, Bryant M, Lucas A. Serpins for diagnosis and therapy in cancer. Cardiovasc Hematol Disord Drug Targets. 2013;13(2):123-32. 40. Yokosako K, Mimura T, Funatsu H, Noma H, Goto M, Kamei Y, et al. Glycolysis in patients with age-related macular degeneration. Open Ophthalmol J. 2014;8:39-47. 41. Jha P, Bora PS, Bora NS. The role of complement system in ocular diseases including uveitis and macular degeneration. Mol Immunol. 2007;44(16):3901-8. 42. Karamichos D, Zieske JD, Sejersen H, Sarker-Nag A, Asara JM, Hjortdal J. Tear metabolite changes in keratoconus. Exp Eye Res. 2015;132:1-8. 43. Cocuzzi E, Szczotka LB, Brodbeck WG, Bardenstein DS, Wei T, Medof ME. Tears contain the complement regulator CD59 as well as decay-accelerating factor (DAF). Clin Exp Immunol. 2001;123(2):188-95. 44. Willcox MD, Morris CA, Thakur A, Sack RA, Wickson J, Boey W. Complement and complement regulatory proteins in human tears. Invest Ophthalmol Vis Sci. 1997;38(1):1-8. 45. Stenmark H, Olkkonen VM. The Rab GTPase family. Genome Biol. 2001;2(5):REVIEWS3007. 46. Hutagalung AH, Novick PJ. Role of Rab GTPases in membrane traffic and cell physiology. Physiol Rev. 2011;91(1):119-49. 47. Wada T, Penninger JM. Mitogen-activated protein kinases in apoptosis regulation. Oncogene. 2004;23(16):2838-49. 48. Farias E, Yasunaga KL, Peixoto RVR, Fonseca MP, Fontes W, Galera PD. Comparison of two methods of tear sampling for protein quantification by Bradford method. Pesquisa Veterinária Brasileira. 2013;33:261-4. 49. Green-Church KB, Nichols KK, Kleinholz NM, Zhang L, Nichols JJ. Investigation of the human tear film proteome using multiple proteomic approaches. Mol Vis. 2008;14:456-70. 50. Stuchell RN, Feldman JJ, Farris RL, Mandel ID. The effect of collection technique on tear composition. Invest Ophthalmol Vis Sci. 1984;25(3):374-7. 51. Horth P, Miller CA, Preckel T, Wenz C. Efficient fractionation and improved protein identification by peptide OFFGEL electrophoresis. Mol Cell Proteomics. 2006;5(10):1968-74. 52. Chenau J, Michelland S, Sidibe J, Seve M. Peptides OFFGEL electrophoresis: a suitable preanalytical step for complex eukaryotic samples fractionation compatible with quantitative iTRAQ labeling. Proteome Sci. 2008;6:9. 53. Mostovenko E, Hassan C, Rattke J, Deelder AM, van Veelen PA, Palmblad M. Comparison of peptide and protein fractionation methods in proteomics. EuPA Open Proteomics. 2013;1:30-7. 54. Slebos RJ, Brock JW, Winters NF, Stuart SR, Martinez MA, Li M, et al. Evaluation of strong cation exchange versus isoelectric focusing of peptides for multidimensional liquid chromatographytandem mass spectrometry. J Proteome Res. 2008;7(12):5286-94. 55. Waller LN, Shores K, Knapp DR. Shotgun proteomic analysis of cerebrospinal fluid using offgel electrophoresis as the first-dimension separation. J Proteome Res. 2008;7(10):4577-84.
AC C
685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734
29
ACCEPTED MANUSCRIPT
772 773
EP
TE D
M AN U
SC
RI PT
56. Manadas B, English JA, Wynne KJ, Cotter DR, Dunn MJ. Comparative analysis of OFFGel, strong cation exchange with pH gradient, and RP at high pH for first-dimensional separation of peptides from a membrane-enriched protein fraction. Proteomics. 2009;9(22):5194-8. 57. Antberg L, Cifani P, Sandin M, Levander F, James P. Critical comparison of multidimensional separation methods for increasing protein expression coverage. J Proteome Res. 2012;11(5):2644-52. 58. Tammen H, Hess R. Collection and handling of blood specimens for peptidomics. Methods Mol Biol. 2013;1023:161-8. 59. Sapan CV, Lundblad RL. Review of methods for determination of total protein and peptide concentration in biological samples. Proteomics Clin Appl. 2015;9(3-4):268-76. 60. Saveliev S, Bratz M, Zubarev R, Szapacs M, Budamgunta H, Urh M. Trypsin/Lys-C protease mix for enhanced protein mass spectrometry analysis. Nat Meth. 2013;10(11). 61. Oberg AL, Vitek O. Statistical design of quantitative mass spectrometry-based proteomic experiments. J Proteome Res. 2009;8(5):2144-56. 62. Gibson EJ, Stapleton F, Wolffsohn JS, Golebiowski B. Local synthesis of sex hormones: are there consequences for the ocular surface and dry eye? Br J Ophthalmol. 2017;101(12):1596-603. 63. Micera A, Di Zazzo A, Esposito G, Longo R, Foulsham W, Sacco R, et al. Age-Related Changes to Human Tear Composition. Invest Ophthalmol Vis Sci. 2018;59(5):2024-31. 64. Ozdemir M, Temizdemir H. Age- and gender-related tear function changes in normal population. Eye (Lond). 2010;24(1):79-83. 65. Murthy KR, Goel R, Subbannayya Y, Jacob HK, Murthy PR, Manda SS, et al. Proteomic analysis of human vitreous humor. Clin Proteomics. 2014;11(1):29. 66. Murthy KR, Rajagopalan P, Pinto SM, Advani J, Murthy PR, Goel R, et al. Proteomics of human aqueous humor. OMICS. 2015;19(5):283-93. 67. Bishop PN. Structural macromolecules and supramolecular organisation of the vitreous gel. Prog Retin Eye Res. 2000;19(3):323-44. 68. Aretz S, Krohne TU, Kammerer K, Warnken U, Hotz-Wagenblatt A, Bergmann M, et al. Indepth mass spectrometric mapping of the human vitreous proteome. Proteome Sci. 2013;11(1):22. 69. Goel M, Picciani RG, Lee RK, Bhattacharya SK. Aqueous humor dynamics: a review. Open Ophthalmol J. 2010;4:52-9. 70. Nobl M, Reich M, Dacheva I, Siwy J, Mullen W, Schanstra JP, et al. Proteomics of vitreous in neovascular age-related macular degeneration. Exp Eye Res. 2016;146:107-17. 71. Jongbloed W, van Dijk KD, Mulder SD, van de Berg WD, Blankenstein MA, van der Flier W, et al. Clusterin Levels in Plasma Predict Cognitive Decline and Progression to Alzheimer's Disease. J Alzheimers Dis. 2015;46(4):1103-10. 72. Kang GY, Bang JY, Choi AJ, Yoon J, Lee WC, Choi S, et al. Exosomal proteins in the aqueous humor as novel biomarkers in patients with neovascular age-related macular degeneration. J Proteome Res. 2014;13(2):581-95.
AC C
735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771
774
Supporting information
775
S1 dataset: Proteins identified in experiment 3 (2 peptides, no keratins)
776
S2 dataset: details of the six enzyme categories
777
S3 dataset: Common proteins between the four studies
778
S4 dataset: Proteins specifically identified in this study 30
ACCEPTED MANUSCRIPT S5 dataset: Proteins in common between tears and vitreous humor
780
S6 dataset: Proteins in common between tears and aqueous humor
AC C
EP
TE D
M AN U
SC
RI PT
779
31
AC C
EP
TE D
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
AC C
EP
TE D
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
AC C
EP
TE D
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
AC C
EP
TE D
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
AC C
EP
TE D
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
ACCEPTED MANUSCRIPT Highlights Tear fluid contains a high variety of proteins
•
A lot of enzymes, more precisely hydrolases and transferases, were identified
•
Pathways involved in different diseases can be greatly mapped in tears
•
Combination of different studies is needed to establish a reliable tear
RI PT
•
AC C
EP
TE D
M AN U
SC
proteome