Investigation of the global protein content from healthy human tears

Investigation of the global protein content from healthy human tears

Accepted Manuscript Investigation of the global protein content from healthy human tears Marianne Dor, Simone Eperon, Patrice H. Lalive, Yan Guex-Cros...

782KB Sizes 0 Downloads 25 Views

Accepted Manuscript Investigation of the global protein content from healthy human tears Marianne Dor, Simone Eperon, Patrice H. Lalive, Yan Guex-Crosier, Mehrad Hamedani, Cindy Salvisberg, Natacha Turck PII:

S0014-4835(18)30356-7

DOI:

10.1016/j.exer.2018.10.006

Reference:

YEXER 7504

To appear in:

Experimental Eye Research

Received Date: 11 May 2018 Revised Date:

1 September 2018

Accepted Date: 11 October 2018

Please cite this article as: Dor, M., Eperon, S., Lalive, P.H., Guex-Crosier, Y., Hamedani, M., Salvisberg, C., Turck, N., Investigation of the global protein content from healthy human tears, Experimental Eye Research (2018), doi: https://doi.org/10.1016/j.exer.2018.10.006. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

ACCEPTED MANUSCRIPT

Investigation of the global protein content from healthy human

2

tears

3

Authors: Marianne Dor1,2, Simone Eperon3, Patrice H. Lalive4, Yan Guex-Crosier3,

4

Mehrad Hamedani3, Cindy Salvisberg2, Natacha Turck1,2*.

RI PT

1

5 6

Affiliations:

7

1

8

University of Geneva, Geneva, Switzerland

9

2

SC

OPTICS Group, Department of Internal Medicine Specialties, Faculty of Medicine,

M AN U

OPTICS Group, Department of Human Protein Science, Faculty of Medicine,

10

University of Geneva, Geneva, Switzerland

11

3

12

Fondation Asile des Aveugles, Lausanne, Switzerland

13

4

14

Switzerland

Department of Ophthalmology, University of Lausanne; Jules-Gonin Eye Hospital,

15

TE D

Division of Neurology, Geneva University Hospitals, Faculty of Medicine, Geneva,

*Corresponding author:

17

Email: [email protected].

19 20 21

AC C

18

EP

16

22 23 24 25 1

ACCEPTED MANUSCRIPT 26

Abstract

28

Considering absence of invasiveness and side effects, tears emerge as a particularly

29

attractive fluid for biomarker discovery and therefore for daily clinical use. However,

30

to date this fluid remains poorly studied in healthy condition. Here, we present an

31

updated in-depth characterisation of the human healthy tear protein composition

32

using proteomics approach. Both eyes of 8 healthy controls (4 men and 4 women,

33

average age: 38 ± 18) were collected using the Schirmer’s strip method. After liquid

34

digestion and off-gel electrophoresis fractionation, three independent proteomics

35

analyses were performed on a LTQ-Orbitrap Velos Pro. Globally, 1351 proteins were

36

identified with 2 unique peptides and 1% FDR. Gene ontology analyses showed up

37

that 39% of the tear proteins were enzymes, with high numbers of dehydrogenases,

38

phosphatases, kinases and ligases. Immunoglobulins, serpins and 14-3-3 domains

39

proteins also emerged as the main tear protein families. The glycolysis and the

40

coagulation and complement cascades, which were already shown in tears as

41

involved in ocular and systemic diseases, were highlighted performing pathway

42

analyses. Our study therefore complements the existing data on healthy tears

43

proteome. Nevertheless, extensive studies for deeply and definitively characterise

44

this promising fluid are required in the near future in order to be able to routinely use

45

this fluid in clinics. A better understanding of its protein content will probably open

46

new avenues in the biomarker discovery and clinical practice in the near future.

AC C

EP

TE D

M AN U

SC

RI PT

27

47

48

Keywords: tears; proteome; Off-Gel Electrophoresis; Gas-Phase Fractionation;

49

integrative pathways. 2

ACCEPTED MANUSCRIPT 50

1. Introduction

52

It is now well known that tears play a key role in the correct function and health status

53

of the eye by providing oxygen and nutriments to the ocular surface cells and

54

improving the optical properties of the eye (lubricating eye surface). Furthermore,

55

tear fluid protects the eye’s epithelium thanks to antibacterial properties and by

56

flushing contaminants from the ocular surface. Surprisingly, until now tears and their

57

clinical relevance have been relatively poorly studied. This is particularly illustrated by

58

searching the keywords “Eye tears and biomarkers” in Pubmed. Only 268 items were

59

found during the two last decades (between 1998/01 and 2018/01). In the same

60

period, 264’224 publications containing the words “blood” and “biomarkers” were

61

published. However, with non-invasive, easy and rapidly collected samples, tear-

62

based approaches open up new routes for diagnostic methods and for deeper

63

understanding both ocular and systemic diseases. Differences in the tear protein

64

pattern of patients suffering from diabetic retinopathy (1-3), Graves’ Orbitopathy (4,

65

5), dry eye disease (6-11), multiple sclerosis (12, 13) and even breast cancer (14) or

66

renal failure (15) have already been highlighted. Consequently, the ability to measure

67

modifications in human tear content offers promising opportunities for screening not

68

only ocular but also systemic diseases and for discovering potentially new

69

biomarkers for these clinical situations.

70

At this stage, in order to be as performing as possible, the use of tears for biomarker

71

discovery and clinics requires well knowing and understanding their global protein

72

content, mainly in healthy subjects. To the best of our knowledge, to date, only three

73

major studies (16-18) deeply investigated the proteome of human healthy tears. Even

AC C

EP

TE D

M AN U

SC

RI PT

51

3

ACCEPTED MANUSCRIPT if these studies are technically and biologically relevant, this is probably not enough

75

to definitively close the question of the complex composition of this fluid in healthy

76

condition. However, in the past three years, there was an increasing number of

77

published papers about tear proteomics, in both ocular and systemic diseases (10,

78

11, 19-22). It is not easy to understand or explain why tears, unlike other biological

79

fluids, just start to arouse real clinical interest. It is likely that the relatively low

80

volumes available for collection contributed significantly to the low numbers of ‘omics’

81

studies reported so far. Indeed, an average between 5 and 15 µl of tears can be

82

collected per eye, which is not sufficient for some ‘omics’ technics (for example,

83

Nuclear Magnetic Resonance usually required around 100µl). Due to their

84

localisation close to the eye, for a long time tears were probably not considered as a

85

relevant biofluid to study systemic diseases. Moreover, such as albumin within blood,

86

tears contain some abundant proteins that makes more difficult the detection of other

87

proteins. In previous studies investigating tear proteome, either the pool of samples

88

were rather small (one sample for De Souza et al.(17) and four samples for Zhou et

89

al.(18)) or the protein list was obtained with several extraction methods from the

90

Schirmer strip (Aass et al.(16)). In this context, we designed our study with eight

91

samples divided in three pools for our experiments and we used only one extraction

92

method from the Schirmer. Moreover, we fractionated our peptides thanks to the off-

93

gel electrophoresis method, which was not well described for tear investigation yet.

94

To conclude, compared to the previous articles, we made improvements that allowed

95

to enhance tear understanding. In this context, using mass spectrometry-based

96

approaches, our study proposes an updated and extended list of tear proteins which

97

could be used as additional reference list for clinical applications in the field of the

98

biomarker research.

AC C

EP

TE D

M AN U

SC

RI PT

74

4

ACCEPTED MANUSCRIPT 99

100

2. Material and methods

101

2.1.Subjects.

103

Healthy subjects (N=8; experiment 1: 2 women aged 59 and 61 years; experiment 2:

104

1 woman, 2 men, aged 21, 24 and 60 years respectively; experiment 3: 1 woman, 2

105

men aged 26, 24 and 26 years respectively) were included in this study. Inclusion

106

criteria were: subjects aged ≥ 18 and ≤ 70 years, any gender and ethnicity, mm of

107

Schirmer >10mm in 5 minutes. Exclusion criteria were: any eye allergy, conjunctivitis,

108

any eye pathology, acute infectious disease, make-up or contact lenses, cancer

109

and/or chronic autoimmune disease. In accordance with the Declaration of Helsinki,

110

written informed consents were obtained from these subjects. The cantonal ethics

111

committee for research on human beings has approved the patient’s informed

112

consent form and the use of biological material (approbation N°516/12, Dec 2012).

SC

M AN U

TE D

EP

113

RI PT

102

2.2. Sample collection.

115

Tears were collected using Schirmer-Plus® paper strip (Biotech Vision Care PVT

116

LTD, Gujarat, India). To avoid any discomfort to subjects, collection was restricted to

117

a maximum of 5 minutes. No external stimulation was done to collect the basal fluid.

118

Topical anaesthesia should be avoided because it reduces tear production. For each

119

patient, tear samples were taken from both eyes. Persons collecting tears wore

120

gloves in order to avoid any contamination. Care was taken to avoid damage to the

121

conjunctive surface and local eye irritation. External factors such as harsh lighting,

AC C

114

5

ACCEPTED MANUSCRIPT background noise and extreme room temperature, all known to affect the content of

123

samples, were strictly supervised in order to ensure satisfactory reproducibility. The

124

strip was then inserted in a tube on ice and centrifuged at 7840 g for 7 min at 4°C

125

without any additional buffer, as described elsewhere (21, 23, 24). After

126

centrifugation, tear samples were immediately stored at -80°C until analysis. In order

127

to verify the absence of cellular contamination in our samples, we performed a

128

haematoxylin and eosin (H&E) stain in order to color the potential cells. 5µl of tears

129

were deposited on a slide. The slide was heated 5 minutes at 60°C then fixed with 50

130

µl of 10% formalin (10 minutes). 3 washes were done with H2O and another stain

131

was done with 20µl of Hemalun (5 min). The slide was washed 3 times with H2O,

132

stained with 20µl of Eosin (5 min), washed 3 times with H2O then 3 times with 10%

133

alcohol. The slide was finally washed 3 times with the Neo-Clear solution (Merck,

134

Darmstadt, Deutschland). The same protocol was applied to the positive control, i.e.

135

human astrocytes (around 5’000 cells, suspended in PBS 1X, added on a Schirmer

136

strip and centrifuged under the same conditions as for control tear strip) and negative

137

control, i.e. PBS. The conclusion was that the Schirmer’s method did not induce cell

138

contamination in our samples.

EP

TE D

M AN U

SC

RI PT

122

139

Table 1: Moisture length (mm) of Schirmer strip for the cohort.

AC C

140

Subject

Age

Sex*

mm Schirmer (right eye)

mm Schirmer (left eye)

C1

59

W

12

13

C2

61

W

14

19

C3

21

W

15

18

C4

24

M

25

25

C5

60

M

25

15

C6

26

W

16

14

C7

24

M

32

23

6

ACCEPTED MANUSCRIPT C8

26

M

13

12

*: M: man; W: woman

142

2.3. Total protein assay.

143

The protein concentration of the pooled tear sample (from both eyes of each subject)

144

was determined by performing a Bradford Assay according to manufacturer’s

145

recommendations (Protein assay Dye reagent concentrate, Bio-Rad, Hercules, US-

146

CA). The absorbance was measured at 595 nm with a spectrophotometer (Ultrospec

147

2100 Pro, Amersham Biosciences) and the protein concentrations of the three pools

148

were determined using a bovine serum albumin (BSA) calibration curve. They were

149

estimated at 12.9 µg/µl, 11.06 µg/µl and 9.4 µg/µl for the experiments 1, 2 and 3

150

respectively.

M AN U

SC

RI PT

141

151

2.4. Reduction, alkylation and digestion.

153

Three independent proteomics experiments were done. Experiment 1 was performed

154

with a pool of two healthy subjects, experiments 2 and 3 were done using two

155

different pools of three healthy subjects (both eyes of the subjects in all three

156

experiments). 60 µg of proteins were used for all experiments. They were dried under

157

speed-vacuum; then urea (33 µl of 6M; Merck, Darmstadt, Deutschland) diluted in

158

Triethylammonium bicarbonate buffer (0.1M; TEAB, Sigma-Aldrich, Saint Louis, US-

159

MO) and tris-(2-carboxyethyl)phosphine (2 µl of 50 mM; TCEP, Sigma-Aldrich, Saint

160

Louis, US-MO) were added in each tube. After incubation at 37°C during 1 hour,

161

iodoacetamid (1 µl of 400 mM; Sigma-Aldrich, Saint Louis, US-MO) was added and

162

tubes were incubated 30 minutes of in the dark. TEAB (67 µl of 0.1M) was added,

163

then a liquid trypsin digestion (1:20 ratio, 1 microgram of enzyme to 20 micrograms of

164

protein, porcine origin, Promega Corporation, Madison, US-WI) was done overnight

165

at 37°C.

AC C

EP

TE D

152

7

ACCEPTED MANUSCRIPT 166

2.5. Off-gel electrophoresis (OGE).

168

Before OGE, samples were dried under speed-vacuum then purified by using

169

Macrospin columns (Harvard Apparatus, Holliston, US-MA) according to

170

manufacturer’s recommendations. Tubes were dried under speed-vacuum and A

171

3100 OFFGEL Fractionator (Agilent technologies, Santa Clara, US-CA) was

172

performed over night to separate the sample. Guidelines available in Agilent

173

datasheet were followed, using a 13cm IPG strip (Immobiline DryStrip pH 3-10, 13cm

174

GE Healthcare, Little Chalfont, UK) and 12 OGE wells (25, 26). After fractionation,

175

microspin columns (Harvard Apparatus, Holliston, US-MA) were used according to

176

the manufacturer’s recommendations and the 12 fractions of each experiment were

177

dried under speed-vacuum. Peptide concentration of the fractions was theoretically

178

approximated, considering that 1/12 of the pooled sample was found in each fraction

179

after OGE.

SC

M AN U

TE D

180

RI PT

167

2.6. Mass spectrometry (MS) analyses.

182

The fractions resulting from OGE were dissolved in 94.9% H2O / 5% Acetonitrile

183

(ACN) / 0.1% Formic acid (FA). For the three experiments, 1µg of fraction was

184

injected in GPF4 mode (27), meaning that each fraction of each experiment was

185

injected four times (GPF1, GPF2, GPF3 and GPF4). They were analysed by tandem

186

MS (Liquid Chromatography-MS/MS) using a Linear Trap Quadrupole (LTQ) Orbitrap

187

Velos Pro (ThermoFisher instruments, San Jose, US-CA) coupled to a nanoflow high

188

pressure liquid chromatography (HPLC, NanoAcquity system from Waters, Milford,

189

US-MA). Peptides were trapped on a home-made 5 µm 200 Å Magic C18 AQ

190

(Michrom) 0.1 × 20 mm pre-column and separated on a commercial 0.075 x 150 mm

AC C

EP

181

8

ACCEPTED MANUSCRIPT Nikkyo (Nikkyo Technology, Tokyo, JPN) analytical nanocolumn (C18, 5 µm, 100 Å).

192

More precisely, trapping was done during 15 minutes with a flow rate of 3 µl/min

193

using a gradient of H2O/FA 99.9%/0.1% (solvent A) and CH3CN/FA 99.9%/0.1%

194

(solvent B), where 95% of solvent A were mixed with 5% of solvent B. Then the

195

analytical separation was run for 85 min with a flow rate of 220 nl/min as follows: 0–1

196

min 95% A and 5% B, 1-55 min 65% A and 35% B, 55-65 min 20% A and 80% B, 65-

197

67 min 20% A and 80% B, 67-69 min 95% A and 5% B and 69-85 min 95% A and

198

5% B. min. For MS survey scans, the OT resolution was set to 60000 and the ion

199

population was set to 5 × 105 with an m/z window from 400 to 2000. Five precursor

200

ions were selected for collision-induced dissociation (CID) in the LTQ. The

201

normalised collision energies were set to 35% for CID. The different m/z windows for

202

the gas-phase fractions were set as following: 400-520 for GPF1, 515-690 for GPF2,

203

685-979 for GPF3 and 974-2000 for GPF4. Exclusion duration was set at 45

204

seconds, with a repeat count of 1. Exclusion was done by mass, with a low m/z = 0.1

205

and a high m/z = 1.1. Peak lists and resulting files, combined from the different

206

experiments, were searched against the uniprot_sprot (2017_05_10) database using

207

Thermo Proteome Discoverer (version 2.2.0388; Thermo Fisher Scientific, San Jose,

208

US-CA). Oxidised methionine was set as variable amino acid modifications and

209

carbamidomethylation of cysteines were set as fixed modification. Trypsin was

210

selected as the enzyme, with one potential missed cleavage. The precursor mass

211

tolerance and the fragment mass tolerance were 10 ppm and 0.6 Da respectively. A

212

false discovery rate (FDR) of 1% was selected at peptide and protein levels and only

213

the Master proteins were kept. The list of identified protein was generated containing

214

proteins matching with two different and unique peptide sequences. The datasets

215

supporting the conclusions of this article are available in the ProteomeXchange

AC C

EP

TE D

M AN U

SC

RI PT

191

9

ACCEPTED MANUSCRIPT 216

Consortium repository via the PRIDE database (submission identifier: PXD008702).

217

The reviewer account details are the following: username: [email protected];

218

password: ycE67P81.

RI PT

219

2.7. Process and pathway analyses.

221

Database for Annotation, Visualization and Integrated Discovery (DAVID), a web-

222

accessible program, was used to perform the Gene Ontology (GO) Analysis

223

classification of the identified proteins (28). Options were the following: high

224

classification stringency and p-Value for the GO term > 0.05. Some subcategories

225

were grouped in order to reduce the numbers of GO categories and simplify the pie

226

charts.

227

Search Tool for the Retrieval of Interacting Genes/Proteins /STRING) database was

228

used to classify the proteins into protein families (29). We considered a count in gene

229

set > 5 and a p-Value > 0.05 for relevant protein families.

230

Kyoto Encyclopedia of Genes and Genomes (KEGG) was also used to study and

231

visualise the pathways in which the proteins were involved (30). We set a threshold

232

count of 10, with a p-Value and adjusted p-Value (Benjamini-Hochberg correction)

233

>0.05.

M AN U

TE D

EP

AC C

234

SC

220

A manual merge of the protein lists obtained with the three experiments was done

235

using Venny (http://bioinfogp.cnb.csic.es/tools/venny/index.html).

236

Top 10, 20, 50, 100 and 200 of the most abundant proteins for the three experiments

237

were obtained by classifying the proteins according to their Peptide Spectrum

238

Matches (PSM). PSM of a protein corresponds to the number of identified peptide

239

spectra matched for this protein. The same classification was done to obtain the top-

240

100 and top-200 of the less abundant proteins of the different experiments. 10

ACCEPTED MANUSCRIPT 241

3. Results

243

In order to obtain a robust in-depth characterisation, three independent proteomics

244

experiments were performed on three different pools of healthy tears. Individual and

245

general lists of proteins were then generated, analysed and compared to the

246

literature. The study design is summarised in the graphical abstract. In addition,

247

comparisons between the tear protein content with other ocular fluids will be

248

proposed.

M AN U

SC

RI PT

242

249

3.1. Characterisation of the human tear proteome

251

By combining all the experiments, 1351 proteins with 2 unique peptides were

252

identified. By excluding the keratins, which are suspected to come from

253

contamination during the tear collect, the original list was finally reduced to 1337

254

proteins with 2 unique peptides (874, 837 and 1143 proteins for experiment 1, 2, and

255

3 respectively; global and detailed lists in S1 dataset).

256

Interestingly, 45.5% (608 proteins) were in common between the three different

257

experiments, which is a good overlapping considering the variabilities induced by

258

intervariability. Other variabilities could also have been induced during trypsin

259

digestion (31), off-gel fractionation and mass spectrometry injections (32). For further

260

analyses, only list of proteins without keratins was considered.

AC C

EP

TE D

250

261 262

Top-10 of the most abundant proteins for each experiment was highly stable with 7

263

proteins always present whatever the experiment observed (Table 2). Interestingly, 11

ACCEPTED MANUSCRIPT the PSM corresponding to these 7 proteins represented between 27.7% (experiment

265

3) and 36.7% (experiment 1) of the total PSM detected. As expected, the rank one

266

protein was the lactotransferrin, one of the major proteins of the tears, involved in the

267

anti-inflammatory and antimicrobial processes (33). This point highlighted the

268

importance of tears to maintain a very clean environment for the eyes. The other six

269

common proteins between our three experiments were lipocalin-1, also known for its

270

antimicrobial activity (34), serum albumin, lysozyme C, immunoglobulin heavy

271

constant alpha 1, immunoglobulin kappa constant and polymeric immunoglobulin

272

receptor. Moreover, even if some differences in term of abundance (PSM number)

273

have been observed, the presence of the 5 first proteins (lactotransferrin, lipocalin-1,

274

lysozyme C, serum albumin, immunoglobulin heavy constant alpha 1) were largely

275

conserved in all experiments. The principal tear proteins are directly secreted by

276

lacrimal glands (lactotransferrin, epidermal growth factor (EGF), tear lipocalin,

277

secretory immunoglobulin A (sIgA)) or by lysosomes (lysozyme, which was found in

278

the three experiments in the top 10 more abundant proteins). However, some serum

279

proteins such as albumin, transferrin, IgG and IgM were also found in the tear fluid

280

probably as a result of passive transport from the blood and are suggestive of serum

281

leakage. It is also known that cell infiltrating conjunctiva (T cells, B cells among

282

others) secrete Igs and cytokines in various conditions (35). Finally, local contribution

283

of the neighbouring cells releasing proteins in tears cannot be excluded.

284

By expanding the comparison to the top-20, top-50 or top-100, the percentages of

285

proteins in common were still relatively high (64%, 45.1% and 52.42%, respectively).

286

At the opposite, top-100 and top-200 of the less abundant proteins presented only

287

0.1% and 1.6% of proteins in common between the three experiments. The relatively

AC C

EP

TE D

M AN U

SC

RI PT

264

12

ACCEPTED MANUSCRIPT 288

low numbers of unique peptides of these less abundant proteins may explain why

289

they were found in some experiments but not in all three.

290

Table 2: Top 10 proteins of the three experiments. Protein names Lactotransferrin Lipocalin-1 Lysozyme C Serum albumin Immunoglobulin heavy constant alpha 1 Immunoglobulin alpha-2 heavy chain Polymeric immunoglobulin receptor Immunoglobulin heavy constant alpha 2 Prolactin-inducible protein Immunoglobulin kappa constant

PSMs

AC P02788 P02768 P61626 P01876 P31025 P01834 P01833 P0DOX2 P25311 P01024

Protein names Lactotransferrin Serum albumin Lysozyme C Immunoglobulin heavy constant alpha 1 Lipocalin-1 Immunoglobulin kappa constant Polymeric immunoglobulin receptor Immunoglobulin alpha-2 heavy chain Zinc-alpha-2-glycoprotein Complement C3

PSMs

EP

TE D

Protein names Lactotransferrin Lysozyme C Serum albumin Lipocalin-1 Immunoglobulin heavy constant alpha 1 Prolactin-inducible protein Polymeric immunoglobulin receptor Immunoglobulin kappa constant Zinc-alpha-2-glycoprotein Immunoglobulin kappa light chain

AC C

Pep

3074 1252 1109 1021 857 546 541 521 505 389

AC P02788 P61626 P02768 P31025 P01876 P12273 P01833 P01834 P25311 P0DOX7

293

294

M AN U

292

PSMs

RI PT

AC P02788 P31025 P61626 P02768 P01876 P0DOX2 P01833 P01877 P12273 P01834

SC

291

60 14 12 54 13 8 28 2 9 2

Pep 1570 906 751 649 558 487 385 344 268 244

60 13 57 12 14 12 33 2 21 3 Pep

1913 1358 1179 677 586 432 353 336 305 288

54 52 10 11 10 2 27 8 20 75

295

From top to bottom: experiment 1,2 and 3. Bold: same position in the three lists.

296

Italic: present in the three lists. AC: accession number on UniProt; PSMs: Peptide

297

Spectrum Matches for each protein; Pep: number of unique peptides for each protein.

298 13

ACCEPTED MANUSCRIPT 3.2. Biological analyses of the tear proteome

300

Using the functional annotation-clustering tool from DAVID, the 1337 proteins were

301

classified according to their molecular functions, cellular components and biological

302

processes (Figs 1a, b and c respectively). The cadherin-binding involved in cell-cell

303

adhesion (62%, with for instance some eukaryotic translation elongation factors and

304

several capping actin proteins) and the enzyme activity (16%, containing a lot of

305

alcohol dehydrogenases and aldo-keto reductase) were the two main subcategories

306

emerging from GO molecular functions. For cellular components, half of the proteins

307

were linked to the cell-cell adherens junction (56%), an observation that was

308

confirmed by the biological process classification (cell-cell adhesion 46%).

M AN U

SC

RI PT

299

309

Around 39% (518 proteins) of the 1337 proteins appeared to be part of the enzyme

311

protein classes (detailed lists are available as S2 dataset). By classifying these

312

enzymes using UniProt, the three major subclasses were hydrolases (41%, 212

313

proteins), transferases (27%, 138 proteins) and oxidoreductases (15%, 80 proteins)

314

(Fig 2). In the hydrolases category, we highlighted for example the phospholipase A2

315

(26, 15 and 15 PSMs in the three experiments) but also the lactotransferrin and the

316

lysozyme C, which were present in the three top 10 proteins. Several mitogen-

317

activated protein kinases (mitogen-activated protein kinase 1, 3, 13, 14) were found

318

in the transferases category and the oxidoreductases contained a lot of

319

dehydrogenases such as 4 alcohol dehydrogenases (alcohol dehydrogenase, alcohol

320

dehydrogenase 1B, alcohol dehydrogenase 1C, alcohol dehydrogenase class 4,

321

alcohol dehydrogenase class 3) and retinal dehydrogenase 1.

AC C

EP

TE D

310

322 14

ACCEPTED MANUSCRIPT After noticing the large amount of enzymes in our proteins, we were interested by

324

studying the other major protein families. Using STRING, which is a biological

325

database for protein-protein interactions, it appeared that serpins (10 proteins) and

326

14-3-3 domain (7 proteins) were among the top five protein families. Both families

327

have been extensively studied for various cellular functions. 14-3-3 family has been

328

shown as a potential marker for ocular hypertension induced by glaucoma (36)

329

andseveral serpins such as serpinA1 (serpin peptidase inhibitor clade A member 1)

330

or serpinB1 (plasminogen activator inhibitor-1) were proposed as targets for

331

diagnosis but also therapy in cancer (37-39). Tracing these proteins in tears, in

332

response to a treatment or to predict patient outcome for examples, could be

333

clinically relevant and open new way to manage cancer patients. Until now, no

334

information related to the eye can be found concerning these protein families, but we

335

may suggest that it is related to the weak overall investigation of tears.

M AN U

SC

RI PT

323

TE D

336

Furthermore, KEGG analyses highlighted two notable biological pathways, the

338

glycolysis/gluconeogenesis (31 proteins, p-Value = 4.6E-14, with phosphoglycerate

339

kinase (EC 2.7.2.3) and the ATP-dependent 6-phosphofructokinase (EC 2.7.1.11))

340

(Fig 3) and the coagulation and complement cascades (38 proteins, p-Value = 1.7E-

341

20

342

glycoprotein) (Fig 4). Alterations of these pathways have been already shown to be

343

involved in some eye diseases such as Age-Related Macular Degeneration (AMD) or

344

diabetic retinopathy (40, 41). Yokosako et al.(40) described the urinary levels of

345

lactate and pyruvate as deregulated in AMD patients. In the present study, we were

346

able to identify the lactate dehydrogenase and the pyruvate kinase, which may play a

347

role in the deregulation observed by Yokosako et al.(40). As the AMD is an ocular

AC C

EP

337

, among other the complement factor H, the complement factor 3 and the CD59

15

ACCEPTED MANUSCRIPT disease, it could be relevant to investigate if the levels of these two proteins are also

349

deregulated in tears. Karamichos et al.(42) highlighted the

350

glycolysis/gluconeogenesis as part of the most significantly affected pathways in

351

case of keratoconus (KC), which is a non-inflammatory corneal disease. They noticed

352

the 1,3 diphosphoglycerate and the 3-phosphoglycerate as significantly up-regulated

353

in patients suffering from KC compared to healthy controls. Interestingly, we

354

identified in tears the glyceraldehyde-3- phosphate dehydrogenase and the

355

phosphoglycerate kinase, which may have an influence on the metabolites studied by

356

Karamichos et al.(42). De Souza et al.(17) also reported in their tear study a high

357

number of hydrolases involved in the glycolysis. Since the glycolysis takes place

358

within the cells, the source of these proteins might be damaged cells. These cells

359

could be epithelial cells covering the eyes but we cannot exclude that lacrimal cells

360

released them. The fact that these proteins were deregulated in a disease context

361

emphasises their role or importance. Regarding the complement and coagulation

362

pathways, proteins that are involved in play a major role in the protection of the

363

ocular surface (43, 44) but also in several diseases (41). As one of the tear main

364

functions is protection and lubrication of the eye, it is completely relevant to find some

365

proteins related to these processes. Detecting in patient tears involved at different

366

levels in these pathways is clinically relevant and offer new therapeutic strategies.

SC

M AN U

TE D

EP

AC C

367

RI PT

348

368

3.3. Consistency with the existing knowledge

369

To date, only three major publications focused on healthy tear proteome and

370

reported 491, 1543 and 1526 identified proteins respectively (16-18). Summary of

371

these studies compared to our data were described in Table 3. In order to compare

372

them with our study, the same stringent identification criteria that we used were 16

ACCEPTED MANUSCRIPT applied: identification of proteins with at least 2 unique peptides, only reviewed

374

proteins and exclusion of keratins. Subsequent new lists of identified proteins were

375

generated for each study, greatly reducing their lists at 478, 1026 and 662 proteins.

376

By merging all the information (the 3 published studies and our study), 1620 proteins

377

were identified in tears (Fig 5). Only 197 proteins (11.2%; S3 dataset) were in

378

common across all 4 studies. Among them, a mean of 55% were found in the top-200

379

of the most abundant proteins of our experiments (53.6%, 53.3% and 55.3% in the

380

experiment 1, 2 and 3 respectively). At the opposite, between 1% and 7.1% of these

381

197 proteins were found in the 200 less abundant proteins of the experiments (4.6%,

382

7.1% and 1% in the experiment 1, 2 and 3 respectively). Therefore, the less

383

abundant proteins are probably the main cause of variability between these studies.

384

This result is in correlation with the low number of shared proteins between the three

385

top-100 and top-200 less abundant proteins of our own experiments. It is then

386

important to enhance the identification of these low abundant proteins, as they are

387

more likely to contain potential biomarkers for diseases than abundant proteins.

SC

M AN U

TE D

388

RI PT

373

Furthermore, 425 proteins (24.1%) were described for the first time in our study (S4

390

dataset), including some translation initiation factors (eukaryotic translation initiation

391

factors 2A and subunit 2, factor 3 subunits A, B, C, D, E, F, M, factors 4 gamma 1

392

and 2, factor 5), ribosomal proteins (60S ribosomal protein L14, L17, L18a, L22, L23,

393

L24, L3, L30, L32, L9, 40S ribosomal protein S2, S21, S4, S5, S7 and ribosomal

394

protein S6 kinase alpha-1, alpha-3) and mitogen-activated protein kinases 1, 13 and

395

14. An important point is that around 40% of these 425 proteins were identified at

396

least in two out of our three experiments (S4 dataset), reinforcing our results. If we

397

exclude these 425 specific proteins from our three lists of top-100 and three lists of

AC C

EP

389

17

ACCEPTED MANUSCRIPT top-200 less abundant proteins, it does not greatly change the low percentages of

399

shared proteins which were previously found (2% instead of 0.1% between the three

400

top-100 and 5.7% instead of 1.6% between the three top-200). We specifically

401

identified some ribosomal proteins, which are part of small guanosine

402

trisphosphatases (GTPases) (45). Depending on the binding to guanosine

403

trisphosphate (GTP) or guanosine diphosphate (GDP), Rab proteins (Ras-related in

404

brain) are active or inactive, respectively (46). They are involved in the membrane

405

traffic and vesicles formation and transport. Several mitogen-activated protein

406

kinases which are in charge of the signal transduction by the phosphorylation of

407

serine and threonine protein residues were also found (47). They could be released

408

after apoptosis of the cells in contact with tears. Why did we find these proteins

409

specifically in our study and how could we explain the low number of proteins in

410

common (192) between the studies? The use of different technical workflows (Table

411

3) may bring some answers to these questions.

SC

M AN U

TE D

412

RI PT

398

Firstly, the method of sample collection has an impact on tear composition (23). De

414

Souza et al.(17) used microcapillary glass tubes whereas the other studies sampled

415

tears with Schirmer’s strip. Schirmer method is reported to be more comfortable and

416

pleasant for the collected persons, easier to handle and can not wound neither

417

cornea, nor conjunctiva. On the other side, capillary tube was described as requiring

418

more technical expertise from the person who collected tears. The good angle and

419

position of the capillary on the ocular surface are not trivial, which frequently need

420

capillary tube repositioning. Thereby, more risks to seriously damage the conjunctiva

421

and/or the cornea are associated to this technique. Finally, collection with a capillary

422

tube takes more time since it must be stopped when the patient moves or blinks.

AC C

EP

413

18

ACCEPTED MANUSCRIPT Even if this technique is suspected to limit cell contamination from conjunctival or

424

epithelial cells compared to Schirmer collection, this cannot be completely excluded.

425

Posa et al. (23) concluded that both methods were suitable for protein analyses. Yet,

426

they reported that all subjects experienced the Schirmer strip much more pleasant

427

compared to the capillary tube. Our personal data also support the conclusion of

428

Posa et al. (23), so regarding the balances between various advantages and few

429

limitations, we chose the Schirmer test. However, the use of capillary tubes in the

430

study of De Souza et al.(17) could be part of the reasons why only 197 proteins were

431

commonly found in the 4 studies. Indeed, in the literature higher protein contents and

432

concentrations were obtained using Schirmer strip method (48-50). Another

433

parameter that was different between the studies is the Schirmer extraction method.

434

Whereas we only centrifuged the Schirmer’s strip to collect tears, Zhou et al.(18)

435

used extraction buffer comprising 100 mM of ammonium bicarbonate and protease

436

inhibitor. Aass et al. (16) worked with different buffers containing NH4HCO3, NaCl, a

437

surfactant, or a combination of the three, the consequences on recovered proteins

438

following these extraction methods were not investigated yet.

439

Another different technical point could be the off-gel electrophoresis fractionation

440

(based on the isoelectric point of the peptides) while Aass et al.(16) and Zhou et

441

al.(18) used strong cation exchange (SCX) chromatography. It has been proven that

442

both methods were suitable to separate a complex sample and enhance the number

443

of identified proteins (51, 52). A few studies compared these two methods (53, 54).

444

The off-gel electrophoresis provides the isoelectric point of the peptides, a useful

445

information, and shows a high resolution, a high sample loading and a flexibility

446

concerning the choice of the pH gradient (25, 51, 55). This allows to focus on a

447

specific pH range, to better separate the samples in this pH gradient. The SCX

AC C

EP

TE D

M AN U

SC

RI PT

423

19

ACCEPTED MANUSCRIPT method is described to be a faster method (25), but evidences about enhancing the

449

number of identified proteins are contradictory (53, 55, 56). As expected, the

450

methods lead to different protein profiles (55, 57). We also would like to mention that

451

we did not measure peptide concentration of the fractions after OG. Regarding the

452

difficulty in quantifying peptides (58, 59) and the low quantity of material per fraction,

453

we estimated the concentration by considering a homogeneous quantity of peptides

454

in each fraction.

455

Peptide length and tryptic missed cleavage sites could also explain the differences

456

between the 4 studies that we observed. While we used LTQ-Orbitrap like Aass et

457

al.(16), Zhou et al.(18) performed their analyses with a TripleTOF 5600. On another

458

hand, Ass et al.(16) supplemented the trypsin with Lys-C enzyme. This combination

459

of two proteases was described to enhance the efficiency of the digestion, and

460

influences the identified proteins (60).

461

Like Zhou et al.(18) and Aass et al.(16), we pooled different collected samples, while

462

De Souza et al.(17) analysed one single subject. Pooling different samples permitted

463

to reach a sufficient quantity and work with an average sample in term of peptide

464

content (61) in the analysed sample. However, pooling loses the sample identity and

465

the inter-variability information. In our case, we were interested in the global tear

466

proteome so we chose to pool samples.

SC

M AN U

TE D

EP

AC C

467

RI PT

448

468

In our pie charts, we did not obtain the same category names as Zhou and al. (18),

469

probably because of a different software version. Nevertheless, the main categories

470

seemed similar for each GO classification, which was completely relevant with the

471

fact that almost 50% (776) of proteins were found in common in their study and ours. 20

ACCEPTED MANUSCRIPT This observation strengthened our results but also emphasised the need to analyse

473

more samples, to consider the potential effects of physiological parameters (age,

474

gender, circadian rhythm and potential drugs) and to standardise the proteomics

475

workflow.

RI PT

472

476

Our method presents the advantage not to use extraction buffer, compared to Zhou

478

et al.(18) and Aass et al.(16). Therefore, the samples will be available for any kind of

479

experiments and analysis methods, without any interferences. Moreover, no study

480

checked whether the extraction buffer well preserved the samples. We recommend

481

including an equal number of men and women of various ages, to properly cover tear

482

proteome. Indeed, some recent studies started to investigate how both age and sex

483

could influence both tear composition and characteristics (62-64). The use of OGE

484

was also an added value, as this method was only described for tears in another

485

article from our team in a disease context (13). This alternative fractionation method

486

probably contributed to the specifically identified proteins of our study. However, the

487

current methodology has limitations. First, Schirmer strip extraction without buffer will

488

not allow studying ocular diseases leading to dry eye. Indeed, Schirmer moisturing

489

could be too short, resulting in 1 or 2-µl sample, not suitable for further analyses. In

490

these cases, using extraction buffers or other collecting methods such as a polyester

491

fiber rod or microcapillary tube would be more efficient (10, 11). Moreover, due to its

492

multiple steps, this methodology is not easy to handle in a clinical study with many

493

samples. However, our goal was to describe a methodology which can be used in the

494

first step of research projects, usually done with limited number of samples.

AC C

EP

TE D

M AN U

SC

477

495 496

Table 3: Comparison of the four tears proteome studies. 21

ACCEPTED MANUSCRIPT De Souza et al.

Zhou et al.

Aass et al.

Dor et al.

1 healthy patient

4 healthy subjects

3 healthy subjects

8 healthy subjects

Microcapillary tube

Schirmer's strip

Schirmer's strip

Schirmer's strip

Protein quantity (µg)

NM (13 µl)

400

NM

180

Pool

/

Yes

Yes

Yes

In-gel digestion

In-solution

In-solution

In-solution

Trypsin

Trypsin

Lys-C+ trypsin

Trypsin

SDS-PAGE

Offline SCX

Offline SCX

Digestion

RI PT

Collection

Off-Gel

Fractionation

NM LTQ-Orbitrap +

Mass spectrometer

2% ACN

Triple TOF 5600

LTQ-FT FDR <0.1%, 2

NM

5% ACN

LTQ-Orbitrap

LTQ-Orbitrap

M AN U

Injection MS buffer

SC

Electrophoresis

FDR <1%,

Identification

unique peptides

conditions

(Orbitrap), 1 peptide

FDR <1%, 1 peptide unspecified number FDR <1%, 1 peptide of peptide

identification



Stringent

497

*



1543

1526

1351



478

1026

662

1337

AC C

identification

491

EP

Maximum

TE D

(FT)

498

NM: not mentioned; Lys-C: proteinase lys-C; FT: Fourier transform; †: with 1 peptide

499

and non-reviewed proteins; ‡: with 2 peptides; *: with 2 unique peptides, without

500

keratins and reviewed proteins.

501 502

3.4. The tear proteome and the other ocular fluids

22

ACCEPTED MANUSCRIPT The tear proteome generated in our study was then compared to the proteome of

504

vitreous and aqueous humors (VH and AH respectively), investigated among others

505

by Murphy et al. (65, 66). So far, VH and AH were not very much studied in term of

506

proteomics studies. VH is a transparent and gelatinous substance situated between

507

the lens and the retina (67). Its global proteome, meaning in healthy subjects, was

508

described in only two main studies (65, 68). This fluid should contain some

509

information about the physiological condition of the retina. AH is situated at the

510

anterior and posterior chambers of the eye and plays a crucial role in cleaning the

511

lens and the cornea, but also permitting the distribution of nutriments and drugs to

512

several ocular structures (69). Interestingly, 291 (21.8%) and 197 (14.7%) tear

513

proteins were also found in VH and AH respectively (% of proteins we found in tears;

514

with two peptides, only reviewed and without keratins; S5 and S6 datasets). Proteins

515

found in common were represented by clusterin, recently demonstrated in VH as a

516

potential biomarker of the AMD (70) and suspected to be a marker of Alzheimer

517

disease in plasma (71), but also proteins involved in the glycolysis (among others the

518

lactate dehydrogenases A and B chains and the enolase 1) and complement and

519

coagulation pathways (6 proteins of the serpin family and 16 complement factors).

520

Several cathepsins (B, D, L1 and Z) were also commonly found in both tears, VH and

521

AH. Cathepsin D, identified in the three different ocular fluids, was also recently

522

proposed as biomarker candidate in AH for neovascular AMD (72). Finding in tears

523

some proteins also present in VH and AH and related to certain diseases is very

524

promising. Indeed collecting VH and AH requires surgery, which is an invasive

525

process. Using tears to find the same information could be a real advantage for both

526

clinicians and patients. Moreover, it could allow easier and better identification of

527

proteins of interest in certain pathologies.

AC C

EP

TE D

M AN U

SC

RI PT

503

23

ACCEPTED MANUSCRIPT The comparisons between tears, AH and VH have some limitations. Indeed, two

529

studies of Murphy (65, 66) were done with AH and VH of patients with cataract,

530

because the need to a surgery does not permit to collect complete healthy persons.

531

We do not know how the cataract status could affect the results obtained in these

532

studies, and so the comparisons that we made. However, we are aware that these

533

remarks are somewhat speculative and that more experiments will be needed to

534

support our hypothesis.

RI PT

528

SC

535

5. Conclusion

537

To conclude, the merge of all tear protein lists has enabled the identification of 1620

538

proteins (with stringent conditions), suggesting that there is probably room for

539

improvement in tear protein identification. Indeed, the fact that the combination of

540

different studies greatly enhances our knowledge about this fluid clearly

541

demonstrates that additional experiments will be required for the establishment of a

542

reliable proteome for the tears. Through this study, we first complemented the

543

knowledge on tear proteome but we also proposed a global overview of their great

544

potential for clinical research and biomarker discovery.

TE D

EP

AC C

545

M AN U

536

546

Acknowledgments

547

Authors thank Patrizia Arboit, Carla Pasquarello, Alexandre Hainard (Proteomics

548

Core Facility of the University of Geneva), Catherine Fouda, Christophe Tati and

549

Hortense Delacretaz for their technical assistance.

550

24

ACCEPTED MANUSCRIPT 551

Figure captions

552

Graphical Abstract. Summary of the experimental workflow followed in this

554

study. GPF: gas-phase fractionation mode; LTQ: linear trap quadrupole; IPG:

555

immobilized pH gradient. DAVID: Database for Annotation, Visualization and

556

Integrated Discovery; STRING: Search Tool for the Retrieval of Interacting

557

Genes/Proteins; KEGG: Kyoto Encyclopedia of Genes and Genomes.

SC

RI PT

553

M AN U

558

Fig 1. Pie charts of molecular function (a), cellular component (b) and

560

biological process (c) obtained with the 1337 proteins using DAVID software.

561

Some subcategories were grouped to reduce the number of GO categories

562

TE D

559

Fig 2. Classification of the 518 enzymes identified in our study. Data were

564

obtained using the “enzymes classes” available on UniProt. Details of the content of

565

each category could be found in S2 dataset.

AC C

566

EP

563

567

Fig 3. Glycolysis/Gluconeogenesis pathway obtained using KEGG database

568

(30). Number of genes that are involved in is higher than the number of bold boxes

569

on the figures due to the isoforms of the proteins. It means that one bold box can

570

encompass several proteins. 5.4.2.2: phosphoglucomutase 1, 2; 5.3.1.9: glucose-6-

571

phosphate isomerase; 5.1.3.3: galactose mutarotase; 3.1.3.11: fructose-

25

ACCEPTED MANUSCRIPT bisphosphatase 1; 2.7.1.11: phosphofructokinase muscle, platelet, liver types;

573

4.1.2.13: aldolase fructose-bisphosphate C, A; 5.3.1.1: triosephosphate isomerase 1;

574

1.2.1.12: glyceraldehyde-3-phosphate dehydrogenase; 2.7.2.3: phosphoglycerate

575

kinase 1; 5.4.2.11: phosphoglycerate mutase 1; enolase 1; 2.7.1.40: pyruvate kinase

576

(muscle); 1.1.1.27: lactate dehydrogenase A, B; 6.2.1.1: acyl-CoA synthetase short-

577

chain family member 2; 1.2.1.3: aldehyde dehydrogenase 9 family member A1, 7

578

family member A1; 1.2.1.5: aldehyde dehydrogenase 1 family member A3, 3 family

579

member A1; 1.1.1.1: Alcohol dehydrogenase 1B, 1C, 5, 7; 1.1.1.2: aldo-keto

580

reductase family 1 member A1.

M AN U

SC

RI PT

572

581

Fig 4. Coagulation and complement cascades obtained using KEGG database

583

(30). Number of genes that are involved in is higher than the number of bold boxes

584

on the figures due to the isoforms of the proteins. It means that one bold box can

585

encompass several proteins. AT3: serpin family C member 1; A1AT: serpin family A

586

member 1; A2M: alpha-2-macroglobulin; CD59: CD59 molecule; CPB2:

587

carboxypeptidase B2; CR3: fibrinogen gamma chain; CR4: fibrinogen gamma chain;

588

C1INH: serpin family G member; C1qrs: complement C1q B chain, C1q C chain, C1r,

589

C1s; C2: complement factor 2; C3: complement C3; C4: complement C4A; C5:

590

complement C5; C6, 7, 8, 9: complement C6, C7, C8 alpha chain, C8 beta chain, C9;

591

DAF: CD55 molecule; FB: complement factor B; FD: complement factor D; FH:

592

complement factor H; FI: complement factor I; Fibrinogen: fibrinogen gamma chain,

593

beta chain, alpha chain; F2: coagulation factor 2 (thrombin); F5: coagulation factor 5;

594

F12: coagulation factor 12; F13: coagulation factor 13 B chain; HCII: serpin family D

595

member 1; kallikrein: kallikrein B1; kininogen: kininogen 1; PCI: serpin family A

596

member 5; PLG: plasminogen; α2AP: serpin family F member 2.

AC C

EP

TE D

582

26

ACCEPTED MANUSCRIPT 597

Fig 5. Comparison between the studies of De Souza et al.(17), Zhou et al.(18),

599

Aass et al.(16) and us. The following criteria were applied to the four protein lists:

600

only reviewed proteins, 2 unique peptides and exclusion of keratins. For each

601

category, percentage represents the ratio between the proteins contain in this

602

category and the total number of proteins. Major information is the proteins in

603

common to the four studies (197; 11.2%) and our specific proteins (425; 24.1%).

SC

RI PT

598

604

References

606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632

1. Csosz E, Boross P, Csutak A, Berta A, Toth F, Poliska S, et al. Quantitative analysis of proteins in the tear fluid of patients with diabetic retinopathy. J Proteomics. 2012;75(7):2196-204. 2. Kim HJ, Kim PK, Yoo HS, Kim CW. Comparison of tear proteins between healthy and early diabetic retinopathy patients. Clin Biochem. 2012;45(1-2):60-7. 3. Park KS, Kim SS, Kim JC, Kim HC, Im YS, Ahn CW, et al. Serum and tear levels of nerve growth factor in diabetic retinopathy patients. Am J Ophthalmol. 2008;145(3):432-7. 4. Pieragostino D, D'Alessandro M, di Ioia M, Di Ilio C, Sacchetta P, Del Boccio P. Unraveling the molecular repertoire of tears as a source of biomarkers: beyond ocular diseases. Proteomics Clin Appl. 2015;9(1-2):169-86. 5. Ujhelyi B, Gogolak P, Erdei A, Nagy V, Balazs E, Rajnavolgyi E, et al. Graves' orbitopathy results in profound changes in tear composition: a study of plasminogen activator inhibitor-1 and seven cytokines. Thyroid. 2012;22(4):407-14. 6. Perumal N, Funke S, Pfeiffer N, Grus FH. Proteomics analysis of human tears from aqueousdeficient and evaporative dry eye patients. Sci Rep. 2016;6:29629. 7. Zhou L, Beuerman RW, Chan CM, Zhao SZ, Li XR, Yang H, et al. Identification of tear fluid biomarkers in dry eye syndrome using iTRAQ quantitative proteomics. J Proteome Res. 2009;8(11):4889-905. 8. Aluru SV, Agarwal S, Srinivasan B, Iyer GK, Rajappa SM, Tatu U, et al. Lacrimal proline rich 4 (LPRR4) protein in the tear fluid is a potential biomarker of dry eye syndrome. PLoS One. 2012;7(12):e51979. 9. Postnikoff CK, Huisingh C, McGwin G, Nichols KK. Leukocyte Distribution in the Open Eye Tears of Normal and Dry Eye Subjects. Curr Eye Res. 2018. 10. Aluru SV, Shweta A, Bhaskar S, Geetha K, Sivakumar RM, Utpal T, et al. Tear Fluid Protein Changes in Dry Eye Syndrome Associated with Rheumatoid Arthritis: A Proteomic Approach. Ocul Surf. 2017;15(1):112-29. 11. Jung JH, Ji YW, Hwang HS, Oh JW, Kim HC, Lee HK, et al. Proteomic analysis of human lacrimal and tear fluid in dry eye disease. Sci Rep. 2017;7(1):13363.

AC C

EP

TE D

M AN U

605

27

ACCEPTED MANUSCRIPT

EP

TE D

M AN U

SC

RI PT

12. Del Boccio P, Rossi C, di Ioia M, Cicalini I, Sacchetta P, Pieragostino D. Integration of metabolomics and proteomics in multiple sclerosis: From biomarkers discovery to personalized medicine. Proteomics Clin Appl. 2016;10(4):470-84. 13. Salvisberg C, Tajouri N, Hainard A, Burkhard PR, Lalive PH, Turck N. Exploring the human tear fluid: discovery of new biomarkers in multiple sclerosis. Proteomics Clin Appl. 2014;8(3-4):185-94. 14. Lebrecht A, Boehm D, Schmidt M, Koelbl H, Schwirz RL, Grus FH. Diagnosis of breast cancer by tear proteomic pattern. Cancer Genomics Proteomics. 2009;6(3):177-82. 15. Terekhina NA, Petrovich Iu A. [Diagnostic value of tear analysis in uremia, uricemia and cholesterolemia]. Klin Lab Diagn. 1994(6):17-8. 16. Aass C, Norheim I, Eriksen EF, Thorsby PM, Pepaj M. Single unit filter-aided method for fast proteomic analysis of tear fluid. Anal Biochem. 2015;480:1-5. 17. de Souza GA, Godoy LM, Mann M. Identification of 491 proteins in the tear fluid proteome reveals a large number of proteases and protease inhibitors. Genome Biol. 2006;7(8):R72. 18. Zhou L, Zhao SZ, Koh SK, Chen L, Vaz C, Tanavde V, et al. In-depth analysis of the human tear proteome. J Proteomics. 2012;75(13):3877-85. 19. Gerber-Hollbach N, Plattner K, O'Leary OE, Jenoe P, Moes S, Drexler B, et al. Tear Film Proteomics Reveal Important Differences Between Patients With and Without Ocular GvHD After Allogeneic Hematopoietic Cell Transplantation. Invest Ophthalmol Vis Sci. 2018;59(8):3521-30. 20. Kallo G, Emri M, Varga Z, Ujhelyi B, Tozser J, Csutak A, et al. Changes in the Chemical Barrier Composition of Tears in Alzheimer's Disease Reveal Potential Tear Diagnostic Biomarkers. PLoS One. 2016;11(6):e0158000. 21. Kishazi E, Dor M, Eperon S, Oberic A, Hamedani M, Turck N. Thyroid-associated orbitopathy and tears: A proteomics study. J Proteomics. 2018;170:110-6. 22. Tang Q, Zhang C, Wu X, Duan W, Weng W, Feng J, et al. Comprehensive Proteomic Profiling of Patients' Tears Identifies Potential Biomarkers for the Traumatic Vegetative State. Neurosci Bull. 2018;34(4):626-38. 23. Posa A, Brauer L, Schicht M, Garreis F, Beileke S, Paulsen F. Schirmer strip vs. capillary tube method: non-invasive methods of obtaining proteins from tear fluid. Ann Anat. 2013;195(2):137-42. 24. Remington SG, Crow JM, Nelson JD. Antibodies to endogenous tear protein in normal human tears. Curr Eye Res. 2009;34(10):819-23. 25. Moreda-Pineiro A, Garcia-Otero N, Bermejo-Barrera P. A review on preparative and semipreparative offgel electrophoresis for multidimensional protein/peptide assessment. Anal Chim Acta. 2014;836:1-17. 26. Dayon L, Turck N, Kienle S, Schulz-Knappe P, Hochstrasser DF, Scherl A, et al. Isobaric taggingbased selection and quantitation of cerebrospinal fluid tryptic peptides with reporter calibration curves. Anal Chem. 2010;82(3):848-58. 27. Scherl A, Shaffer SA, Taylor GK, Kulasekara HD, Miller SI, Goodlett DR. Genome-specific gasphase fractionation strategy for improved shotgun proteomic profiling of proteotypic peptides. Anal Chem. 2008;80(4):1182-91. 28. Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44-57. 29. Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43(Database issue):D447-52. 30. Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 2017;45(D1):D353-D61. 31. Walmsley SJ, Rudnick PA, Liang Y, Dong Q, Stein SE, Nesvizhskii AI. Comprehensive analysis of protein digestion using six trypsins reveals the origin of trypsin as a significant source of variability in proteomics. J Proteome Res. 2013;12(12):5666-80. 32. Tabb DL, Vega-Montoto L, Rudnick PA, Variyath AM, Ham AJ, Bunk DM, et al. Repeatability and reproducibility in proteomic identifications by liquid chromatography-tandem mass spectrometry. J Proteome Res. 2010;9(2):761-76.

AC C

633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684

28

ACCEPTED MANUSCRIPT

EP

TE D

M AN U

SC

RI PT

33. Flanagan JL, Willcox MD. Role of lactoferrin in the tear film. Biochimie. 2009;91(1):35-43. 34. Fluckinger M, Haas H, Merschak P, Glasgow BJ, Redl B. Human tear lipocalin exhibits antimicrobial activity by scavenging microbial siderophores. Antimicrob Agents Chemother. 2004;48(9):3367-72. 35. Offiah I, Calder VL. Immune mechanisms in allergic eye diseases: what is new? Curr Opin Allergy Clin Immunol. 2009;9(5):477-81. 36. Tezel G. A proteomics view of the molecular mechanisms and biomarkers of glaucomatous neurodegeneration. Prog Retin Eye Res. 2013;35:18-43. 37. Farshchian M, Kivisaari A, Ala-Aho R, Riihila P, Kallajoki M, Grenman R, et al. Serpin peptidase inhibitor clade A member 1 (SerpinA1) is a novel biomarker for progression of cutaneous squamous cell carcinoma. Am J Pathol. 2011;179(3):1110-9. 38. Kwon CH, Park HJ, Lee JR, Kim HK, Jeon TY, Jo HJ, et al. Serpin peptidase inhibitor clade A member 1 is a biomarker of poor prognosis in gastric cancer. Br J Cancer. 2014;111(10):1993-2002. 39. Zheng D, Chen H, Davids J, Bryant M, Lucas A. Serpins for diagnosis and therapy in cancer. Cardiovasc Hematol Disord Drug Targets. 2013;13(2):123-32. 40. Yokosako K, Mimura T, Funatsu H, Noma H, Goto M, Kamei Y, et al. Glycolysis in patients with age-related macular degeneration. Open Ophthalmol J. 2014;8:39-47. 41. Jha P, Bora PS, Bora NS. The role of complement system in ocular diseases including uveitis and macular degeneration. Mol Immunol. 2007;44(16):3901-8. 42. Karamichos D, Zieske JD, Sejersen H, Sarker-Nag A, Asara JM, Hjortdal J. Tear metabolite changes in keratoconus. Exp Eye Res. 2015;132:1-8. 43. Cocuzzi E, Szczotka LB, Brodbeck WG, Bardenstein DS, Wei T, Medof ME. Tears contain the complement regulator CD59 as well as decay-accelerating factor (DAF). Clin Exp Immunol. 2001;123(2):188-95. 44. Willcox MD, Morris CA, Thakur A, Sack RA, Wickson J, Boey W. Complement and complement regulatory proteins in human tears. Invest Ophthalmol Vis Sci. 1997;38(1):1-8. 45. Stenmark H, Olkkonen VM. The Rab GTPase family. Genome Biol. 2001;2(5):REVIEWS3007. 46. Hutagalung AH, Novick PJ. Role of Rab GTPases in membrane traffic and cell physiology. Physiol Rev. 2011;91(1):119-49. 47. Wada T, Penninger JM. Mitogen-activated protein kinases in apoptosis regulation. Oncogene. 2004;23(16):2838-49. 48. Farias E, Yasunaga KL, Peixoto RVR, Fonseca MP, Fontes W, Galera PD. Comparison of two methods of tear sampling for protein quantification by Bradford method. Pesquisa Veterinária Brasileira. 2013;33:261-4. 49. Green-Church KB, Nichols KK, Kleinholz NM, Zhang L, Nichols JJ. Investigation of the human tear film proteome using multiple proteomic approaches. Mol Vis. 2008;14:456-70. 50. Stuchell RN, Feldman JJ, Farris RL, Mandel ID. The effect of collection technique on tear composition. Invest Ophthalmol Vis Sci. 1984;25(3):374-7. 51. Horth P, Miller CA, Preckel T, Wenz C. Efficient fractionation and improved protein identification by peptide OFFGEL electrophoresis. Mol Cell Proteomics. 2006;5(10):1968-74. 52. Chenau J, Michelland S, Sidibe J, Seve M. Peptides OFFGEL electrophoresis: a suitable preanalytical step for complex eukaryotic samples fractionation compatible with quantitative iTRAQ labeling. Proteome Sci. 2008;6:9. 53. Mostovenko E, Hassan C, Rattke J, Deelder AM, van Veelen PA, Palmblad M. Comparison of peptide and protein fractionation methods in proteomics. EuPA Open Proteomics. 2013;1:30-7. 54. Slebos RJ, Brock JW, Winters NF, Stuart SR, Martinez MA, Li M, et al. Evaluation of strong cation exchange versus isoelectric focusing of peptides for multidimensional liquid chromatographytandem mass spectrometry. J Proteome Res. 2008;7(12):5286-94. 55. Waller LN, Shores K, Knapp DR. Shotgun proteomic analysis of cerebrospinal fluid using offgel electrophoresis as the first-dimension separation. J Proteome Res. 2008;7(10):4577-84.

AC C

685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734

29

ACCEPTED MANUSCRIPT

772 773

EP

TE D

M AN U

SC

RI PT

56. Manadas B, English JA, Wynne KJ, Cotter DR, Dunn MJ. Comparative analysis of OFFGel, strong cation exchange with pH gradient, and RP at high pH for first-dimensional separation of peptides from a membrane-enriched protein fraction. Proteomics. 2009;9(22):5194-8. 57. Antberg L, Cifani P, Sandin M, Levander F, James P. Critical comparison of multidimensional separation methods for increasing protein expression coverage. J Proteome Res. 2012;11(5):2644-52. 58. Tammen H, Hess R. Collection and handling of blood specimens for peptidomics. Methods Mol Biol. 2013;1023:161-8. 59. Sapan CV, Lundblad RL. Review of methods for determination of total protein and peptide concentration in biological samples. Proteomics Clin Appl. 2015;9(3-4):268-76. 60. Saveliev S, Bratz M, Zubarev R, Szapacs M, Budamgunta H, Urh M. Trypsin/Lys-C protease mix for enhanced protein mass spectrometry analysis. Nat Meth. 2013;10(11). 61. Oberg AL, Vitek O. Statistical design of quantitative mass spectrometry-based proteomic experiments. J Proteome Res. 2009;8(5):2144-56. 62. Gibson EJ, Stapleton F, Wolffsohn JS, Golebiowski B. Local synthesis of sex hormones: are there consequences for the ocular surface and dry eye? Br J Ophthalmol. 2017;101(12):1596-603. 63. Micera A, Di Zazzo A, Esposito G, Longo R, Foulsham W, Sacco R, et al. Age-Related Changes to Human Tear Composition. Invest Ophthalmol Vis Sci. 2018;59(5):2024-31. 64. Ozdemir M, Temizdemir H. Age- and gender-related tear function changes in normal population. Eye (Lond). 2010;24(1):79-83. 65. Murthy KR, Goel R, Subbannayya Y, Jacob HK, Murthy PR, Manda SS, et al. Proteomic analysis of human vitreous humor. Clin Proteomics. 2014;11(1):29. 66. Murthy KR, Rajagopalan P, Pinto SM, Advani J, Murthy PR, Goel R, et al. Proteomics of human aqueous humor. OMICS. 2015;19(5):283-93. 67. Bishop PN. Structural macromolecules and supramolecular organisation of the vitreous gel. Prog Retin Eye Res. 2000;19(3):323-44. 68. Aretz S, Krohne TU, Kammerer K, Warnken U, Hotz-Wagenblatt A, Bergmann M, et al. Indepth mass spectrometric mapping of the human vitreous proteome. Proteome Sci. 2013;11(1):22. 69. Goel M, Picciani RG, Lee RK, Bhattacharya SK. Aqueous humor dynamics: a review. Open Ophthalmol J. 2010;4:52-9. 70. Nobl M, Reich M, Dacheva I, Siwy J, Mullen W, Schanstra JP, et al. Proteomics of vitreous in neovascular age-related macular degeneration. Exp Eye Res. 2016;146:107-17. 71. Jongbloed W, van Dijk KD, Mulder SD, van de Berg WD, Blankenstein MA, van der Flier W, et al. Clusterin Levels in Plasma Predict Cognitive Decline and Progression to Alzheimer's Disease. J Alzheimers Dis. 2015;46(4):1103-10. 72. Kang GY, Bang JY, Choi AJ, Yoon J, Lee WC, Choi S, et al. Exosomal proteins in the aqueous humor as novel biomarkers in patients with neovascular age-related macular degeneration. J Proteome Res. 2014;13(2):581-95.

AC C

735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771

774

Supporting information

775

S1 dataset: Proteins identified in experiment 3 (2 peptides, no keratins)

776

S2 dataset: details of the six enzyme categories

777

S3 dataset: Common proteins between the four studies

778

S4 dataset: Proteins specifically identified in this study 30

ACCEPTED MANUSCRIPT S5 dataset: Proteins in common between tears and vitreous humor

780

S6 dataset: Proteins in common between tears and aqueous humor

AC C

EP

TE D

M AN U

SC

RI PT

779

31

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

ACCEPTED MANUSCRIPT Highlights Tear fluid contains a high variety of proteins



A lot of enzymes, more precisely hydrolases and transferases, were identified



Pathways involved in different diseases can be greatly mapped in tears



Combination of different studies is needed to establish a reliable tear

RI PT



AC C

EP

TE D

M AN U

SC

proteome