Accepted Manuscript Transcriptome profiling of red swamp crayfish (Procambarus clarkii) hepatopancreas in response to lipolysaccharide (LPS) infection Miao Zhou, Muhammad Nadeem Abbas, Saima Kausar, Cheng-Xi Jiang, Li-Shang Dai PII:
S1050-4648(17)30637-X
DOI:
10.1016/j.fsi.2017.10.030
Reference:
YFSIM 4901
To appear in:
Fish and Shellfish Immunology
Received Date: 29 July 2017 Revised Date:
5 October 2017
Accepted Date: 16 October 2017
Please cite this article as: Zhou M, Abbas MN, Kausar S, Jiang C-X, Dai L-S, Transcriptome profiling of red swamp crayfish (Procambarus clarkii) hepatopancreas in response to lipolysaccharide (LPS) infection, Fish and Shellfish Immunology (2017), doi: 10.1016/j.fsi.2017.10.030. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT
Transcriptome profiling of red swamp crayfish (Procambarus clarkii)
2
hepatopancreas in response to lipolysaccharide (LPS) infection
3
Miao Zhou a,1 , Muhammad Nadeem Abbas b,1 , Saima Kausar b,1, Cheng-Xi Jiang c,*,
4
Li-Shang Dai a,* a
5
325035, P.R. China
Department of Zoology and Fisheries, University of Agriculture, Faisalabad 38000,
SC
b
7 8
School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou
Pakistan c
9
M AN U
6
RI PT
1
Life Sciences Institute, Wenzhou University, Wenzhou 325035, P.R. China
10
TE D
11
12
13
* Corresponding author: Cheng-xi Jiang and Li-Shang Dai
AC C
15
EP
14
Tel/fax: +86 577 86699572
16
E-mail:
[email protected] (Cheng-Xi Jiang),
[email protected]
17 18
(Li-Shang Dai)
Address: School of Pharmaceutical Sciences, Wenzhou Medical University,
19
Wenzhou 325035, P.R. China.
20 21
22
1
These authors contributed equally to this work.
ACCEPTED MANUSCRIPT Abstract
23
The RNA-sequencing followed by de novo assembly generated 61,912 unigene
25
sequences of P. clarkii hepatopancreas. Comparison of gene expression between LPS
26
challenged and PBS control samples revealed 2,552 differentially expressed genes
27
(DEGs). Of these sequences, 1,162 DEGs were differentially up-regulated and 1,360
28
DEGs differentially down-regulated. The DEGs were then annotated against gene
29
ontology (GO) database and Kyoto Encyclopedia of Genes and Genomes (KEGG)
30
database. Some immune-related pathways such as PPAR signaling pathway, lysosome,
31
Chemical carcinogenesis, Peroxisome were predicted by canonical pathways analysis.
32
The reliability of transcriptome data was validated by quantitative real time
33
polymerase chain reaction (qRT-PCR) for the selected genes. The data presented here
34
shed light into antibacterial immune responses of crayfish. In addition, these results
35
suggest that transcriptomic data provides valuable sequence resource for
36
immune-related gene identification and helps to understand P. clarkii immune
37
functions.
38
Keywords: Crayfish; LPS; Immunity; Differential expressed genes; Arthropods
40 41 42 43 44
SC
M AN U
TE D
EP
AC C
39
RI PT
24
ACCEPTED MANUSCRIPT 45
1. Introduction The red-swamp crayfish (Procambarus clarkii) is a rich source of high quality
47
proteins, which constitutes all the essential amino acids required for human nutrition.
48
P. clarkii is a fresh water species, native to Mexico and United States [1]. It is
49
extremely tolerant to adverse environmental conditions such as temperature
50
fluctuations, poor water quality, and low oxygen concentrations. Therefore, the
51
species is considered highly suitable for culture and extraordinary farm production
52
[2,3]. Thus far, this species has been introduced and successfully cultured in many
53
countries world-wide [4]. In China, it was introduced from Japan in 1929 [5], since
54
then it has become one of the most economically important species in China.
M AN U
SC
RI PT
46
Vertebrates and invertebrates through the process of evolution have developed an
56
innate immune system, and innate immune responses are considered first line of
57
defense against variety of pathogens [6-8]. The innate immune system is stimulated
58
by microbial infections, thereby transcribing variety of immune related genes to
59
perform their specific roles in host defense [9]. Hence, to understand the immune
60
mechanisms in an organism, it is crucially important to explore the genes that are
61
probably implicated in the immune functions.
EP
AC C
62
TE D
55
In spite of crayfish economic importance and being cultured worldwide, our
63
knowledge on its innate immune system is limited. In crayfish hepatopancreas is a
64
vital organ for nutrient storage and it is also involved in a variety of immune
65
responses and detoxification processes [10]. Hepatopancreas in crustaceans
66
transcribes various immune related genes and proteins, which play crucial role to
ACCEPTED MANUSCRIPT maintain physiological processes. For instance, hemocyanin that is converted to
68
antimicrobial proteins, antiviral agent, hemolysin, agglutinin and phenoxidase like
69
enzyme, and is considered to be involved in immune responses of crustaceans,
70
mollusks and arthropods [11-13]. In addition, many immune related genes such as
71
cathepsin B, cathepsin L, interferongamma inducible lysosomal thiol reductases
72
(GILT) are also produced in hepatopancreas [14,15]. Besides this, lectins are
73
important immune responsive genes, and known to have tissue specific expression
74
and some lectins (hsL and FLec2) are only expressed in hepatopancreas [16-18].
75
Although, these fragmentary studies are available on the immune responses of
76
hepatopancreas in crustacean, however, the anti-bacterial immune mechanism of
77
crayfish hepatopancreas has not been explored previously. Therefore study on its
78
immune responses and exploration of immune-related genes will contribute to
79
understand anti-bacterial immune mechanism in the crayfish.
TE D
M AN U
SC
RI PT
67
The transcriptome is an entire set of RNA transcripts produced by the genome at
81
any one time, and the knowledge of the transcriptome is critically important for
82
interpreting the functional factors of the genome [5]. In the present study, we prepared
83
and sequenced two libraries of P. clarkii hepatopancreas after LPS stimulation and
84
PBS administration using Illumina sequencing platform. First, the obtained reads were
85
assembled to unigene sequences by RNA-Seq de novo programs. Subsequently, the
86
transcriptome data was subjected to GO, KOG, and KEGG databases to investigate
87
the involvement of unigene sequences in different canonical pathways. Furthermore,
88
enrichment analysis was executed following the identification of differentially
AC C
EP
80
ACCEPTED MANUSCRIPT expressed genes (DEGs) from two P. clarkii libraries. The main purpose of this study
90
was to identify and align immune-related genes in the hepatopancreas. The
91
transcriptomic data will also provide sequence resources for future studies on
92
crustaceans and may help to understand antibacterial immune mechanisms of P.
93
clarkii.
94
2. Materials and methods
95
2.1. Experimental animal and LPS treatment
SC
RI PT
89
P. clarkii individuals were obtained from local market; the specimens were
97
ranged 7-8 cm in size and approximately an average weight of 10 g. They were kept
98
in spacious tanks, supplied with high-quality freshwater, and fed at 23±2 °C with
99
natural (small invertebrates) and artificial diet. The specimens were divided into
100
experimental group and control group, each of which contained 10 individuals. The
101
experimental group was administrated with LPS (1 µg/g of body weight) in abdominal
102
part and PBS was used as control group. Following 12 h of treatment hepatopancreas
103
samples were collected from both the experimental and control group and
104
immediately frozen in liquid nitrogen and then stored at −80 °C till the extraction of
105
RNA.
106
2.2. RNA extraction, library preparation and sequencing
AC C
EP
TE D
M AN U
96
107
The collected hepatopancreas samples were used to isolate RNA by the Trizol
108
Reagent (Invitrogen, USA) according to manufacturer’s instructions (Supplementary
109
Fig. 1). The integrity and quality of extracted RNA was analyzed by agarose gel
110
electrophoresis
(1%),
and
NanoDrop
1000
spectrophotometer
(NanoDrop
ACCEPTED MANUSCRIPT Technologies, Wilmington, DE, USA), respectively. After DNase I treatment, the
112
messenger RNA (mRNA) was purified from 20 µg mixed total RNA using oligo (dT)
113
magnetic beads. Following purification, the mRNA was fragmented into short
114
fragments by mixing in fragmentation buffer in a thermomixer at a suitable
115
temperature. First-strand cDNA was then synthesized using the mRNA fragments as
116
templates, followed by second-strand cDNA synthesis using DNA polymerase I and
117
RNase H. After the end repair process and connection with adapters, suitable
118
fragments were used as templates for PCR amplification to create the final cDNA
119
library. cDNA libraries were constructed using SMART cDNA library construction kit
120
(Clontech, CA, USA) in accordance with the manufacturer’s protocol. Following the
121
examination of quality and quantity, the libraries for transcriptome sequencing was
122
prepared
123
Solexa/Illumina sequencing was carried out by Novogene, Beijing, China.
124
2.3. Quality control and assembly of reads
Illumina’s
kit
following
manufacturer’s
recommendations.
TE D
using
M AN U
SC
RI PT
111
The assembly of LPS and PBS administrated hepatopancreas samples were
126
executed using RNA-Seq de novo programs Trinity and SOAP de novo assembler [19]
127
with default parameters. To obtain clean reads, short reads (Read length >10bp) and
128
low quality reads (Quality score >20) were discarded. The clean reads were processed
129
to form contigs, and then these contigs were clustered to obtain unigene sequences,
130
which were finally connected to obtain transcript [20].
131
2.4. Annotation and gene ontology of transcriptome
132
AC C
EP
125
After transcriptome assembly, the obtained unigene and transcript sequences
ACCEPTED MANUSCRIPT were annotated on the basis of sequence similarity with known genes. Briefly, the
134
contigs were aligned against available sequence in the National Center of
135
Biotechnology and Information (NCBI) database [21]. The unigene sequences were
136
aligned with NCBI protein data bases such as Nr, KOG, KEGG and Swiss-Prot using
137
BLASTx [22]. While the remaining unigene sequences (none of the BLASTx hits)
138
were annotated with Nt database of NCBI. To determine sequence homology,
139
alignments were executed with previously known genes (cutoff E-value≤ 10−5). The
140
alignments results were used to identify sequence direction and prediction of protein
141
coding region (CDS). Functional annotation was performed with gene function
142
classification
143
(http://www.blast2go.com/b2ghome) [23]. The pathway annotation was executed to
144
determine the biological behavior of the genes using the KEGG database.
145
2.5. Identification of differentially expressed genes
M AN U
SC
RI PT
133
(www.geneontology.org)
using
Blast2GO
software
TE D
(GO)
To identify differentially expressed genes from hepatopancreas samples of P.
147
clarkii following LPS treatment (PBS as control). cDNA libraries were constructed,
148
sequenced and transcriptome was assembled according to sections 2.2 and 2.3. The
149
differentially expressed genes (DEGs) were identified based on the number of
150
fragments per kilo base of exon per million fragments mapped (FPKM) of the genes,
151
and False Discovery Rate (FDR) was used to find the P-value threshold [24]. The
152
DEGs were characterized as those having FDR less than 0.05 and a fold change
153
between treated and control groups of greater than 2 (log2 fold change P1). Further,
154
the DEGs were mapped to the GO database (http://www.geneontology.org/) and then
AC C
EP
146
ACCEPTED MANUSCRIPT their functions were determined using WEGO and Blast2GO programs [25]. The
156
KEGG public database was used to predict enriched metabolic and signal transduction
157
pathways [26]. The pathway categories classification was determined using Fisher’s
158
exact test, and the FDR was used to correct the P values. Pathways with a P-value less
159
than 0.05 were considered as significantly enriched [27].
160
2.6. Quantitative RT-PCR assay
RI PT
155
Quantitative real time PCR (qRT-PCR) was performed to analyze mRNA levels
162
of immune related genes. Hepatopancreas samples were collected from crayfish (7-8
163
cm in size, and average weight of 10 g) purchased from local market, further the
164
number of specimens in each experiment and time of hepatopancreas collection was
165
same as mentioned in the section 2.1. Total RNA was isolated from LPS treated and
166
control group (PBS), then reverse transcribed into cDNA. The gene specific primers
167
were designed based on the identified transcript sequences by primer premier 5.0
168
software (Table 1). The primers were used to analyze the expression of immune
169
related genes, and 18S RNA was used as internal control. The qRT-PCR was
170
performed in 25 µL reactions including 12.5 µL of 2× SYBR Premix Ex TaqII
171
(Takara), 1 µL each of the forward and reverse primers, 2 µL of template (cDNA), and
172
8.5 µL of RNase-free H2O. The amplification program was as follows: 95°C for 30 s;
173
39 cycles of 95°C for 15 s, 58°C for 30 s, and 72°C for 30 s. At the end of the reaction,
174
a melting curve was produced by monitoring fluorescence continuously, while slowly
175
heating the sample from 65°C to 95°C. The relative expression levels of the selected
176
genes were calculated according to the 2−△△Ct method [28]. All qRT-PCR experiments
AC C
EP
TE D
M AN U
SC
161
ACCEPTED MANUSCRIPT 177
were repeated five times.
178
3. Results and Discussion
179
3.1. Sequencing and assembly of trancriptome To determine gene expression profile of P. clarkii, total RNA was extracted and
181
cDNA libraries were constructed from hepatopancreas following LPS treatment and
182
PBS was used as control. The Illumina sequencing of two libraries generated
183
52,414,336 and 47,592,460 raw reads for treated and control group, respectively.
184
After discarding ambiguous nucleotides, low quality reads and short reads, treated
185
group contained 50,310,480 clean reads, representing 7.55G nucleotides. The Q20
186
percentage and Q30 percentage (percentage of nucleotide whose quality was more
187
than 20 and 30 clean reads) were 96.21% and 90.78%, respectively and GC contents
188
was47.53%. The control group comprised 45,706,720 clean reads representing 6.86G
189
nucleotides. The Q20 percentage, Q30 percentage and GC percentage were 96.63%
190
and 91.77%, and 43.99%, respectively (Table 2).
TE D
M AN U
SC
RI PT
180
The ‘de novo’ transcriptome assembly strategy is most commonly used to
192
assemble RNA-seq data. It does not depend on reference genome and can provide an
193
initial set of transcripts, allowing for RNA-seq expression studies [29]. Currently,
194
Trinity, SOAP de novo, trans-ABySS, ABySS, and Oases assemblers have been
195
documented to be used for the de novo assemblage of short read RNA-seq data into
196
transcripts [30]. In the present study, we used Trinity and SOAP de novo assemblers
197
to assemble high quality clean reads of two libraries [19]. Trinity is an important de
198
novo assembly program that has been reported to be sensitive and efficient in
AC C
EP
191
ACCEPTED MANUSCRIPT recovering full-length transcripts/isoforms in various organisms. It constructs de
200
Bruijn graph from huge amounts of short-read sequence data, then use an enumeration
201
algorithm to score all possible paths and branches, and retain only those plausible
202
ones as transcripts and isoforms. Additionally, it is specially programmed to recover
203
paths supported by actual reads and eradicate ambiguous edges, thus ensure correct
204
transcript reconstruction [31]. Likewise, the SOAP de novo program is a novel
205
short-read assembler that can build draft assemblies de novo for large sized genomes
206
such as animal and plant genomes. It builds full-length transcripts/isoforms and
207
allowing to carry out accurate analyses of unexplored genomes [32].
208
novo programs (SOAP de novo and Trinity) are highly reliable and widely used to
209
recover transcripts and unigenes of unexplored species. In the present study, the
210
assembly of high quality clean reads generated 61,912 unigene sequences and 11,3398
211
transcripts. Both the unigene sequences and transcripts were ranged in size from <301
212
bp to > 2000 bp. The maximum number of unigene sequences were between 500
213
-1000 bp range, and transcripts were in <301 bp (Fig. 1).
214
3.2. Functional annotation of clustered unigene sequences
SC
M AN U
EP
TE D
Both of the de
AC C
215
RI PT
199
To evaluate the functions of unigene sequences, a total of 61,912 unigene
216
sequences were aligned with six public databases (NR, Swiss-Port, KO, GO, KOG
217
and PFAM). All of these sequences are annotated in at least one Database except
218
36,091 unigene sequences. The results revealed that 20,363 unigene sequences were
219
significantly best hit with the GO database, 20,264 unigene sequences were
220
significantly annotated with the PFAM database, and 19,672 unigene sequences were
ACCEPTED MANUSCRIPT significantly aligned with the NR database. While 16,369, 14,515 and 10,174 unigene
222
sequences were mapped against SwissProt database, KO database and KOG database,
223
respectively (Table 3). Of these aligned sequences, almost 32% unigene sequences
224
had more than 1e-30 E-value (Fig. 2A). The similarity distribution pattern revealed
225
that 61.8% unigene sequences shared greater than 60% homology (Fig. 2B). The
226
species specific distribution patterns revealed that 16.7% of the unigene sequences
227
most closely matched with Zootermopsis nevadensis, 6.7% of the unigene sequences
228
resembled Daphnia pulex, 3.5% unigene sequences best hits with Tribolium
229
castaneum (3.5%) and so on (Fig. 2C).
230
3.3. Functional analysis of transcriptome
M AN U
SC
RI PT
221
The putative functions of unigene sequences in LPS treated and control group
232
were analyzed using Gene Ontology (GO) and Eukaryotic Orthologous Groups of
233
proteins (KOG). The analysis of GO classification revealed that most of unigene
234
sequences mapped to biological process, followed by cellular component and
235
molecular function. The three categories were further divided into 55 functional
236
groups. In the two libraries, most of the corresponding biological process unigene
237
sequences were implicated in cellular processes, metabolic processes and single
238
organism process. Most of the cellular component unigene sequences were associated
239
with cell, cell junction, organelle, macromolecular complex and membrane. Most of
240
the molecular function unigene sequences encoded protein involved in binding,
241
followed by catalytic activity, and transporter activity (Fig. 3).
242
AC C
EP
TE D
231
Eukaryotic Orthologous Groups of proteins (KOG) database classifies gene
ACCEPTED MANUSCRIPT products into different categories, and these categories are critically important to
244
understand gene functions and their evolutionary relationships [33]. A total of 10,174
245
genes were aligned with KOG database and divided into 26 functional categories.
246
Among the 26 KOG categories, general function prediction only and signal
247
transduction mechanism were the most abundant category, followed by post
248
translational modification, protein turnover, chaperones and transcription categories
249
(Fig. 4).
SC
RI PT
243
Kyoto Encyclopedia of Genes and Genomes (KEGG) is a valuable tool for
251
canonical pathways based classification of orthologous genes, and provides important
252
information to understand biological functional profiles of gene [34]. In the present
253
study, 14,515 genes were annotated against the 229 biological pathways in the KEGG
254
database. Among these biological pathways, the top 20 statistically significant
255
pathways are shown in Table 4. Of these some important immune related biological
256
pathways were predicted in the KEGG database, including Rap1 signaling pathway,
257
Focal adhesion, MAPK signaling pathway PI3K-Akt signaling pathway, cAMP
258
signaling pathway and Ubiquitin mediated proteolysis. In addition, lysosome,
259
endocytosis, RNA transport, spliceosome, protein processing endoplasmic reticulumn,
260
and so on were predicted (Table 4).
TE D
EP
AC C
261
M AN U
250
Furthermore, the unigene sequences were classified into environmental
262
information processing, genetic information processing, metabolism, cellular process
263
and organismal system (Fig. 5). Among the 12 categories of metabolism, carbohydrate
264
metabolism (456 sequences), lipid metabolism (449 sequences) and amino acid
ACCEPTED MANUSCRIPT metabolism (328 sequences) unigene sequences were most abundant. In organismal
266
system, Endocrine system and Immune system were the most abundant clusters with
267
654 sequences and 478 sequences, respectively. Interestingly, in environmental
268
information processing, Signal transduction (1197 sequences) was the most abundant
269
cluster in all categories. While, folding, sorting and degradation (538 sequences) was
270
the most unigene sequences cluster among the genetic information processing (Fig. 5).
271
These biological processes play vital role in the physiological processes of P. clarkii
272
and will help to understand its immune mechanisms.
273
3.4. Identification and analysis of DEGs
M AN U
SC
RI PT
265
The gene expression pattern is utilized to predict the gene functions, and
275
generally, the genes exhibiting similar expression pattern mostly have same functions,
276
or involved in the same biochemical pathways. Hence, clustering of genes can be
277
useful to evaluate the function of unknown genes [35]. Here, we performed gene
278
clustering to determine the functional enrichment of genes exhibiting similar
279
expression pattern (Fig. 6). To determine the unigene sequences expression levels of P.
280
clarkii hepatopancreas following LPS treatment and PBS was used as control. Here
281
we
clean
reads
using
RSEM
282
|log2(foldchange)|>1)
and
then
screened
283
down-regulated unigene sequences in response to LPS challenge. A 2,522 DEGs were
284
identified, of these 1,162 unigene sequences significantly upregulated and 1,360
285
sequences down-regulated (Fig. 7).
286
3.5. GO classification of immune-related DEGs
AC C
EP
TE D
274
aligned
software
(q-value
significantly
<
0.005
up-regulated
& and
ACCEPTED MANUSCRIPT The unigene sequences were mapped against the GO and KEGG database. The
288
DEGs were categorized into biological process, cellular component and molecular
289
function, and then further classified into 31 subcategories (Fig. 8). The biological
290
process and molecular function categories represented most of the subcategories.
291
Metabolic process and single organism metabolic process were the most abundant
292
among the biological process. The hepatopancreas tissues metabolic oxygen demands
293
are relatively high in crustaceans similar across different species [36]. Further, in this
294
study, following the LPS treatment, we observed obvious transcription changes in
295
hepatopancrease under different GO terms including catalytic activity and
296
oxidoreductase (Fig. 8). Furthermore the DEGs were aligned to the KEGG database to
297
determine the biological pathway enrichment. Many biological pathways were (P = <
298
0.05) changed in LPS treated sample compared to control sample (PBS). Among the
299
biological pathways, some immune related pathways including PPAR signaling
300
pathway, Rheumatoid arthritis, lysosome, Chemical carcinogenesis, Peroxisome, and
301
so on were observed in the DEGs. The knowledge on crayfish antibacterial innate
302
immune mechanism is limited, and the above canonical pathways information will be
303
useful to explore and understand the immune mechanisms (Fig. 9 & Table 5).
SC
M AN U
TE D
EP
AC C
304
RI PT
287
Crayfish have an efficient innate immune system to combat various microbial
305
infections. Based on the KEGG classification, we identified some of the important
306
mechanisms of the crayfish innate immune system that are involved in the
307
antimicrobial defense, including lysosome and antibacterial peptides production.
308
These results are in agreement with study of Adema et al. [37], they performed
ACCEPTED MANUSCRIPT microarray analysis and reported the up-regulation of antibacterial peptides and
310
lysosome transcripts after the treatment of Echinostoma paraensei and Schistosoma
311
mansoni in Biomphalaria glabrata. Further, these results are also comparable with the
312
studies of Byadgi et al. [38]. Phagocytosis is the most common mechanism in animals
313
(crayfish) to overcome microbial infection. Upon the attachment of pathogen to the
314
surface of phagocyte, the phagocyte forms a vesicle (Phagosome) after engulfing
315
pathogen. Then lysosome containing different enzymes inside the phagocyte, fuse
316
with the phagosome resulting in the degradation of pathogen [39, 40].
M AN U
SC
RI PT
309
Further, the synthesis of antibacterial peptides in organisms is also an important
318
innate immune defense mechanism against microbial agents. Antibacterial peptides
319
attach to bacterial surface and break the bacterial cell membrane and consequently
320
bacterial cell is lysed. In insects, antibacterial peptides have been well studied, and
321
approximately 15 different antibacterial peptides have been reported that are induced
322
by bacteria or bacterial products exposure in insects. For instance, cecropins are
323
produced in insects and have a broad spectrum of activity against both gram negative
324
and gram positive bacteria [41, 42]. However, little information is available on the
325
antibacterial peptide production in crayfish [43], and the present study will provide a
326
foundation to further explore formation and possible role of antibacterial peptides in
327
crayfish.
328
3.6. Validation of transcriptome data by qRT-PCR
AC C
EP
TE D
317
329
To validate the expression levels of differentially expressed genes in our
330
transcriptome analysis, eleven genes were randomly selected based on altered
ACCEPTED MANUSCRIPT expression pattern after LPS treatment, and gene of interest through functional
332
enrichment and canonical pathways results, later investigated by qRT-PCR.
333
Quantitative RT-PCR analysis revealed that the most of differentially expressed genes
334
were in agreement with the RNA-seq data, however some genes such as cubilin had
335
relatively little lower expression in RNA-seq data compared to qRT-PCR analysis.
336
Although these genes expression was relatively lower, but overall both the analysis
337
showed approximately similar trend (Fig. 10). This characteristic feature has
338
previously been reported in transcriptomic studies of invertebrates such as
339
Biomphalaria glabrata [37], Homarus americanus [44], Exopalaemon carinicauda
340
[45] and so on.
341
Expression analysis of immune related genes
M AN U
SC
RI PT
331
Red swamp crayfish is one of the most economically important crustaceans. It is
343
used as model organism in various research fields such as toxicity, environmental
344
stress, animal behavior and studies on pathogen infection. Despite its importance,
345
little is known about crayfish genome particularly on immune related genes, immune
346
mechanisms and pathways, which are involved to combat microbial infection [46].
347
The identification of immune-related genes can contribute to understand the immune
348
mechanism of crayfish. Hepatopancreas in crustaceans has been reported to be
349
involved in metabolism, nutrient absorption and immune functions [47]. In this study,
350
four immune-related DEGs with clearly changed expression after LPS administration
351
were investigated by qRT-PCR assay. All of the immune-related genes were
352
significantly up-regulated in the LPS treated sample compared to PBS control (Fig.
AC C
EP
TE D
342
ACCEPTED MANUSCRIPT 11). The interferon (IFN) system is an important cellular defense mechanism in
354
mammals, and play crucial role to suppress viral infections by inhibiting multiple
355
steps of viral replication, increasing antigen-presentation process and activating
356
macrophages [48] (Cebulla and Miller, 1999). In the present study, component of IFN
357
system was up-regulated like gamma-interferon-inducible lysosomal thiol reductase
358
(GILT) in the LPS treated sample compared to control. The GILT is widely expressed
359
in a variety of cell types particularly in antigen presenting cells [49-51]. Where, it
360
catalyzes the reduction of disulfide bonds in exogenous antigens and facilitate in the
361
unfolding of the protein and then that is cleared by cellular proteases [52, 53], in
362
addition, GILT play important role in the neutralization of pathogens and removal of
363
cellular debris resulting from pathogen (bacterial) infection [54]. Further, several
364
studies reported the up-regulation of GILT in response to bacterial or LPS
365
administration, and its involvement in immune response [55, 56]. Our data is also
366
consistent with above mentioned studies, and suggest that GILT may be regulating the
367
immune function following LPS or bacterial challenge.
EP
TE D
M AN U
SC
RI PT
353
Anti-lipopolysaccharide factor (ALF) is an important member of antimicrobial
369
peptides and a key effector molecule in the crustaceans innate immune system. ALFs
370
can neutralize LPS, mediate degradation and activate coagulation cascade in cell [57].
371
So far a number of ALFs have been described in different species of crustaceans [58,
372
59], and bacterial and viral infection or LPS treatment can induce the expression of
373
ALFs in them [59, 60]. ALFs contain amphipatic loop structure that can bind a single
374
fatty acid with the phosphoglucosamine portion of lipid A that helps to bind with LPS
AC C
368
ACCEPTED MANUSCRIPT and subsequently neutralize it [61]. In this study, ALF-3 appeared as second most
376
greatly expressed factor. These results are consistent with the study of Clark et al. [44],
377
they reported up-regulation of various immune related factors including ALFs in
378
hepatopancrease of Homarus americanus after bacterial infection. Likewise,
379
Cathepsin L (CL) was significantly induced in hepatopancreas following LPS
380
challenge. Lysosomal cathepsins belonging to the family of cysteine protease have
381
been documented to be involved in the innate immunity of invertebrates. In P.
382
monodon, Microarray analysis showed that cathepsins B and L expression enhanced
383
after WSSVtreatment and the upregulation in expression of cathepsin probably reflect
384
the activation of lysosomal functions against viral infection. In the hepatopancreas of
385
F. chinensis mRNA levels of cathepsin B, C and L were upregulated following V.
386
anguillarum challenge, indicating that these cathepsins were involved in the innate
387
immunity of F. chinensis [62-64]. Hu and Leung [65] reported the expression of
388
Cathepsin L in the B cells of the hepatopancreas of Metapenaeus ensis, and suggested
389
the involvement of Cathepsin L in immune responses. Our data suggest
390
exist in the crayfish and is an important effector molecule in the innate immune
391
system, which helps to deal bacterial infection. In addition, peroxiredoxin 6 (Per6)
392
also showed significant increase compared to PBS control. It belongs to the
393
peroxiredoxin family that is broadly distributed in prokaryotes and eukaryotes [66,
394
67]. Serveral studies on invertebrates reported the increase in Per6 expression
395
following the treatment of bacteria, virus and pathogen-associated molecular pattern,
396
and has been shown to have important role in host defense [68, 69]. The members of
cathepsin L
AC C
EP
TE D
M AN U
SC
RI PT
375
ACCEPTED MANUSCRIPT peroxiredoxin family play crucial protective role in tissues such as detoxification and
398
reduction of hydrogen peroxide and so on [70]. In crustaceans, hepatopancreas is the
399
site for several redox reactions, thereby producing various free radicals, and the
400
increase of Per6 may have protective role in the crayfish hepatopancraes.
401
4. Conclusion
402
In this study, we sequenced the entire hepatopancreas transcriptome from LPS
403
administrated red swamp crayfish using Illumina sequencing technology. The
404
transcriptome information acquired in the present study makes substantial
405
contribution to a sequence source for future studies on P. clarkii. The functional
406
analysis of differentially expressed genes predicted the involvement of genes in
407
various immune pathways. The explored innate immunity-related genes and
408
biological pathways make a comprehensive picture of the immune network that
409
provides valuable information to understand immune mechanisms of P. clarkii against
410
LPS. Further, qRT-PCR analysis on the selected genes indicated that these immunity
411
related genes may contribute in the anti-bacterial (LPS) immune responses of P.
412
clarkii. The sequence information provided in the present study should help to
413
understand anti-bacterial immune responses in an important cultured species;
414
moreover these results could provide a foundation to clarify the complex interaction
415
between the P. clarkii and pathogenic bacteria.
416
Acknowledgements
AC C
EP
TE D
M AN U
SC
RI PT
397
417
This work was supported by Ph.D. programs in Wenzhou Medical University
418
(89216023), the Opening Project of Zhejiang Provincial Top Key Discipline of
ACCEPTED MANUSCRIPT Pharmaceutical Sciences (201721). The authors declare no competing interests.
420
References
421
[1] H.H. Hobbs, An illustrated checklist of the American crayfish (Decapoda:
422
Astacidae, Cambaridae & Parastacidae). Smithsonian Contributions to Zoology.
423
Smithsonian Institute Press, Washington, D.C. 480 (1989) 1-236.
424
[2] J.V. Huner, O.V. Lindqvist, Physiological adaptations of freshwater crayfishes that
425
permit successful aquaculture enterprises, American Zoologist 35 (1995) 12-19.
426
[3] C. Souty-Grosset, D.M. Holdich, P.Y. Noel, J.D. Reynolds, P. Haffner, Atlas of
427
crayfish in Europe. Paris, Publications Scientifiques du Muséum national d'Histoire
428
naturelle, Patrimoines Naturels 64 (2006) 188p.
429
[4] P. Henttonen, J.V. Huner, The introduction of alien species of crayfish in Europe:
430
A historical introduction. In: Gherardi F. and Holdich D.M., (Eds). Crayfish in Europe
431
as alien Species (How to make the best of a bad situation?). Crustacean issuses, 11,
432
A.A. Balkema, Rotterdam, Nethrlands (1999) 13-22.
433
[5] G.H. Yue, J. Li, Z. Bai, C.M. Wang, F. Feng, Genetic diversity and population
434
structure of the invasive alien red swamp crayfish, Biol. Invasions 12 (2010)
435
2697-2706.
436
[6] D.A. Kimbrell, B. Beutler, The evolution and genetics of innate immunity, Nat.
437
Rev. Genet. 2 (2001) 256-267.
438
[7] J.A. Hoffmann, The immune response of Drosophila, Nature 426 (2003) 33-38.
439
[8] E.S. Loker, C.M. Adema, S.M. Zhang, T.B. Kepler, Invertebrate immune
440
systems–not homogeneous, not simple, not well understood, Immunol. Rev. 198
AC C
EP
TE D
M AN U
SC
RI PT
419
ACCEPTED MANUSCRIPT (2004) 10-24
442
[9] C. Tafoni, N. Pujol, Mechanisms of innate immunity in C. elegans epidermis,
443
Tissue Barriers, 4 (2015) e1078432.
444
[10] P. Barker, R. Gibson, The decapod hepatopancreas, Oceanogr. Mar. Biol. 17
445
(1979) 19.
446
[11] K. Wei, J. Yang, Oxidative damage of hepatopancreas induced by pollution
447
depresses humoral immunity response in the freshwater crayfish Procambarus clarkii,
448
Fish Shellfish Immunol. 43 (2015) 510-519.
449
[12] J. Qiao, Z.H. Du, Y.L. Zhang, H. Du, L.L. Guo, M.Q. Zhong, J. Cao, X. Wang,
450
Proteomic identification of the related immune-enhancing proteins in shrimp
451
Litopenaeus vannamei stimulated with vitamin C and Chinese herbs, Fish Shellfish
452
Immunol. 31 (2011) 736-45.[13] H.H. Sung, Y.H. Lin, C.Y. Hsiao, Differential
453
immune responses of the green neon shrimp (Neocaridina denticulate) to dipropyl
454
phthalate, Fish Shellfish Immunol. 31 (2011) 511-5.
455
[14] X. Chen, D. Zeng, D. Xie, Y. Zhao, C. Yang, Y. Li, N. Ma, M. Li, Q. Yang, Z .
456
Liao, H. Wang, Transcriptome analysis of Litopenaeus vannamei in response to white
457
spot syndrome virus infection, PLoS ONE 8 (2013) e73218.
458
[15] D. Zhang, D. Pan, S. Cui, T. Su, L. Qiu, C. Zhu, S. Jiang, Molecular
459
characterization and expression analysis of interferon-gammainducible lysosomal
460
thiol reductase (GILT) gene from pearl oyster Pinctada fucata, Dev. Comp. Immunol.
461
34 (2010) 969-976.
462
[16] Y.D. Sun, L.D. Fu, Y.P. Jia, X.J. Du, Q. Wang, Y.H. Wang, X.F. Zhao, X.Q. Yu,
AC C
EP
TE D
M AN U
SC
RI PT
441
ACCEPTED MANUSCRIPT J.X. Wang, A hepatopancreas-specific C-type lectin from the Chinese shrimp
464
Fenneropenaeus chinensis exhibits antimicrobial activity, Mol. Immunol. 45 (2008)
465
348-361.
466
[17] T. Luo, H. Yang, F. Li, X. Zhang, X. Xu, Purification, characterization and cDNA
467
cloning of a novel lipopolysaccharide-binding lectin from the shrimp Penaeus
468
monodon, Dev. Comp. Immunol. 30 (2006) 607-617.
469
[18] X.W. Zhang, W.T. Xu, X.W. Wang, Y. Mu, X.F. Zhao, X.Q. Yu, J.X. Wang, A
470
novel C-type lectin with two CRD domains from Chinese shrimp Fenneropenaeus
471
chinensis functions as a pattern recognition protein, Mol. Immunol. 46 (2009)
472
1626-1637.
473
[19] M.G. Grabherr, B.J. Haas, M. Yassour, J.Z. Levin, D.A. Thompson, I. Amit,
474
Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq
475
data, Nat. Biotechnol. 29 (2013) 644-652.
476
[20] F. Li, J. Xiang, Signaling pathways regulating innate immune responses in
477
shrimp, Fish Shellfish Immunol. 4 (2013) 973-980.
478
[21] X. Leng, H. Jia, X. Sun, L. Shangguan, Q. Mu, B. Wang, J. Fang, Comparative
479
transcriptome analysis of grapevine in response to copper stress, Sci. Rep. 5 (2015)
480
17749.
481
[22] D. Li, Y. Liang, X. Wang, L. Wang, M. Qi, Y. Yu, Y. Luan, Transcriptomic
482
Analysis
483
Prophenoloxidase-Activating System, G3: Genes, Genome, Genetics 9 (2015)
484
1827-1841.
AC C
EP
TE D
M AN U
SC
RI PT
463
of
Musca
domestica
to
Reveal
Key
Genes
of
the
ACCEPTED MANUSCRIPT [23] A. Conesa, S. Götz, J.M. García-Gómez, J. Terol, M. Talón, M Robles, Blast2GO:
486
a universal tool for annotation, visualization and analysis in functional genomics
487
research, Bioinformatics 21(2005) 3674-6.
488
[24] A. Mortazavi, B.A. Williams, , K. McCue, L. Schaeffer, B. Wold, Mapping and
489
quantifying mammalian transcriptomes by RNASeq, Nat. Methods 7 (2008) 621-628.
490
[25] J. Ye, L. Fang, H.K. Zheng, Y. Zhang, J. Chen, Z.J. Zhang, J. Wang, S.T. Li, R.Q.
491
Li, L. Bolund, J. Wang, WEGO: a web tool for plotting GO annotations, Nucleic
492
Acids Res. 24 (2006) W293-W297.
493
[26] X.Z. Mao, T. Cai, J.G. Olyarchuk, L.P. Wei, Automated genome annotation and
494
pathway identification using the KEGG Orthology (KO) as a controlled vocabulary,
495
Bioinformatics 21 (2005)3787-3793.
496
[27] S. Audic, J.M. Claverie, The significance of digital gene expression profiles,
497
Genome Res. 7 (1997) 986-995.
498
[28] K.J. Livak, T.D. Schmittgen, Analysis of relative gene expression data using
499
real-time quantitative PCR and the 2(T)(-Delta Delta C) method, Methods 25 (2001)
500
402-408.
501
[29] A.J. Martin, Z. Wang, Next-generation transcriptome assembly, Nat. Rev.
502
Genetics 12 (2011) 671-682.
503
[30] Q.Y. Zhao, Y. Wang, Y.M. Kong, D. Luo, X. Li, P. Hao, Optimizing de novo
504
transcriptome assembly from short-read RNA-Seq data: a comparative study, BMC
505
Bioinformatics 12 (2011) S14-S2.
506
[31] M.G. Grabherr, B.J. Haas, M. Yassour, J.Z. Levin, D.A. Thompson, I. Amit, X.
AC C
EP
TE D
M AN U
SC
RI PT
485
ACCEPTED MANUSCRIPT Adiconis, L. Fan, R. Raychowdhury, Q. Zeng, et al., Full-length transcriptome
508
assembly from RNA-Seq data without a reference genome, Nat. Biotechnol. 29 (2011)
509
644-652.
510
[32] R. Luo , B. Liu , Y. Xie, Z. Li, W. Huang, J. Yuan, G. He, Y. Chen, Q. Pan, Y.
511
Liu, J. Tang, G. Wu, H. Zhang, Y. Shi, Y. Liu, et al., SOAPdenovo2: an empirically
512
improved memory-efficient short-read de novo assembler,
513
[33] M. Ashburner, C.A. Ball, J.A. Blake, D. Botstein, H. Butler, J.M. Cherry, A.P.
514
Davis, K. Dolinski, S.S. Dwight, T. Eppig, M.A. JHarris, D.P. Hill,
515
A. Kasarskis, S. Lewis, J.C. Matese, J.E. Richardson, M. Ringwald, G.M. Rubin, G.
516
Sherlock, Gene Ontology: tool for the unification of biology, Nat. Genet. 25(2000),
517
25-29.
518
[34] D. Garcia-Seco, Y. Zhang, F.J. Gutierrez-Mañero, C. Martin, B. Ramos-Solano,
519
RNA-Seq analysis and transcriptome assembly for blackberry (Rubus sp. Var.
520
Lochness) fruit, BMC Genomics 16 (2015) 5.
521
[35] B. Bai, J. Wu, W.T. Sheng, B. Zhou, L.J. Zhou, W. Zhuang, D.P. Yao, Q.Y. Deng,
522
Comparative analysis of anther transcriptome profiles of two different rice male
523
sterile lines genotypes under cold stress, Int. J. Mol. Sci. 16 (2015) 11398-11416.
524
[36] N.J. Brown-Peterson, P. Larkin, N. Denslow, C. King, S. Manning, M. Brouwer,
525
Molecular indicators of hypoxia in the blue crab Callinectes sapidus, Mar. Ecolo.
526
Prog. Ser. 286 (2005) 203-15.
527
[37] C.M. Adema, P.C. Hanington, C.M. Lun, G.H. Rosenberg , A.D. Aragon, B.A.
528
Stout, M.L.L Richard, P.S. Gross, E.S. Loker, Differential transcriptomic responses of
RI PT
507
L. Issel-Tarver,
AC C
EP
TE D
M AN U
SC
GigaScience 1 (2012)18.
ACCEPTED MANUSCRIPT Biomphalaria glabrata (Gastropoda, Mollusca) to bacteria and metazoan parasites,
530
Schistosoma mansoni and Echinostoma paraensei (Digenea, Platyhelminthes), Mol.
531
Immunol. 47 (2010) 849-860.
532
[38] O. Byadgi, C.W. Chen, P.C. Wang, M.A. Tsai, S.C. Chen, De novo transcriptome
533
analysis of differential functional gene expression in largemouth Bass (Micropterus
534
salmoides) after challenge with Nocardia seriolae, J. Mol. Sci. 17 (2016) 1-16.
535
[39] E. Benjamini, G. Sunshine, S. Leskowitz, 1996. Immunology a short course.
536
Wiley-Liss, Inc. (1996) 19-39.
537
[40] C.A. Janeway, P. Travers, M. Walport, Immuno biology: The Immune system in
538
healthy and disease, Elsevier Science Ltd/Garland Publishing (1999) 365-390.
539
[41] W.E.R. Xylander, L. Nevermann, Antibacterial activity in the hemolymph of
540
myriapods (Arthropoda), J. Invertebr. Pathol. 56 (1990) 206-214.
541
[42] J.A. Hoffmann, C. Hetru, Insect defensins: inducible antibacterial peptides.
542
Immunol, Today 10 (1992) 411-415.
543
[43] W.R. Xylander, G. Ullrich, H.E. Kaiser, Antibacterial immune response in
544
Astacus leptodactylus (Crustacea, Decapoda), In Vivo. 11 (1997) 195-200.
545
[44] K.F. Clark, A.R. Acorn, S.J. Greenwood, A transcriptomic analysis of American
546
lobster (Homarus americanus) immune response during infection with the bumper car
547
parasite Anophryoides haemophila. Dev. Comp. Immunol. 40 (2013) 112-122.
548
[45] Q. Ge, J. Li, J. Wang, J. Li, H. Ge, Q. Zhai, Transcriptome analysis of the
549
hepatopancreas in Exopalaemon carinicauda infected with an AHPND-causing strain
550
of Vibrio parahaemolyticus. Fish Shellfish Immunol. 67 (2017) 620-633.
AC C
EP
TE D
M AN U
SC
RI PT
529
ACCEPTED MANUSCRIPT [46] H. Shen, Y. Hu, Y. Ma, X. Zhou, Z. Xu, Y. Shui, C. Li, P. Xu, X. Sun, In-Depth
552
transcriptome analysis of the red swamp crayfish Procambarus clarkii, PLoS One 9
553
(2014) e110548.
554
[47] T. Rőszer, The invertebrate midintestinal gland (“hepatopancreas”) is an
555
evolutionary forerunner in the integration of immunity and metabolism, Cell Tissue
556
Res. 358 (2014) 685-695.
557
[48] C.M. Cebulla, D.M. Miller, D.D. Sedmak, Viral inhibition of interferon signal
558
transduction, Intervirol. 42 (1999) 325-330.
559
[49] B. Arunachalam, U.T. Phan, H.J. Geuze, P. Cresswell, Enzymatic reduction of
560
disulfide bonds in lysosomes: characterization of a gamma-interferon-inducible
561
lysosomal thiol reductase (GILT), Proc. Natl .Acad. Sci. 97 (2000) 745-750.
562
[50] U.T. Phan, M. Maric, T.P. Dick, P. Cresswell, Multiple species express thiol
563
oxidoreductases related to GILT, Immunogenetics 53 (2001) 342-346.
564
[51] M. Maric, B. Arunachalam, U.T. Phan, D. Chen, W.S. Garrett, K.S. Cannon, et al.,
565
Defective antigen processing in GILT free mice, Science 294 (2001) 1361-1365.
566
[52] P. Li, M.A. Haque, J.S. Blum, Role of disulfide bonds in regulating antigen
567
processing and epitope selection, J. Immunol. 169 (2002) 2444-2450.
568
[53] R. Thai, G. Moine, M. Desmadril, D. Servent, J. Tarride, A. Menez, et al.,
569
Antigen stability controls antigen presentation, J. Biol. Chem. 279: (2004)
570
50257-50266.
571
[54] R.L. Lackman, P. Cresswell, Exposure of the promonocytic cell line THP-1 to
572
Escherichia coli induces IFN-c-inducible lysosomal thiol reductase expression by
AC C
EP
TE D
M AN U
SC
RI PT
551
ACCEPTED MANUSCRIPT 573
inflammatory cytokines, J. Immunol. 177 (2006) 4833-4840.
574
[55] M.D. Zoysa, J. Lee, Molecular cloning and expression analysis of interferon-c
575
inducible lysosomal thiol reductase (GILT)-like cDNA from disk abalone (Haliotis
576
discus discus), J. Invertebr. Pathol.
577
[56] W.B. Dan, S.L. Wang, J.Q. Liang, S.Q. Zhang, Molecular cloning and expression
578
analysis of porcine c-interferon-inducible lysosomal thiol reductase (GILT), Vet.
579
Immunol. Immunopathol. 126 (2008) 163–167.
580
[57] L. Wang, Y. Zhang, L. Wang, J. Yang, Z. Zhou, Y. Gai, L. Qiu, L. Song, A new
581
anti-lipopolysaccharide factor (EsALF-3) from Eriocheir sinensis with antimicrobial
582
activity, African J. Biotechnol. 10 (2011) 17678-17689.
583
[58] H. Liu, P. Jiravanichpaisal, I. Soderhall, L. Cerenius, K. Soderhall,
584
Antilipopolysaccharide factor interferes with white spot syndrome virus replication in
585
vitro and in vivo in the crayfish Pacifastacus leniusculus, J. Virol. 80 (2006)
586
10365-10371.
587
[59]
588
Antilipopolysaccharide factor (ALF) of mud crab Scylla paramamosain: molecular
589
cloning, genomic organization and the antimicrobial activity of its synthetic LPS
590
binding domain, Mol. Immunol. 44 (2007) 3195-3203.
591
[60] S. Tharntada, S. Ponprateep, K. Somboonwiwat, H. Liu, I. Soderhall, K.
592
Soderhall, A. Tassanakajon, Role of antilipopolysaccharide factor from the black tiger
593
shrimp, Penaeus monodon, in protection from white spot syndrome virus infection, J.
594
Gen. Virol. 90 (2009) 1491-1498.
P.
EP
Imjongjirak,
Amparyup,
A.
Tassanakajon,
S.
Sittipraneed,
AC C
C.
TE D
M AN U
SC
RI PT
96 (2007) 221-229.
ACCEPTED MANUSCRIPT [61] H. Nagoshi, H. Inagawa, K. Morii, H. Harada, C. Kohchi, T. Nishizawa, Y.
596
Taniguchi, M. Uenobe, T. Honda, M. Kondoh, Cloning and characterization of a
597
LPS-regulatory gene having an LPS binding domain in kuruma prawn Marsupenaeus
598
japonicas, Mol. Immunol. 43 (2006) 2061-2069.
599
[62] X. Li, X. Meng, J. Kong, K. Luo, S. Luan, B. Cao, N. Liu, J. Pang, X. Shi,
600
Molecular cloning and characterization of a cathepsin B gene from the Chinese
601
shrimp Fenneropenaeus chinensis, Fish Shellfish Immunol. 35 (2013) 1604-1612.
602
[63] S. Wang, L.J. Shi, N. Liu, A.J. Chen, X.F. Zhao, J.X. Wang, Involvement of
603
Fenneropenaeus chinensis Cathepsin C in antiviral immunity, Fish Shellfish Immunol.
604
33 (2012) 821-828.
605
[64] Q. Ren, X.W. Zhang, Y.D. Sun, S.S. Sun, J. Zhou, Z.H. Wang, X.F. Zhao, J.X.
606
Wang, Two cysteine proteinases respond to bacterial and WSSV challenge in Chinese
607
white shrimp Fenneropenaeus chinensis, Fish Shellfish Immunol. 29 (2010) 551-556
608
[65] H. Kejin, P.C. Leung, Shrimp cathepsin L encoded by an intronless gene has
609
predominant expression in hepatopancreas, and occurs in the nucleus of oocyte, Comp.
610
Biochem. Physiol. B Biochem. Mol. Biol. 137 (2004) 21-33.
611
[66] G. Leyens, I. Donnay, B. Knoops, Cloning of bovine peroxiredoxins-gene
612
expression in bovine tissues and amino acid sequence comparison with rat, mouse and
613
primate peroxiredoxins, Comp. Biochem. Physiol. B: Biochem. Mol. Biol. 136 (2003)
614
943-955.
615
[67] Z.A. Wood, E. Schroder, J.R. Harris, L.B. Poole, Structure, mechanism and
616
regulation of peroxiredoxins. Trends Biochem. Sci. 28 (2003) 32-40.
AC C
EP
TE D
M AN U
SC
RI PT
595
ACCEPTED MANUSCRIPT [68] W.J. Zheng, Y.H. Hu, M. Zhang, L. Sun, Analysis of the expression and
618
antioxidative property of a peroxiredoxin 6 from Scophthalmus maximus, Fish
619
Shellfish Immunol. 29 (2010) 305-311.
620
[69] M. De Zoysa, J.H. Ryu, H.C. Chung, C.H. Kim, C. Nikapitiya, C. Oh, et al.,
621
Molecular characterization, immune responses and DNA protection activity of rock
622
bream (Oplegnathus fasciatus), peroxiredoxin 6 (Prx6). Fish Shellfish Immunol. 33
623
(2012) 28-35.
624
[70] S.N. Radyuk, V.I. Klichko, B. Spinola, R.S. Sohal, W.C. Orr, The peroxiredoxin
625
gene family in Drosophila melanogaster, Free Radic. Biol. Med. 31 (2001)
626
1090-1100.
M AN U
SC
RI PT
617
627
631 632 633 634 635 636 637 638
EP
630
AC C
629
TE D
628
ACCEPTED MANUSCRIPT Figure legends
640
Fig. 1 Distribution pattern of unigene sequences and transcript length. Blue and
641
Red bars indicate unigenes and transcripts, respectively.
642
Fig. 2 Homology analysis of hepatopancreas specific unigene sequences of
643
crayfish (A) E-value distribution, (B) similarity distribution and (C) species
644
classification.
645
Fig. 3 Gene functional classification (GO) of crayfish hepatopancreas. The results
646
are summarized into three ontologies (cellular component, biological function and
647
molecular function). Y-axis indicates the total number genes in each category.
648
Fig. 4 The Eukaryotic Orthologous Groups (KOG) classification of putative
649
proteins.
650
Fig. 5 The Kyoto Encyclopedia of Genes and Genomes (KEGG) classification
651
based on assembled unigene sequences.
652
Fig. 6 Hierarchical cluster analysis of differentially expressed based on the data
653
of log ratio fold change. The colour scale shows the level of differentially expressed
654
gene: red colour shows increased expression level of mRNA and blue colour shows
655
decrease in expression level. L_1 and D_1 represent LPS treated and control PBS
656
treated, respectively.
657
Fig. 7 Distribution pattern of differentially expressed unigene sequences of
658
crayfish. X-axis exhibits change in fold between two sets of samples (PBS and LPS),
659
and Y-axis shows significance of DEGs. Red (up-regulation) and green
660
(down-regulation) dots represent significantly different expression (q-value<0.005,
AC C
EP
TE D
M AN U
SC
RI PT
639
ACCEPTED MANUSCRIPT |log2 (fold change)|>1), respectively, and dots in blue colour shows no significant
662
differences.
663
Fig. 8 Histogram and gene ontology analysis of crayfish differentially expressed
664
genes. The results are summarized into three ontologies (cellular component,
665
biological function and molecular function). Y-axis indicates the total number
666
differentially expressed genes in each category.
667
Fig. 9 The top 20 canonical pathways with maximum numbers of differentially
668
expressed genes based on KEGG classifications.
669
Fig. 10 Comparison of gene expression pattern between RNA-Seq and qRT-PCR
670
results in crayfish. The gene expression values were normalized to the 18S RNA
671
gene. Log2FC indicates the log2 fold change. Blue bars represent expression of genes
672
in RNA-seq data and orange bars show qRT-PCR results.
673
Fig. 11 Expression profiles of the key DEGs related immune responses after LPS
674
challenge. These DEGs were quantified in crayfish hepatopancreas followingLPS
675
infection. 18S RNA gene was used as an internal control.
EP
TE D
M AN U
SC
RI PT
661
AC C
676 677
Legends of Supplementary Data
678
Supplementary Fig. 1 1.0% agrose gel electrophoresis of Procambarus clarkii
679
total RNA.
ACCEPTED MANUSCRIPT Table 1 Primers used in this study Forward Primer (5’-3’)
Reverse Primer (5’-3’)
Purpose
alkaline phosphatase
CAGCAGTGGTGGAAGACAGC
GTCGCCGAGGAACAAGATG
qPCR
beta 1,4-endoglucanase
ATGTTGTGCAGTTCGCTTGG
CGGTGTGTATGGTGCTTCGT
qPCR
catalase
TGCTGAGGTGGAACAGATGG
CGATGGCGATGAGTGTCATT
qPCR
cellulase GHF9
TGCCGGTCAGACAGAATACG
CCGTGGTCAATACCTCCGTTA qPCR
cubilin
GATCTGGCGCTCAATGACAA
TAGGAGGCGGACACTCTTGG
qPCR
acyl-CoA AGCGCAAGCACTACATTCCA
GATGGAGTAGCGCAGGAAGC
qPCR
SC
delta-9 desaturase
RI PT
Primer name
GCCGACAGTGTGACGAAGTC
ACGTTCACCACCAGCAACAC
qPCR
hemocyanin 2
GCTCGTCCTGATGATGTCCA
CGGTGATGTAGCCAAGAGCA
qPCR
hemocyanin
TGGAACACAGCAAGGACTGG
AAGCACGATGTCTTCGGTGA
qPCR
serine carboxypeptidase 1
CCGTGGTTAAGCTGTTGACG
GGCCACTCCATGTTCTCGAT
qPCR
vitellogenin
CCGGCTGAGCTTCACTAACC
AGTCGATGCGGTTGCCTTAT
qPCR
Per6
GCCATCAAGCTGCACACCTA
TTTGCCACACATCCCAACTC
qPCR
GTLT
CTGGAAGGCCAGAACTTGCT
CCTGGCAATCTGTCACCGTA
qPCR
CL
GCAGTGTGGATCATGCTGGT
TGCCATACTTGGTGGAGCAG
qPCR
GCGCCTGCAACTACAGCTAC
TTTGGGAGTCGCCAGAGAA
qPCR
18 S
CTGTGATGCCCTTAGATGTT
GCGAGGGGTAGAACATCCAA
qPCR
AC C
ALF3
EP
TE D
M AN U
glutathione peroxidase
ACCEPTED MANUSCRIPT Table 2 Summary of the sequence analyses Sample
Raw Reads
Clean reads
Clean Bases
Error(%)
Q20(%)
Q30(%)
GC content(%)
PBS
47,592,460
45,706,720
6.86G
0.02
96.63
91.77
43.99
LPS
52,414,336
50,310,480
7.55G
0.02
96.21
90.78
47.53
RI PT
Clean reads: The number of reads after removing low-quality sequences. The subsequent analysis is based on clean reads. Error rate: Base error rate. Q20 and Q30, the percentage of bases with Phred values >20 and >30, respectively. GC content: the GC ratio of the total base number
Number of genes
Percentage (%)
M AN U
SC
Table 3 The success rate of gene annotation
20,363
32.89
14,515
23.44
10,174
16.43
20,264
32.73
19,672
31.77
Annotated in SwissProt
16,369
26.44
Total Unigenes
61,912
100
Annotated in GO Annotated in KO Annotated in KOG
EP
Annotated in NR
TE D
Annotated in PFAM
AC C
Annotated in GO: The unigene number and annotation rate in the GO database. Annotated in KO: The unigene number and annotation rate in the KO database. Annotated in KOG: The unigene number and annotation rate in the KOG database. Annotated in PFAM: The unigene number and annotation rate in the PFAM database. Annotated in NR: The unigene number and annotation rate in the NR database. Annotated in SwissProt: The unigene number and annotation rate in the SwissPort database.
ACCEPTED MANUSCRIPT Table 4 Top 20 statistically significant Kyoto Encyclopedia of Genes and Genomes (KEGG) classifications Number
Pathway definition
Pathway ID
Number of unigenes
Endocytosis
ko04144
264
2
Spliceosome
ko03040
238
3
Purine metabolism
ko00230
223
4
PI3K-Akt signaling pathway
ko04151
220
5
Focal adhesion
ko04510
205
6
Lysosome
7
Rap1 signaling pathway
8
Ubiquitin mediated proteolysis
9
SC
RI PT
1
205
ko04015
200
ko04120
191
Protein processing in endoplasmic reticulum
ko04141
177
10
Regulation of actin cytoskeleton
ko04810
174
11
Ras signaling pathway
ko04014
170
12
cAMP signaling pathway
ko04024
169
13
RNA transport
ko03013
163
14
MAPK signaling pathway
ko04010
161
15
Tight junction
ko04530
151
16
Insulin signaling pathway
ko04910
146
Thyroid hormone signaling pathway
ko04919
146
AMPK signaling pathway
ko04152
142
19
Hippo signaling pathway
ko04390
140
20
Carbon metabolism
ko01200
137
18
TE D
EP
AC C
17
M AN U
ko04142
ACCEPTED MANUSCRIPT Table 5 Top 20 differentially expressed pathways between normal crayfish and LPS challenged crayfish Pathway definition
P-value
1
Ribosome
3.05E-08
ko03010
46
2
Oxidative phosphorylation
5.89E-06
ko00190
33
3
Cardiac muscle contraction
7.56E-06
ko04260
17
4
Arachidonic acid metabolism
7.23E-05
ko00590
20
5
Longevity regulating pathway – worm
0.00015111
ko04212
27
6
Parkinson's disease
0.000678484
ko05012
31
7
Non-alcoholic fatty liver disease (NAFLD)
0.001235084
ko04932
29
8
Steroid biosynthesis
0.004219732
ko00100
8
9
Linoleic acid metabolism
0.006381182
ko00591
9
10
PPAR signaling pathway
0.006525988
ko03320
15
11
Glycolysis / Gluconeogenesis
0.006669947
ko00010
17
12
Lysosome
0.007820667
ko04142
39
13
Alzheimer's disease
0.010607122
ko05010
30
14
Carbohydrate digestion and absorption
0.016187143
ko04973
11
15
Ovarian steroidogenesis
0.020214647
ko04913
12
16
Metabolism of xenobiotics by cytochrome P450
0.02282741
ko00980
10
17
Other glycan degradation
0.023003679
ko00511
12
18
Retinol metabolism
0.024717722
ko00830
11
19
Rheumatoid arthritis
0.029414928
ko05323
12
20
Fatty acid degradation
0.03063433
ko00071
14
SC
M AN U
TE D
EP
AC C
DEG, differentially expressed gene
Pathway ID Number of DEGs
RI PT
No.
TE D
S
M AN U
ED
M AN U
ED
M AN
AC C
EP TE D
SC
M AN U
RI P
TE
D
S
M AN U
PT ED
S
M AN U
EP TE D
SC
M AN U
CE
ED
PT SC
M AN U
ED
M AN
TE
D
M AN U
EP TE D
SC
M AN U
ACCEPTED MANUSCRIPT Highlights 1. The
Hepatopancrease
transcriptome
of
P.
clarkii
following
Lipolysaccharide (LPS) challenge was constructed for the first time.
RI PT
2. The RNA-sequencing followed by assembly generated 61,912 unigenes. 3. In total, 2522 genes differentially expressed in LPS treated group compared with the control.
differentially expressed genes were identified and
SC
4. Immune-related
AC C
EP
TE D
M AN U
categorized.