Transcriptome analysis provides new insights into the growth superiority of a novel backcross variety, Megalobrama amblycephala ♀ × (M. amblycephala ♀ × Culter alburnus ♂) ♂

Transcriptome analysis provides new insights into the growth superiority of a novel backcross variety, Megalobrama amblycephala ♀ × (M. amblycephala ♀ × Culter alburnus ♂) ♂

Accepted Manuscript Transcriptome analysis provides new insights into the growth superiority of a novel backcross variety, Megalobrama amblycephala ♀ ...

917KB Sizes 0 Downloads 39 Views

Accepted Manuscript Transcriptome analysis provides new insights into the growth superiority of a novel backcross variety, Megalobrama amblycephala ♀ × (M. amblycephala ♀ × Culter alburnus ♂) ♂

Guodong Zheng, Chenbin Wu, Juan Liu, Jie Chen, Shuming Zou PII: DOI: Article Number: Reference:

S0044-8486(19)30779-3 https://doi.org/10.1016/j.aquaculture.2019.734317 734317 AQUA 734317

To appear in:

aquaculture

Received date: Revised date: Accepted date:

2 April 2019 30 June 2019 16 July 2019

Please cite this article as: G. Zheng, C. Wu, J. Liu, et al., Transcriptome analysis provides new insights into the growth superiority of a novel backcross variety, Megalobrama amblycephala ♀ × (M. amblycephala ♀ × Culter alburnus ♂) ♂, aquaculture, https://doi.org/10.1016/j.aquaculture.2019.734317

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT Transcriptome analysis provides new insights into the growth superiority of a novel backcross variety, Megalobrama amblycephala ♀ × (M. amblycephala ♀ × Culter alburnus ♂) ♂

PT

Guodong Zheng, Chenbin Wu, Juan Liu, Jie Chen, Shuming Zou*

RI

Key Laboratory of Freshwater Aquatic Genetic Resources, Ministry of Agriculture,

SC

Genetics and Breeding Center for Blunt Snout Bream, National Demonstration Center

NU

for Experimental Fisheries Science Education, Shanghai Ocean University, Shanghai 201306, China

MA

*Correspondence Authors: Dr. Shu-Ming Zou

Tel.: +86-021-61900345

ED

Fax: +86-021-61900345

AC C

EP T

Email: [email protected]

1

ACCEPTED MANUSCRIPT

Abstract A novel backcross variety (BC) [Megalobrama amblycephala (MA) ♀ × (Megalobrama amblycephala ♀ × Culter alburnus (CA) ♂) ♂] was generated by artificial insemination. The BC exhibits growth superiority and enhanced digestive

PT

enzyme activity than that of its parents. We conducted the transcriptome analysis

RI

using hepatopancreas between the BC and its parents. Three hundred and twenty-five

SC

and 872 differentially expressed genes (DEGs) were identified in ‘BC vs. MA’ and

NU

‘BC vs. CA’, respectively. The GO and KEGG enrichment analysis of DEGs were classified. From the 134 overlapping DEGs and their pathway, the digestive enzyme

MA

genes, such as TRY, ELA1, CTRL1, CPA2, and BAL, and the IGF system genes, such as IGF1, IGF2a, and IGFBP2b, of the BC had significant difference in expression.

ED

Furthermore, the expression of downstream genes related to the protein and fatty acid

EP T

synthesis pathways, such as PI3KR, RAPTOR, EIF4E, CS, MDH, FAS, ELOVL1, ELOVL5, and ELOVL6, were significantly up-regulated. In addition, a total of 40732

AC C

SSRs and 52951 SNPs were identified from the transcriptome. The results revealed that the enhanced digestive enzymes activity and synthesis of proteins/fatty acids might be contributed to growth superiority of backcross BC. Key words: transcriptome; backcross; growth superiority; digestive enzyme; protein/fatty acid synthesis

2

ACCEPTED MANUSCRIPT

1. Introduction Blunt snout bream (Megalobrama amblycephala, MA) and topmouth culter (Culter alburnus, CA) are freshwater fish species of the subfamily Cultrinae. In China, both species are important in the fish polyculture system. Although the M.

PT

amblycephala grows faster than the C. alburnus, but its meat quality is inferior to that

RI

of the C. alburnus. To produce new varieties with growth superiority and high quality

SC

meat, hybridization has been introduced into the artificial breeding of these two

NU

species. According to statistics, the yield of the hybrids between M. amblycephala and C. alburnus has exceeded 50 thousand tons in China in 2018 (China Aquaculture

MA

Network, www.shuichan.cc). In previous study, a fertlie hybrid lines between M. amblycephala and C. alburnus were established (Xiao et al., 2014), this hybrid were

ED

also found to have obvious heterosis in growth performance, feed utilization and

EP T

muscle quality (Zheng et al., 2015; Zheng et al., 2019). Moreover, the total amino acid and essential amino acid content of hybrids between M. amblycephala and C.

AC C

alburnus were higher than those of their parents (He et al., 2014), SRAP marker analysis showed that the genetic diversity of this hybrids increased significantly (Jia et al., 2011) and the number of intermuscular bones decreased significantly (Jiang et al., 2016). In addition, a similar hybrid between M. terminalis and C. alburnus also showed growth advantage (Guo et al., 2018). RNA-Seq has been widely used as an effective tool for transcriptome analysis in order to discover, profile, and quantify the RNA transcripts (Wang et al., 2009).

3

ACCEPTED MANUSCRIPT

RNA-Seq combines the advantages of high throughput, low background noise and high sensitivity, making it feasible to map transcribed regions, quantify gene expression, and distinguish different isoforms and allelic expression (Wang et al., 2009; Ozsolak and Milos, 2011). Recently, the differently expressed genes (DEGs)

PT

and pathways related to heterosis have been identified through the transcriptome

RI

profiles analysis in some commercially crops and fishes, such as hybrid maize (Zea

SC

mays L.) (Bi et al., 2014), hybrid rice (Oryza sativa) (Zhai et al., 2013), hybrid

NU

soybean (Glycine max (L.) Merr) (Zhang et al., 2017), hybrid pufferfish (Takifugu) (Gao et al., 2013), hybrid grouper (Epinephelus spp.) (Sun et al., 2016a, b), etc. These

MA

studies could help us to further understand the mechanisms underlying the heterosis in crops and fishes.

ED

In order to reveal the molecular mechanism of the heterosis of M. amblycephala

EP T

and C. alburnus, transcriptome analysis was carried out for the two species and their hybrids. It is hoped that effective DEGs can be screened to reasonably explain the

AC C

heterosis of fish, and a large number of molecular markers can be developed for the study of QTL analysis and genetic maps.

2. Materials and methods 2.1. Experimental fish and feeding trial M. amblycephala (MA) and C. alburnus (CA) broodstocks were obtained from the Genetics and Breeding Center for Blunt Snout Bream of Ministry of Agriculture,

4

ACCEPTED MANUSCRIPT Shanghai, China. MC hybrids (MA♀ × CA♂) were produced first, when MC hybrids reached sexual maturity, the backcross BC (MA♀ × MC♂) was produced and parental selfbred strains of MA (♀ × ♂) and CA (♀ × ♂) were also produced by artificial insemination. All fish used in the study came from a single cross family, eggs from

PT

the same female used for the backcross with the male hybrid also used for maintaining

RI

the parental strains. These three crosses (BC, MA and CA) with the same age and

SC

similar sizes were randomly distributed into 3 concrete ponds (6 × 4 × 1.2 m, L : W :

NU

H) with water depth of 1m. Each pond was stocked all three crosses and each cross had 50 fry (total of 150 fish per pond). The feeding period was 90 days. During the

MA

feeding trial, fish were fed twice (06:30 and 17:30) each day with equal amounts of floating compound feed (containing 33% protein, 3% lipid and 8.5% fiber, Tech-bank

ED

Co., Ltd, Ningbo, China) and until the feed is eaten completely by the fish. One half

EP T

of the pond water was changed weekly to ensure fresh water quality. Temperature of

AC C

water ranged from 22~30 ℃ and dissolved oxygen was above 5.0 mg/L.

2.2. Growth measurement and sample collection At the end of feeding trial, after 24 hours of starvation, the weight gain rate (WGR), specific growth rate (SGR), hepatosomatic index (HSI) and condition factor (CF) were calculated. Ten individuals per cross were euthanized with 100 mg/L of MS222 (Sigma, USA) before tissues were collected. Hepatopancreases (for RNA extraction, sequencing and digestive enzyme activity assays) was collected and stored

5

ACCEPTED MANUSCRIPT

immediately at -80°C.

2.3. Digestive enzyme activity assays The whole hepatopancreas of three samples per cross was weighed and

PT

homogenized (dilution 1:10) in ice-cold buffer. The homogenization buffer solution

RI

was 0.02 M Tris/0.01 M phosphate, pH 7.0, in 50% (v/v) glycerol (Moro et al., 2010).

SC

The extract was centrifuged at 3000 rpm for 10 min at 4°C, and the supernatant was

NU

used as the enzyme source using BSA (Sigma, USA) as standard. The activity of total protease was detected using Folin-phenol method and the activity of total lipase was

MA

detected using commercial kits (ref. no. A054) produced by Jiancheng Bioengineering

ED

Institute (Nanjing, China) (Li et al., 2012).

EP T

2.4. RNA extraction, library construction and sequencing Total RNA from hepatopancreas samples was extracted using TRIzol reagent

AC C

(Invitrogen, Carlsbad, CA, USA) and purified using the RNeasy Mini Kit (Qiagen, Valencia, CA, USA). RNA degradation and contamination were detected by 1% agarose gels. The integrity and concentration of RNA samples were examined by Agilent 2100 Bioanalyzer (Agilent Technologies, CA, USA). Equal amounts of nine RNA samples (three samples per cross) were used for cDNA synthesis and RNA-seq. Before the construction of library, Magnetic Oligo-dT beads (Invitrogen, USA) were used to isolate poly (A) + mRNA, and then cDNA

6

ACCEPTED MANUSCRIPT libraries were constructed and sequenced by Illumina Hiseq™ 4000 platform (Majorbio Biotech Co., Ltd, Shanghai, China). Briefly, 8 μg of total RNA for each sample was used to construct libraries by using Truseq TM RNA sample prep Kit (Illumina, USA) according to the manufacturer’s protocol. The constructed DNA

PT

template was enriched by PCR amplification (15 cycles) . Amplicons were collected

RI

and purified by Certified Low Range Ultra Agarose (Bio-Rad, USA) gel

SC

electrophoresis. Before sequencing, the DNA libraries were quantified by using

NU

TBS-380 micro fluorometer with Picogreen® reagent (Invitrogen, USA). Clone clusters were generated on Illumina cBot, using Truseq PE Cluster Kit v3-cBot-HS,

MA

and high- throughput sequencing was performed on an Illumina Hiseq™ 4000

ED

sequencer, using Truseq SBS Kit (300 cycles).

EP T

2.5. De novo assembly and functional annotation of unigenes The raw paired end reads from all transcriptomes were cleaned by removing

AC C

adapter contamination, low quality sequences (reads with over 10% unknown base pairs‘N’) and empty reads. All clean reads were then assembled using the denovo assembly program Trinity. The assembled unigenes of three crosses were used for BlastX search and annotation against the NR, Swissprot, String (Franceschini et al., 2013), COG (Clusters of Orthologous Groups of proteins), KEGG (Kyoto Encyclopedia of Genes and Genomes) databases with an E-value cut-off of 10−5 . Functional annotation including biological process, molecular function and cellular

7

ACCEPTED MANUSCRIPT

component for gene ontology (GO) terms was analysed by using BLAST2GO software (Conesa et al., 2005).

2.6. Comparative expression analysis

PT

The read counts were further normalized into FPKM (Fragments Per Kilobase of

RI

exon model per Million mapped reads) values. The FPKM values from the three

SC

libraries were pairwise compared, and the fold changes (FC) of the genes were

NU

calculated by using RSEM software (version 1.2.7) (Li and Dewey, 2011). If |log2(FC)| ≥ 1 and false discovery rate (FDR) ≤ 0.05, these genes were considered as

MA

having significant differential expression (Anders and Huber, 2010). The expressions of unigenes were displayed intuitively by Volcano Plot. After DEGs in the

ED

hepatopancreas between backcrosses and their parents were screened, the enrichment

EP T

analysis of GO and KEGG pathways was carried out by Goatools software (version0.4.7) and KOBAS 2.0 software (Xie et al., 2011). The R program was used

AC C

to depict the heat maps for gene clustering in the present study.

2.7. Detection of SSRs and SNPs Assembled

transcriptomes

were

screened

for

microsatellites

using

Msatcommander program (Faircloth, 2008). The parameters were designed to identify perfect mono-hexa- nucleotide motifs with a minimum of ten repetitions for mononucleotides, six repetitions for dinucleotides, and five repetitions for tri- hexa

8

ACCEPTED MANUSCRIPT nucleotides. Potential SNPs were filtered using the program Samtools (Li, 2011) and VarScan v.2.2.7 (Koboldt et al., 2009 ) with the default parameters.

2.8. Qualitative real-time PCR (qRT-PCR)

PT

Total RNA of hepatopancreas was extracted with the method described above, and

RI

cDNA was synthesized by PrimeScript RT reagent Kit (Takara, Janpan). The cDNA

SC

was then used for qRT-PCR analysis using specific primers (Table 1). qRT-PCR was

NU

performed by using SYBR Green RT-PCR kit (Takara, Janpan) on the CFX96 Touch™ real- time PCR Detection System (BioRad, USA). The housekeeping gene

MA

β-actin had been proved to be stable between the comparison groups and then used as the internal control. All reactions were performed in triplicates. The relative

ED

expression ratio of the target genes versus β-actin gene was calculated using 2-ΔΔCT

EP T

method (Livak and Schmittgen, 2001).

AC C

2.9 Statistical analysis

Digestive enzyme activity and gene expression data were carried out by using SPSS version 17 (Michigan Avenue, Chicago, IL, USA) for significant differences. If significant (P < 0.05) differences were found in factors, Duncan's multiple range test (Duncan, 1955) was used to rank the means.

3. Result

9

ACCEPTED MANUSCRIPT

3.1. Growth performance and digestive enzyme activity The growth performances of the backcross BC, MA, and CA are presented in Table 2. The weight gain rate (WGR) of BC, MA, and CA was 680.00%, 589.38%, and 334.61%, respectively. The WGR of BC was 15.38% and 103.22% higher than

PT

that of its parents MA and CA, this result showed that the growth superiority of the

RI

BC was significant (P < 0.05). The condition factor (CF) of BC was slightly and

SC

significantly (P < 0.05) higher than that of MA and CA, respectively. However,

NU

hepatosomatic index (HSI) of BC was intermediated between their two parents. In addition, the protease and lipase activity in hepatopancreas of BC was significantly (P

MA

< 0.05) higher than that of their parents, protease activity of CA was slightly higher

ED

than that of MA and lipase activity of MA was higher than that of CA (P < 0.05).

EP T

3.2. High-throughput sequencing and mapping to the reference genome Over 7.45 Gb of raw data for each sample and 159 million 100-bp paired-end

AC C

reads with an average of 53 million reads for each of the three samples (Table 3) were obtained. The raw data were cleaned and quality checked, and then assembled. Approximately, 49 million (BC), 55 million (MA), and 51 million (CA) clean reads were produced; 83.28% (BC), 89.64% (MA), and 89.38% (CA) of these clean reads were mapped to the assembled sequences (≤ 5 base mismatches) (Table 3). An assembly of the reads generated 77102 (BC), 63367 (MA) and 67087 (CA) transcripts with an average length of 923 bp, 835 bp and 842 bp, respectively. And a

10

ACCEPTED MANUSCRIPT

total of 62248 (BC), 55182 (MA) and 57366 (CA) unigenes were generated, with an average length of 813 bp, 745 bp and 747 bp, respectively (Table 4). An average of 61.78% of the transcripts and 66.29% of the unigenes were < 600 bp in size for the three samples (Supplementary Fig. 1). The results showed that ORFs of 41713

PT

unigenes (67.01%) in BC, 34654 unigenes (62.79%) in MA and 34602 unigenes

RI

(60.32%) in CA were successfully predicted using Trinity respectively. Overall, there

SC

were 25 classifications produced from the COG/KOG/NOG-annotated putative

NU

proteins, including ‘metabolism’, ‘cellular processes a nd signaling’ and ‘information storage and processing’, in accordance with the GO annotation categories

MA

(Supplementary Fig. 2 and Supplementary Table 1).

Characterization of BC unigenes by searching against public databases: E-value

ED

distribution of the top hits in the databases showed 11205 (35.39%) unigenes with a

EP T

strong homology (<1.0e-50) (Fig. 1A), 27629 (87.28%) unigenes had a similarity higher than 80%, while 36754 (11.61%) showed a similarity between 60% and 80%

AC C

with respect to the identity distribution pattern. Therefore, 98.89% of the unigenes showing an identity higher than 60% along with a high-quality similar distribution supported the reliability of the de novo assembly (Fig. 1B). A total of 31657 putative known unigenes were found in all reference species, 16903 (53.39%) were found in Danio rerio (Fig. 1C).

3.3. Identification and analysis of the diff erently expressed genes (DEGs)

11

ACCEPTED MANUSCRIPT

FPKMs of each gene in the hepatopancreas of the backcross BC were compared with those of their parents. The DEGs of ‘BC vs. MA’ and ‘BC vs. CA’ were identified by filtering the data with the following criteria: absolute log2 (FC) ≥ 1, FDR < 0.05, and FPKM > 2. According to these criteria, volcano plots were used to

PT

describe the number of significant up and downregulated DEGs of ‘BC vs. MA’ (Fig.

RI

2A) and ‘BC vs. CA’ (Fig. 2B).

SC

As shown in Fig. 3, Venn diagram showed that there were 325 DEGs (130

NU

up-regulated and 195 down-regulated) in ‘BC vs. MA’ and 872 DEGs in (388 up-regulated and 484 down-regulated) ‘BC vs. CA’. Among those genes, 134

MA

overlapping DEGs were found in ‘BC vs. MA’ and ‘BC vs. CA’. Based on GO annotation, the hepatopancreas DEGs of ‘BC vs. MA’ and ‘BC vs. CA’ were enriched

ED

into 46 and 53 GO terms respectively, which provides an overview of ontology

EP T

content. Metabolic process, cell part and binding were the most three enriched GO terms in the biological process, cellular component and molecular function categories,

AC C

respectively (Fig. 4A and B). By KEGG analysis, the DEGs of ‘BC vs. MA’ were classified to 5 categories with 54 pathway and DEGs of ‘BC vs. CA’ were classified to 5 categories with 51 pathway (Supplementary Fig. 3-4), in which the protein digestion-absorption pathway and pancreatic secretion pathway were the most significant pathway in both ‘BC vs. MA’ and ‘BC vs. CA’.

3.4. Analysis of diff erently expressed genes related to growth

12

ACCEPTED MANUSCRIPT

By analyzing these 134 overlapping DEGs and their pathway, we found that 5 genes, including trypsinogen (TRY), pancreatic elastase 1 (ELA1), chymotrypsin- like 1 (CTRL1), carboxypeptidase 2 (CPA2) and bile salt-activated lipase (BAL) of BC had higher FPKMs value than both MA and CA, which provides further evidence that

PT

digestive enzyme genes were more active in the BC (Fig. 5 and Table 5).

RI

From these 134 overlapping DEGs and their pathway, we also analyzed the

SC

insulin- like growth factor (IGF) system related genes in hepatopancreases, such as

NU

IGF1, IGF2a, IGF binding protein1 (IGFBP1) and IGFBP2b were differentially expressed in BC when compared with those of its parents (Fig. 5 and Table 5). In

including

MA

addition, the DEGs were identified in protein and fatty acid synthesis pathways, phosphoinositide-3-kinase

regulatory

subunit

(PI3KR),

regulatory

ED

associated protein of mTOR (RAPTOR), eukaryotic translation initiation factor 4E

EP T

(EIF4E), citrate synthase (CS), malate dehydrogenase (MDH), fatty acid synthetase (FAS) and elongation of very long chain fatty acids protein (ELOVL1, ELOVL5 and

AC C

ELOVL6) (Fig. 5 and Table 5).

3.5. SSRs and SNP discovery Using the Msatcommander program, a total of 12718, 12922 and 15092 potential SSRs contained repeats of more than one nucleotides with a minimum of five repetitions were identified from MA, CA and BC, respectively (Fig. 6). The number of SSRs with ten repetitions is the largest in all three crosses and the BC group has

13

ACCEPTED MANUSCRIPT

more SSRs than MA and CA group. Among these SSRs, GT/GT, GAT and GATA were most common in dinucleotide, trinucleotide and quadnucleotide SSRs, respectively. A total of 52951 candidate SNPs were identified in 3 crosses. Of these SNP

PT

candidates, 38787 SNPs were putative transitions (Ts), and 14164 SNPs were putative

RI

transversions (Tv) with a mean Ts: Tv ratio of 2.73 (Fig. 7). The SNPs were then

SC

categorized into 4 classes, including class 1 (C/A, A/C, T/G and G/T) for 11.44%,

NU

class 2 (C/T, G/A, T/C and A/G) for 73.25%, class3 (C/G and G/C) for 6.84%, and

MA

class 4 (A/T and T/A) for 8.47%.

3.6. Validation of the RNA-seq analysis by qRT-PCR

ED

To validate the DEGs in FPKM values, ten DEGs were randomly selected for

EP T

qRT-PCR assay in hepatopancreas of these three crosses. The results were further compared with the FPKM values generated from RNA-seq. Our results showed that

AC C

the data of qRT-PCR were consistent with those of RNA-seq (Fig. 8).

4. Discussion

Hybridization is widely used to improve growth rate or other economic traits of animals and plants (Lafarga-De la Cruz et al., 2013; Zheng et al., 2017; Chen et al., 2018; Wang et al., 2019; Zheng et al., 2019). In recent years, many studies have focused on exploring the molecular basis of heterosis in hybrid species by

14

ACCEPTED MANUSCRIPT

transcriptome analysis (Zhang et al., 2008; Gao et al., 2013; Zhai et al., 2013; Sun et al., 2016a, b). In the present study, comparative transcriptomic analysis was performed on hepatopancreas of the backcross BC and their parents MA and CA. Among the annotated transcripts, 1197 DEGs (325 and 872 DEGs in ‘BC vs. MA’ and

PT

‘BC vs. CA’, respectively) were identified, and 134 overlapping DEGs were

RI

confirmed. By analyzing these 134 overlapping DEGs and their pathway, many

SC

growth superiority-related genes were found to be diff erentially expressed in BC

NU

compared with MA and CA. Based on the function of these DEGs and their location in the pathway, a overview map of the generation of growth superiority was produced

MA

(Fig. 9).

ED

4.1. Gene expression and activity of digestive enzyme

EP T

The growth and development of fish depends on nutrients, especially amino acids and fatty acids. High-quality digestive juices can break down food efficiently,

AC C

improving the feed utilization of fish (Wang and Hartsuck, 1993; Jayaraman et al., 2006). In the present study, we found that the expression of four protein digestive enzyme genes, such as TRY, ELA1, CTRL1 and CPA2, and one fat digestive enzyme gene BAL was higher in the BC than that of their parents (Fig. 9). Furthermore, we found that the activity of most protease and lipase also increased significantly in BC, which was consistent with digestive enzyme genes expression above. Some studies showed that the growth performance of fish was related to digestive enzyme activity

15

ACCEPTED MANUSCRIPT

(Salze et al., 2012; Lu et al., 2015). With increased activity of digestive enzymes, large molecules of protein and fat can be broken down more efficiently into small molecules of amino acids and fatty acids, which can provide more raw materials for

PT

protein/fatty acid synthesis.

RI

4.2. IGF system and protein synthesis

SC

The growth performance of vertebrates or fish is primarily effected by the

NU

growing axis (Duan 1997; Duan 1998). In this axis, IGF system is involved in cell regeneration, proliferation and differentiation (Stewart and Rotwein 1996; Mahardini

MA

et al., 2018). In the present study, four DEGs associated with IGF system were found between BC and their parents (Fig. 9). Among them, the expression of IGF1 and

ED

IGF2a in the hepatopancreas of BC was significantly up-regulated than that of their

EP T

parents, their main function was to amplify and activate downstream pathways (such as protein synthesis pathway mentioned in next paragraph) (Sun et al., 2016a).

AC C

However, IGFBP1 exhibited a significantly decreased expression in the BC compared with their parents. This might indicate that reduced IGFBP1 protein allow IGFs to be released from the IGFBP-IGFs conjugates, therefore, more separated IGFs could combine with IGFR on cell surface and exert their role, which is similar to previous study in Scophthalmus maximu (Hu et al., 2012). On the contrary, the expression of the IGFBP2b mRNA in the BC was up-regulated significantly, the same results were also found in hybrid grouper (Sun et al., 2016a).

16

ACCEPTED MANUSCRIPT

In present study, besides the increased expression of IGF1 and IGF2a, the genes PI3KR, RAPTOR and EIF4E in the protein synthesis pathway also had higher expression in the BC, indicating the strengthened ability of protein synthesis in the BC. This results are consistent with previous study of hybrid grouper (Sun et al.,

PT

2016a). Combined with the above paragraph, many phosphorylation events, including

RI

PI3K/AKT pathway, are activated by the binding of IGF-1 to its receptor IGFR1 (Stitt

SC

et al., 2004), thus leading to a series of anabolic effects, such as protein synthesis

NU

(Burgos and Cant, 2010) and glycogen synthesis (KallooHosein et al., 1997). When stimulated by IGF-1, the PI3K/AKT signal activates mTOR (Bibollet-Bahena and

MA

Almazan, 2009), mTORC1 is then formed by the combination of Raptor with mTOR, which can phosphorylate EIF4E-binding protein 1 (EIF4E-BP1) (Ma and Blenis,

ED

2009). The phosphorylated EIF4E-BP1 can promote the dissociation of EIF4E from

EP T

the inhibitory complex of EIF4E and EIF4E-BP1 (Kimball et al., 1999). Finally, the

AC C

increased available EIF4E improves the synthesis of protein (Kimball et al., 1999).

4.3. Fatty acid synthesis The expression of many genes involved in the fatty acid synthesis pathway, such as CS, MDH, FAS, ELOVL1, ELOVL5 and ELOVL6 was up-regulated in the hepatopancreas of backcross BC compared with those in their parents (Fig. 9). These up-regulated genes may lead to faster rate of fatty acid synthesis and their specific roles in fatty acid synthesis pathway were explained as follows. As the direct raw

17

ACCEPTED MANUSCRIPT

material for fatty acid synthesis, acetyl CoA can bind with oxaloacetate to generate citric acid by the catalysis of citrate synthetase (CS), and enter the cytoplasm from mitochondrion (Verma et al., 2018; Xu et al., 2017). Citric acid will be cleaved into acetyl CoA and oxaloacetate in cytoplasm. Oxaloacetate then has to enter the

PT

mitochondrion by the catalysis of MDH for the next cycle of combining acetyl CoA.

RI

In cytoplasm, acetyl CoA can turn into malonyl Co A by catalysis of acetyl CoA

SC

carboxylase (ACC), fatty acid synthetase (FAS) then catalyzes the transformation of

NU

both acetyl CoA and malonyl CoA into fatty acid (16C) (Smith et al., 2003; Leng et al., 2012). Additionally, the ELOVLs conduct the extension of very long chain fatty

MA

acid (18C or 20C or 24C) in endoplasmic reticulum (ER) to play a variety of

EP T

4.4. SSRs and SNPs

ED

biological roles (Morais et al., 2009; Gregory et al., 2010; Green and Olson, 2011).

By transcriptome analysis, a large number of SSRs and SNPs have been identified

AC C

in BC and their parents. We found that SSRs with 10 repeats and SNPs of transition (Ts) were the most numerous, respectively (Li et al., 2018). However, the number of SSRs and SNPs in BC was significantly higher than that of their parents, this might be because the backcross BC genome fused the genomes of both parents. This SSRs and SNPs will provide a helpful resource for marker-assisted selection (MAS), quantitative trait locus (QTL) association and population genetics analysis (Li et al., 2018).

18

ACCEPTED MANUSCRIPT

5. Conclusion In the present study, we performed a hepatopancreas transcriptome analysis for a backcross BC and their parents to explore the molecular mechanisms of growth

PT

superiority. The results revealed that the digestive enzymes activity and the synthesis

RI

of proteins/fatty acids might be important factors for growth superiority of BC.

SC

Moreover, a large number of SSRs were identified in hepatopancreas transcriptome,

NU

which would be beneficial to QTL analysis and genetic linkage. Overall, these

Acknowledgments

ED

growth superiority of hybrid fish.

MA

findings provide new insights into molecular and physiological basis underlying the

EP T

This work was supported by grants from the National Natural Sc ience Foundation of China (31572220), Project funded by China Postdoctoral Science Foundation

AC C

(2019M651473), and the Shanghai University Knowledge Service Platform (ZF1206).

19

SC

RI

PT

ACCEPTED MANUSCRIPT

NU

References Anders, S., Huber, W., 2010. Differential expression analysis for sequence count data. Genome

MA

Biol. 11, R106.

Bi, Y.M., Meyer, A., Downs, G.S., Shi, X.J., El-Kereamy, A., Lukens, L., Rothstein, S.J., 2014. High throughput RNA sequencing of a hybrid maize and its parents shows different mechanisms responsive to nitrogen limitation. Bmc Genomics 15, 77.

ED

Bibollet-Bahena, O., Almazan, G., 2009. IGF-1-stimulated protein synthesis in oligodendrocyte progenitors requires PI3K/mTOR/Akt and MEK/ERK pathways. J. Neurochem. 109,

EP T

1440-1451. Burgos, S.A., Cant, J.P., 2010. IGF-1 stimulates protein synthesis by enhanced signaling through mTORC1 in bovine mammary epithelial cells. Domest. Anim. Endocrin. 38, 211-221. Chen, J., Luo, M., Li, S.N., Tao, M., Ye, X.L., Duan, W., Zhang, C., Qin, Q.B., Xiao, J., Liu, S.J., 2018. A comparative study of distant hybridization in plants and animals. Sci. China Life Sci.

AC C

61, 285-309.

Conesa, A., Gotz, S., Garcia-Gomez, J.M., Terol, J., Talon, M., Robles, M., 2005. Blast2GO: a universal tool for annotation, visualization and analys is in functional genomics research. Bioinformatics 21, 3674-3676. Duan, C.M., 1997. The insulin-like growth factor system and its biological actions in fish. Am. Zool. 37, 491-503. Duan, C.M., 1998. Nutritional and developmental regulation of insulin-like growth factors in fish. J. Nutr .128, 306s-314s. Faircloth, B.C., 2008. Msatcommander: detection of microsatellite repeat arrays and automated, locus-specific primer design. Mol. Ecol. Resour. 8, 92-94. Firth, S.M., Baxter, R.C., 2002. Cellular actions of the insulin-like growth factor binding proteins. Endocr. Rev. 23, 824-854. Franceschini, A., Szklarczyk, D., Frankild, S., Kuhn, M., Simonovic, M., Roth, A., Lin, J.Y.,

20

ACCEPTED MANUSCRIPT Minguez, P., Bork, P., Von Mering C., Jensen, L.J., 2013. STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res. 41, D808-D815. Gao, Y., Zhang, H., Gao, Q., Wang, L., Zhang, F., Siva, V.S., Zhou, Z., Song, L., Zhang, S., 2013. Transcriptome analysis of artificial hybrid pufferfish Jiyan-1 and its parental species: implications for pufferfish heterosis. PLoS One 8, e58453. Green, C.D., Olson, L.K., 2011. Modulation of palmitate-induced endoplasmic reticulum stress and apoptosis in pancreatic beta-cells by stearoyl-CoA desaturase and Elovl6. Am. J. Physiol.

PT

Endocrinol. Metab. 300, E640-649. Gregory, M.K., See, V.H., Gibson, R.A., Schuller, K.A., 2010. Cloning and functional characterisation of a fatty acyl elongase from southern bluefin tuna (Thunnus maccoyii).

RI

Comp. Biochem. Phys. B 155, 178-185.

Guo, H.H., Zheng, G.D., Wu, C.B., Chen, J., Jiang, X.Y., Zou, S.M., 2018. Comparative analys is

SC

of the growth performance and intermuscular bone traits in F 1 hybrids of black bream (Megalobrama terminalis) (♀) × topmouth culter (Culter alburnus) (♂). Aquaculture 492, 15-23.

NU

He, Z.L., Liu, S.J., Xiao J., Hu, F.Z., Wen, M., Ye, L.H., Zhang, C., Xu, K., Tao, M., 2014. Muscle nutrients of the backcross progeny of female diploid F-1 hybrid (blunt snout bream ×

MA

topmouth culter) × male blunt snout bream and its parents. J. Fish. China 38, 1786-1792. Hu, J., Wen, H.S., Guan, J., Guan, S.G., He, F., Li, J.F., Shi, D., Ma, R.J., Liu, M., Mu, W.J., Zhang, Y.Q., 2012. Cloning of IGFBP-1, -2 and expression analys is during adult and early developmental stages in Scophthalmus maximus. Acta Oceanol. Sin. 34, 139-146.

ED

Hwa V, Oh Y, Rosenfeld RG (1999) The insulin-like growth factor-binding protein (IGFBP) superfamily. Endocr. Rev. 20, 761-787.

EP T

Jayaraman, G., Srimathi, S., Bjarnason, J.B., 2006. Conformation and stability of elastase from Atlantic cod, Gadus morhua. BBA-Gen Subjects. 1760, 47-54. Jia, Y. Y., Gu, Z.M., Ye, J.Y., Yang, Y.J., Zhu, J.J., Huang, X.M., 2011. Analys is on genetic variations of Erythroculter ilishaeformis (♀) × Megalobrama amblycephala (♂) hybrids F1 by SRAP markers. Journal of Shanghai Ocean University 20, 198-203.

AC C

Jiang, W.P., Jia, Y. Y., Liu, S.L., Li, Q., Li, T., Gu, Z.M., 2016. Comparative analysis of intermuscular bones in hybrid F1, F2 of (C. alburnus) (♀) × (M. amblycephala) (♂) and its parents. Acta Hydrobiologica Sinica 40, 277-286. Kalloohosein, H.E., Whitehead, J.P., Soos, M., Tavare, J.M., Siddle, K., Orahilly, S., 1997. Differential signaling to glycogen synthesis by the intracellular domain of the insulin versus the insulin-like growth factor-1 receptor. Evidence from studies of TrkC-chimeras. J. Biol. Chem. 272, 24325-24332. Kimball, S.R., Shantz, L.M., Horetsky, R.L., Jefferson, L.S., 1999. Leucine regulates translation of specific mRNAs in L6 myoblasts through mTOR-mediated changes in availability of eIF4E and phosphorylation of ribosomal protein S6. J. Biol. Chem. 274, 11647-11652. Koboldt, D.C., Chen, K., Wylie, T., Larson, D.E., McLellan, M.D., Mardis, E.R., Weinstock, G.M., Wilson, R.K., Ding, L., 2009. VarScan: variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics 25, 2283-2285.

21

ACCEPTED MANUSCRIPT Lafarga-De, La., Cruz, F., Nunez-Acuna, G., Gallardo-Escarate, C., 2013. Hybridization between Haliotis rufescens and Haliotis discus hannai: evaluation of fertilization, larval development, growth and thermal tolerance. Aquac. Res. 44, 1206-1220. Leng, X.J., Wu, X.F., Tian, J., Li, X.Q., Guan, L., Weng, D.C., 2012. Molecular cloning of fatty acid synthase from grass carp (Ctenopharyngodon idella) and the regulation of its expression by dietary fat level. Aquacult. Nutr. 18, 551-558. Li, B., Dewey, C.N., 2011. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. Bmc Bioinformatics 12, 323.

PT

Li, H., 2011. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987-2993.

RI

Li, X.F., Jiang, Y.Y., Liu, W.B., Ge, X.P., 2012. Protein-sparing effect of dietary lipid in practical diets for blunt snout bream (Megalobrama amblycephala) fingerlings : effects on digestive

SC

and metabolic responses. Fish Physiol. Biochem. 38, 529-541.

Li, S.W., Wang, D., Cao, Y.S., Zhang, Y., Liu, H.B., Lu, T.Y., 2018. Transcriptome profile of amur sturgeon (Acipenser schrenckii) liver provides insights into immune modulation in response

NU

to Yersinia ruckeri infection. Aquaculture 492, 137-146.

Livak, K.J., Schmittgen, T.D., 2001. Analys is of relative gene expression data using real-time

MA

quantitative PCR and the 2(T)(-Delta Delta C) method. Methods 25, 402-408. Lu, Z.J., Xie, S.L., Wang, C., Zhou, A.G., Zou, J.X., 2015. Comparative study on the growing and digestion indexes of Snakehead with nine mating groups. Feed Industry 36, 25-30. Ma, X.M., Blenis, J., 2009. Molecular mechanisms of mTOR-mediated translational control. Nat.

ED

Rev. Mol. Cell Biol. 10, 307-318.

Mahardini, A., Yamauchi, C., Takeuchi, Y., Rizky, D., Takekata, H., Takemura, A., 2018. Changes

EP T

in mRNA abundance of insulin-like growth factors in the brain and liver of a tropical damselfish, Chrysiptera cyanea, in relation to seasonal and food-manipulated reproduction. Gen. Comp. Endocr. 269, 112-121. Morais, S., Monroig, O., Zheng, X., Leaver, M.J., Tocher, D.R., 2009. Highly unsaturated fatty acid synthesis in Atlantic salmon: characterization of ELOVL5- and ELOVL2-like elongases.

AC C

Mar. Biotechnol. 11, 627-639.

Moro, G.V., Camilo, R.Y., Moraes, G., Fracalossi, D.M., 2010. Dietary non-protein energy sources: growth, digestive enzyme activities and nutrient utilization by the catfish jundia, Rhamdia quelen. Aquac. Res. 41, 394-400. Ozsolak, F., Milos, P.M., 2011. RNA sequencing: advances, challenges and opportunities. Nat. Rev. Genet. 12, 87-98. Salze, G., Mclean, E., Craig, S.R., 2012. Dietary taurine enhances growth and digestive enzyme activities in larval cobia. Aquaculture 362-363, 44-49. Smith, S., Witkowski, A., Joshi, A.K., 2003. Structural and functional organization of the animal fatty acid synthase. Prog. Lipid Res. 42, 289-317. Stewart, C. E., Rotwein, P., 1996. Growth, differentiation, and survival: multiple physiological functions for insulin-like growth factors. Physiol. Rev. 76,1005-1026. Stitt, T.N., Drujan, D., Clarke, B. A., Panaro. F., Timofeyva, Y., Kline, W.O., Gonzalez, M.,

22

ACCEPTED MANUSCRIPT Yancopoulos, G.D., Glass, D.J., 2004. The IGF-1/PI3K/Akt pathway prevents expression of muscle atrophy-induced ubiquitin ligases by inhibiting FOXO transcription factors. Mol. Cell 14, 395-403. Sun, Y., Guo, C. Y., Wang, D. D., Li, X. F., Zhang, Y. H., You, X. X., Shi, Q., Hu, G. J., Fang, C., Lin, H.R., Zhang, Y., 2016a. Transcriptome analysis reveals the molecular mechanisms underlying growth superiority in a novel grouper hybrid (Epinephelus fuscogutatus♀ × E. lanceolatus♂). BMC Genetics 17, 24. Sun, Y., Huang, Y., Hu, G.J., Zhang, X.H., Ruan, Z.Q., Zhao, X.M., Guo, C.Y., Tang, Z.J., Li, X.F.,

PT

You, X.X., Lin, H.R., Zhang, Y., Shi, Q., 2016b. Comparative transcriptomic study of muscle provides new insights into the growth superiority of a novel grouper hybrid. Plos One 11, e0168802.

RI

Verma, E., Chakraborty, S., Tiwari, B., Mishra, A.K., 2018. Transcriptional regulation of acetyl CoA and lipid synthesis by PII protein in Synechococcus PCC 7942. J. Basic Microb. 58,

SC

187-197.

Wang, C.S., Hartsuck, J.A., 1993. Bile salt-activated lipase. A multiple function lipolyt ic enzyme. Biochim. Biophys. Acta. 1166, 1-19.

NU

Wang, S., Tang, C.C, Tao, M., Qin, Q.B., Zhang, C., Luo, K.K., Zhao, R.R., Wang, J., Ren, L., Xiao, J., Hu, F.Z., Zhou, R., Duan, W., Liu, S.J., 2019. Establishment and application of

MA

distant hybridization technology in fish. Sci. China Life Sci. 62, 22-45. Wang, Z., Gerstein, M., Snyder, M., 2009. RNA-Seq: a revolutionary tool for transcriptomics. Nat. Rev. Genet. 10, 57-63.

Xiao, J., Kang, X.W., Xie, L.H., Qin, Q.B., He, Z.L., Hu, F.Z., Zhang, C., Zhao, R.R., Wang, J.,

ED

Luo, K.K., Liu, Y., Liu, S.J., 2014. The fertility of the hybrid lineage derived from female Megalobrama amblycephala × male Culter alburnus. Anim. Reprod. Sci. 151, 61-70.

EP T

Xie, C., Mao, X.Z., Huang, J.J., Ding, Y., Wu, J.M., Dong, S., Kong, L., Gao, G., Li, C.Y., Wei, L.P., 2011. KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res. 39, W316-W322. Xu, W.N., Qian, Y., Li, X.F., Li, J.Y., Li, P.F., Cai, D.S., Liu, W.B., 2017. Effects of dietary biotin on growth performance and fatty acids metabolism in blunt snout bream, Megalobrama

AC C

amblycephala fed with different lipid levels diets. Aquaculture 479, 790-797. Zhai, R.R., Feng, Y., Wang, H.M., Zhan, X.D., Shen, X.H., Wu, W.M., Zhang, Y.X., Chen, D.B., Dai, G.X., Yang, Z.L., Cao, L.Y., Cheng, S.H., 2013. Transcriptome analys is of rice root heterosis by RNA-Seq. Bmc Genomics 14, 19. Zhang, C., Lin, C., Fu, F., Zhong, X., Peng, B., Yan, H., Zhang, J., Zhang, W., Wang, P., Ding, X., Zhang, W., Ding, X., Zhang, W., Zhao, L., 2017. Comparative transcriptome analysis of flower heterosis in two soybean F1 hybrids by RNA-seq. PloS one 12, e0181061. Zhang, H. Y., He, H., Chen, L.B., Li, L., Liang, M.Z., Wang, X.F., Liu, X.G., He, G.M., Chen, R.S., Ma, L.G., Deng, X.W., 2008. A genome-wide transcription analysis reveals a close correlation of promoter INDEL polymorphism and heterotic gene expression in rice hybrids. Mol. Plant 1, 720-731. Zheng, G.D., Guo, D.D., Wu, C.B., Chen, J., Jiang, X.Y., Zou, S.M., 2019. The obvious heterosis and genetic characters of intergeneric cross and backcross juveniles between blunt snout

23

ACCEPTED MANUSCRIPT bream (Megalobrama amblycephala) and topmouth culter (Culter alburnus). Aquac. Res. 50, 1634-1643. Zheng, G.D., Wang, C.L., Guo, D.D., Jiang, X.Y., Zou, S.M., 2017. Ploidy level and performance in meiotic gynogenetic offsprings of grass carp using UV-irradiated blunt snout bream sperm. Aquaculture and Fisheries 2, 213-219. Zheng, G.D., Zhang, Q.Q., Li, F.G., Chen, J., Jiang, X.Y., Zou, S.M., 2015. Genetic characteris tics and growth performance of different Megalobrama amblycephala (♀) × Erythroculter

NU

SC

RI

PT

ilishaeformis (♂) hybrids. J. Fish. Sci. China 22, 402-409.

MA

Figure legends Fig. 1. Landscape of unigene distribution in the backcross BC. (A) E-value distribution of unigenes searched against public databases with an E-value cut-off of 1E-5. (B) Identity distribution of unigenes searched against public databases with an E-value cut-off of 1E-5. (C) Unigenes conserved in BC and model species. Unigenes of the BC were characterized by species by using BLASTX.

ED

searching against public databases. The number of BC homologous genes identified in other

EP T

Fig. 2. DEGs analysis and volcano plot for ‘BC vs. MA’ (A) and ‘BC vs. CA’ (B). The x-axis is the value of Log2 (Fold Change), and the y-axis is the value of Log2 (p-value). The red and yellow dots reveal the up-regulated DEGs, the blue and light blue dots reveal the down-regulated DEGs. MA, CA and BC denote M. amblycephala, C. alburnus and their backcrossing offsprings,

AC C

respectively.

Fig. 3. DEGs and the overlaps of DEGs between ‘BC vs. MA’ and ‘BC vs. CA’. Fig. 4. GO enrichment of DEGs in hepatopancreas of BC and their parents. (A) GO enrichment of DEGs of ‘BC vs. MA’. (B) GO enrichment of DEGs of ‘BC vs. CA’. Asterisk showed the most enriched GO terms in each category. Fig. 5. Hierarchical cluster analysis of DEGs involved in the growth superiority. The color key represents FPKM normalized log2 transformed counts in hepatopancreas of three fish species. Changes in expression levels are shown using color scales with saturation at > 2-fold changes. Red and green gradients indicate a increase and decrease of DEGs, respectively.

24

ACCEPTED MANUSCRIPT Fig. 6. A summary of the SSRs identified from the MA, CA and BC. Fig. 7. Distribution of putative single nucleotide polymorphisms (SNP) in the MA, CA and BC. Fig. 8. Validation of RNA-seq data of ten DEGs by qRT-PCR. β-actin was used as an internal control and each value represents average of three separate biological replicates. Fig. 9. The overview map of molecular mechanisms underlying growth superiority of the BC.

PT

Blue arrows denote the DEGs of ‘BC vs. MA’. Red arrows denote the DEGs ‘BC vs. CA’. Up or down arrows stand for up- or down-regulation in the BC compared with their parent. CM: cell membrane; MM: mitochondrial membrane; ERM: endoplasmic reticulum membrane; ACC: acetyl

RI

CoA carboxylase; oaa: oxaloacetate.

SC

Supplementary Information Supplementary Fig. 1. Length distribution of transcripts and unigenes in BC (A, D), MA (B, E)

NU

and CA (C, F) respectively.

Supplementary Fig. 2. COG functional classification of unigenes in BC. A total of 7,406

MA

assembled unigenes were annotated and assigned to 25 functional categories. Supplementary Fig. 3. KEGG pathway enrichment analysis of the DEGs of ‘BC vs. MA’. The x-axis is the significant enriched KEGG pathway classification and y-axis is P-value of

ED

enrichment.

enrichment.

EP T

Supplementary Fig. 4. KEGG pathway enrichment analysis of the DEGs of ‘BC vs. CA’. The x-axis is the significant enriched KEGG pathway classification and y-axis is P-value of

Supplementary Table 1. COG, KOG and NOG functional classification of unigenes in BC, MA

AC C

and CA, respectively.

25

ACCEPTED MANUSCRIPT Table 1. Genes and specific primers used for quantitative real-time PCR. Forward primer sequence (5’-3’)

Reverse primer sequence (5’-3’)

TRY

GGGTGTCTGATGACCTTAGTG

TGTCTGCTGCTCATTGCT

ELA1

TCTCCTCCTCCTTCAGGTTATG

CTTGTGGAGGCAGCCTTATT

CPA2

GCTTTCACTCACACCAATAACC

CCAGCATCCCAGTTCCTATT

BAL

GAGGAGATCGCAAAGAAGGTAG

ATCCACATTCCCAGCAAGAG

IGF1

CAAGAGAGGAGTTTGCAGTGA

GCAAGGGTTCCATCTGGTATAA

IGFBP2b

GCTGACAGAGGGCTAGATTTG

CTGCAGGCAAACTTGTGTTTAT

RAPTOR

CTGCCTCACTACACCAATCAA

GCCTGGGATCTTCTCAATCAA

EIF4E

CAGATGGGCTCTCTGGTATTTC

AGTTTACTGGGCTGCTGTATG

FAS

AGGTGTCCGAGAATGGAAATC

ELOVL1

CCGGCGTACAGTACAAAGAA

AGACCCAAGGTTGAAGGATTAC

β-actin

CGTGCTGTTTTCCCTTCCATT

CAATACCGTGCTCAAAGGATACTT

SC

RI

PT

Gene name

AC C

EP T

ED

MA

NU

GTCTGCAGGCATTGGTTTATTC

26

ACCEPTED MANUSCRIPT

Table 2. Growth performance and digestive enzyme activities (U g-1 tissue protein) Group

IW

FW

SGR

WGR

HSI

CF

Protease

Lipase

BC

6.50±0.35

50.70±3.47a

2.28a

680.00a

1.48±0.17ab

2.10±0.02a

31.60±1.34a

22.18±2.02a

MA

6.50±0.34

44.81±3.24b

2.14ab

589.38b

1.68±3.24a

1.86±0.01ab

23.50±1.10b

20.16±1.50b

CA

6.50±0.62

28.25±2.02c

1.63c

334.61c

1.07±0.11b

1.11±0.03c

25.43±1.15b

18.60±2.10b

PT

Notes: IW (g): initial body weight, FW (g): final body weight, Weight gain rate (WGR, %) = (FW − IW)×100 / IW, Specific growth rate (SGR, %) = (lnFW − lnIW)

RI

×100/number of culture days, Hepatosomatic index (HSI, %) = 100×hepatopancreas

SC

weight/body weight, Condition factor (CF, %) = 100×(body weight, g)/(body length, cm)3 . Digestive enzyme activities were expressed as micromoles of hydrolyzed

NU

substrate min-1 g-1 tissue protein (U g-1 tissue protein). Means in the same column with

AC C

EP T

ED

MA

different superscripts are significantly different (P < 0.05).

27

ACCEPTED MANUSCRIPT Table 3. Characterization of raw/clean data and mapping rate Total raw

# of clean

Total clean

Number of reads

Mapping

reads

read size (bp)

reads

read size (bp)

mapped to assembly

ratea (%)

BC

50,064,120

7,559,682,120

49,063,830

7,316,679,170

40,859,480

83.28

MA

56,209,354

8,487,612,454

55,225,738

8,244,390,148

49,405,514

89.46

CA

52,814,668

7,975,014,868

51,725,452

7,714,644,875

Average

53,029,381

8,007,436,481

52,005,007

7,758,571,398

Total

159,088,142

24,022,309,442

156,015,020 23,275,714,193

RI

SC

46,230,046

89.38

45,498,347

87.37

136,495,040

——

Reads with ≤ 5 base mismatches were counted when mapped to the reference

NU

a

PT

# of raw

EP T

ED

MA

sequences.

AC C

Sample

28

ACCEPTED MANUSCRIPT Table 4. Summary of cDNA sequences of the BC and their parents

MA

CA

BC

MA

CA

Total sequence No.

77102

63367

67087

62248

55182

57366

Total length (bp)

71136472

52919414

56482685

50596979

41110483

42828640

Largest length (bp)

16621

15752

15127

16621

15752

15127

Smallest length (bp)

201

201

201

201

201

201

Average length (bp)

923

835

842

813

745

747

N50

1799

1524

1601

1581

1288

1349

GC%

45.43

45.69

45.57

45.34

45.48

45.36

AC C

EP T

ED

MA

NU

SC

BC

PT

Unigenes

RI

Transcripts

29

ACCEPTED MANUSCRIPT

Gene name

BC-FPKM

MA-FPKM

CA-FKPM

Digestive

TRY

4614.88±65.81

411.07±6.83

1658.76±35.34

enzyme

ELA1

1327.78±25.27

145.72±5.47

400.73±11.26

CTRL1

564.14±18.29

6.51±0.84

125.65±11.31

CPA2

92.67±5.87

5.03±0.18

20.96±2.45

BAL

425.73±20.28

20.12±2.37

61.82±8.46

IGF1

28.02±1.05

2.73±0.84

3.11±0.93

IGF2a

82.29±5.86

27.62±3.63

13.79±2.86

IGFBP1

28.59±1.04

137.95±3.86

241.57±10.81

IGFBP2b

57.74±3.17

11.62±0.89

9.01±1.04

Protein

PI3KR

56.03±5.61

5.46±1.15

6.23±0.86

synthesis

RAPTOR

182.19±25.88

61.33±6.43

90.33±8.89

EIF4E

152.65±12.31

62.36±9.39

13.88±2.84

Fatty acid

CS

101.67±16.81

19.13±1.87

27.50±1.03

synthesis

MDH

979.76±32.12

404.63±15.23 469.19±21.13

1379.37±59.11

658.22±37.27 82.96±6.82

ELOVL1

33.86±5.45

14.14±4.72

6.65±1.37

ELOVL5

283.73±27.80

73.77±9.17

83.06±8.82

1213.94±55.88

604.20±30.28 35.71±3.91

AC C

EP T

FAS

ELOVL6

RI

SC

NU

ED

GH/IGF axis

PT

Function

MA

Table 5. FPKMs of digestive enzyme, IGF system and protein/fatty acid synthesis related genes in BC, MA and CA.

30

ACCEPTED MANUSCRIPT Highlights  Intergeneric backcross BC with growth superiority and enhanced digestive enzyme activity were obtained between Megalobrama amblycephala and Culter alburnus.

PT

 134 differentially expressed genes (DEGs) were identified between BC and

RI

their parents in hepatopancreas by transcriptome analysis.

SC

 The DEGs related to the IGF system, digestive enzyme, protein/fatty acid

AC C

EP T

ED

MA

NU

synthesis may contribute to growth advantage of backcross BC.

31