Accepted Manuscript Integrative analysis of mRNA and lncRNA profiles identified pathogenetic lncRNAs in esophageal squamous cell carcinoma
Weiwei Wang, Chengguo Wei, Pan Li, Li Wang, Wencai Li, Kuisheng Chen, Jianying Zhang, Weijia Zhang, Guozhong Jiang PII: DOI: Reference:
S0378-1119(18)30310-X doi:10.1016/j.gene.2018.03.066 GENE 42688
To appear in:
Gene
Received date: Revised date: Accepted date:
25 January 2018 2 March 2018 20 March 2018
Please cite this article as: Weiwei Wang, Chengguo Wei, Pan Li, Li Wang, Wencai Li, Kuisheng Chen, Jianying Zhang, Weijia Zhang, Guozhong Jiang , Integrative analysis of mRNA and lncRNA profiles identified pathogenetic lncRNAs in esophageal squamous cell carcinoma. The address for the corresponding author was captured as affiliation for all authors. Please check if appropriate. Gene(2017), doi:10.1016/j.gene.2018.03.066
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT Integrative analysis of mRNA and lncRNA profiles identified pathogenetic lncRNAs in esophageal squamous cell carcinoma Weiwei Wanga,b,c,#, Chengguo Weid,#, Pan Lia,b,c, Li Wanga,b,c, Wencai Lia,b,c, Kuisheng Chena,b,c, Jianying Zhange, Weijia Zhangd,*, Guozhong Jianga,b,c,* a. Department of Pathology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou
IP
T
450052, China
b. Department of Pathology, School of Basic Medicine, Zhengzhou University, Zhengzhou
CR
450002, China
US
c. Henan Key Laboratory for Tumor Pathology, Zhengzhou 450052, China. d. Divisions of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai,
AN
New York, New York.
e. Institute of Medical and Pharmaceutical Sciences, Zhengzhou University, Zhengzhou, Henan,
M
China, 450052.
ED
# These authors contributed equally to this work. * Corresponding Authors: Dr. Guozhong Jiang, Department of Pathology, The First Affiliated
PT
Hospital of Zhengzhou University, No. 1, Jianshe East Road, Zhengzhou 450052, Henan Province, China, E-mail:
[email protected], Tel: +86-0371-67966157; Dr. Weijia
CE
Zhang, Divisions of Nephrology, Department of Medicine, Icahn School of Medicine at Mount
AC
Sinai, 1468 Madison Ave, Annenberge 23-16, New York NY 10029, E-mail:
[email protected], Tel: +1-212-241-2883 Abstract
Systems biology approaches can help understand pathogenesis of complex human diseases like cancers for identification of potential new therapeutic targets. Here in this study, we performed genome-wide screening for mRNA and lncRNA profiles in esophageal cancer to identify the novel cancer-related mRNA and lncRNA in esophageal squamous cell carcinoma (ESCC). We identified 1260 up-regulated/1445 down-regulated mRNAs and 402 up-regulated/741 downregulated lncRNAs. Further functional analysis revealed differentially expressed genes (DEGs) of
ACCEPTED MANUSCRIPT mRNA and lncRNA are related to different pathways. mRNA pathways are mainly involved in cell cycles while lncRNA pathways are for regulation and metabolic procession. Differentially expressed mRNAs/lncRNAs were validated with qPCR. At last, mRNA and lncRNA coexpression network were built and highly-connected networks were identified, which may provide a mechanism of mRNA expression regulation by lncRNA. In together, we used next generation sequencing data to explore the co-expression networks of lncRNA and mRNA, which
T
may elucidate the functions and mechanisms of lncRNA in ESCC.
IP
Keywords genome-wide screening; esophageal squamous cell carcinoma; mRNA; lncRNA
CR
1. Introduction
Esophageal cancer (EC) is one of the most commonly diagnosed cancer type in esophagus
US
and ranks as the sixth most common cause of death from cancer with an estimated 400,000 deaths (4.9% of the total worldwide)(Ferlay, Soerjomataram et al. 2015). EC comprises of two different
AN
histopathological forms: esophageal squamous cell carcinoma (ESCC) and esophageal adenocarcinoma (EAC). The underlying molecular mechanism of these two forms is very distinct.
M
As lack of specific symptoms and effective early diagnostic methods, esophageal cancer tends to be diagnosed at a late stage and its 5-year survival rate remains poor, ranging from 10 to 25 %
ED
(Su, Hu et al. 2011, Ohashi, Miyamoto et al. 2015). In China, ESCC is the predominant form of tumor arising from esophageal epithelial cells. Therefore, it is important to better understand the
PT
molecular mechanisms of ESCC tumorigenesis and screen biomarkers for the improvement of early diagnosis or treatment of ESCC in Chinese population. In past decades, expression profiling of coding genes and microRNAs have defined important signaling pathways involved in
CE
tumorigenesis (Su, Hu et al. 2011, Chen, Li et al. 2014, Song, Li et al. 2014). In recent 3 to 5 years, long non coding RNAs, like lncRNAs and circular RNAs have been gradually reported to
AC
drive many important cancer phenotypes by multiple ways(Beermann, Piccoli et al. 2016, Bhan, Soleimani et al. 2017, Han, Li et al. 2017, Yang, Gao et al. 2018). Thus, the latest knowledge of actively transcribed long noncoding RNAs from high-throughput sequencing reveals even greater complexity about cancer genome regulatory networks. Systems biology approaches can help understand pathogenesis and identify potential new targets for complex diseases such as ESCC. Many transcriptome analysis and files for ESCC(Ito, Shimada et al. 2008, Ma, Bao et al. 2012, Sawada, Niida et al. 2016) have been made available. Long noncoding RNAs (lncRNAs) have been reported to drive many important cancer phenotypes by multiple ways, including epigenetic modification, transcription regulation, RNA
ACCEPTED MANUSCRIPT decay, miRNA sponging and so on (Evans, Feng et al. 2016, Schmitt and Chang 2016). Several groups have identified ESCC-associated lncRNAs (Li, Wu et al. 2013, Li, Zheng et al. 2014, Huang, Chen et al. 2015, Wu, Hu et al. 2017). However the molecular mechanism of ESCC onset and development is still unclear despite of the progress from these reports. In this study, we performed genome-wide screening for mRNA and lncRNA profiles in 7 pairs of esophageal cancer and normal tissues to identify the novel cancer-related mRNA and
T
lncRNA in ESCC. Then we build a co-expression network for all of these mRNA and lncRNA to
IP
build the relationship between mRNA and lncRNA. To our knowledge, this is the first study to
CR
explore the association networks for mRNA and lncRNA in ESCC and the central networks could be therapeutic targets for ESCC treatments.
US
2. Materials and methods
AN
2.1. Study population and tissue samples
The research protocol was approved by the ethics committee of Zhengzhou University’s Institutional Ethical Review Boards, and all patients had signed informed consents before tissue
M
collection. Primary tumors and adjacent non-neoplastic tissues were obtained from 7 patients with ESCC who underwent surgical treatment at the First Affiliated Hospital of Zhengzhou University
ED
(Henan, China) during August 2016. Tissues were frozen in liquid nitrogen immediately after surgical resection. None of the patients had prior chemotherapy or radiotherapy, nor did they have
PT
any other serious diseases. All tissues from ESCC patients were histopathologically diagnosed by at least two independent senior pathologists. The clinicopathological characteristics of the
CE
patients are summarized in Table 1.
2.2. RNA Isolation for RNA Sequencing
AC
Total RNA from tissues was extracted using the Trizol reagent (Ambion, Catalog No.15596026, USA) according to the manufacturer’s protocol. RNA concentrations were quantified using a Nano-drop Spectrophotometer at a wavelength of 260 nm. RNA samples were analyzed by Bioanalyzer at a concentration of 100–200 ng/μl to verify the concentration and the purity of samples. Only the samples with RNA integrity values of >7.0 were used for mRNA and lncRNA sequencing at the Genomic Core Facility at Novogene Co., Ltd (Beijing, China). 2.3. Bioinformatics Analysis of RNA Sequencing (RNA-seq) Data
ACCEPTED MANUSCRIPT The RNA-seq data were analyzed by following the procedure described below. Briefly, after sequence quality filtering at a cutoff of a minimum quality score Q20 in at least 90% bases, the good-quality reads aligned to Reads were processed and aligned to the University of California Santa Cruz (UCSC) human reference genome and transcriptome (build hg19) using the BurrowsWheeler Aligner (BWA) (Li and Durbin 2009). The reads that are uniquely aligned to the exon and splicing junction sites for each transcript were combined to calculate an expression level for a
T
corresponding transcript and further normalized based on reads per kilobase per million reads in
IP
order to compare transcription levels among samples. The transcripts with a low raw read count <100 in all the samples were excluded for downstream analysis. Gene expression value was
CR
transformed to the log 2 base scale.
Principle Component Analysis (PCA) was firstly performed to assess the sample correlations
US
using the expression data of all the genes. The differentially expressed genes for mRNA and lncRNA in tumor compared with normal were identified by the R package and we selected the
AN
genes based on limma test adjusted P<0.05 and 1.5-fold change. The gene ontology (GO) and pathway analysis for the differentially expressed genes were then performed with fold change
M
cutoff of ≥1.5 using INGENUITY© IPA (www.ingenuity.com/products/ipa) and the online tool Enrichr. The read coverage of gene functional elements was also visualized by the Integrative
ED
Genome Viewer tool (www.broadinstitute.org/igv/) from the genome alignment file. Heatmap was used to visualize the top 50 differentially expressed genes using the limma test after the
PT
median center was transformed using Multi-Experiment Viewer software (Saeed, Sharov et al. 2003).
CE
2.4. Validation by Reverse Transcription-Polymerase Chain Reaction (RT-PCR) Total RNA was reverse-transcribed into complementary DNA (cDNA) using TIANScript II
AC
cDNA kit (Tiangen biotech, Beijing, China) according to the manufactory protocol. PCR analysis was performed on additional matched ESCC and adjacent non neoplastic tissues for selected mRNAs and lncRNAs. The housekeeping gene glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was used as internal control. Primers for RT-PCR were designed by using Primer-Blast (NCBI) and synthesized at Sangon Biotech Co., Ltd (Shanghai, China). The primer sequences for PCR used in this study were listed as follows: MMP1, forward 5'GATGAAGCAGCCCAGATGTG-3' and reverse 5'-GCTTGACCCTCAGAGACCTT3'; SOX11, forward 5'-CTGCAGATCAAACAGGAGCC-3' and reverse 5'TGAAGCTGTAGTAGAGGCGG-3'; LYPD2, forward 5'-CACCTGCACCACCAACGAAA-3' and
ACCEPTED MANUSCRIPT reverse 5'-CCACATCCGAGGGCTTACACTT-3'; MUC5B, forward 5'AGAGCGGGGACTACATCAAG-3' and reverse 5'-CGTTGTGGGCATAGAACTCG3'; LINC01518, forward 5'-CAGTGACGGAACAGTACCAG-3' and reverse 5'TCACAAACATCCCGCTCT-3'; LINC01614, forward 5'CTCTTTGGAGACTTCAATGTTCCTT-3' and reverse 5'CCTGTCATTGTATAGCCAGGAAATC-3'; PGM5-AS1, forward 5'-
T
GACTATGTTGTGAGCCTGCG-3' and reverse 5'-AAAAGGGGAGGGGCAATACA-
IP
3'; LINC01626, forward 5'-TTCTCCCCATCATCCCAGTG-3' and reverse 5'-
CACATCTCTCTTGGCGCAAA-3'; GAPDH, forward 5'-TTGGTATCGTGGAAGGACTCA-3'
CR
and reverse 5'-CAGTAGAGGCAGGGATGATGT-3'. GAPDH was used as internal control. The thermocycle conditions are as follows: initial denaturation at 95℃ for 10 minutes, followed by 95℃
US
for 20 seconds, 57℃ for 30 seconds, and 72℃ for 30 seconds for 27-29 cycles. Final extension was at 72℃ for 10 minutes. The amplicons were resolved in 1% agarose gel.
AN
2.5. Statistical analysis
All statistical analyses were performed on SPSS 16.0. Measurement data are presented as
M
mean ± standard deviation (SD). Statistical analyses were performed with Student’s t-test (two-
ED
tailed) and one-way ANOVA as appropriate. P<0.05 was considered as significant.
PT
3. Results
3.1. mRNA and lncRNA expression profile in ESCC
CE
We obtained tumor and adjacent normal tissues from ESCC patients for mRNA and lncRNA sequencing (RNA-seq). The principle component analysis (PCA) and the heatmap of
AC
differentially expressed genes (DEGs) are shown in Fig. 1. From the PCA, normal and cancer samples fell into distinct clusters, indicating that samples within each biologic group have more similarity. We identified 1260 up-regulated and 1445 down-regulated mRNAs (with a cutoff padj<0.05 & FC>2) and 402 up-regulated and 741 down-regulated lncRNAs (with cutoff padj<0.05). All DEGs for mRNA and lncRNA in tumor tissue are listed in Supplemental files (files 1 and 2). Heatmaps to show top 50 DEGs for mRNA (Fig. 1B, upper) and lncRNA (Fig. 1B, lower). Within the differently express genes, we identified some mRNAs (MMP1, SOX11, GST1, LYDP2, MUC5B, EN1, TMRRSS11, CRISP3 and CRNN) and some lncRNAs (LINC01518, LINC01614, PGM5-AS1 and LINC01626) are differentially expressed in ESCC tumors, some of
ACCEPTED MANUSCRIPT which were reported in other cancers but not in ESCC (Liu, Kato et al. 2012, Beltran, Graves et al. 2014, Dai, Li et al. 2017). 3.2. Validation of mRNA and lncRNA by RT-PCR We then found some of our upregulated lncRNAs such as CASC9, HOTAIR, and FOXD2AS1 (Fig. 1B and Supplemental file 2) are well-known tumor-associated lncRNA in ESCC and
T
other malignant tumor (Pan, Mao et al. 2016, Guo, Peng et al. 2017), which give us confidence
IP
for our sequencing results. Then further validate our DEGs, we selected 2 up-regulated lncRNAs (lnc01518, lnc01614), 2 down-regulated lncRNAs (PGM5-AS1, lnc01626), 2 up-regulated
CR
mRNAs (MMP1, SOX11), and 2 down-regulated mRNAs (LYDP2, MUC5B) to detect their expression in more independent ESCC and normal tissues by RT-PCR (Fig. 1C). The results
US
coincided with the RNA-sequencing results well, confirming the reliability of our RNAsequencing results.
AN
3.3. Enrichment analysis for differentially expressed mRNA and lncRNA Enrichment analysis (GO & Pathway) were performed to reveal the functions of hundreds of
M
differentially expressed mRNAs and lncRNAs. Firstly, we analyzed the differentially expressed mRNAs in ESCC tissues and conducted gene ontology analysis (Fig. 2 and 3). The gene ontology
ED
for the up-regulated mRNA functions are mitotic skeletal system development, cell cycle, nuclear division, down-regulated mRNA are involved in epithelial cell differentiation, porphyrin
PT
catabolic process, oxidation reduction (Fig. 2A, Supplemental files 3 and 4). For the mRNA DEGs, we found the most significant Kyoto Encyclopedia of Genes and Genomes (KEGG)
CE
pathways are cell cycle control of chromosomal replication, Hepatic Fibrosis / Hepatic Stellate Cell Activation, Dendritic Cell Maturation (Fig. 3A, Supplemental file 5). Interestingly, the
AC
functions of differentially expressed lncRNA are different. The gene ontology terms for upregulated lncRNA are folic acid biosynthetic process, positive regulation of G2/M transition of mitotic cell cycle involved in cellular response to nitrogen starvation, positive regulation of cellcell adhesion mediated by integrin. The down-regulated lncRNA are mainly involved in regulation of glycolysis involved in cellular glucose homeostasis, leukotriene metabolic process (Fig. 2B, Supplemental files 6 and 7). The pathways for lncRNA are Adenosine Nucleotides Degradation II, RAN Signaling, and Methylglyoxal Degradation III (Fig. 3B, Supplemental file 8). To our knowledge, this is the first time of transcriptome data revealing the mRNA and lncRNA expression profiling for ESCC, which will provide valuable information for researchers studying ESCC. Our results showed that the different functions between DEG mRNA and
ACCEPTED MANUSCRIPT lncRNA is that the lncRNA are mainly involved in regulation and metabolic procession (DNA, protein synthesis), however, the mRNA DEGs are for cell cycle, which might implicate that lncRNA are up-stream regulators for mRNA DEGs transcription in ESCC, and lncRNA may play key roles in the procession of the cancer development. 3.4. Establishment of correlation for mRNA and lncRNA in ESCC tumor tissues
T
The gene co-expression network between lncRNAs and mRNAs in ESCC tissues can reveal
IP
the relationship between mRNAs and lncRNA. So we performed the correlation analysis for mRNA and lncRNA with their expression values in ESCC tumor tissues and build the mRNA and
CR
lncRNA association networks (Fig. 4, Supplemental file 9). Many interesting interactions between lncRNA and mRNA were revealed, such as NBAT1 was reported up-regulated in
US
neuroblastomas (Pandey, Mitra et al. 2014). Several of the c-MYC–regulator lncRNAs (CCAT1L, CARLo-5, PCAT1, PRNCR1, CCAT2, PVT1) are localized to the same genomic region and
AN
co-amplified with c-MYC (Huarte 2015). In our study, we identified many lncRNA-mRNA coexpression interactions, by which lncRNA might regulate mRNA expression (Fig. 4). From the top co-expression networks in figure 4, we can see lncRNA FAM197Y5, TCONS_I2_00006579
M
and MIR548H2 are the cores of the networks, and they are associated with many mRNAs, like MMP2 (Xu, Wang et al. 2005), ANTXR1 (Chen, Bhat-Nakshatri et al. 2013), SPARC (Tai and
ED
Tang 2008), which are reported in previous studies playing roles in cancer. This finding indicated that the expression of mRNAs might be regulated by lncRNAs, which implied the prominent
PT
function of lncRNAs in ESCC carcinogenesis.
CE
4. Discussion
ESCC is one of the most fatal cancers worldwide, but the etiology remains poorly
AC
understood. To explore the pathogenesis of ESCC and find novel biomarker for ESCC, we conducted a NGS analysis to screen the whole transcript expression profile in ESCC. Our results reveal that both mRNA and lncRNA play a vital role in pathogenesis of ESCC. Intriguingly, we have identified an aberrantly overexpressed lncRNA in ESCC tissues, and we build the interaction network for mRNA and lncRNA. Life and disease are complex system that involves interactions among many factors of various levels of regulation, including mRNA, lncRNA and so on. Recent advances in various forms of omics technologies have generated a huge amount of data. Using the transcriptome data, we found lncRNA and mRNA DEGs are involved in different terms of GO biological process. The
ACCEPTED MANUSCRIPT mRNA DEGs are for cell cycle, which might implicate that lncRNA are up-stream regulators for mRNA DEGs transcription in ESCC, and lncRNA may play key roles in the procession of the cancer development. There are many studies of mRNA or lncRNA data to explore the pathogenesis of ESCC. Several groups have reported the aberrant lncRNA expression profile in ESCC and identified hundreds of ESCC-associated lncRNAs (Cao, Wu et al. 2013, Pan, Mao et al. 2016, Yao, Huang et al. 2016), some of which could be used as biomarkers for diagnosis or
T
prognosis of ESCC. However, all of these studies are from a single prospect, either lncRNA, or
IP
mRNA. Here in this study, we collected total RNA and performed RNA-seq for the mRNA and
CR
lncRNA for integrative analysis to build the interactions between both types of data. Many interesting interactions between lncRNA and mRNA were found, one of the novel interesting lncRNA PGM5 has very strong associating with many mRNA in ESCC tumor tissues
US
(Supplemental file 7). We found all the associated genes are functionally involved in ubiquitinprotein ligase activity involved in mitotic cell cycle and tumor necrosis factor-mediated signaling
AN
pathway(the genes are ITAG9, A2M, MYH11, PREX2) (Pastuszak-Lewandoska, Kordiak et al. 2016, Douet-Guilbert, Chauveau et al. 2017, Srijakotre, Man et al. 2017, Tanaka, Kohashi et al.
M
2017). Therefore the lncRNAs we identified may play vitriol roles in tumor development and merit further study on how they regulate the cancer-related gene expression.
ED
5. Conclusions
PT
In this study, the novel ESCC-related mRNA and lncRNA were identified by genome-wide screening for mRNA and lncRNA profiles in 7 pairs of esophageal cancer and normal tissues. And co-expression network for all of these mRNA and lncRNA were built to explore the
CE
relationship between mRNA and lncRNA, which may provide a mechanism of mRNA expression
AC
regulation by lncRNA.
ACCEPTED MANUSCRIPT Conflict of interest statement None of the authors have any relevant conflicts of interests to declare. Acknowledgements This study was supported by grant from National Natural Science Foundation of China (No. 81272371), and the Major Project of Science and Technology in Henan Province
T
(No.161100311400), and the Key Research Projects of Henan Higher Education (No.18A310033),
IP
and the Youth Innovation Fund of The First Affiliated Hospital of Zhengzhou University to
AC
CE
PT
ED
M
AN
US
CR
Weiwei Wang & Pan Li.
ACCEPTED MANUSCRIPT Legends: Table 1. Clinicopathological characteristics of 7 ESCC patient. Fig. 1. PCA analysis between tumor and normal tissues from ESCC patients, heatmaps showing top 50 DEGs for mRNA and lncRNA data and validation by RT-PCR. (A) The principle component analysis (PCA) was performed for transcriptomic data. Genes with
T
the highest loadings in the first three principal components were plotted in 3D visualization. PCA
IP
revealed that the tumor and normal samples form distinct clusters, indicating that samples within
CR
each biologic group have more similarity. Each symbol represents each biologic sample (black circles, normal tissues; red circles, tumor tissues). (B) Heatmaps to show top 50 differentially expressed genes (DEGs) for mRNA (upper) and lncRNA (lower). (C) Validation of 8 selected
US
differentially expressed mRNA and lncRNA, all of the RT-PCR-validated RNAs showed the
AN
same fold change tendencies as those in the RNA sequencing results (**P < 0.01). Fig. 2. GO terms for mRNA and lncRNA DEGs comparing tumor vs. normal tissues. (A) GO terms for mRNA DEGs comparing tumor vs. normal tissues. (B) GO terms for lncRNA
ED
regulated DEGs (lower, green color).
M
DEGs comparing tumor vs. normal tissues. Up-regulated DEGs (upper, red color) and down-
Fig. 3. KEGG Pathways of enrichment analysis for differentially expressed mRNA and
PT
lncRNA
(A) KEGG pathways for the differentially expressed mRNA. (B) KEGG pathways for the
CE
differentially expressed lncRNA.
AC
Fig. 4. Network of lncRNAs associated mRNAs. lncRNA are labeled as red rectangle, mRNA are labeled as green circle, positive association were indicated as red lines and negative association were indicated as green lines.
ACCEPTED MANUSCRIPT References
AC
CE
PT
ED
M
AN
US
CR
IP
T
Beermann, J., M. T. Piccoli, J. Viereck and T. Thum (2016). "Non-coding RNAs in Development and Disease: Background, Mechanisms, and Therapeutic Approaches." Physiol Rev 96(4): 1297-1325. Beltran, A. S., L. M. Graves and P. Blancafort (2014). "Novel role of Engrailed 1 as a prosurvival transcription factor in basal-like breast cancer and engineering of interference peptides block its oncogenic function." Oncogene 33(39): 4767-4777. Bhan, A., M. Soleimani and S. S. Mandal (2017). "Long Noncoding RNA and Cancer: A New Paradigm." Cancer Res 77(15): 3965-3981. Cao, W., W. Wu, F. Shi, X. Chen, L. Wu, K. Yang, F. Tian, M. Zhu, G. Chen, W. Wang, F. G. Biddle and J. Gu (2013). "Integrated analysis of long noncoding RNA and coding RNA expression in esophageal squamous cell carcinoma." Int J Genomics 2013: 480534. Chen, D., P. Bhat-Nakshatri, C. Goswami, S. Badve and H. Nakshatri (2013). "ANTXR1, a stem cellenriched functional biomarker, connects collagen signaling to cancer stem-like cells and metastasis in breast cancer." Cancer Res 73(18): 5821-5833. Chen, Z., J. Li, L. Tian, C. Zhou, Y. Gao, F. Zhou, S. Shi, X. Feng, N. Sun, R. Yao, K. Shao, N. Li, B. Qiu, F. Tan and J. He (2014). "MiRNA expression profile reveals a prognostic signature for esophageal squamous cell carcinoma." Cancer Lett 350(1-2): 34-42. Dai, D. N., Y. Li, B. Chen, Y. Du, S. B. Li, S. X. Lu, Z. P. Zhao, A. J. Zhou, N. Xue, T. L. Xia, M. S. Zeng, Q. Zhong and W. D. Wei (2017). "Elevated expression of CST1 promotes breast cancer progression and predicts a poor prognosis." J Mol Med (Berl) 95(8): 873-886. Douet-Guilbert, N., A. Chauveau, N. Gueganic, G. Guillerm, C. Tous, M. J. Le Bris, A. Basinko, F. Morel, V. Ugo and M. De Braekeleer (2017). "Acute myeloid leukaemia (FAB AML-M4Eo) with cryptic insertion of cbfb resulting in cbfb-Myh11 fusion." Hematol Oncol 35(3): 385-389. Evans, J. R., F. Y. Feng and A. M. Chinnaiyan (2016). "The bright side of dark matter: lncRNAs in cancer." J Clin Invest 126(8): 2775-2782. Ferlay, J., I. Soerjomataram, R. Dikshit, S. Eser, C. Mathers, M. Rebelo, D. M. Parkin, D. Forman and F. Bray (2015). "Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012." Int J Cancer 136(5): E359-386. Guo, L., Y. Peng, Y. Meng, Y. Liu, S. Yang, H. Jin and Q. Li (2017). "Expression profiles analysis reveals an integrated miRNA-lncRNA signature to predict survival in ovarian cancer patients with wild-type BRCA1/2." Oncotarget 8(40): 68483-68492. Han, D., J. Li, H. Wang, X. Su, J. Hou, Y. Gu, C. Qian, Y. Lin, X. Liu, M. Huang, N. Li, W. Zhou, Y. Yu and X. Cao (2017). "Circular RNA circMTO1 acts as the sponge of microRNA-9 to suppress hepatocellular carcinoma progression." Hepatology 66(4): 1151-1164. Huang, M. D., W. M. Chen, F. Z. Qi, M. Sun, T. P. Xu, P. Ma and Y. Q. Shu (2015). "Long noncoding RNA TUG1 is up-regulated in hepatocellular carcinoma and promotes cell growth and apoptosis by epigenetically silencing of KLF2." Mol Cancer 14: 165. Huarte, M. (2015). "The emerging role of lncRNAs in cancer." Nat Med 21(11): 1253-1261. Ito, T., Y. Shimada, T. Kan, S. David, Y. Cheng, Y. Mori, R. Agarwal, B. Paun, Z. Jin, A. Olaru, J. P. Hamilton, J. Yang, J. M. Abraham, S. J. Meltzer and F. Sato (2008). "Pituitary tumortransforming 1 increases cell motility and promotes lymph node metastasis in esophageal squamous cell carcinoma." Cancer Res 68(9): 3214-3224. Li, H. and R. Durbin (2009). "Fast and accurate short read alignment with Burrows-Wheeler transform." Bioinformatics 25(14): 1754-1760.
ACCEPTED MANUSCRIPT
AC
CE
PT
ED
M
AN
US
CR
IP
T
Li, W., J. Zheng, J. Deng, Y. You, H. Wu, N. Li, J. Lu and Y. Zhou (2014). "Increased levels of the long intergenic non-protein coding RNA POU3F3 promote DNA methylation in esophageal squamous cell carcinoma cells." Gastroenterology 146(7): 1714-1726 e1715. Li, X., Z. Wu, Q. Mei, X. Li, M. Guo, X. Fu and W. Han (2013). "Long non-coding RNA HOTAIR, a driver of malignancy, predicts negative prognosis and exhibits oncogenic activity in oesophageal squamous cell carcinoma." Br J Cancer 109(8): 2266-2278. Liu, H., Y. Kato, S. A. Erzinger, G. M. Kiriakova, Y. Qian, D. Palmieri, P. S. Steeg and J. E. Price (2012). "The role of MMP-1 in breast cancer growth and metastasis to the brain in a xenograft model." BMC Cancer 12: 583. Ma, S., J. Y. J. Bao, P. S. Kwan, Y. P. Chan, C. M. Tong, L. Fu, N. Zhang, A. H. Y. Tong, Y. R. Qin, S. W. Tsao, K. W. Chan, S. Lok and X. Y. Guan (2012). "Identification of PTK6, via RNA sequencing analysis, as a suppressor of esophageal squamous cell carcinoma." Gastroenterology 143(3): 675-686 e612. Ohashi, S., S. Miyamoto, O. Kikuchi, T. Goto, Y. Amanuma and M. Muto (2015). "Recent Advances From Basic and Clinical Studies of Esophageal Squamous Cell Carcinoma." Gastroenterology 149(7): 1700-1715. Pan, Z., W. Mao, Y. Bao, M. Zhang, X. Su and X. Xu (2016). "The long noncoding RNA CASC9 regulates migration and invasion in esophageal cancer." Cancer Med 5(9): 2442-2447. Pandey, G. K., S. Mitra, S. Subhash, F. Hertwig, M. Kanduri, K. Mishra, S. Fransson, A. Ganeshram, T. Mondal, S. Bandaru, M. Ostensson, L. M. Akyurek, J. Abrahamsson, S. Pfeifer, E. Larsson, L. Shi, Z. Peng, M. Fischer, T. Martinsson, F. Hedborg, P. Kogner and C. Kanduri (2014). "The risk-associated long noncoding RNA NBAT-1 controls neuroblastoma progression by regulating cell proliferation and neuronal differentiation." Cancer Cell 26(5): 722-737. Pastuszak-Lewandoska, D., J. Kordiak, A. Antczak, M. Migdalska-Sek, K. H. Czarnecka, P. Gorski, E. Nawrot, J. M. Kiszalkiewicz, D. Domanska-Senderowska and E. Brzezianska-Lasota (2016). "Expression level and methylation status of three tumor suppressor genes, DLEC1, ITGA9 and MLH1, in non-small cell lung cancer." Med Oncol 33(7): 75. Saeed, A. I., V. Sharov, J. White, J. Li, W. Liang, N. Bhagabati, J. Braisted, M. Klapa, T. Currier, M. Thiagarajan, A. Sturn, M. Snuffin, A. Rezantsev, D. Popov, A. Ryltsov, E. Kostukovich, I. Borisovsky, Z. Liu, A. Vinsavich, V. Trush and J. Quackenbush (2003). "TM4: a free, opensource system for microarray data management and analysis." Biotechniques 34(2): 374-378. Sawada, G., A. Niida, R. Uchi, H. Hirata, T. Shimamura, Y. Suzuki, Y. Shiraishi, K. Chiba, S. Imoto, Y. Takahashi, T. Iwaya, T. Sudo, T. Hayashi, H. Takai, Y. Kawasaki, T. Matsukawa, H. Eguchi, K. Sugimachi, F. Tanaka, H. Suzuki, K. Yamamoto, H. Ishii, M. Shimizu, H. Yamazaki, M. Yamazaki, Y. Tachimori, Y. Kajiyama, S. Natsugoe, H. Fujita, K. Mafune, Y. Tanaka, D. P. Kelsell, C. A. Scott, S. Tsuji, S. Yachida, T. Shibata, S. Sugano, Y. Doki, T. Akiyama, H. Aburatani, S. Ogawa, S. Miyano, M. Mori and K. Mimori (2016). "Genomic Landscape of Esophageal Squamous Cell Carcinoma in a Japanese Population." Gastroenterology 150(5): 1171-1182. Schmitt, A. M. and H. Y. Chang (2016). "Long Noncoding RNAs in Cancer Pathways." Cancer Cell 29(4): 452-463. Song, Y., L. Li, Y. Ou, Z. Gao, E. Li, X. Li, W. Zhang, J. Wang, L. Xu, Y. Zhou, X. Ma, L. Liu, Z. Zhao, X. Huang, J. Fan, L. Dong, G. Chen, L. Ma, J. Yang, L. Chen, M. He, M. Li, X. Zhuang, K. Huang, K. Qiu, G. Yin, G. Guo, Q. Feng, P. Chen, Z. Wu, J. Wu, L. Ma, J. Zhao, L. Luo, M. Fu, B. Xu, B. Chen, Y. Li, T. Tong, M. Wang, Z. Liu, D. Lin, X. Zhang, H. Yang, J. Wang and Q. Zhan (2014). "Identification of genomic alterations in oesophageal squamous cell cancer." Nature 509(7498): 91-95.
ACCEPTED MANUSCRIPT
AC
CE
PT
ED
M
AN
US
CR
IP
T
Srijakotre, N., J. Man, L. M. Ooms, C. M. Lucato, A. M. Ellisdon and C. A. Mitchell (2017). "P-Rex1 and P-Rex2 RacGEFs and cancer." Biochem Soc Trans 45(4): 963-977. Su, H., N. Hu, H. H. Yang, C. Wang, M. Takikita, Q. H. Wang, C. Giffen, R. Clifford, S. M. Hewitt, J. Z. Shou, A. M. Goldstein, M. P. Lee and P. R. Taylor (2011). "Global gene expression profiling and validation in esophageal squamous cell carcinoma and its association with clinical phenotypes." Clin Cancer Res 17(9): 2955-2966. Tai, I. T. and M. J. Tang (2008). "SPARC in cancer biology: its role in cancer progression and potential for therapy." Drug Resist Updat 11(6): 231-246. Tanaka, M., K. Kohashi, K. Kushitani, M. Yoshida, S. Kurihara, M. Kawashima, Y. Ueda, R. Souzaki, Y. Kinoshita, Y. Oda, Y. Takeshima, E. Hiyama, T. Taguchi and Y. Tanaka (2017). "Inflammatory myofibroblastic tumors of the lung carrying a chimeric A2M-ALK gene: report of 2 infantile cases and review of the differential diagnosis of infantile pulmonary lesions." Hum Pathol 66: 177-182. Wu, Y., L. Hu, Y. Liang, J. Li, K. Wang, X. Chen, H. Meng, X. Guan, K. Yang and Y. Bai (2017). "Upregulation of lncRNA CASC9 promotes esophageal squamous cell carcinoma growth by negatively regulating PDCD4 expression through EZH2." Mol Cancer 16(1): 150. Xu, X., Y. Wang, Z. Chen, M. D. Sternlicht, M. Hidalgo and B. Steffensen (2005). "Matrix metalloproteinase-2 contributes to cancer cell migration on collagen." Cancer Res 65(1): 130-136. Yang, Y., X. Gao, M. Zhang, S. Yan, C. Sun, F. Xiao, N. Huang, X. Yang, K. Zhao, H. Zhou, S. Huang, B. Xie and N. Zhang (2018). "Novel Role of FBXW7 Circular RNA in Repressing Glioma Tumorigenesis." J Natl Cancer Inst 110(3). Yao, J., J. X. Huang, M. Lin, Z. D. Wu, H. Yu, P. C. Wang, J. Ye, P. Chen, J. Wu and G. J. Zhao (2016). "Microarray expression profile analysis of aberrant long non-coding RNAs in esophageal squamous cell carcinoma." Int J Oncol 48(6): 2543-2557.
AC
CE
PT
ED
M
AN
US
CR
IP
T
ACCEPTED MANUSCRIPT
ACCEPTED MANUSCRIPT Abbreviations: DEGs, differentially expressed genes; EC, esophageal cancer; EAC, esophageal adenocarcinoma; ESCC, esophageal squamous cell carcinoma; lncRNA, long noncoding RNA; UCSC, University of California Santa Cruz; BWA, Burrows-Wheeler Aligner; PCA, Principle Component Analysis; GO, gene ontology; RT-PCR, reverse transcription-polymerase chain reaction; GAPDH, glyceraldehyde-3-phosphate dehydrogenase; KEGG, Kyoto Encyclopedia of
AC
CE
PT
ED
M
AN
US
CR
IP
T
Genes and Genomes.
ACCEPTED MANUSCRIPT Highlights
Genome-wide screening for mRNA and lncRNA profiles in esophageal squamous cell carcinoma (ESCC) was performed.
•
3000 up-regulated/2517 down-regulated mRNAs and 402 up-regulated/741 down-regulated lncRNAs were identified
•
mRNA and lncRNA co-expression network were built and highly-connected networks were identified for the first time in ESCC.
AC
CE
PT
ED
M
AN
US
CR
IP
T
•
Figure 1
Figure 2
Figure 3
Figure 4