Genetic Alteration of MicroRNA Affecting Cancer Pathways

Genetic Alteration of MicroRNA Affecting Cancer Pathways

CHAPTER GENETIC ALTERATION OF MICRORNA AFFECTING CANCER PATHWAYS 15 Flavia Scoyni1, Vincenzo Bonnici2, Alfredo Pulvirenti3, Rosalba Giugno2,* 1Univ...

819KB Sizes 0 Downloads 62 Views

CHAPTER

GENETIC ALTERATION OF MICRORNA AFFECTING CANCER PATHWAYS

15

Flavia Scoyni1, Vincenzo Bonnici2, Alfredo Pulvirenti3, Rosalba Giugno2,* 1University

of Eastern Finland, Kuopio, Finland; 2University of Verona, Verona, Italy; 3University of Catania, Catania, Italy

CHAPTER OUTLINE Introduction������������������������������������������������������������������������������������������������������������������������������������������ 270 Biological Evidence of MicroRNA Mutations and Diseases Associations��������������������������������������������������� 271 Single-Nucleotide Polymorphism Affecting MicroRNA Transcription��������������������������������������������������������� 273 Single-Nucleotide Polymorphism Affecting MicroRNA Maturation������������������������������������������������������������ 273 Single Nucleotide Polymorphism Affecting MicroRNA Targeting��������������������������������������������������������������� 275 Computational Resources to Study Phenotypes at System Level��������������������������������������������������������������� 276 Computational Resources to Study the Single-Nucleotide Polymorphisms Effects on Cancer��������������������� 277 Conclusion������������������������������������������������������������������������������������������������������������������������������������������� 282 List of Acronyms and Abbreviations�������������������������������������������������������������������������������������������������������� 282 Glossary����������������������������������������������������������������������������������������������������������������������������������������������� 282 References������������������������������������������������������������������������������������������������������������������������������������������� 283

CHAPTER OUTLINE This chapter starts with a description of microRNA (miRNA or miR) biogenesis and physiological role of miRNAs on diseases. Then, it presents the in vivo heterogeneity of miRNAs and the biological evidence of miRNA mutations and cancer associations reported in literature accurately classified according to the nature of mutations such as single-nucleotide polymorphism (SNP) affecting miRNA Transcription, SNP affecting miRNA Maturation, and SNP affecting miRNA Targeting. The chapter continues with a review of knowledge base-driven pathway analysis to study phenotypes at system level. Such methods could make use of the computational tools and online repositories to collect and study the SNPs effects on diseases such as cancer. Finally, a summary concludes the chapter.

*Senior

author.

Cancer and Noncoding RNAs. http://dx.doi.org/10.1016/B978-0-12-811022-5.00015-2 Copyright © 2018 Elsevier Inc. All rights reserved.

269

270

CHAPTER 15  ALTERATION OF MICRORNA AFFECTING CANCER PATHWAYS

INTRODUCTION MicroRNAs (miRNAs) are endogenous noncoding and small RNA molecules 21–25 nucleotide in length [1]. The physiological role of these RNA molecules consists in regulating gene expression at posttranscriptional level through sequence-specific pairing. Silencing protein complexes are recruited by miRNAs sequence-specific interaction promoting the destabilization of the messenger RNAs (mRNAs) or interfering with translation of the target transcript [2]. In that way, miRNAs can accomplish either silencing or fine-tune regulation of the translation of specific mRNAs. miRNAs control such a great variety of biological processes, including cell proliferation, development, differentiation, and apoptosis [3]. Moreover, dysregulations in miRNAs interactions are associated with the development and progression of complex diseases such as cancer and cancer-related processes [4]. For this reason, miRNAs are identified as a functional class of tumor suppressors or oncogenes. In 2008 Morin et al. described isomiRs as the presence in vivo of different miRNAs originated by the same miRNA transcript [6]. These changes result in a final product different in length and sequence respect to the canonical miRNA. It has been observed a wide range of modification in miRNA precursor molecule (pre-miR) that leads to isomiRs generation; 5′ or 3′ gain or deletion and substitution in the sequence are the most common modification occurring in the pre-miR [7]. Nevertheless, miRNA biogenesis machinery cleavage or exonucleases degradation could produce different miRNA isoform as well [6,8]. Finally, posttranscriptional modification and RNA editing complexes as well as SNP presence in the genomic sequence of the primary miRNA are responsible of variability. When the modification affects the 5′ of the miRNA sequence, it is possible to obtain isomiR with differences in the seed region with respect to the common miRNA sequence. Changes in the seed region are involved in gain of new miRNA functions that could generate a new pool of targets, or a loss of regulation of the natural target [9]. Despite the abundance of isomiR in RNA-seq dataset, their role in cell physiology remains unknown and the biological effect is considered to be strictly related to the difference in the expression within the isomiR population. Since alteration in the 3′ of the miRNA sequence could lead to the production of an isomiR with different out-of-seed complementarity of the target, isomiRs may act in a redundant manner affecting the same pathway of the most common miRNA isoform. In 2011 Cloonan et al. hypothesized that the heterogeneity in the miRNA population could act protecting the miRNA functionality from nonspecific targeting of the pathways [8]. Nevertheless, the variability in the isomiR population represents the attempt of fine-tune regulation in different cell type or cell stages in which a particular isomiRs pattern expression acts as the best suitable one to a specific cell response. Nowadays, computational biologists rely on advanced tools to address difficulties associated with miRNAs’ discovery or target sites’ identification. The prediction of phenotypes, such as diseases, or of responses to therapies widely uses the large amount of genotypic high-dimensional data obtained through next-generation sequencing techniques. High-throughput sequencing and gene profiling techniques are radically transforming medical research, allowing the full monitoring of a biological system. The aim is to analyze the effects of mRNA target SNP on miRNAs’ binding and more generally to investigate the result of mutations in miRNA genes. The recognition operated by the miRNAs is mainly encoded in the seed region (nucleotides 2–7 of the mature miRNAs) that mediates the target identification by sequence complementarity [5]. Therefore, SNPs located in miRNA genes may

  BIOLOGICAL EVIDENCE

271

theoretically affect miRNA biogenesis and impact target association. Nevertheless, SNPs and mutations in miRNAs’ target sites may impair the maturation process or the target selection. Furthermore, alterations of target recognition due to SNPs or mutations in the seed regions create new off-target interactions. Consequently, researchers over the years are studying and discovering the associations of genetic mutations affecting miRNA-target interactions and functional pathways leading to several diseases such as chronic lymphocytic leukemia [10], papillary thyroid carcinoma [11], colorectal cancer [12], breast cancer [13], Parkinson disease [14], rheumatoid arthritis [15], systemic lupus erythematosus [16], Crohn disease [17], and psoriasis [18], among the others.

BIOLOGICAL EVIDENCE OF MICRORNA MUTATIONS AND DISEASES ASSOCIATIONS Excluding miRNA processing complexes mutation, SNPs in different miRNAs’ sites could affect the miRNA expression at different level or leading to an ineffective or unspecific target binding (Fig. 15.1). Analyzing genome-wide distribution of miRNA polymorphism in miRNA loci and their targets, Saunders et al. found that polymorphisms are present mostly in the flanking sequences with respect to occurring within the pre-miR sequence [5]. Nevertheless, evaluating the incidence of SNPs inside the pre-miR sequences, they showed a drop in genetic alteration frequency in the mature miRNA seed region. These data demonstrate that miRNAs’ loci are under selective pressure and that SNPs occurring in the seed region are negatively selected with an occurrence of ≈1.3 SNPs per kilobase (Fig. 15.2A). Since the seed region is functionally active in the specificity and efficiency of miR-mediated posttranscriptional regulation, it is not surprising that the occurrence in mutation is decreased with respect to pre-miR flanking region in which the occurrence is ≈3 SNPs per kilobase. Lu et al. confirmed the

FIGURE 15.1 Selective pressure acting in single nucleotide polymorphism (SNP) conservation. (A) Conservation of SNP in microRNA (miRNA) seed and flanking sites. (B) Conservation of SNP in miRNA-target seed and flanking sites.

272

CHAPTER 15  ALTERATION OF MICRORNA AFFECTING CANCER PATHWAYS

FIGURE 15.2 How single-nucleotide polymorphism (SNP) may affect microRNAs (miRNAs) biogenesis and function. (A) SNP in the genomic DNA hosting the miRNAs gene. (B) Affection of the production of pre-miRNA molecule due to SNP presence. (C) Affection of the production of pri-miRNA molecule due to SNP presence. (D) SNP carrying miRNA is loaded in DICER complex impair guide strand selection or target recognition. (E–G) Target RNA SNP affects the regulation mediated by miRNAs.

presence of selective constraints in miRNA loci, associated to their functional role, showing that SNPs occurrence is reduced in disease-related miRNAs respect of other miRNAs [19]. Recently, new studies analyzed the distribution of SNPs within miRNA target sequences. Although experimental data about targets of a specific miRNA are not always available, bioinformatics tools predict miRNA targets. The confidence of bioinformatics tools for miRNA target prediction consists informally in obtaining the same results with different procedures. The occurrence of SNPs in miRNA targets are predicted by several studies by investigating the negative selective pressure in occurrence of SNPs in seed binding region of miRNA targets [5,20]. Polymorphisms in the miRNA target sites seem to be mostly deleterious and so negatively selected (Fig. 15.3). Besides these evidences, Chen et al. and Saunders et al. showed that it is rarely present, as well a recent positive selection pressure on advantageous SNPs in miRNA target binding sites (Fig. 15.2B) [5,20]. Moreover, SNPs occurring in miRNA binding sites flanking regions are present homogenously inside the population respect of SNPs occurring in the miRNA binding sites that show a high level of differentiation [21]. The presence of SNPs in the target binding regions could possibly lead to a match to the seed of new miRNAs (gain of regulation) and not only just be involved in mismatch with the physiological miRNAs (loss of regulation). However, to be biologically relevant, the mutation should be correlated to the expression of the appropriate miRNAs both in space and in time.

  SINGLE-NUCLEOTIDE POLYMORPHISM

273

FIGURE 15.3 Percentage of single-nucleotide polymorphism (SNP) affecting different microRNA (miRNA) region. Data collected from miRNASNP v.2.0.

SINGLE-NUCLEOTIDE POLYMORPHISM AFFECTING MICRORNA TRANSCRIPTION Due to the difficulty to identify and validate miRNA unique promoter, few examples of experimentally validated SNPs affecting miRNAs promoter are available. Luo et al., in 2011 were able to associate the presence of SNPs in the promoter region of miR-146a with systemic lupus erythematosus (SLE) occurrence [22]. SNP rs57095329 was able to affect the binding of Ets-1 transcription factor leading to a decrease in transcription of miR-146a. Since they identified miR-146a as a negative regulator of the interferon pathway, they linked the abnormal activation of this pathway to the downregulation of the expression of miR-146a due to the presence of SNP rs57095329 in SLE patients. However, miRNA coding genes affect also at epigenetic level. In this case, the presence of SNP is involved in changing of epigenetic regulation of the transcription of the miRNA and so in the cancer development. For example, in the case of breast cancer the silencing of particular miRNAs is a leading event of cancer insurgence [23].

SINGLE-NUCLEOTIDE POLYMORPHISM AFFECTING MICRORNA MATURATION miRNAs’ DNA regions are transcribed in the nucleus as primary transcripts (pri-miRNAs) that is capped, spliced, and polyadenylated [1]. Then the RNase III type DROSHA additionally processes primiRNA in a hairpin-shaped precursor (pre-miRNA). In 2008, Jazdzewski et al. showed that a common G to C polymorphism in the pre-miR-146a sequence leads to a reduction of the amount of the mature

274

CHAPTER 15  ALTERATION OF MICRORNA AFFECTING CANCER PATHWAYS

and the pre-miR generated from the C allele 1.9- and 1.8-fold, respectively, compared with the G allele [24]. The in vitro processing of the C carrying SNP pri-miR146a decreases, displaying an impairment in DROSHA processing of the molecule. Nevertheless, the C allele presence inhibits the interaction of the pre-miR146a with a nuclear factor generating a deficiency of mature miR146a. Clinically, the less efficient inhibition of target genes of miR146a involved in the Toll-like receptor and cytokine signaling pathway (TRAF6, IRAK1), and PTC1 (also known as CCDC6 or H4) leads to a predisposition of the insurgence of papillary thyroid carcinoma (Fig. 15.4). Exportin-5 is responsible to the export of the precursor in the cytoplasm where RNase III type DICER produces a 22-nucleotide-long miRNA duplex. A single strand of the processed duplex (guide strand) is chosen and loaded in the RNA-induced silencing complex (RISC) to obtain a functional miRNA. Interestingly, in 2009 an American group examined SNP occurrence in X-linked miRNAs [25]. They found that an SNP could lead to a production of a completely different miRNAs respect the wild-type. MiR-934-5p T to G variant occurs at the DROSHA processing site; nevertheless, this particular site is located at the 5′-end region that is essential in strand selection into miRISC complex. Northern blot analysis confirmed an impaired length of the mature miRNA, consistent with the impairment in the Drosha maturation step. At the same time, they detected an increased presence of the antisense of the guide strand in the mature miRNA. Usually, the wild-type guide miRNA strand is most likely to be selected and load into RISC complex due to the lower thermodynamic stability of the 5′-end. The U to G transvection produces an even less stable 5′-end and leads to an altered guide strand selection. SNPs could occur at different level in miRNAs transcription and regulation, but it has also demonstrated that miRNAs’ modification at RNA-editing level leads to impairment in molecule functionality. In 2007, Kawahara et al. demonstrated that ADAR protein is able to modify pri-miR-151 editing A to I affecting DICER-mediated miRNA processing [26]. This abnormal editing leads to a complete

FIGURE 15.4 Correlation of miRNA SNP and cancer development.

  Single Nucleotide Polymorphism Affecting MicroRNA Targeting

275

blockage of miR-151 cleavage by Dicer and accumulation of edited pre-miR-151 RNAs. Although the existence of miRNA editing [27–29] is an evolutionally conserved mechanism to allow an assortment in miRNAs and their targets, and it was demonstrated to occur frequently in the cells [30]; the fate of edited pri-miRNAs is mostly unknown.

SINGLE NUCLEOTIDE POLYMORPHISM AFFECTING MICRORNA TARGETING In 2005, Abelson et al. studying the role of the gene SLITRK1 in Tourette syndrome, provided the first evidence that a mutation that impair miRNA-target interaction produced a relevant phenotype [31]. A mutation screening in patients affected by the disease revealed a G to A transition in 3′-UTR region of SLITRK1 gene. The mutation resulted in a durable interaction of the mRNAs with miR-189 and promoted a downregulation of the gene target tougher than physiologically, leading to a loss of function phenotype. SNP rs61764370 is the most studied SNP affecting miRNA binding efficiency and is located in the 3′-UTR region of KRAS gene. G to T transversion in the KRAS 3′-UTR region influences the binding with let-7 miRNA. Since both KRAS and let-7 correlate in cancer development, as oncogene and tumor suppressor miRNAs, SNP rs61764370 may be relevant in cancer. In fact, Chin et al. defined a biological role of this SNP in non–small lung cancer patients [32]. They demonstrate the effectiveness of the SNP in disrupting the miRNA binding site in KRAS 3′-UTR and the consequent overexpression of KRAS oncogene. After this pioneer study, SNP rs61764370 has been associated to the insurgence of many other types of cancer [33] (Fig. 15.5).

FIGURE 15.5 Correlation of microRNA-target SNP and cancer development.

276

CHAPTER 15  ALTERATION OF MICRORNA AFFECTING CANCER PATHWAYS

FIGURE 15.6 Percentage of single-nucleotide polymorphism affecting microRNA target sites in different RNA target.

Recently, it has been demonstrated that miRNA molecules are able to interact with two additional classes of noncoding RNAs: circular RNAs (circRNAs) and long noncoding RNAs (lncRNAs) [34– 37]. mRNAs, lncRNAs, and circRNAs compete for the binding sites of miRNAs and create a complex network of interaction and regulation (Fig. 15.6). For these reason, noncoding RNAs are usually depleted of polymorphisms at miRNAs’ binding sites [24,38–40].

COMPUTATIONAL RESOURCES TO STUDY PHENOTYPES AT SYSTEM LEVEL Great research interest has focused on a class of methods called Knowledge base-driven pathway analysis [41]. Such methods leverage on existing databases, such as the Kyoto Encyclopedia of Gene and Genomes (KEGG) [42] or Pathway Commons [43], to identify those pathways that may be affected by the expression changes in the observed phenotype. Knowledge base-driven pathway analysis techniques can be grouped into three generations of approaches [44]: (1) Over-Representation Analysis (ORA); (2) Functional Class Scoring (FCS); (3) Pathway Topology-based (PT). Within the last class, we can identify two subclasses: the first one which allows to study the effect of a phenotype at the global level; the second, and more recent, which allows to identify subpathways that are specifically related to the phenotype under study. First-generation methods statistically evaluate the number of altered genes in a pathway with respect to the set of all analyzed genes. After filtering the resulting gene set of an expression assessment experiment, ORA strategies typically divide the list of genes according to the pathway each gene belongs to [45–50]. DIANA-miRPath [51], assesses the impact of miRNAs in biological processes by identifying the pathways in which they are significantly involved. The software package performs the functional annotation of one or more miRNAs and allows the identification of subsets of miRNAs, which significantly regulate a collection of pathways. FCS methods compute a gene-level statistic

  COMPUTATIONAL RESOURCES TO STUDY THE SINGLE-NUCLEOTIDE

277

from their expression levels, by means of a statistical approach considering all genes in a pathway [52–54]. FCS methods avoid some of the limitations of the ORA approaches by ranking all genes through their expression level and by considering the dependencies within a pathway. The third class of techniques models a pathway as a graph, considering its topology when computing scores [55]. In Draghici et al. [56], an analytical technique called impact factor (IF) was introduced. The IF is a pathway-level score that takes into account biological factors such as the magnitude of change in genes expression, the type of interactions between genes, and their location in the pathway. The method presented in [56] has been further improved by the SPIA algorithm [57]. More recently, Vaske et al. [58] presented the PARADIGM algorithm, which has been further improved by [59]. PARADIGM is a method to infer patient-specific genetic activity by incorporating information regarding interactions between genes provided in a pathway. In [60], authors developed a new approach, Micrographite, which can integrate pathways with predicted and validated miRNA-target interactions. The method, by performing a topological analysis based on expression profiles, can identify significant gene circuits specific to a phenotype. In [61], authors presented MITHrIL (miRNA enriched pathway impact analysis). The strength of MITHrIL lies in the enrichment of pathways with information regarding miRNAs, posttranscriptional regulatory elements whose consideration is clearly essential to the greater reliability of the results. The method, starting from expression values of genes and/or miRNAs, returns a list of pathways sorted according to the degree of their deregulation, together with the corresponding statistical significance (P-values). To represent the underlying biological phenomena more accurately and to identify putative contextspecific communities related to the pathology under study, the last generation of pathway analysis tools shifted the focus toward subpathways (local areas of the entire biological pathway). In [62], authors introduced Subpathway-GM, which uses a structural node similarity within pathways to identify metabolic subpathways based on information from genes and metabolites. In [63], authors introduced TEAK, a topology enrichment analysis tool based on Bayesian Networks, to extract linear and nonlinear subpathways. In [64], authors propose DEsubs to identify differentially expressed subpathways using RNA-seq data. In [65], authors propose CHRONOS which for the first time introduces temporal subpathway searching built on top of mRNA and miRNA expression data. In a near future, the above extensive work will take more and more advances of computational resources that collect and study the SNPs effects in miRNAs.

COMPUTATIONAL RESOURCES TO STUDY THE SINGLE-NUCLEOTIDE POLYMORPHISMS EFFECTS ON CANCER Large-scale in-silico data analysis [5] on the distribution of polymorphisms in the human miRNA genes have shown that miRNA genes have a lower density of SNPs compared to the flanking regions in the genome. The mature miRNA has a lower density of SNPs compared to the precursor, and finally the seed in the mature miRNA has the lowest SNP density. This reflects the functional importance of the binding site. In [66], authors conducted a global analysis of SNPs in miRNA genes by making use of dbSNP (build 137 for human) [67]. Authors identified 1899 SNPs in 961 pre-miRNAs of human genome. Few underlying reasons of the distribution of SNPs in miRNA genes were then conducted. These include (1) the degree of conservation for one miRNA family and the comparison of the SNP density between miRNA groups with different degree of conservation; (2) the average SNP densities

278

CHAPTER 15  ALTERATION OF MICRORNA AFFECTING CANCER PATHWAYS

between clustered and individual miRNAs; (3) the genomic localization of miRNAs in fragile sites, and then investigated the enrichment of the miRNAs with multiple SNPs in fragile sites; the stability of secondary structures of pre-miRNAs with respect to the SNPs frequencies; the relationships between the SNP densities of miRNAs with the number of diseases that they were associated with, and the number of QTLs (Quantitative Trait Loci) they were overlapped with. Such results are coherent with those obtained by Duan et al. [68], which predicted that the highest effects on minimum free energy (MFE) are caused by SNPs within mature sequences, followed by SNPs in the stem regions and loop domains. Gong et al. showed that 44% of the candidate SNPs cause significant changes in the stem-loop structures and are likely to affect production of mature miRNAs [9]. Furthermore, Gong et al. identified 48 SNPs in the seed regions and used miRanda [69] and TargetScan [70] algorithms to predict the changes in miRNA targets. This study demonstrated that SNPs in seed regions lead to creation and disruption of putative binding sites, and the total numbers of putative targets could be drastically altered by these polymorphisms. For example, an SNP (rs5186) located in the binding site of miR-155 can change the expression of a target gene (AGTR1), which is associated with blood pressure. Few computational resources about the polymorphic miRNA regulation have been developed to allow the systematic study of the above effects. These integrate different features of the miRNA-related genetic variants, and facilitate the research in this field. Due to the abundance of candidate SNPs in 3′UTRs, most of the databases included the information about polymorphic target sites, and utilized different prediction methods to assess their effects on miRNA bindings. Table (Table 15.1) reports the data sources and their main features, listed in what follows. In [71], authors presented the PolymiRTS database 3.0 that establishes links between polymorphisms in miRNA target sites and their possible functional impact in biological processes by including gene pathways for human and mouse from the KEGG database. The system allows to highlight the Table 15.1  List of Currently Maintained Resources System

URL

PolymiRTS 3.0

http://compbio.uthsc. edu/miRSNP/ http://bioinfo.life. hust.edu.cn/miRNASNP2/index.php http://bioinfo.bjmu. edu.cn/mirsnp/ search/ http://gyanxet.com/ hno.html http://mirdsnp.ccr. buffalo.edu/ http://compbio.uthsc. edu/SomamiR/ http://driverdb. tms.cmu.edu.tw/ ym500v3/index.php

miRNASNP 2.0

mirSNP

HNOCDB miRdSNP SomamiR 2.0 YM500v3

Genomic Annotations

Functional Annotations

Phenotype Associations

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

  COMPUTATIONAL RESOURCES TO STUDY THE SINGLE-NUCLEOTIDE

279

genes with polymorphic miRNA target sites in the context of pathways and to study the SNPs falling in the mRNAs corresponding to the miRNA recognition elements. In [72], authors proposed miRNASNP, which characterizes all known miRNA-related SNPs. Authors also and analyzed the SNPs’ effects on target binding alteration and mature miRNA biogenesis both computationally and experimentally. In particular, the miRNASNP database contains five major modules: SNPs in human pre-miRNAs; SNPs in human pre-miRNA flanking regions; SNPs in premiRNAs of other eight species; targets gain/loss by SNPs in miRNA seeds; and targets gain/loss by SNPs in target 3′-UTRs. The marked SNPs in the pre-miRNA stem-loop are shown into the RNA secondary structure. All information is available through a web application. Recently, in [68], the database has been extended with expression data in different tissues, genome-wide association studies of miRNASNPs and experimentally validate miRNA–mRNA interaction. mirSNP, which has been proposed in [73], collects the genomic variations affecting miRNA target sites and that have been associated with complex diseases by GWAS and expression quantitative trait loci (eQTL). The system combines information from dbSNP [74] 135 and miRBase [75] 18 to extract SNP in predicted miRNA target sites. It provides a web interface for querying the database content that also allows to combine user’s additional GWAS and eQTL data to identify putative miRNA-related SNPs from traits/diseases-associated variants. HNOCDB [76] is a comprehensive database regarding genes and miRNAs relevant to head, neck, and oral cancers. It stores mutations, methylations, and polymorphisms of oncogenes/oncomiRs, as well as expression profiles and chromosomal maps for the three types of cancers that represent the sixth most widespread cancer worldwide (as of 2012) with an average 5-year survival rate of around 50% [77]. The database contains 451 genes and 109 miRNAs supported by evidence compiled from Pubmed records. miRdSNP, proposed in [78], provides a manually curated dataset, obtained from literature, of SNPs functionally related to diseases and falling on the 3′-UTRs of human genes. It is intended as a resource for investigating disease-associated SNPs and their spatial relationship with miRNA target sites. The authors provided dSNP-disease associations on several types of diseases and for each SNP provide its specificity to the pathology. For example, authors identified that Breast cancer has the highest number of dSNPs, followed by type 2 diabetes, schizophrenia, rheumatoid arthritis, obesity, and colorectal cancer. The database also incorporates reference sequence (RefSeq) genes, predicted miRNA target sites, and SNP sequence data into a single consolidated resource. The authors filtered the SNP dataset obtained from UCSC Genome browser by indexing the genomic coordinates located on 3′-UTRs of genes; then authors annotated dSNPs using an in-house developed data pipeline which searches for PubMed articles linked to SNPs. By manually reviewing the literature authors identified 630 dSNPs for 204 human diseases from 754 publications. Then for each dSNP authors captured linkage disequilibrium (LD) information using the data provided by the HapMap project [79]. miRNAs target site were obtained using TargetScan [70] and PicTar [80]. To address the low prediction specificity of the miRNA target prediction algorithms, authors incorporate data from four curation databases (TarBase [81], miRTarBase [82], miRecords [83], and miR2disease [84]). An online interactive visualization tool displays SNPs and miRNA target sites’ densities across human chromosomes. SomamiR 2.0 [85] is a database of somatic mutations altering interactions between miRNAs and competing endogenous RNAs (ceRNA) in cancer [86]. The competing endogenous effect is a posttranscriptional activity in which different RNAs (ceRNAs) compete for shared miRNAs (miRNAs), thus regulating each other. A competing effect occurs when one or more miRNA response elements (MREs),

280

CHAPTER 15  ALTERATION OF MICRORNA AFFECTING CANCER PATHWAYS

targeted by the same pool of miRNAs, lie in an RNA transcript. The cross talk between mRNAs, lncRNA, and circRNA mediated by MREs, regulates the relative concentrations of transcripts within the cell, yielding large-scale regulatory networks. Mutations in miRNAs, especially in their seed regions, and miRNA target sites may alter miRNA-ceRNA interactions and dysregulation in ceRNA networks has shown to play important roles in cancer pathogenesis [87]. The database is divided into six components, four of them relate to mutations in miRNA sequences or miRNA target sites and the other two concern genes and pathways associated with such somatic mutations. The system stores 2423 somatic mutations regarding miRNA sequences, including pre-miRNA and mature miRNA locations, 181 of which are in miRNA seeds. Sequence mutations were collected by integrating whole-exome and whole-genome sequences from several data sources, including TCGA (The Cancer Genome Atlas, https://cancergenome.nih.gov). The system also provides a public instance of mir2GO that can be used to analyze the functional impact of sequence mutations in miRNA seed regions. Somatic mutations in experimentally identified miRNA target sites, retrieved by three newly emerging high-throughput technologies, are split into two sections. Experimental data were obtained from starBase and combined with the COSMIC database mutations. In particular, CLASH (cross-linking, ligation, and sequencing of hybrids) data were scanned to identify 4048 mature transcripts affected by somatic mutations in miRNA target regions. PAR-CLIP (photoactivatable-ribonucleoside-enhanced cross-linking and immunoprecipitation) and HITS-CLIP (high-throughput sequencing of RNA isolated by cross-linking immunoprecipitation) data were integrated to identify affected target sites in mRNAs, lncRNAs, and circRNAs, and grouped into 34 distinct experiments. Application of miRNA target site prediction tools, including TargetScan, PITA, and the six seed matches algorithm, were used to provide 23,460 potentially affected mature ceRNA transcripts. The two additional data sections regard biological pathways in KEGG and a list of genes associated with cancer risk obtained by integrating GWAS and CGAS (candidate gene association studies) from the UCSC Table Browser (https://genome.ucsc.edu/cgi-bin/hgTables), the NHGRI GWAS Catalog (https://www. ebi.ac.uk/gwas/), and the Cancer GAMAdb (http://www.hugenavigator.net/CancerGEMKB/caIntegratorStartPage.do). The two collections regard pathways and genes that are impacted by or contains somatic mutations in miRNA target sites (listed within the above data sections), respectively. YM500 [88] is a database built by analyzing next-generation sequencing data regarding small noncoding RNAs (sncRNA). The system collects from expression data analysis results for sncRNA quantification, arm switching discovery, novel sncRNA prediction, and most important for the purpose of this review, isomiR identification that can be integrated to dbSNP information to distinguish isomiRs from SNPs. MirVar [89] collects manually curated variations from public resources and published literature. By making use of dbSNP authors collected genetic variations in human genome and mapped them to premiRNA loci. Authors also collected manually curated variation in miRNA loci from published literature. Furthermore few miRNAs with posttranscriptional modification such as adenosine to inosine RNA editing [90] were also included. mirVar, is also built on top of Leiden Open (source) Variation Database (LOVD) system. The system provides also a computational pipeline to predict potential effects of variations on miRNA biogenesis and its functions. This pipeline makes use of PHDcleav [91] to predict dicer processing sites using Support Vector Machine (SVM) models and RISCbinder [92] to predict the guide strand of miRNAs. Finally, authors analyzed the allele frequencies data of SNPs from the HapMap [79] populations to evaluate the possible effect of the variations in the miRNA and its penetrance in the population.

  COMPUTATIONAL RESOURCES TO STUDY THE SINGLE-NUCLEOTIDE

281

Computational knowledge is often the result of an integration between highly reliable sources and intermediated steps of computational analyses. Pubmed articles and manually curated databases storing for example sequences and their related annotations, such as miRBase, are mined to support computational procedures, like algorithms used in predicting target binding. There exist well-established tools for modeling different aspects regarding miRNAs, here we focus on those aiming at assess functional effects of somatic mutations in miRNAs. mir2GO [93] is one of the most used tools in analyzing miRNA mutations and their functional impact. It is a modular web-based platform on top of which two different workflows, miRmut2GO and miRpair2GO, provide miRNA mutation analysis by means of GO (Gene Ontology) enrichment analysis. miRmut2GO investigates functional effects of genetic and somatic mutations in miRNA seed regions, and miRpair2GO is a functional analysis tool that compares two miRNAs by looking at their target genes. Starting from the fact that the main regulatory signal, namely target miRNA target recognition, is primarily encoded in seed regions, both workflows rely on the same core methodology that executes and enrichment functional analysis on reference and derived target genes, obtained through target prediction algorithms. Finally, a GO graph is used to visualize the functional effects of miRNA seed mutations. In [94], authors proposed miRNA SNiPer, an online tool for the detection of miRNA polymorphisms in vertebrates. The system, which focuses on variants in the miRNA seed, allows to access to a catalog of miRNA seed region polymorphisms (miR-seed-SNPs) consisting of 149 SNPs in six species. Interestingly, authors determined that miR-SNPs were frequently located within the QTL, chromosome fragile sites, and cancer susceptibility loci, indicating their potential role in the genetic control of various complex traits. The ImiRP [95] (Illegitimate miRNA Predictor) computational tool is a system aiming at study possible mutations affecting miRNA-mRNA interactions. A random generator is used to mutate 3′UTR mRNA sequences that are targets of specific miRNAs. The methodology applies a specific mutation strategy such that target site mutations do not inadvertently create new miRNA target sites. Illegitimate mutated target sites are identified by comparing target site predictions between input and mutant sequences on top of the miRBase high confidence knowledge. The creation of unwanted target sites may affect the experimental outcome, in fact, a common approach to validate and probe miRNA–mRNA interactions is to mutate predicted target sites within the mRNA and determine whether it affects miRNA-mediated activity. The software is able to investigate multiple miRNA target sites at once and it can be useful to study miRNA cooperativity in case of multiple mutagenesis. The core of the system is a Java-based application, such that it can be used in user-defined computational pipelines, and a web-based interface is provided, too. The mrSNP [96] software was designed for similar intents, however it does not have the capacity of generate random mutations, that must be provided by the user. In ImiRP, target mutation is a task performed efficiently and with ad hoc strategies. In [97] authors introduced a computational tool to predict SNP pathological effects on miRNAbased gene regulation. It can help to identify SNPs causative to diseases, such as cancer. The system focuses on SNPs that may affect miRNA targeting and thereby cause gene dysregulation. The tool predicts the effects of SNPs in miRNA target sites and uses linkage disequilibrium to map those mirSNPs to SNPs of interest in GWAS. Authors showed that the predictions correspond well to the SNP’s measured effects on miRNA regulation.

282

CHAPTER 15  ALTERATION OF MICRORNA AFFECTING CANCER PATHWAYS

CONCLUSION miRNA genes’ variations may have important effects at different functional levels in miRNAs. These include the transcription of miRNAs, maturation, and targeting. The miRNA transcripts need to meet several prerequisites to be able to express the mature miRNA sequences. The variability in miRNA genes can therefore influence both the expression level of the corresponding mature miRNA and give rise to a differential regulation of target genes. This, may impact biological processes by affecting the posttranscriptional regulation of the target genes and more in general cause biological pathways perturbation. Consequently, SNPs in some miRNAs may lead to various diseases such as cancers. This chapter recounts the biogenesis of miRNAs and they functional roles in normal and pathological conditions. It focuses on genetic alterations and their system level consequences that determine phenotypes. Reader is introduced to both the biomedical descriptions of the phenomena and the computational tools essential for its analysis and characterization.

LIST OF ACRONYMS AND ABBREVIATIONS 3′-UTR 3′-untranslated region ceRNA  Competing endogenous RNA circRNA  circularRNA IRAK1  Interleukin 1 Receptor Associated Kinase 1 isomiR  isomiRNA KRAS  Kirsten rat sarcoma viral oncogene homolog lncRNA  Long noncoding RNA miRNA  MicroRNA mRNA  Messenger RNA pre-miR  Precursor miRNA pri-miR  Primary miRNA PTC1  Patched gene 1 RISC  RNA-induced silencing complex SLITRK1  SLIT and NTRK-like family, member 1 TRAF6  TNF Receptor Associated Factor 6

GLOSSARY Circular RNA (circRNA)  A type of RNA which forms a covalently closed continuous loop. Competing endogenous RNAs (ceRNAs)  Molecular sponges for miRNAs. This type RNAs regulate RNA transcripts by competing for shared targeted miRNAs. Flanking region  A region of a gene preceding or following the transcribed region. These regions primarily function in the regulation of gene transcription. The 5′-flanking region contains the promoter and may contain enhancers or other protein binding sites. isomiR  An RNA sequence that have variations, due to additions, deletions or substitutions, with respect to the reference miRNA sequences. Long noncoding RNA (lncRNA)  Nonprotein coding transcript longer than 200 nucleotides. MiRISC complex  A RISC (RNA-induce silencing complex) that incorporates a miRNAs.

References

283

MicroRNA (miRNA)  A small endogenous noncoding RNA molecules, which sequence is about 22 nucleotides long, that regulates gene expression at posttranscriptional level and functions in RNA silencing. MiRNA gene  A gene from which a miRNA is transcribed. MiRNAs may be transcribed from either introgenic regions, from within introns of other genes such as other noncoding RNA and also from exons of coding genes. miRNA targeting  miRNAs regulate gene expression by targeting a repressor complex to specific messenger RNAs. Pathway  A series of actions among molecules in a cell that leads to a certain product or a change in a cell. Pre-miR  A precursor miRNA is the result of cleaving the hairpin of a pri-miRNA in the nucleus by Drosha (double-strand-specific ribonuclease). It is transported to the cytoplasm via a process involving Exportin-5. A pre-miRNA is further cleaved by Dicer to generate a short RNA in which one strand is the mature miRNA. pri-miR  A primary miRNA transcript directly transcribed in the nucleus from a DNA region. It can be more than 1000 nucleotides long and contains an RNA hairpin in which one of the two strands includes the mature miRNA. Seed  A miRNA conserved heptametrical sequence within 2–7 nucleotides from 5′-end that mediates target recognition through base pairing complementary. Single-nucleotide polymorphism (SNP)  A variation in a single nucleotide of a genetic sequence such that is has an appreciable degree of presence within a given population.

REFERENCES [1] Bartel D. MiRNAs: genomics, biogenesis, mechanism, and function. Cell 2004;116(2):281–97. [2] Bartel D. MiRNAs: target recognition and regulatory functions. Cell 2009;136(2):215–33. [3] Ke X, Liu C, Liu D, Liang C. MiRNAs: key participants in gene regulatory networks. Curr Opin Chem Biol 2003;7(4):516–23. [4] Calin G, Croce C. MiRNA signatures in human cancers. Nat Rev Cancer 2006;6(11):857–66. [5] Saunders M, Liang H, Li W. Human polymorphism at miRNAs and miRNA target sites. Proc Natl Acad Sci USA 2007;104(9):3300–5. http://dx.doi.org/10.1073/pnas.0611347104. [6] Morin R, O’Connor M, Sanchez M, Kuchenbauer F, Delaney A, Prabhu A, et al. Application of massively parallel sequencing to miRNA profiling and discovery in human embryonic stem cells. Genome Res 2008;18(4):610–21. [7] Cammaerts S, Strazisar M, De Rijk P, Del Favero J. Genetic variants in miRNA genes: impact on miRNA expression, function. Disease Front Genet 2015:6. [8] Cloonan N, Wani S, Xu Q, Gu J, Lea K, Heater S, et al. MiRNAs and their isomiRs function cooperatively to target common biological pathways. Genome Biol 2011;12(12):R126. [9] Gong J, Tong Y, Zhang HM, Wang K, Hu T, Shan G, et al. Genome-wide identification of SNPs in miRNA genes and the SNP effects on miRNA target binding and biogenesis. Hum Mutat 2012;33:254–63. http:// dx.doi.org/10.1002/humu.21641. [10] Raveche ES, Salerno E, Scaglione BJ, Manohar V, Abbasi F, Lin Y-C, et al. Abnormal miRNA-16 locus with synteny to human 13q14 linked to CLL in NZB mice. Blood 2007;109:5079–86. http://dx.doi.org/10.1182/ blood-2007-02-071225. [11] He H, Jazdzewski K, Li W, Liyanarachchi S, Nagy R, Volinia S, et al. The role of miRNA genes in papillary thyroid carcinoma. Proc Natl Acad Sci USA 2005;102:19075–80. http://dx.doi.org/10.1073/pnas.0509603102. [12] Landi D, Gemignani F, Naccarati A, Pardini B, Vodicka P, Vodickova L, et al. Polymorphisms within microRNA-binding sites and risk of sporadic colorectal cancer. Carcinogenesis 2008;29:579–84. http://dx.doi. org/10.1093/carcin/bgm304. [13] Song F, Zheng H, Liu B, Wei S, Dai H, Zhang L, et al. An mir-502–binding site single-nucleotide polymorphism in the 3′-untranslated region of the SET8 gene is associated with early age of breast cancer onset. Clin Cancer Res 2009;15:6292–300. http://dx.doi.org/10.1158/1078-0432.CCR-09-0826.

284

CHAPTER 15  ALTERATION OF MICRORNA AFFECTING CANCER PATHWAYS

[14] Wang G, van der Walt JM, Mayhew G, Li Y-J, Züchner S, Scott WK, et al. Variation in the miRNA-433 binding site of FGF20 confers risk for Parkinson disease by overexpression of α-synuclein. Am J Hum Genet 2008;82:283–9. http://dx.doi.org/10.1016/j.ajhg.2007.09.021. [15] Chatzikyriakidou A, Voulgari PV, Georgiou I, Drosos AA. A polymorphism in the 3′-UTR of interleukin-1 receptor-associated kinase (IRAK1), a target gene of miR-146a, is associated with rheumatoid arthritis susceptibility. Joint Bone Spine 2010;77:411–3. http://dx.doi.org/10.1016/j.jbspin.2010.05.013. [16] Consiglio CR, Veit TD, Monticielo OA, Mucenic T, Xavier RM, Brenol JCT, et al. Association of the HLA-G gene +3142C>G polymorphism with systemic lupus erythematosus. Hla 2011;77:540–5. http:// dx.doi.org/10.1111/j.1399-0039.2011.01635.x. [17] Brest P, Lapaquette P, Souidi M, Lebrigand K. A synonymous variant in IRGM alters a binding site for miR196 and causes deregulation of IRGM-dependent xenophagy in Crohn’s disease. Nature 2011;43. [18] Wu L-S, Li F-F, Sun L-D, Li D, Su J, Kuang Y-H, et al. A miRNA-492 binding-site polymorphism in BSG (basigin) confers risk to psoriasis in Central South Chinese population. Hum Genet 2011;130:749–57. http:// dx.doi.org/10.1007/s00439-011-1026-5. [19] Lu M, Zhang Q, Deng M, Miao J, Guo Y, Gao W, et al. An analysis of human MiRNA and disease associations. PLoS One 2008;3(10):e3420. [20] Chen K, Rajewsky N. Natural selection on human miRNA binding sites inferred from SNP data. Nat Genet 2006;38(12):1452–6. [21] Jin Y, Lee C. Single nucleotide polymorphisms associated with MiRNA regulation. Biomolecules 2013;3(2):287–302. [22] Luo X, Yang W, Ye D, Cui H, Zhang Y, Hirankarn N, et al. A functional variant in MiRNA-146a promoter modulates its expression and confers disease risk for systemic lupus erythematosus. PLoS Genet 2011;7(6):e1002128. [23] Lehmann U, Hasemeier B, Christgen M, Müller M, Römermann D, Länger F, et al. Epigenetic inactivation of miRNA genehsa-mir-9-1in human breast cancer. J Pathol 2008;214(1):17–24. [24] Jazdzewski K, Murray E, Franssila K, Jarzab B, Schoenberg D, de la Chapelle A. Common SNP in pre-miR146a decreases mature miR expression and predisposes to papillary thyroid carcinoma. Proc Natl Acad Sci USA 2008;105(20):7269–74. [25] Sun G, Yan J, Noltner K, Feng J, Li H, Sarkis D, et al. SNPs in human miRNA genes affect biogenesis and function. RNA 2009;15(9):1640–51. [26] Kawahara Y, Zinshteyn B, Chendrimada T, Shiekhattar R, Nishikura K. RNA editing of the miRNA-151 precursor blocks cleavage by the Dicer–TRBP complex. EMBO Rep 2007;8(8):763–9. [27] Nigita G, Alaimo S, Ferro A, Giugno R, Pulvirenti A. Knowledge in the investigation of A-to-I RNA editing signals. Front Bioeng Biotechnol 2015:3. [28] Distefano R, Nigita G, Macca V, Laganà A, Giugno R, Pulvirenti A, Ferro A. VIRGO: visualization of A-to-I RNA editing sites in genomic sequences. BMC Bioinform April 22, 2013;14(7):1. [29] Laganà A, Paone A, Veneziano D, Cascione L, Gasparini P, Carasi S, et al. miR-EdiTar: a database of predicted A-to-I edited miRNA target sites. Bioinformatics December 1, 2012;28(23):3166–8. [30] Blow MJ, Grocock RJ, van Dongen S, Enright AJ, Dicks E, Futreal PA, et al. RNA editing of human microRNAs. Genome Biol 2006;7(4):R27. [31] Abelson J. Sequence variants in SLITRK1 are associated with Tourette’s syndrome. Science 2005;310(5746):317–20. [32] Chin L, Ratner E, Leng S, Zhai R, Nallur S, Babar I, et al. A SNP in a let-7 miRNA complementary site in the KRAS 3′ untranslated region increases non-small cell lung cancer risk. Cancer Res 2008;68(20):8535–40. [33] Kim M, Slack F. MiRNA-mediated regulation of KRAS in cancer. J Hematol Oncol 2014;7(1). [34] Yoon J, Abdelmohsen K, Gorospe M. Functional interactions among miRNAs and long noncoding RNAs. Semin Cell Dev Biol 2014;34:9–14. [35] Hansen T, Jensen T, Clausen B, Bramsen J, Finsen B, Damgaard C, et al. Natural RNA circles function as efficient miRNA sponges. Nature 2013;495(7441):384–8.

References

285

[36] Alaimo S, Giugno R, Pulvirenti A. ncPred: ncRNA-disease association prediction through tripartite networkbased inference. Front Bioeng Biotechnol 2014:2. [37] Bonnici V, Russo F, Bombieri N, Pulvirenti A, Giugno R. Comprehensive reconstruction and visualization of non-coding regulatory networks in human. Front Bioeng Biotechnol 2014;2:69. [38] Russo F, Di Bella S, Nigita G, Macca V, Lagana A, Giugno R, Pulvirenti A, Ferro A. miRandola: extracellular circulating miRNAs database. PLoS One 2012;7(10):e47786. [39] Russo F, Di Bella S, Bonnici V, Laganà A, Rainaldi G, Pellegrini M, Pulvirenti A, Giugno R, Ferro A. A knowledge base for the discovery of function, diagnostic potential and drug effects on cellular and extracellular miRNAs. BMC Genomics 2014;15(3):1. [40] Laganà A, Russo F, Veneziano D, Di Bella S, Pulvirenti A, Giugno R, Croce CM, Ferro A. Extracellular circulating viral miRNAs: current knowledge and perspectives. Front Genetics 2013;4:120. [41] Green ML, Karp PD. The outcomes of pathway database computations depend on pathway ontology. Nucleic Acids Res 2006;34:3687–97. http://dx.doi.org/10.1093/nar/gkl438. [42] Kanehisa M, Goto S, Sato Y, Kawashima M, Furumichi M, Tanabe M. Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res 2014;42:D199–205. http://dx.doi.org/10.1093/nar/gkt1076. [43] Cerami EG, Gross BE, Demir E, Rodchenkov I, Babur Ö, Anwar N, et al. Pathway Commons, a web resource for biological pathway data. Nucleic Acids Res 2011;39:D685–90. http://dx.doi.org/10.1093/nar/gkq1039. [44] Khatri P, Sirota M, Butte AJ. Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Comput Biol 2012;8:e1002375. http://dx.doi.org/10.1371/journal.pcbi.1002375. [45] Beißbarth T, Speed TP. GOstat: find statistically overrepresented gene ontologies within a group of genes. Bioinformatics 2004;20:1464–5. http://dx.doi.org/10.1093/bioinformatics/bth088. [46] Berriz GF, King OD, Bryant B, Sander C, Roth FP. Characterizing gene sets with FuncAssociate. Bioinformatics 2003;19:2502–4. http://dx.doi.org/10.1093/bioinformatics/btg363. [47] Castillo-Davis CI, Hartl DL. GeneMerge—post-genomic analysis, data mining, and hypothesis testing. Bioinformatics 2003;19:891–2. http://dx.doi.org/10.1093/bioinformatics/btg114. [48] Doniger SW, Salomonis N, Dahlquist KD, Vranizan K, Lawlor SC, Conklin BR. MAPPFinder: using gene ontology and GenMAPP to create a global gene-expression profile from microarray data. Genome Biol 2003;4:R7. http://dx.doi.org/10.1186/gb-2003-4-1-r7. [49] Draghici S, Khatri P, Martins RP, Ostermeier GC, Krawetz SA. Global functional profiling of gene expression. Genomics 2003;81:98–104. http://dx.doi.org/10.1016/S0888-7543(02)00021-6. [50] Martin D, Brun C, Remy E, Mouren P, Thieffry D, Jacq B. GOToolBox: functional analysis of gene datasets based on Gene Ontology. Genome Biol 2004;5:R101. http://dx.doi.org/10.1186/gb-2004-5-12-r101. [51] Vlachos IS, Zagganas K, Paraskevopoulou MD, Georgakilas G, Karagkouni D, Vergoulis T, et al. DIANAmiRPath v3.0: deciphering miRNA function with experimental support. Nucleic Acids Res 2015;43:gkv403– W466. http://dx.doi.org/10.1093/nar/gkv403. [52] Ackermann M, Strimmer K. A general modular framework for gene set enrichment analysis. BMC Bioinform 2009;10:47. http://dx.doi.org/10.1186/1471-2105-10-47. [53] Tian L, Greenberg SA, Kong SW, Altschuler J, Kohane IS, Park PJ. Discovering statistically significant pathways in expression profiling studies. Proc Natl Acad Sci USA 2005;102:13544–9. http://dx.doi.org/10.1073/ pnas.0506577102. [54] Xiong H. Non-linear tests for identifying differentially expressed genes or genetic networks. Bioinformatics 2006;22:919–23. http://dx.doi.org/10.1093/bioinformatics/btl034. [55] Mitrea C, Taghavi Z, Bokanizad B, Hanoudi S, Tagett R, Donato M, et al. Methods and approaches in the topology-based analysis of biological pathways. Front Physiol 2013:4. http://dx.doi.org/10.3389/ fphys.2013.00278. [56] Draghici S, Khatri P, Tarca AL, Amin K, Done A, Voichita C, et al. A systems biology approach for pathway level analysis. Genome Res 2007;17:1537–45. http://dx.doi.org/10.1101/gr.6202607. [57] Tarca AL, Draghici S, Khatri P, Hassan SS, Mittal P, Kim J-S, et al. A novel signaling pathway impact analysis. Bioinformatics 2009;25:75–82. http://dx.doi.org/10.1093/bioinformatics/btn577.

286

CHAPTER 15  ALTERATION OF MICRORNA AFFECTING CANCER PATHWAYS

[58] Vaske CJ, Benz SC, Sanborn JZ, Earl D, Szeto C, Zhu J, et al. Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM. Bioinformatics 2010;26:i237–45. http:// dx.doi.org/10.1093/bioinformatics/btq182. [59] Sedgewick AJ, Benz SC, Rabizadeh S, Soon-Shiong P, Vaske CJ. Learning subgroup-specific regulatory interactions and regulator independence with PARADIGM. Bioinformatics 2013;29:i62–70. http://dx.doi. org/10.1093/bioinformatics/btt229. [60] Calura E, Martini P, Sales G, Beltrame L, Chiorino G, D’Incalci M, et al. Wiring miRNAs to pathways: a topological approach to integrate miRNA and mRNA expression profiles. Nucleic Acids Res 2014;42:e96. http://dx.doi.org/10.1093/nar/gku354. [61] Alaimo S, Giugno R, Acunzo M, Veneziano D, Ferro A, Pulvirenti A. Post-transcriptional knowledge in pathway analysis increases the accuracy of phenotypes classification. Oncotarget 2016;7:54572–82. http:// dx.doi.org/10.18632/oncotarget.9788. [62] Li C, Han J, Yao Q, Zou C, Xu Y, Zhang C, et al. Subpathway-GM: identification of metabolic subpathways via joint power of interesting genes and metabolites and their topologies within pathways. Nucleic Acids Res 2013;41:e101. http://dx.doi.org/10.1093/nar/gkt161. [63] Judeh T, Johnson C, Kumar A, Zhu D. TEAK: Topology enrichment analysis framework for detecting activated biological subpathways. Nucleic Acids Res 2013;41:1425–37. http://dx.doi.org/10.1093/nar/gks1299. [64] Vrahatis AG, Balomenos P, Tsakalidis AK, Bezerianos A. DEsubs: an R package for flexible identification of differentially expressed subpathways using RNA-seq experiments. Bioinformatics 2016;32:btw544–3846. http://dx.doi.org/10.1093/bioinformatics/btw544. [65] Vrahatis AG, Dimitrakopoulou K, Balomenos P, Tsakalidis AK, Bezerianos A. CHRONOS: a time-varying method for miRNA-mediated subpathway enrichment analysis. Bioinformatics 2016;32:884–92. http:// dx.doi.org/10.1093/bioinformatics/btv673. [66] Han M, Zheng Y. Comprehensive analysis of single nucleotide polymorphisms in human MiRNAs. PLoS One 2013;8:e78028. http://dx.doi.org/10.1371/journal.pone.0078028. [67] Smigielski EM, Sirotkin K, Ward M, Sherry ST. dbSNP: a database of single nucleotide polymorphisms. Nucleic Acids Res 2000;28:352–5. http://dx.doi.org/10.1093/nar/28.1.352. [68] Duan R, Pak C, Jin P. Single nucleotide polymorphism associated with mature miR-125a alters the processing of pri-miRNA. Hum Mol Genet 2007;16:1124–31. http://dx.doi.org/10.1093/hmg/ddm062. [69] Betel D, Wilson M, Gabow A, Marks DS, Sander C. The miRNA.org resource: targets and expression. Nucleic Acids Res 2008;36:D149–53. http://dx.doi.org/10.1093/nar/gkm995. [70] Agarwal V, Bell GW, Nam J-W, Bartel DP, Izaurralde E. Predicting effective miRNA target sites in mammalian mRNAs. eLife Sci 2015;4:e05005. http://dx.doi.org/10.7554/eLife.05005. [71] Bhattacharya A, Ziebarth JD, Cui Y. PolymiRTS Database 3.0: linking polymorphisms in miRNAs and their target sites with human diseases and biological pathways. Nucleic Acids Res 2014;42:D86–91. http://dx.doi. org/10.1093/nar/gkt1028. [72] Gong J, Liu C, Liu W, Wu Y, Ma Z, Chen H, Guo AY. An update of miRNASNP database for better SNP selection by GWAS data, miRNA expression and online tools. Database 2015;2015:bav029. [73] Liu C, Zhang F, Li T, Lu M, Wang L, Yue W, Zhang D. MirSNP, a database of polymorphisms altering miRNA target sites, identifies miRNA-related SNPs in GWAS SNPs and eQTLs. BMC Genomics 2012;13(1):1. [74] Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res 2001;29(1):308–11. [75] Griffiths-Jones S, Grocock RJ, Van Dongen S, Bateman A, Enright AJ. miRBase: miRNA sequences, targets and gene nomenclature. Nucleic Acids Res 2006;34(Suppl. 1):D140–4. [76] Mitra S, Das S, Das S, Ghosal S, Chakrabarti J. HNOCDB: a comprehensive database of genes and miRNAs relevant to head and neck and oral cancer. Oral Oncol 2012;48(2):117–9. [77] Babu JM, Prathibha R, Jijith VS, Hariharan R, Pillai MR. A miR-centric view of head and neck cancers. Biochim Biophys Acta 2011;1816(1):67–72.

References

287

[78] Bruno AE, Li L, Kalabus JL, Pan Y, Yu A, Hu Z. miRdSNP: a database of disease-associated SNPs and miRNA target sites on 3’UTRs of human genes. BMC Genomics 2012;13(1):1. [79] Gibbs RA, Belmont JW, Hardenbol P, Willis TD, Yu F, Yang H, et al. The International HapMap project. Nature 2003;426:789–96. http://dx.doi.org/10.1038/nature02168. [80] Rajewsky N. miRNA target predictions in animals. Nat Genet 2006;38. [81] Sethupathy P, Corda B, Hatzigergiou AG. TarBase: a comprehensive database of experimentally supported animal miRNA targets. RNA 2006;12:192–7. http://dx.doi.org/10.1261/rna.2239606. [82] Hsu S-D, Tseng Y-T, Shrestha S, Lin Y-L, Khaleel A, Chou C-H, et al. miRTarBase update 2014: an information resource for experimentally validated miRNA-target interactions. Nucleic Acids Res 2014;42:D78–85. http://dx.doi.org/10.1093/nar/gkt1266. [83] Xiao F, Zuo Z, Cai G, Kang S, Gao X, Li T. miRecords: an integrated resource for miRNA-target interactions. Nucleic Acids Res 2009;37:D105–10. http://dx.doi.org/10.1093/nar/gkn851. [84] Jiang Q, Wang Y, Hao Y, Juan L, Teng M, Zhang X, et al. miR2Disease: a manually curated database for miRNA deregulation in human disease. Nucleic Acids Res 2009;37:D98–104. http://dx.doi.org/10.1093/nar/ gkn714. [85] Bhattacharya A, Cui Y. SomamiR 2.0: a database of cancer somatic mutations altering miRNA–ceRNA interactions. Nucleic Acids Res 2015:gkv1220. [86] Sardina DS, Alaimo S, Ferro A, Pulvirenti A, Giugno R. A novel computational method for inferring competing endogenous interactions. Brief Bioinform 2016:bbw084. [87] Sanchez-Mejias A, Tay Y. Competing endogenous RNA networks: tying the essential knots for cancer biology and therapeutics. J Hematol Oncol 2015;8(1):1. [88] Cheng WC, Chung IF, Huang TS, Chang ST, Sun HJ, Tsai CF, et al. YM500: a small RNA sequencing (smRNA-seq) database for miRNA research. Nucleic Acids Res 2013;41(D1):D285–94. [89] Bhartiya D, Laddha SV, Mukhopadhyay A, Scaria V. miRvar: a comprehensive database for genomic variations in miRNAs. Hum Mutat 2011;32:E2226–45. http://dx.doi.org/10.1002/humu.21482. [90] Nishikura K. Functions and regulation of RNA editing by ADAR deaminases. Annu Rev Biochem 2010;79:321–49. http://dx.doi.org/10.1146/annurev-biochem-060208-105251. [91] Ahmed F, Kaundal R, Raghava GP. PHDcleav: a SVM based method for predicting human dicer cleavage sites using sequence and secondary structure of miRNA precursors. BMC Bioinform 2013;14:S9. http:// dx.doi.org/10.1186/1471-2105-14-S14-S9. [92] Ahmed F, Ansari HR, Raghava GP. Prediction of guide strand of miRNAs from its sequence and secondary structure. BMC Bioinform 2009;10:105. http://dx.doi.org/10.1186/1471-2105-10-105. [93] Bhattacharya A, Cui Y. miR2GO: comparative functional analysis for miRNAs. Bioinformatics 2015:btv140. [94] Zorc M, Skok DJ, Godnic I, Calin GA, Horvat S, Jiang Z, et al. Catalog of miRNA seed polymorphisms in vertebrates. PLoS One 2012;7(1):e30737. [95] Ryan BC, Werner TS, Howard PL, Chow RL. ImiRP: a computational approach to miRNA target site mutation. BMC Bioinform 2016;17(1):1. [96] Deveci M, Çatalyürek ÜV, Toland AE. mrSNP: software to detect SNP effects on miRNA binding. BMC Bioinform March 15, 2014;15(1):1. [97] Thomas LF, Saito T, Sætrom P. Inferring causative variants in miRNA target sites. Nucleic Acids Res 2011;39:e109. http://dx.doi.org/10.1093/nar/gkr414.