Cancer genome sequencing: Understanding malignancy as a disease of the genome, its conformation, and its evolution

Cancer genome sequencing: Understanding malignancy as a disease of the genome, its conformation, and its evolution

Cancer Letters 340 (2013) 152–160 Contents lists available at SciVerse ScienceDirect Cancer Letters journal homepage: www.elsevier.com/locate/canlet...

1MB Sizes 0 Downloads 89 Views

Cancer Letters 340 (2013) 152–160

Contents lists available at SciVerse ScienceDirect

Cancer Letters journal homepage: www.elsevier.com/locate/canlet

Mini-review

Cancer genome sequencing: Understanding malignancy as a disease of the genome, its conformation, and its evolution Lalit R. Patel a,b, Matti Nykter c, Kexin Chen d, Wei Zhang b,⇑ a

The University of Texas Health Science Center, Houston, TX, USA Department of Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA c Department of Signal Processing, Tampere University of Technology, Tampere 33101, Finland d Department of Epidemiology and Biostatistics, Tianjin Medical University Cancer Institute and Hospital, Tianjin 300060, China b

a r t i c l e

i n f o

Keywords: Next-generation sequencing Cancer genomics Transcriptomics Chromosomal conformation sequencing Bioinformatics Tumor heterogeneity

a b s t r a c t Advances in cancer genomics have been propelled by the steady evolution of molecular profiling technologies. Over the past decade, high-throughput sequencing technologies have matured to the point necessary to support disease-specific shotgun sequencing. This has compelled whole-genome sequencing studies across a broad panel of malignancies. The emergence of high-throughput sequencing technologies has inspired new chemical and computational techniques enabling interrogation of cancer-specific genomic and transcriptomic variants, previously unannotated genes, and chromatin structure. Finally, recent progress in single-cell sequencing holds great promise for studies interrogating the consequences of tumor evolution in cancers presenting with genomic heterogeneity. Ó 2012 Elsevier Ireland Ltd. All rights reserved.

1. Introduction Cancer is often described as a population of cells that have either lost genomic integrity or have altered the way they manifest their genome. Another way to say this is that cancer is a genomic and epigenomic disease. Advances in genomic profiling have accordingly benefitted our understanding of cancer [1,2]. Molecular aberrations identified in cancer using genomic profiling predict therapeutic response, providing a suite of diagnostic features that may help inform treatment selection and patient care [3,4]. Cancer genomics is thus at the heart of both basic and translational research aiming to eradicate this deadly disease. A major limitation of the previous generation of genomics technologies was its reliance on hybridization and probe-based techniques. While highly informative with respect to copy number variations (CNVs) [5], single-nucleotide polymorphisms (SNPs) [6], and differential expression of known transcripts [7], microarrays by design capture only information about genomic variation that can be assessed by probes with predetermined sequences. Reviewed here are advances reflected in the current generation of cancer genomics, which are being propelled by the rapid evolution of high-throughput sequencing technology and concomitant ⇑ Corresponding author. Address: Cancer Genomics Core Laboratory, Department of Pathology, Unit 85, The University of Texas M.D. Anderson Cancer Center, 1515 Holcombe Blvd., Houston, TX 77030, USA. Tel.: +1 713 745 1103; fax: +1 713 792 5549. E-mail address: [email protected] (W. Zhang). 0304-3835/$ - see front matter Ó 2012 Elsevier Ireland Ltd. All rights reserved. http://dx.doi.org/10.1016/j.canlet.2012.10.018

bioinformatics [8]. This generation of genomic technologies is furthering our understanding of the cancer genome by providing insight into cancer-associated somatic sequence variants, genomic conformations, and expression of previously unannotated transcripts (Fig. 1). Several cancers have been studied extensively using highthroughput sequencing. The Cancer Genome Atlas (TCGA) is an ongoing publicly funded endeavor invested in comprehensively studying the cancer genomes across a broad panel of cancer types [9]. Several whole-exome and a few whole-genome sequencing studies independent of the TCGA have also been reported [10–14].

2. Highly parallel shotgun sequencing Shotgun sequencing is defined by the practice of sequencing many overlapping genomic fragments while relying on computational algorithms exploiting regions of sequence overlap to assemble fragments into a whole genome. This differs from clone-based sequencing where defined segments of the genome are cloned into and sequenced from bacterial or yeast artificial chromosomes [15]. A significant debate in the late 1990s over the feasibility of applying shotgun-sequencing to the genomes of higher order organisms focused on major concerns such as cost, logistics, and the computational challenges inherent to assembling large genomes from relatively short reads [16,17]. Nevertheless, a first draft of the human genome developed by using shotgun sequencing was published in 2001 [18].

L.R. Patel et al. / Cancer Letters 340 (2013) 152–160

153

Fig. 1. Shotgun sequencing. Genomic DNA and cDNA synthesized from RNA are fragmented into a size range compatible with the specifications of massively parallel highthroughput sequencing machines. During repair of fragmented ends, distinct adapters are ligated to each end, permitting paired-end sequencing of DNA fragments. Read pairs are then aligned computationally against a reference genome. Read depth at specific loci can be used to assess genomic copy number or RNA expression of genes. Bioinformatic comparisons of normal, tumor, and reference genomes identify sequence variants associated with disease. Discordant read pairs where the ends of a pair align to separate genomic loci nominate structural genomic rearrangements and expressed chimeras. Evidence of junction spanning read pairs supports ab initio annotation of novel genes in previously unannotated loci.

While this draft was a critical step toward enabling cancer genome sequencing, the National Human Genome Research Institute estimated the cost of shotgun sequencing in 2001 at nearly $100 million per genome. During the decade since, sequencing technologies have significantly improved their throughput, reducing this cost into the range of thousands of dollars per genome [19]. These advances have made cancer genome sequencing studies economically feasible.

3. Genomic sequencing Genomic sequencing can be divided into two major categories: whole-genome sequencing and targeted sequencing. Whole-genome sequencing refers to sequencing the full length of the genome, whereas targeted sequencing enables greater depth of coverage for higher resolution analysis of sequence variation within specific genomic elements. The most frequent case of targeted sequencing is whole-exome sequencing, where known exons are enriched and analyzed for mutations. In contrast, whole-genome sequencing identifies structural changes in the cancer genome, such as inversions, deletions, and other genomic rearrangements, in addition to mutations [8]. Targeted exome sequencing efforts have identified recurrent somatic mutations in diseases where oncogenic drivers were previously not well known. For instance, it was recently reported that the most frequent class of genomic aberrations in malignant melanomas is activating events in the MAPK pathway [20]. Interestingly, the same study showed differences between cutaneous,

acral, uveal, and mucosal melanomas with respect to their proclivities for accumulating specific aberrations within this pathway. This suggests exploitation of a common pathway across known anatomical subtypes of disease while also laying a foundation for investigating differences in the molecular etiologies of each type. Similarly, the mutational landscape as determined through exome sequencing of high-grade serous ovarian cancer reveals that, while more than 95% of sequenced tumors demonstrate inactivating mutations in participants of the tumor-suppressive TP53-pathway, activating alterations (mutational and amplification) of oncogenes were diverse and rarely recurrent in more than 10% of samples [21]. In a study of colorectal carcinomas, more than 90% of tumor samples presented with activating events in the WNT signaling pathway, while the specific events responsible occurred at low frequencies across a broad panel of genes involved in the WNT pathway [22]. An added layer of complexity is emerging from early wholegenome studies. While the existence of recurrent copy-number alterations in cancer has been well documented using arraycomparative genomic hybridization (CGH) studies [23], wholegenome sequencing studies have also revealed a breadth of genomic rearrangements previously underappreciated in many solid tumors [14,21,24]. While accepted as drivers of oncogenesis in hematologic malignancy, recurrent genomic rearrangements have recently also come to the fore as oncogenic events in solid tumors where recurrent functional gene fusions are being identified by sequencing [25]. The emerging theme of complex genetic backgrounds within cancers and differing mechanisms of oncogenesis across cancer

154

L.R. Patel et al. / Cancer Letters 340 (2013) 152–160

subtypes has prompted creative computational biology efforts to understand the structural genomic rationale behind the tissue specificity of oncogenic events (for example, [26]). This study in particular identified a significant association between the breakpoints of recurrent somatic copy-number alterations and guanine-rich DNA stretches capable of adopting G-quadraplex structure. A separate study identified a strong association between the genomic endpoints involved in somatic copy-number changes in cancer and genomic loci involved in chromosomal contact, thus lending further support to the hypothesis that recurrent rearrangements are structurally predisposed by genomic conformation [27].

4. Transcriptome sequencing and genome annotation The use of expressed sequence tags as a method for identifying genes has a long history in the molecular biology of gene discovery and the annotation of the human genome [28]. The technique involves cloning-based sequencing of cDNA fragments produced from RNA extracts and cloned into bacterial artificial chromosomes. Much like shotgun genome sequencing accelerated genomics, shotgun transcriptome sequencing (RNAseq) also overcomes the throughput limitations of cloning-based methods in transcriptomics. The result has been RNAseq-driven discovery of cancerassociated functional gene-fusion transcripts [29] and in some cases cancer-associated transcripts from genes annotated for the first time [30]. Inspired by advances in transcriptome sequencing, bioinformatics techniques for ab initio transcriptome assembly were developed with the intent of resolving alternatively spliced isoforms and were applied to questions in developmental biology [31]. However, this creative suite of assembly tools has also led to discovery of transcripts derived from genomic loci previously considered gene deserts. Specifically, ab initio assembly of RNAseq reads generated from libraries developed from 16 normal human tissues as part of the Illumina BodyMap Project reported as many as 8000 long non-coding RNA (lncRNA) genes. These RNAs are >200 bp in length and are transcribed from intergenic loci previously underappreciated as possible gene sites because of their lack of protein-coding potential [32]. Similar ab initio assembly also identified functional lncRNAs associated with prostate cancer progression [30]. It will be interesting to see how annotation of the genome with new genes discovered by RNAseq will lead to reinterpretation of prior genome-wide association studies and CGH studies reporting recurrent but overlooked cancer-associated SNPs and CNVs in genomic regions dismissed as gene deserts. In addition to finding new genes, RNAseq also permits sequence analysis of transcript variants of known genes arising from alterations in posttranscriptional RNA processing. Such studies have provided insight into mechanisms leading to oncogene dysregulation in the absence of genomic aberrations. For instance, it was recently reported that a chimeric transcript generated by cis-splicing of the adjacent genes SLC45A3 and ELK4 results in expression of a proliferation-promoting fused gene product despite there being no chromosomal rearrangement at this locus [33]. Likewise, alternative polyadenylation leading to truncation of the 30 -untranslated region and concomitant escape of microRNA-mediated transcript decay was observed as a posttranscriptional mechanism for elevated expression of known oncogenes [34]. Alternative splice variants of the androgen receptor identified by RNAseq analysis are also thought to contribute to prostate cancer progression from a hormone therapy-responsive to a treatment-resistant state [35]. Together these results suggest that posttranscriptional sequence variants in the transcriptome may constitute an underappreciated class of molecular events contributing to oncogenesis and tumor progression.

The ability to call sequence variants from RNAseq data also presents the potential of using transcriptome sequencing as a method of identifying genomic events. For instance RNAseq calls identifying tumor-specific single-nucleotide variants and chimeras could, respectively, nominate somatic mutations and putative genefusions. However, because RNA processing can alter transcripts independent of genomic events, an orthogonal method of validating calls using genomic DNA would be required to confirm the genomic origins of nominated alterations. Recent work comparing sample-matched genomic and transcriptomic mutation calls demonstrated the value of calls performed on transcriptome sequencing data in interpreting somatic mutations identified by genomic sequencing [36]. Specifically, the study demonstrated the value of integrating RNAseq data with somatic mutation calls from whole-genome sequencing to categorize mutations into four expressional categories: silent, expressed-variant, wild-type biased, and mutant-biased. Such categorization helps separate mutations of functional consequence from bystander events that although present in the cancer genome are not expressed or are biased against at the RNA level.

5. Sequencing malignant epigenomes While tumor genome and cancer transcriptome sequencing have helped identify molecular aberrations leading to altered gene function in malignant disease, the field of epigenetics has focused on understanding how heritable patterns of genomewide expression are encoded and passed on between generations of cells. This ‘‘code above the code’’ consists of chemical changes to the DNA and the proteins it interacts with to form transcriptionally-accessible and transcriptionally-repressed chromatin states. Epigenetic alterations therefore do not require alterations to the coding sequence of the genome to promote pathogenesis. Instead changes in the epigenome alters the selective accessibility of genomic elements to transcriptional machinery, a process thought to be responsible for allowing the genome of a single zygote to manifest the variety of stable tissue specific cell-lineages seen in mammalian development [37,38]. Advances in high-throughput sequencing enable analysis of genomewide landscapes for histone modifications by ChIP-seq [39], DNA-methylation by high-throughput sequencing of bisulfite-converted [40] or methyl-enriched genomic libraries [41], and transcriptionally-accessible chromatin regions by the DNAse-seq method of identifying DNAseI digestion hypersensitive sites (DHSs) in the genome [42]. Cancer epigenome studies have exploited these technologies to expand our understanding of epigenetic mechanisms of tumor development and progression. Recent work in colorectal carcinoma integrated H3K4me1 ChIPseq with DNAse-seq to identify variant enhancer loci associated with known genetic risk alleles for colorectal cancer, implying a role for epigenetically active loci in the pathogenesis of this disease. [43] A study of prostate cancer DNA-methylation using MethylPlex-NGS revealed regions of cancer specific DNA methylation despite no significant difference with respect to the total number of methylated regions between benign and malignant tissue, suggesting a role for the distribution of this epigenetic mark in prostatic malignancy. [41] Additionally DNA-methylation has established roles in spontaneous deamination of cytosine causing C to T mutations and the development of microsatellite instability, both of which provide a causal mechanism linking changes in the epigenome to potential downstream genetic events. [44] As these technologies mature and become more widely implemented, we are likely to learn much about the etiology of cancers driven by specific molecular lesions from the integration of epigenomic and genomic sequencing findings.

L.R. Patel et al. / Cancer Letters 340 (2013) 152–160

6. HiC: understanding genomic conformations Although genomic sequences are frequently represented as linear character strings, it would not be physically possible for the human genome to be contained within the nuclei of cells in such form. Instead the genome is densely packaged into a structurally complex conformation allowing it to fit within the nucleus of somatic cells [45]. The mechanics of such packaging necessarily induces proximity of linearly distant elements of the genome causing both interchromosomal and long-range intrachromosomal contacts [46]. Emerging epigenetic studies suggest that the structural conformation of the genome is carefully regulated as a mechanism for defining transcriptional assemblies responsible for coordinated regulation of gene-networks [47]. HiC is a high-throughput sequencing method that utilizes matepair sequencing of crosslinked and fragmented chromatin extracts processed by restriction enzymes such that each end represents a different member of two DNA segments engaged in structural interaction. The result of this unique sequencing modality is an unbiased map of pairwise interactions between any two points in the genome [48] (Fig. 2). Recently, HiC delineation of ERG-induced chromosomal conformations associated with coordinated gene expression in prostate cancer provided proof-of-principle that the technique may prove valuable in discovering conformational genomic interactions underlying transcription factor-driven oncogenic activity [49]. In addition to HiC, 5C [50] and ChIA-PET [51] assays have also been developed to map structural interactions between genomic loci. The advantage of ChIA-PET is its inclusion of a chromatin immunoprecipitation step allowing for identification of conformational genomic interactions specifically associated with a defined protein [51]. While there is no chromatin immunoprecipitation step in 5C, the method has the advantage of flexibility due the abil-

155

ity to use either high-throughput sequencing or genomic microarrays to generate readouts [50]. Despite methodological differences between the two techniques a comparison of interaction maps generated by 5C and ChIA-PET as part of the ENCODE project demonstrates high concordance between interactions identified by the two assays [52]. 7. Integrative analysis: inferring disease-relevant events and interactions across profiles Integrative analysis has proven to be a powerful approach to ascertaining information across multiple platforms of genomic profiles. Such analysis aims to infer associations that are not apparent when analyzing a single form of genomic profile (e.g., copy number only) but become apparent when reading one genomic profile in light of another (e.g., copy number alterations cooperating with a defined mutation status). The assumption inherent to such analysis is that some genomic aberrations are stochastic events, resulting from genomic instability, that do not contribute to disease biology. Integrative analysis attempts to enrich for clinically relevant functional events by identifying aberrations supported by data across multiple genomic analysis platforms (Fig. 3). An early opportunity for integrative bioinformatics arose from the evolution of datasets generated by screening the NCI60 panel of cell lines for both drug sensitivity and genomic attributes [53]. While many bioinformatics toolkits were initially developed to address profile queries within each screening modality (reviewed in [53]), over time the desire for analysis across the modalities compelled the development of CellMiner [54]. The tool is structured as a relational database of the individual NCI60 molecular profiles, permitting extraction of complex associations across the compendia of genomic studies performed on this cell-line panel.

Fig. 2. HiC method for chromosomal conformation sequencing. Long-range interchromosomal and intrachromosomal interactions implicated in regulation of epigenetically coordinated gene networks can be identified by HiC. Briefly, chromosomal interactions are preserved by formalin fixation, while restriction enzyme digestion cuts the genome into fragments containing sites of formalin fixation between genomic segments. Sticky ends are then repaired using biotinylated nucleotides and the repaired ends are ligated as blunt-ends introducing biotin at the site of ligation before crosslinks are reversed. DNA is then sheared to a fragment size compatible with high-throughput sequencing, biotin bearing fragments are isolated, and a paired-end sequencing library is made where each end represents a partner in a pairwise interaction. Pairs are then mapped by sequence alignment to a reference genome, creating a genome-wide atlas of structural interactions.

156

L.R. Patel et al. / Cancer Letters 340 (2013) 152–160

Fig. 3. Integrative analysis. Emerging bioinformatics pipelines utilize a combination of graph theory, statistical enrichment, and correlation algorithms to identify associations between distinct categories of genomic aberrations and clinical outcome. The goal of these pipelines is to push past the noise of single-platform analysis by nominating potentially functional genomic attributes supported across a panel of analyses.

Mutually Exclusive Modules in Cancer (MEMo) is a tool recently developed to use a combination of correlation analysis and graph theory to integrate multiplatform data with the intention of identifying gene sets nominating dysregulation of a pathway. When applied to data from the TCGA characterizing glioblastoma multiforme, the method homed in on two regulatory networks identified by a total of six genes [55]. This represents a significant reduction in the number of genes nominated as putative functional drivers for interrogation in follow-up studies. The bioinformatics package integrOmics is another example. This tool was written in R to facilitate integration between any pair of – omics datasets using correlation analysis [56]. Given the creative approach taken by existing and admittedly early algorithms, it will be interesting to see what new strategies for multiplatform genomic data integration emerge as this field matures. Advances in the analysis of data-rich genomic profiles have spawned the emerging field of cancer systems biology. While integrative genomics emphasizes the identification of distinct genomic aberrations, cancer systems biology takes analysis a step further by examining regulatory interactions responsible for sustaining malignant pathobiology [57]. Many methods have been built on top of networks modeled from known biological interactions detailed in pathway databases. For example, the ARACNe algorithm was recently used to uncover regulatory interactions driving epithelial-to-mesenchymal transformation in glioblastoma, a critical step in cancer progression [58]. Similarly the recently developed tool PARADIGM integrates genomic profiles into a model of

transcriptional, posttranscriptional, and cell-signaling interactions to infer activity profiles for interacting molecules [59]. The algorithm recently nominated activation of the FOXM1 signaling network as a critical contributor to ovarian cancer [21]. High-throughput sequencing datasets present several challenges for integrative and systems analysis not faced by early algorithms developed for use with microarrays. Two technical challenges are the computational power and algorithmic efficiency needed when processing datasets of the scale produced by sequencing. This challenge in particular will be important to overcome for integrative analysis to play a role in molecular diagnostics as clinical decisions will require a more rapid turnaround time than many current algorithms provide [60]. Analytically, sequencing provides information about changes in both relative abundance (copy number, expression) and sequence (alternative adenylation, mutations, rearrangements) between tumors and normal specimens. Thus, computational methodologies systematically accounting for changes in profile as well as changes in sequence are needed to fully capitalize on cancer genome sequencing data.

8. Single-cell sequencing: interrogating tumor heterogeneity and evolution While genomic sequencing is helping to identify somatic aberrations unique to cancer that may serve as drug targets, clinical observations from targeted therapy trials have raised critical

L.R. Patel et al. / Cancer Letters 340 (2013) 152–160

157

Fig. 4. Single-cell sequencing. Emerging techniques for genome and transcriptome sequencing of single cells permit identification of cancer clones, their lineages, and clonal gene-expression programs in heterogeneous tumors. These technologies have ramifications for future studies of acquired therapeutic resistance and disease progression through tumor evolution.

questions. Specifically, innate and acquired resistance to targeted therapy is a challenge shared across cancer types [61–64]. This has led some to develop dynamic models of treatment response, applying basic principles from evolutionary biology [65]. This view holds that tumors likely possess a plurality of clones that initiate, compete with each other to sustain, and regenerate overt lesions despite the selective pressure presented by various microenvironmental and therapeutic stresses. From this perspective, tumors are populations that evolve, and understanding this process requires an appreciation for both molecular aberrations and clonality. The sequencing modalities described so far do not directly address the heterogeneity of human cancer. However, the next generation of cancer genomics is emerging with sequencing technology permitting – omic scale analysis of single cells (Fig. 4). Navin et al. presented single-nucleus sequencing as a powerful technique to assess CNVs between individual cells with sufficient resolution to enable evolutionary inferences with respect to the metastatic process [66]. Inspired by questions in developmental biology, Tang and associates developed methods for single-cell RNAseq [67], which they later applied to investigate transcriptional processes regulating the earliest stages of differentiation occurring in embryonic inner mass cells [68]. Sandberg and associates recently furthered single-cell RNAseq technology by developing Smart-Seq, which prepares sequencing libraries representing full-length RNAs through template switching during cDNA preparation prior to library amplification [69]. In this study, the group provided proof-of-principle by comparing the transcriptomes of circulating tumor cells to patient-matched melanoma tumors, indicating that the technique could be applied to study small

quantities of cancer cells. Although very nascent, these techniques may help identify, characterize, and quantify the relative abundance of genomic and transcriptomic clones composing heterogeneous tumor cell populations. The teleological thread held in common by cancer genomics efforts deploying single-cell sequencing is the belief that cancer is more than an altered normal genome. On the contrary, cancer from this perspective is viewed as a collection of independent genomes that evolve separately from and more quickly than the heritable germline of the host in which the tumor forms. Thus, while the past and present of cancer genomics has focused on identifying functional genomic aberrations in cancer cells, the next wave of cancer genomics is primed to study the evolution of whole genomes and the role of intratumoral evolutionary forces in driving the biology of heterogeneous tumors. It is an exciting time to be involved in cancer genome sequencing. References [1] S. Ramaswamy, K.N. Ross, E.S. Lander, T.R. Golub, A molecular signature of metastasis in primary solid tumors, Nat. Genet. 33 (2003) 49–54. [2] D.T. Ross, U. Scherf, M.B. Eisen, C.M. Perou, C. Rees, P. Spellman, V. Iyer, S.S. Jeffrey, M. Van de Rijn, M. Waltham, A. Pergamenschikov, J.C. Lee, D. Lashkari, D. Shalon, T.G. Myers, J.N. Weinstein, D. Botstein, P.O. Brown, Systematic variation in gene expression patterns in human cancer cell lines, Nat. Genet. 24 (2000) 227–235. [3] M.J. Garnett, E.J. Edelman, S.J. Heidorn, C.D. Greenman, A. Dastur, K.W. Lau, P. Greninger, I.R. Thompson, X. Luo, J. Soares, Q. Liu, F. Iorio, D. Surdez, L. Chen, R.J. Milano, G.R. Bignell, A.T. Tam, H. Davies, J.A. Stevenson, S. Barthorpe, S.R. Lutz, F. Kogera, K. Lawrence, A. McLaren-Douglas, X. Mitropoulos, T. Mironenko, H. Thi, L. Richardson, W. Zhou, F. Jewitt, T. Zhang, P. O’Brien, J.L. Boisvert, S. Price, W. Hur, W. Yang, X. Deng, A. Butler, H.G. Choi, J.W. Chang, J. Baselga, I.

158

[4]

[5] [6] [7] [8] [9]

[10]

[11]

[12]

[13]

[14]

[15] [16] [17] [18]

L.R. Patel et al. / Cancer Letters 340 (2013) 152–160 Stamenkovic, J.A. Engelman, S.V. Sharma, O. Delattre, J. Saez-Rodriguez, N.S. Gray, J. Settleman, P.A. Futreal, D.A. Haber, M.R. Stratton, S. Ramaswamy, U. McDermott, C.H. Benes, Systematic identification of genomic markers of drug sensitivity in cancer cells, Nature 483 (2012) 570–575. J. Barretina, G. Caponigro, N. Stransky, K. Venkatesan, A.A. Margolin, S. Kim, C.J. Wilson, J. Lehar, G.V. Kryukov, D. Sonkin, A. Reddy, M. Liu, L. Murray, M.F. Berger, J.E. Monahan, P. Morais, J. Meltzer, A. Korejwa, J. Jane-Valbuena, F.A. Mapa, J. Thibault, E. Bric-Furlong, P. Raman, A. Shipway, I.H. Engels, J. Cheng, G.K. Yu, J. Yu, P. Aspesi Jr., M. de Silva, K. Jagtap, M.D. Jones, L. Wang, C. Hatton, E. Palescandolo, S. Gupta, S. Mahan, C. Sougnez, R.C. Onofrio, T. Liefeld, L. MacConaill, W. Winckler, M. Reich, N. Li, J.P. Mesirov, S.B. Gabriel, G. Getz, K. Ardlie, V. Chan, V.E. Myer, B.L. Weber, J. Porter, M. Warmuth, P. Finan, J.L. Harris, M. Meyerson, T.R. Golub, M.P. Morrissey, W.R. Sellers, R. Schlegel, L.A. Garraway, The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity, Nature 483 (2012) 603–607. J.J. Davies, I.M. Wilson, W.L. Lam, Array CGH technologies and their applications to cancer genomes, Chromosome Res. 13 (2005) 237–248. A. Dutt, R. Beroukhim, Single nucleotide polymorphism array analysis of cancer, Curr. Opin. Oncol. 19 (2007) 43–49. A. Schulze, J. Downward, Navigating gene expression using microarrays – a technology review, Nat. Cell Biol. 3 (2001) E190–E195. M.L. Metzker, Applications of next-generation sequencing: sequencing technologies – the next generation, Nat. Rev. Genet. 11 (2010) 31–46. F.S. Collins, A.D. Barker, Mapping the cancer genome. Pinpointing the genes involved in cancer will help chart a new course across the complex landscape of human malignancies, Sci. Am. 296 (2007) 50–57. P.J. Stephens, P.S. Tarpey, H. Davies, P. Van Loo, C. Greenman, D.C. Wedge, S. Nik-Zainal, S. Martin, I. Varela, G.R. Bignell, L.R. Yates, E. Papaemmanuil, D. Beare, A. Butler, A. Cheverton, J. Gamble, J. Hinton, M. Jia, A. Jayakumar, D. Jones, C. Latimer, K.W. Lau, S. McLaren, D.J. McBride, A. Menzies, L. Mudie, K. Raine, R. Rad, M.S. Chapman, J. Teague, D. Easton, A. Langerod, M.T. Lee, C.Y. Shen, B.T. Tee, B.W. Huimin, A. Broeks, A.C. Vargas, G. Turashvili, J. Martens, A. Fatima, P. Miron, S.F. Chin, G. Thomas, S. Boyault, O. Mariani, S.R. Lakhani, M. van de Vijver, L. van ‘t Veer, J. Foekens, C. Desmedt, C. Sotiriou, A. Tutt, C. Caldas, J.S. Reis-Filho, S.A. Aparicio, A.V. Salomon, A.L. Borresen-Dale, A.L. Richardson, P.J. Campbell, P.A. Futreal, M.R. Stratton, The landscape of cancer genes and mutational processes in breast cancer, Nature 486 (2012) 400–404. C.E. Barbieri, S.C. Baca, M.S. Lawrence, F. Demichelis, M. Blattner, J.P. Theurillat, T.A. White, P. Stojanov, E. Van Allen, N. Stransky, E. Nickerson, S.S. Chae, G. Boysen, D. Auclair, R.C. Onofrio, K. Park, N. Kitabayashi, T.Y. Macdonald, K. Sheikh, T. Vuong, C. Guiducci, K. Cibulskis, A. Sivachenko, S.L. Carter, G. Saksena, D. Voet, W.M. Hussain, A.H. Ramos, W. Winckler, M.C. Redman, K. Ardlie, A.K. Tewari, J.M. Mosquera, N. Rupp, P.J. Wild, H. Moch, C. Morrissey, P.S. Nelson, P.W. Kantoff, S.B. Gabriel, T.R. Golub, M. Meyerson, E.S. Lander, G. Getz, M.A. Rubin, L.A. Garraway, Exome sequencing identifies recurrent SPOP, FOXA1 and MED12 mutations in prostate cancer, Nat. Genet. 44 (2012) 685– 689. X.M. Wei, V. Walia, J.C. Lin, J.K. Teer, T.D. Prickett, J. Gartner, S. Davis, K. Stemke-Hale, M.A. Davies, J.E. Gershenwald, W. Robinson, S. Robinson, S.A. Rosenberg, Y. Samuels, N.C.S. Progra, Exome sequencing identifies GRIN2A as frequently mutated in melanoma, Nat. Genet. 43 (2011) 442–446. M.J. Ellis, L. Ding, D. Shen, J.Q. Luo, V.J. Suman, J.W. Wallis, B.A. Van Tine, J. Hoog, R.J. Goiffon, T.C. Goldstein, S. Ng, L. Lin, R. Crowder, J. Snider, K. Ballman, J. Weber, K. Chen, D.C. Koboldt, C. Kandoth, W.S. Schierding, J.F. McMichael, C.A. Miller, C. Lu, C.C. Harris, M.D. McLellan, M.C. Wendl, K. DeSchryver, D.C. Allred, L. Esserman, G. Unzeitig, J. Margenthaler, G.V. Babiera, P.K. Marcom, J.M. Guenther, M. Leitch, K. Hunt, J. Olson, Y. Tao, C.A. Maher, L.L. Fulton, R.S. Fulton, M. Harrison, B. Oberkfell, F.Y. Du, R. Demeter, T.L. Vickery, A. Elhammali, H. Piwnica-Worms, S. McDonald, M. Watson, D.J. Dooling, D. Ota, L.W. Chang, R. Bose, T.J. Ley, D. Piwnica-Worms, J.M. Stuart, R.K. Wilson, E.R. Mardis, Whole-genome analysis informs breast cancer response to aromatase inhibition, Nature 486 (2012) 353–360. M.F. Berger, M.S. Lawrence, F. Demichelis, Y. Drier, K. Cibulskis, A.Y. Sivachenko, A. Sboner, R. Esgueva, D. Pflueger, C. Sougnez, R. Onofrio, S.L. Carter, K. Park, L. Habegger, L. Ambrogio, T. Fennell, M. Parkin, G. Saksena, D. Voet, A.H. Ramos, T.J. Pugh, J. Wilkinson, S. Fisher, W. Winckler, S. Mahan, K. Ardlie, J. Baldwin, J.W. Simons, N. Kitabayashi, T.Y. MacDonald, P.W. Kantoff, L. Chin, S.B. Gabriel, M.B. Gerstein, T.R. Golub, M. Meyerson, A. Tewari, E.S. Lander, G. Getz, M.A. Rubin, L.A. Garraway, The genomic complexity of primary human prostate cancer, Nature 470 (2011) 214–220. J. Adams, Complex genomes: shotgun sequencing, Nat. Educ. (2008) 1. J.L. Weber, E.W. Myers, Human whole-genome shotgun sequencing, Genome Res. 7 (1997) 401–409. P. Green, Against a whole-genome shotgun, Genome Res. 7 (1997) 410–417. J.C. Venter, M.D. Adams, E.W. Myers, P.W. Li, R.J. Mural, G.G. Sutton, H.O. Smith, M. Yandell, C.A. Evans, R.A. Holt, J.D. Gocayne, P. Amanatides, R.M. Ballew, D.H. Huson, J.R. Wortman, Q. Zhang, C.D. Kodira, X.H. Zheng, L. Chen, M. Skupski, G. Subramanian, P.D. Thomas, J. Zhang, G.L. Gabor Miklos, C. Nelson, S. Broder, A.G. Clark, J. Nadeau, V.A. McKusick, N. Zinder, A.J. Levine, R.J. Roberts, M. Simon, C. Slayman, M. Hunkapiller, R. Bolanos, A. Delcher, I. Dew, D. Fasulo, M. Flanigan, L. Florea, A. Halpern, S. Hannenhalli, S. Kravitz, S. Levy, C. Mobarry, K. Reinert, K. Remington, J. Abu-Threideh, E. Beasley, K. Biddick, V. Bonazzi, R. Brandon, M. Cargill, I. Chandramouliswaran, R. Charlab, K. Chaturvedi, Z. Deng, V. Di Francesco, P. Dunn, K. Eilbeck, C. Evangelista, A.E. Gabrielian, W. Gan, W.

[19]

[20]

[21]

[22]

Ge, F. Gong, Z. Gu, P. Guan, T.J. Heiman, M.E. Higgins, R.R. Ji, Z. Ke, K.A. Ketchum, Z. Lai, Y. Lei, Z. Li, J. Li, Y. Liang, X. Lin, F. Lu, G.V. Merkulov, N. Milshina, H.M. Moore, A.K. Naik, V.A. Narayan, B. Neelam, D. Nusskern, D.B. Rusch, S. Salzberg, W. Shao, B. Shue, J. Sun, Z. Wang, A. Wang, X. Wang, J. Wang, M. Wei, R. Wides, C. Xiao, C. Yan, A. Yao, J. Ye, M. Zhan, W. Zhang, H. Zhang, Q. Zhao, L. Zheng, F. Zhong, W. Zhong, S. Zhu, S. Zhao, D. Gilbert, S. Baumhueter, G. Spier, C. Carter, A. Cravchik, T. Woodage, F. Ali, H. An, A. Awe, D. Baldwin, H. Baden, M. Barnstead, I. Barrow, K. Beeson, D. Busam, A. Carver, A. Center, M.L. Cheng, L. Curry, S. Danaher, L. Davenport, R. Desilets, S. Dietz, K. Dodson, L. Doup, S. Ferriera, N. Garg, A. Gluecksmann, B. Hart, J. Haynes, C. Haynes, C. Heiner, S. Hladun, D. Hostin, J. Houck, T. Howland, C. Ibegwam, J. Johnson, F. Kalush, L. Kline, S. Koduru, A. Love, F. Mann, D. May, S. McCawley, T. McIntosh, I. McMullen, M. Moy, L. Moy, B. Murphy, K. Nelson, C. Pfannkoch, E. Pratts, V. Puri, H. Qureshi, M. Reardon, R. Rodriguez, Y.H. Rogers, D. Romblad, B. Ruhfel, R. Scott, C. Sitter, M. Smallwood, E. Stewart, R. Strong, E. Suh, R. Thomas, N.N. Tint, S. Tse, C. Vech, G. Wang, J. Wetter, S. Williams, M. Williams, S. Windsor, E. Winn-Deen, K. Wolfe, J. Zaveri, K. Zaveri, J.F. Abril, R. Guigo, M.J. Campbell, K.V. Sjolander, B. Karlak, A. Kejariwal, H. Mi, B. Lazareva, T. Hatton, A. Narechania, K. Diemer, A. Muruganujan, N. Guo, S. Sato, V. Bafna, S. Istrail, R. Lippert, R. Schwartz, B. Walenz, S. Yooseph, D. Allen, A. Basu, J. Baxendale, L. Blick, M. Caminha, J. Carnes-Stine, P. Caulk, Y.H. Chiang, M. Coyne, C. Dahlke, A. Mays, M. Dombroski, M. Donnelly, D. Ely, S. Esparham, C. Fosler, H. Gire, S. Glanowski, K. Glasser, A. Glodek, M. Gorokhov, K. Graham, B. Gropman, M. Harris, J. Heil, S. Henderson, J. Hoover, D. Jennings, C. Jordan, J. Jordan, J. Kasha, L. Kagan, C. Kraft, A. Levitsky, M. Lewis, X. Liu, J. Lopez, D. Ma, W. Majoros, J. McDaniel, S. Murphy, M. Newman, T. Nguyen, N. Nguyen, M. Nodell, S. Pan, J. Peck, M. Peterson, W. Rowe, R. Sanders, J. Scott, M. Simpson, T. Smith, A. Sprague, T. Stockwell, R. Turner, E. Venter, M. Wang, M. Wen, D. Wu, M. Wu, A. Xia, A. Zandieh, X. Zhu, The sequence of the human genome, Science 291 (2001) 1304–1351. K.A. Wetterstrand, DNA sequencing costs: data from the NHGRI Large-Scale Genome Sequencing Program www.genome.gov/sequencingcosts. (accessed August 2012). J.A. Curtin, J. Fridlyand, T. Kageshita, H.N. Patel, K.J. Busam, H. Kutzner, K.H. Cho, S. Aiba, E.B. Brocker, P.E. LeBoit, D. Pinkel, B.C. Bastian, Distinct sets of genetic alterations in melanoma, N. Engl. J. Med. 353 (2005) 2135–2147. D. Bell, A. Berchuck, M. Birrer, J. Chien, D.W. Cramer, F. Dao, R. Dhir, P. DiSaia, H. Gabra, P. Glenn, A.K. Godwin, J. Gross, L. Hartmann, M. Huang, D.G. Huntsman, M. Iacocca, M. Imielinski, S. Kalloger, B.Y. Karlan, D.A. Levine, G.B. Mills, C. Morrison, D. Mutch, N. Olvera, S. Orsulic, K. Park, N. Petrelli, B. Rabeno, J.S. Rader, B.I. Sikic, K. Smith-McCune, A.K. Sood, D. Bowtell, R. Penny, J.R. Testa, K. Chang, H.H. Dinh, J.A. Drummond, G. Fowler, P. Gunaratne, A.C. Hawes, C.L. Kovar, L.R. Lewis, M.B. Morgan, I.F. Newsham, J. Santibanez, J.G. Reid, L.R. Trevino, Y.Q. Wu, M. Wang, D.M. Muzny, D.A. Wheeler, R.A. Gibbs, G. Getz, M.S. Lawrence, K. Cibulskis, A.Y. Sivachenko, C. Sougnez, D. Voet, J. Wilkinson, T. Bloom, K. Ardlie, T. Fennell, J. Baldwin, S. Gabriel, E.S. Lander, L. Ding, R.S. Fulton, D.C. Koboldt, M.D. McLellan, T. Wylie, J. Walker, M. O’Laughlin, D.J. Dooling, L. Fulton, R. Abbott, N.D. Dees, Q. Zhang, C. Kandoth, M. Wendl, W. Schierding, D. Shen, C.C. Harris, H. Schmidt, J. Kalicki, K.D. Delehaunty, C.C. Fronick, R. Demeter, L. Cook, J.W. Wallis, L. Lin, V.J. Magrini, J.S. Hodges, J.M. Eldred, S.M. Smith, C.S. Pohl, F. Vandin, B.J. Raphael, G.M. Weinstock, R. Mardis, R.K. Wilson, M. Meyerson, W. Winckler, G. Getz, R.G.W. Verhaak, S.L. Carter, C.H. Mermel, G. Saksena, H. Nguyen, R.C. Onofrio, M.S. Lawrence, D. Hubbard, S. Gupta, A. Crenshaw, A.H. Ramos, K. Ardlie, L. Chin, A. Protopopov, J.H. Zhang, T.M. Kim, I. Perna, Y. Xiao, H. Zhang, G. Ren, N. Sathiamoorthy, R.W. Park, E. Lee, P.J. Park, R. Kucherlapati, D.M. Absher, L. Waite, G. Sherlock, J.D. Brooks, J.Z. Li, J. Xu, R.M. Myers, P.W. Laird, L. Cope, J.G. Herman, H. Shen, D.J. Weisenberger, H. Noushmehr, F. Pan, T. Triche, B.P. Berman, D.J. Van den Berg, J. Buckley, S.B. Baylin, P.T. Spellman, E. Purdom, P. Neuvial, H. Bengtsson, L.R. Jakkula, S. Durinck, J. Han, S. Dorton, H. Marr, Y.G. Choi, V. Wang, N.J. Wang, J. Ngai, J.G. Conboy, B. Parvin, H.S. Feiler, T.P. Speed, J.W. Gray, D.A. Levine, N.D. Socci, Y. Liang, B.S. Taylor, N. Schultz, L. Borsu, A.E. Lash, C. Brennan, A. Viale, C. Sander, M. Ladanyi, K.A. Hoadley, S. Meng, Y. Du, Y. Shi, L. Li, Y.J. Turman, D. Zang, E.B. Helms, S. Balu, X. Zhou, J. Wu, M.D. Topal, D.N. Hayes, C.M. Perou, G. Getz, D. Voet, G. Saksena, J.N.H. Zhang, H. Zhang, C.J. Wu, S. Shukla, K. Cibulskis, M.S. Lawrence, A. Sivachenko, R. Jing, R.W. Park, Y. Liu, P.J. Park, M. Noble, L. Chin, H. Carter, D. Kim, R. Karchin, P.T. Spellman, E. Purdom, P. Neuvial, H. Bengtsson, S. Durinck, J. Han, J.E. Korkola, L.M. Heiser, R.J. Cho, Z. Hu, B. Parvin, T.P. Speed, J.W. Gray, N. Schultz, E. Cerami, B.S. Taylor, A. Olshen, B. Reva, Y. Antipin, R. Shen, P. Mankoo, R. Sheridan, G. Ciriello, W.K. Chang, J.A. Bernanke, L. Borsu, D.A. Levine, M. Ladanyi, C. Sander, D. Haussler, C.C. Benz, J.M. Stuart, S.C. Benz, J.Z. Sanborn, C.J. Vaske, J. Zhu, C. Szeto, G.K. Scott, C. Yau, K.A. Hoadley, Y. Du, S. Balu, D.N. Hayes, C.M. Perou, M.D. Wilkerson, N. Zhang, R. Akbani, K.A. Baggerly, W.K. Yung, G.B. Mills, J.N. Weinstein, R. Penny, T. Shelton, D. Grimm, M. Hatfield, S. Morris, P. Yena, P. Rhodes, M. Sherman, J. Paulauskis, S. Millis, A. Kahn, J.M. Greene, R. Sfeir, M.A. Jensen, J. Chen, J. Whitmore, S. Alonso, J. Jordan, A. Chu, J.H. Zhang, A. Barker, C. Compton, G. Eley, M. Ferguson, P. Fielding, D.S. Gerhard, R. Myles, C. Schaefer, K.R.M. Shaw, J. Vaught, J.B. Vockley, P.J. Good, M.S. Guyer, B. Ozenberger, J. Peterson, E. Thomson, C.G.A.R. Network, Integrated genomic analyses of ovarian carcinoma, Nature 474 (2011) 609–615. D.M. Muzny, M.N. Bainbridge, K. Chang, H.H. Dinh, J.A. Drummond, G. Fowler, C.L. Kovar, L.R. Lewis, M.B. Morgan, I.F. Newsham, J.G. Reid, J. Santibanez, E. Shinbrot, L.R. Trevino, Y.Q. Wu, M. Wang, P. Gunaratne, L.A. Donehower, C.J.

L.R. Patel et al. / Cancer Letters 340 (2013) 152–160

[23]

[24]

[25] [26] [27]

[28]

[29]

[30]

Creighton, D.A. Wheeler, R.A. Gibbs, M.S. Lawrence, D. Voet, R. Jing, K. Cibulskis, A. Sivachenko, P. Stojanov, A. McKenna, E.S. Lander, S. Gabriel, G. Getz, L. Ding, R.S. Fulton, D.C. Koboldt, T. Wylie, J. Walker, D.J. Dooling, L. Fulton, K.D. Delehaunty, C.C. Fronick, R. Demeter, E.R. Mardis, R.K. Wilson, A. Chu, H.J.E. Chun, A.J. Mungall, E. Pleasance, A.G. Robertson, D. Stoll, M. Balasundaram, I. Birol, Y.S.N. Butterfield, E. Chuah, R.J.N. Coope, N. Dhalla, R. Guin, C. Hirst, M. Hirst, R.A. Holt, D. Lee, H.I. Li, M. Mayo, R.A. Moore, J.E. Schein, J.R. Slobodan, A. Tam, N. Thiessen, R. Varhol, T. Zeng, Y. Zhao, S.J.M. Jones, M.A. Marra, A.J. Bass, A.H. Ramos, G. Saksena, A.D. Cherniack, S.E. Schumacher, B. Tabak, S.L. Carter, N.H. Pho, H. Nguyen, R.C. Onofrio, A. Crenshaw, K. Ardlie, R. Beroukhim, W. Winckler, G. Getz, M. Meyerson, A. Protopopov, J. Zhang, A. Hadjipanayis, E. Lee, R. Xi, L. Yang, X. Ren, H. Zhang, N. Sathiamoorthy, S. Shukla, P.C. Chen, P. Haseley, Y. Xiao, S. Lee, J. Seidman, L. Chin, P.J. Park, R. Kucherlapati, J.T. Auman, K.A. Hoadley, Y. Du, M.D. Wilkerson, Y. Shi, C. Liquori, S. Meng, L. Li, Y.J. Turman, M.D. Topal, D. Tan, S. Waring, E. Buda, J. Walsh, C.D. Jones, P.A. Mieczkowski, D. Singh, J. Wu, A. Gulabani, P. Dolina, T. Bodenheimer, A.P. Hoyle, J.V. Simons, M. Soloway, L.E. Mose, S.R. Jefferys, S. Balu, B.D. O’Connor, J.F. Prins, D.Y. Chiang, D.N. Hayes, C.M. Perou, T. Hinoue, D.J. Weisenberger, D.T. Maglinte, F. Pan, B.P. Berman, D.J. Van den Berg, H. Shen, T.T. Jr, S.B. Baylin, P.W. Laird, G. Getz, M. Noble, D. Voet, G. Saksena, N. Gehlenborg, D. DiCara, J. Zhang, H. Zhang, C.J. Wu, S.Y. Liu, S. Shukla, M.S. Lawrence, L. Zhou, A. Sivachenko, P. Lin, P. Stojanov, R. Jing, R.W. Park, M.D. Nazaire, J. Robinson, H. Thorvaldsdottir, J. Mesirov, P.J. Park, L. Chin, V. Thorsson, S.M. Reynolds, B. Bernard, R. Kreisberg, J. Lin, L. Iype, R. Bressler, T. Erkkila, M. Gundapuneni, Y. Liu, A. Norberg, T. Robinson, D. Yang, W. Zhang, I. Shmulevich, J.J. De Ronde, N. Schultz, E. Cerami, G. Ciriello, A.P. Goldberg, B. Gross, A. Jacobsen, J. Gao, B. Kaczkowski, R. Sinha, B.A. Aksoy, Y. Antipin, B. Reva, R. Shen, B.S. Taylor, T.A. Chan, M. Ladanyi, C. Sander, R. Akbani, N. Zhang, B.M. Broom, T. Casasent, A. Unruh, C. Wakefield, S.R. Hamilton, R.C. Cason, K.A. Baggerly, J.N. Weinstein, D. Haussler, C.C. Benz, J.M. Stuart, S.C. Benz, J.Z. Sanborn, C.J. Vaske, J. Zhu, C. Szeto, G.K. Scott, C. Yau, S. Ng, T. Goldstein, K. Ellrott, E. Collisson, A.E. Cozen, D. Zerbino, C. Wilks, B. Craft, P. Spellman, R. Penny, T. Shelton, M. Hatfield, S. Morris, P. Yena, C. Shelton, M. Sherman, J. Paulauskis, J.M. Gastier-Foster, J. Bowen, N.C. Ramirez, A. Black, R. Pyatt, L. Wise, P. White, M. Bertagnolli, J. Brown, T.A. Chan, G.C. Chu, C. Czerwinski, F. Denstman, R. Dhir, A. Doerner, C.S. Fuchs, J.G. Guillem, M. Iacocca, H. Juhl, A. Kaufman, B. Kohl, X. Van Le, M.C. Mariano, E.N. Medina, M. Meyers, G.M. Nash, P.B. Paty, N. Petrelli, B. Rabeno, W.G. Richards, D. Solit, P. Swanson, L. Temple, J.E. Tepper, R. Thorp, E. Vakiani, M.R. Weiser, J.E. Willis, G. Witkin, Z. Zeng, M.J. Zinner, C. Zornig, M.A. Jensen, R. Sfeir, A.B. Kahn, A.L. Chu, P. Kothiyal, Z. Wang, E.E. Snyder, J. Pontius, T.D. Pihl, B. Ayala, M. Backus, J. Walton, J. Whitmore, J. Baboud, D.L. Berton, M.C. Nicholls, D. Srinivasan, R. Raman, S. Girshik, P.A. Kigonya, S. Alonso, R.N. Sanbhadti, S.P. Barletta, J.M. Greene, D.A. Pot, K.R.M. Shaw, L.A.L. Dillon, K. Buetow, T. Davidsen, J.A. Demchok, G. Eley, M. Ferguson, P. Fielding, C. Schaefer, M. Sheth, L. Yang, M.S. Guyer, B.A. Ozenberger, J.D. Palchik, J. Peterson, H.J. Sofia, E. Thomson, Comprehensive molecular characterization of human colon and rectal cancer, Nature 487 (2012) 330– 337. R. Beroukhim, C.H. Mermel, D. Porter, G. Wei, S. Raychaudhuri, J. Donovan, J. Barretina, J.S. Boehm, J. Dobson, M. Urashima, K.T. Mc Henry, R.M. Pinchback, A.H. Ligon, Y.J. Cho, L. Haery, H. Greulich, M. Reich, W. Winckler, M.S. Lawrence, B.A. Weir, K.E. Tanaka, D.Y. Chiang, A.J. Bass, A. Loo, C. Hoffman, J. Prensner, T. Liefeld, Q. Gao, D. Yecies, S. Signoretti, E. Maher, F.J. Kaye, H. Sasaki, J.E. Tepper, J.A. Fletcher, J. Tabernero, J. Baselga, M.S. Tsao, F. Demichelis, M.A. Rubin, P.A. Janne, M.J. Daly, C. Nucera, R.L. Levine, B.L. Ebert, S. Gabriel, A.K. Rustgi, C.R. Antonescu, M. Ladanyi, A. Letai, L.A. Garraway, M. Loda, D.G. Beer, L.D. True, A. Okamoto, S.L. Pomeroy, S. Singer, T.R. Golub, E.S. Lander, G. Getz, W.R. Sellers, M. Meyerson, The landscape of somatic copy-number alteration across human cancers, Nature 463 (2010) 899–905. P.J. Campbell, S. Yachida, L.J. Mudie, P.J. Stephens, E.D. Pleasance, L.A. Stebbings, L.A. Morsberger, C. Latimer, S. McLaren, M.L. Lin, D.J. McBride, I. Varela, S.A. Nik-Zainal, C. Leroy, M. Jia, A. Menzies, A.P. Butler, J.W. Teague, C.A. Griffin, J. Burton, H. Swerdlow, M.A. Quail, M.R. Stratton, C. IacobuzioDonahue, P.A. Futreal, The patterns and dynamics of genomic instability in metastatic pancreatic cancer, Nature 467 (2010) 1109–1113. F. Mitelman, B. Johansson, F. Mertens, The impact of translocations and gene fusions on cancer causation, Nat. Rev. Cancer 7 (2007) 233–245. S. De, F. Michor, DNA secondary structures and epigenetic determinants of cancer genome evolution, Nat. Struct. Mol. Biol. 18 (2011) 950–955. G. Fudenberg, G. Getz, M. Meyerson, L.A. Mirny, High order chromatin architecture shapes the landscape of chromosomal alterations in cancer, Nat. Biotechnol. 29 (2011) 1109–1113. M.D. Adams, J.M. Kelley, J.D. Gocayne, M. Dubnick, M.H. Polymeropoulos, H. Xiao, C.R. Merril, A. Wu, B. Olde, R.F. Moreno, et al., Complementary DNA sequencing: expressed sequence tags and human genome project, Science 252 (1991) 1651–1656. C.A. Maher, C. Kumar-Sinha, X. Cao, S. Kalyana-Sundaram, B. Han, X. Jing, L. Sam, T. Barrette, N. Palanisamy, A.M. Chinnaiyan, Transcriptome sequencing to detect gene fusions in cancer, Nature 458 (2009) 97–101. J.R. Prensner, M.K. Iyer, O.A. Balbin, S.M. Dhanasekaran, Q. Cao, J.C. Brenner, B. Laxman, I.A. Asangani, C.S. Grasso, H.D. Kominsky, X. Cao, X. Jing, X. Wang, J. Siddiqui, J.T. Wei, D. Robinson, H.K. Iyer, N. Palanisamy, C.A. Maher, A.M. Chinnaiyan, Transcriptome sequencing across a prostate cancer cohort identifies PCAT-1, an unannotated lincRNA implicated in disease progression, Nat. Biotechnol. 29 (2011) 742–749.

159

[31] C. Trapnell, B.A. Williams, G. Pertea, A. Mortazavi, G. Kwan, M.J. van Baren, S.L. Salzberg, B.J. Wold, L. Pachter, Transcript assembly and quantification by RNASeq reveals unannotated transcripts and isoform switching during cell differentiation, Nat. Biotechnol. 28 (2010) 511–515. [32] M.N. Cabili, C. Trapnell, L. Goff, M. Koziol, B. Tazon-Vega, A. Regev, J.L. Rinn, Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses, Genes Dev. 25 (2011) 1915– 1927. [33] Y. Zhang, M. Gong, H. Yuan, H.G. Park, H.F. Frierson, H. Li, Chimeric transcript generated by cis-splicing of adjacent genes regulates prostate cancer cell proliferation, Cancer Discov. 2 (2012) 598–607. [34] C. Mayr, D.P. Bartel, Widespread shortening of 3’UTRs by alternative cleavage and polyadenylation activates oncogenes in cancer cells, Cell 138 (2009) 673– 684. [35] P.A. Watson, Y.F. Chen, M.D. Balbas, J. Wongvipat, N.D. Socci, A. Viale, K. Kim, C.L. Sawyers, Constitutively active androgen receptor splice variants expressed in castration-resistant prostate cancer require full-length androgen receptor, Proc. Natl. Acad. Sci. USA 107 (2010) 16759–16765. [36] R. Govindan, L. Ding, M. Griffith, J. Subramanian, N.D. Dees, K.L. Kanchi, C.A. Maher, R. Fulton, L. Fulton, J. Wallis, K. Chen, J. Walker, S. McDonald, R. Bose, D. Ornitz, D. Xiong, M. You, D.J. Dooling, M. Watson, E.R. Mardis, R.K. Wilson, Genomic landscape of non-small cell lung cancer in smokers and neversmokers, Cell 150 (2012) 1121–1134. [37] E. Li, Chromatin modification and epigenetic reprogramming in mammalian development, Nat. Rev. Genet. 3 (2002) 662–673. [38] B.M. Turner, Cellular memory and the histone code, Cell 111 (2002) 285–291. [39] P.J. Park, ChIP-seq: advantages and challenges of a maturing technology, Nat. Rev. Genet. 10 (2009) 669–680. [40] F. Krueger, B. Kreck, A. Franke, S.R. Andrews, DNA methylome analysis using short bisulfite sequencing data, Nat. Methods 9 (2012) 145–151. [41] J.H. Kim, S.M. Dhanasekaran, J.R. Prensner, X. Cao, D. Robinson, S. KalyanaSundaram, C. Huang, S. Shankar, X. Jing, M. Iyer, M. Hu, L. Sam, C. Grasso, C.A. Maher, N. Palanisamy, R. Mehra, H.D. Kominsky, J. Siddiqui, J. Yu, Z.S. Qin, A.M. Chinnaiyan, Deep sequencing reveals distinct patterns of DNA methylation in prostate cancer, Genome Res. 21 (2011) 1028–1041. [42] L. Song, G.E. Crawford, DNase-seq: a high-resolution technique for mapping active gene regulatory elements across the genome from mammalian cells. Cold Spring Harbor protocols 2010 (2010) pdb prot5384. [43] B. Akhtar-Zaidi, R. Cowper-Sal-lari, O. Corradin, A. Saiakhova, C.F. Bartels, D. Balasubramanian, L. Myeroff, J. Lutterbaugh, A. Jarrar, M.F. Kalady, J. Willis, J.H. Moore, P.J. Tesar, T. Laframboise, S. Markowitz, M. Lupien, P.C. Scacheri, Epigenomic enhancer profiling defines a signature of colon cancer, Science 336 (2012) 736–739. [44] P.A. Jones, S.B. Baylin, The fundamental role of epigenetic events in cancer, Nat. Rev. Genet. 3 (2002) 415–428. [45] T. Misteli, Beyond the sequence. cellular organization of genome function, Cell 128 (2007) 787–800. [46] J. Dekker, Gene regulation in the third dimension, Science 319 (2008) 1793– 1794. [47] T. Sexton, H. Schober, P. Fraser, S.M. Gasser, Gene regulation through nuclear organization, Nat. Struct. Mol. Biol. 14 (2007) 1049–1055. [48] E. Lieberman-Aiden, N.L. van Berkum, L. Williams, M. Imakaev, T. Ragoczy, A. Telling, I. Amit, B.R. Lajoie, P.J. Sabo, M.O. Dorschner, R. Sandstrom, B. Bernstein, M.A. Bender, M. Groudine, A. Gnirke, J. Stamatoyannopoulos, L.A. Mirny, E.S. Lander, J. Dekker, Comprehensive mapping of long-range interactions reveals folding principles of the human genome, Science 326 (2009) 289–293. [49] D.S. Rickman, T.D. Soong, B. Moss, J.M. Mosquera, J. Dlabal, S. Terry, T.Y. MacDonald, J. Tripodi, K. Bunting, V. Najfeld, F. Demichelis, A.M. Melnick, O. Elemento, M.A. Rubin, Oncogene-mediated alterations in chromatin conformation, Proc. Natl. Acad. Sci. USA 109 (2012) 9083–9088. [50] G. Li, M.J. Fullwood, H. Xu, F.H. Mulawadi, S. Velkov, V. Vega, P.N. Ariyaratne, Y.B. Mohamed, H.S. Ooi, C. Tennakoon, C.L. Wei, Y. Ruan, W.K. Sung, ChIA-PET tool for comprehensive chromatin interaction analysis with paired-end tag sequencing, Genome Biol. 11 (2010) R22. [51] J. Dostie, T.A. Richmond, R.A. Arnaout, R.R. Selzer, W.L. Lee, T.A. Honan, E.D. Rubio, A. Krumm, J. Lamb, C. Nusbaum, R.D. Green, J. Dekker, Chromosome Conformation Capture Carbon Copy (5C): a massively parallel solution for mapping interactions between genomic elements, Genome Res. 16 (2006) 1299–1309. [52] B.E. Bernstein, E. Birney, I. Dunham, E.D. Green, C. Gunter, M. Snyder, An integrated encyclopedia of DNA elements in the human genome, Nature 489 (2012) 57–74. [53] J.N. Weinstein, Integromic analysis of the NCI-60 cancer cell lines, Breast Dis. 19 (2004) 11–22. [54] U.T. Shankavaram, S. Varma, D. Kane, M. Sunshine, K.K. Chary, W.C. Reinhold, Y. Pommier, J.N. Weinstein, Cell miner: a relational database and query tool for the NCI-60 cancer cell lines, BMC Genomics 10 (2009) 277. [55] G. Ciriello, E. Cerami, C. Sander, N. Schultz, Mutual exclusivity analysis identifies oncogenic network modules, Genome Res. 22 (2012) 398–406. [56] K.A. Le Cao, I. Gonzalez, S. Dejean, IntegrOmics: an R package to unravel relationships between two omics datasets, Bioinformatics 25 (2009) 2855– 2856. [57] O. Yli-Harja, A. Ylipaa, M. Nykter, W. Zhang, Cancer systems biology: signal processing for cancer research, Chin. J. Cancer 30 (2011) 221–225.

160

L.R. Patel et al. / Cancer Letters 340 (2013) 152–160

[58] M.S. Carro, W.K. Lim, M.J. Alvarez, R.J. Bollo, X. Zhao, E.Y. Snyder, E.P. Sulman, S.L. Anne, F. Doetsch, H. Colman, A. Lasorella, K. Aldape, A. Califano, A. Iavarone, The transcriptional network for mesenchymal transformation of brain tumours, Nature 463 (2010) 318–325. [59] C.J. Vaske, S.C. Benz, J.Z. Sanborn, D. Earl, C. Szeto, J. Zhu, D. Haussler, J.M. Stuart, Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM, Bioinformatics 26 (2010) i237–245. [60] Y.F. Guan, G.R. Li, R.J. Wang, Y.T. Yi, L. Yang, D. Jiang, X.P. Zhang, Y. Peng, Application of next-generation sequencing in clinical oncology to advance personalized treatment of cancer, Chin. J. cancer 31 (2012) 463–470. [61] N.P. Shah, C. Tran, F.Y. Lee, P. Chen, D. Norris, C.L. Sawyers, Overriding imatinib resistance with a novel ABL kinase inhibitor, Science 305 (2004) 399–401. [62] R. Nahta, D. Yu, M.C. Hung, G.N. Hortobagyi, F.J. Esteva, Mechanisms of disease: understanding resistance to HER2-targeted therapy in human breast cancer, Nat. Clin. Pract. Oncol. 3 (2006) 269–280. [63] J.A. Engelman, P.A. Janne, Mechanisms of acquired resistance to epidermal growth factor receptor tyrosine kinase inhibitors in non-small cell lung cancer, Clin. Cancer Res. 14 (2008) 2895–2899. [64] J. Villanueva, A. Vultur, J.T. Lee, R. Somasundaram, M. Fukunaga-Kalabis, A.K. Cipolla, B. Wubbenhorst, X. Xu, P.A. Gimotty, D. Kee, A.E. Santiago-Walker, R.

[65] [66]

[67]

[68]

[69]

Letrero, K. D’Andrea, A. Pushparajan, J.E. Hayden, K.D. Brown, S. Laquerre, G.A. McArthur, J.A. Sosman, K.L. Nathanson, M. Herlyn, Acquired resistance to BRAF inhibitors mediated by a RAF kinase switch in melanoma can be overcome by cotargeting MEK and IGF-1R/PI3K, Cancer Cell 18 (2010) 683–695. F. Michor, Y. Iwasa, M.A. Nowak, Dynamics of cancer progression, Nat. Rev. Cancer 4 (2004) 197–205. N. Navin, J. Kendall, J. Troge, P. Andrews, L. Rodgers, J. McIndoo, K. Cook, A. Stepansky, D. Levy, D. Esposito, L. Muthuswamy, A. Krasnitz, W.R. McCombie, J. Hicks, M. Wigler, Tumour evolution inferred by single-cell sequencing, Nature 472 (2011) 90–94. F. Tang, C. Barbacioru, Y. Wang, E. Nordman, C. Lee, N. Xu, X. Wang, J. Bodeau, B.B. Tuch, A. Siddiqui, K. Lao, M.A. Surani, MRNA-Seq whole-transcriptome analysis of a single cell, Nat. Methods 6 (2009) 377–382. F. Tang, C. Barbacioru, S. Bao, C. Lee, E. Nordman, X. Wang, K. Lao, M.A. Surani, Tracing the derivation of embryonic stem cells from the inner cell mass by single-cell RNA-Seq analysis, Cell Stem Cell 6 (2010) 468–478. D. Ramskold, S. Luo, Y.C. Wang, R. Li, Q. Deng, O.R. Faridani, G.A. Daniels, I. Khrebtukova, J.F. Loring, L.C. Laurent, G.P. Schroth, R. Sandberg, Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells, Nat. Biotechnol. 30 (2012) 777–782.