Endogenized viral sequences in mammals

Endogenized viral sequences in mammals

Available online at www.sciencedirect.com ScienceDirect Endogenized viral sequences in mammals Nicholas F Parrish1 and Keizo Tomonaga2,3,4 Reverse-tr...

626KB Sizes 0 Downloads 77 Views

Available online at www.sciencedirect.com

ScienceDirect Endogenized viral sequences in mammals Nicholas F Parrish1 and Keizo Tomonaga2,3,4 Reverse-transcribed RNA molecules compose a significant portion of the human genome. Many of these RNA molecules were retrovirus genomes either infecting germline cells or having done so in a previous generation but retaining transcriptional activity. This mechanism itself accounts for a quarter of the genomic sequence information of mammals for which there is data. We understand relatively little about the causes and consequences of retroviral endogenization. This review highlights functions ascribed to sequences of viral origin endogenized into mammalian genomes and suggests some of the most pressing questions raised by these observations. Addresses 1 Section of Surgical Sciences, Vanderbilt University Medical Center, Nashville, TN 37232, United States 2 Department of Viral Oncology, Institute for Virus Research, Kyoto University, Kyoto 606-8507, Japan 3 Department of Tumor Viruses, Graduate School of Medicine, Kyoto University, Kyoto 606-8507, Japan 4 Department of Mammalian Regulatory Network, Graduate School of Biostudies, Kyoto University, Kyoto 606-8507, Japan Corresponding authors: Parrish, Nicholas F. ([email protected]) and Tomonaga, Keizo ([email protected])

Current Opinion in Microbiology 2016, 31:176–183 This review comes from a themed issue on Megaviromes Edited by Didier Raoult and Jonatas Abrahao

http://dx.doi.org/10.1016/j.mib.2016.03.002 1369-5274/# 2016 Elsevier Ltd. All rights reserved.

Introduction Taxonomists distinguish between exogenous retroviruses (Figure 1a) [1], retrovirus-like endogenous retroviruses [2], retrotransposons that contain long-terminal repeats (LTRs) as do retroviruses [3], and autonomously-mobile retrotransposons lacking LTRs [4–6] all of which are mobile nucleic acid elements encoding reverse transcriptase and inferred to have a common origin on the basis of phylogenetic analysis of this protein. It has been argued that an infectious retrovirus was ancestral to the elements now comprising each of these groups [7]. Some members of each group move via an RNA intermediate in mammalian germlines, that is to say, vertically. Cross-individual or cross-species horizontal mobility is a defining feature of exogenous retroviruses, but has also been observed for endogenous Current Opinion in Microbiology 2016, 31:176–183

retroviruses [8–11], LTR retrotransposons [12–14], and autonomous non-LTR retrotransposons [15,16]. Recent large-scale genome analyses provide evidence for considerable host-switching by endogenous retroviruses, an observation consistent with extensive yet-to-be discovered exogenous retroviral diversity [17,18,19]. The boundaries between taxonomically distinct groups of retroelements are predicated on our cross-sectional perspective with respect to time and the current limited depth of sampling; nevertheless, we have four examples in which a host species is infected by a virus straddling the exogenous/endogenous retrovirus divide: clearly koala [20] and likely sheep [21], opossum [22], and polar bear [23]. It can be expected that the great cloud of mobile retroelement diversity will continue to expand and merge as more data is collected and analyzed. The important realization that these elements have contributed vast swaths of mammalian genomes is ongoing. Conservative estimates that 40% of the human genome is reverse transcript in origin [24] balloon when considering that nearly 80% of the genome can be identified as repeat-derived using algorithms with increased sensitivity for short and degraded sequences [25]. Given their massive contribution to the overall content of the genome, in comparison to the exome there disproportionately little evidence that sequences derived via reverse-transcription of non-self RNA contribute to mammalian organismal physiology. This reflects the nature of scientific change from a historically entrenched paradigm. Prior to the genomic era, selective breeding experiments allowed generation of chicken and mice lacking specific endogenous viral elements [26–28]. At this time, the concept of even one endogenous provirus was very recently a controversial tenet [29]. Perhaps thusly, the possibility that the endogenous viral sequences in question represented the tip of a deep iceberg was neglected; these experiments were interpreted to suggest that endogenous viral elements were superfluous to normal development [26]. Despite the recognition of Barbara McClintock’s foresight into the importance of transposons as ‘controlling elements’ [30], it has taken a long time for comprehensive alternative paradigms to be presented in rigorous forums [31].

Endogenous retroelements in health and disease Perhaps the most frequently cited example of retro-endogenization playing an important role in mammalian physiology comes from the exaptation of retroviral envelope genes encoding syncytin proteins, which play a role in cell– cell fusion during placentation (Figure 1b) [32,33]. In humans, two endogenous retroviral envelope genes are www.sciencedirect.com

Retroviruses and endogenization in mammals Parrish and Tomonaga 177

Figure 1

(a)

Env

receptor

Gag

RNA

RT

DNA

AR

host

PRR

viral

(b)

i

endogenous viral

ii i

i viii

placental functions

ii

vi

ii v

iii

iii

vii iv

receptor interference

retroviral lifecycle

i

iii

(c)

dominant-negative effects

i

iv

iv

(d)

ii

pluripotency iv

iii

ii

iii

v

i

i ii

vi

(g)

adaptive immunity

iv

iv iii

ii

(f)

(e)

innate immunity Current Opinion in Microbiology

(a) Retroviral to endogenous retrovirus lifecycle. Upon engagement of host receptor (i), the viral payload enters the cell. The viral RNA genome is uncovered by the structural proteins protecting it (ii) and, at some point, reverse transcribed into viral cDNA (iii). Host pattern recognition receptors (PRRs) may recognize viral RNA or cDNA and activate signaling cascades that lead to abortive infection (iv). Alternatively, the viral cDNA may be integrated as a proviral copy with the host chromosome (v) from which progeny virus are made (vi). Upon integration of a retrovirus into a germ cell (vii), a provirus may be horizontally transmitted via the germline as an endogenous retrovirus. As an endogenous retrovirus or retroelement, further propagation within the germline is possible in the absence of infectious virion production (viii). (b) Syncytin-mediated fusion and immune suppression. Expression of fusion-competent endogenous retroviral Envelope glycoproteins (Env) on the surface of trophoblast cells of the placenta enables production of multinucleated syncitiatrophoblasts (i). Via an unknown mechanism, syncytins have been proposed to prevent recognition of fetal cells as foreign by the allogeneic maternal immune system (ii). (c) Receptor interference. Expression of endogenous Env interacts with the receptor of closely related exogenous viruses, either in a cell-associated (i) or cell-free (ii) form, leading to either degradation (iii) of the receptor or competitive inhibition preventing efficient interaction of exogenous virus with the host cell receptor. (d) Dominant-negative protein production. After infection of a cell with an exogenous virus (i) which is capable of producing an endogenous viral structural protein (ii), formation of a functional group antigen (Gag) lattice is interrupted (iii) leading to production of non-infectious virions (iv). (e) Innate immune modulation. Environmentally triggered or stochastic expression of endogenous retroelement RNA or reverse transcription of this RNA in to cDNA (i) generates a substrate recognized by PRRs (ii) which transduce a signal to the nucleus (iii) activating antiviral gene transcription programs and leading to a multi-effector antiviral state (iv). (f) Adaptive immune modulation. Activation of PRRs by endogenous triggers (i) is required for expression of anticipatory receptors, antibodies (ii), in T-independent B cell responses. Endogenous retroelement reverse transcriptase (RT) is required for reverse transcription (iii) and integration (iv) of sequences from viruses that do not encode reverse transcriptase activity. These non-retroviral endogenous viral elements may then produce dominant negative viral proteins (see d) or small RNA (v) capable of base-pairing with exogenous viral RNA, leading to viral gene silencing (vi). (g) Pluripotency. Endogenous retrovirus transcripts may act as nuclear non-coding RNA (i) or may encode proteins that bind to host RNAs involved in pluripotency (ii). Germ cells with overactive endogenous retroelements (iii) may die prior to zygote formation (iv), allowing propagation of only those gametes with relative genomic integrity.

www.sciencedirect.com

Current Opinion in Microbiology 2016, 31:176–183

178 Megaviromes

expressed specifically on trophoblasts, the cells that fuse to form syncytia critical to establishing the materno-fetal interface. Interestingly, in mice two endogenous retroviruses integrated into different genomic positions relative to those encoding human syncytins (i.e. the viral integrations are not orthologous, not related by descent) have been implicated in this process and are activated by a transcription factor shared in common by humans and mice [34]; the genetic absence of these syncytins in mice results in abnormal placental syncycia formation [35,36]. Across the Therian subclass, there have been at least ten independent retroviral integration events from which the envelope gene has been convergently exapted to function as a syncytin [37]. In addition to their proposed role in cell–cell fusion, transduction of some syncytins allows engraftment of otherwise rejected allogeneic tumor cells, presumably by suppressing immune recognition [38]. Viral modulation of cell surface proteins involved in immunity, including MHC molecules, is common [39,40], raising the question of whether or not this is the case for syncytins. In any case, expression of endogenous viral proteins in the placenta may have both anatomic and immunological consequences. Prior to their implication in placental immunomodulation, endogenous viral sequences were shown to mediate immunity to closely related exogenous viruses. The bestdescribed mechanism has been termed receptor interference (Figure 1c), whereby an endogenous retroviral envelope glycoprotein (Env) binds to and sequesters the receptor of the very closely related exogenous virus. This has been shown for endogenous Jaagsiekte sheep retrovirus (enJSRV) [41], a provirus related to murine leukemia virus (MLV) [42,43], and endogenous feline leukemia virus [44]. Mtv-3, a provirus related to murine mammary tumor virus (MMTV), was shown to correlate with resistance to MMTV through an as yet unknown mechanism [45]. Not only Env, but other endogenous viral proteins can exert a dominant-negative role on infection by related viruses. This has been shown for enJSRV group antigen [46] proteins [47] as well as for the Gag encoded by Fv1, identified as a murine leukemia virus restriction factor (Figure 1d) [48]. Interestingly Fv1 is itself not closely related to MLV at the nucleotide level, and while the first identified phenotype was resistance to this betaretrovirus, alleles of this gene in various mouse species have recently been shown to restrict infection by a lentivirus and a spumavirus [49]. Thus while initially considered to confer type-specific immunity to very closely related exogenous viruses, in some cases endogenous retroviral protein expression can have a broader antiviral effect. Sequences derived from endogenous retroelements may play a physiological role in immunity via activating innate immune pathways when transcribed to RNA or upon reverse transcription to cDNA. Cells treated with the Current Opinion in Microbiology 2016, 31:176–183

DNA demethylating agent 5-aza-20 -deoxycytidine massively increase expression of endogenous retrovirus and other retroelement RNA and this correlates with interferon-stimulated gene response, however only in cells lacking functional alleles of p53, one of the most frequently mutated tumor suppressor genes in human neoplasms [50]. Several recent papers have extended [51] and clarified these observations in the case of colorectal cancer [52] and ovarian cancer [53]. Retroelement RNA or cDNA has also been suggested to act in a similar mechanism in non-transformed cells (reviewed in [54,55]). Additionally, in germ cells in the mouse testis, expression of the autonomous non-LTR retrotransposon LINE-1 induces expression of interferon beta, especially when the endonuclease domain of the polymerase gene is functional [56]. Moreover, release of retroelement RNA bound to the Ro60 autoantigen increased interferon-regulated gene stimulation [57]. Together these data support a role for endogenous retroelement sequences, generally acting at the RNA or cDNA level to trigger pattern recognition receptors which have previously been implicated in recognizing exogenous viruses, in modulating innate immunity (Figure 1e). For each physiologic function with a hypothetical survival benefit that a retroelement might play, it follows that a corresponding pathology results from improper control of the retroelement. The full literature associating human endogenous retroviruses with disease is beyond the scope of this review (see [58]), but there is growing evidence that a subset of autoimmune diseases with increased type 1 interferon signatures may, etiologically, represent syndromes of retroelement dysregulation [54,59,60]. In addition to innate immunity, endogenous retroviruses have been proposed to play a role in adaptive immunity. Endogenous retrovirus transcript/reverse-transcript sensing as described above is required for generation of some T cell-independent B cell immune responses; preventing cDNA formation by these retroelements with reverse transcriptase inhibitors has been shown to block antibody formation (Figure 1f) [61]. This data is a striking realization of a hypothesis presented over 35 years ago [62]. In addition to a direct involvement, several investigators have proposed that endogenous retrovirus sequences present in major histocompatibility gene families are important for generation of diversity in these genes, which are critical for adaptive immunity [63–65]. A role for endogenous retroviruses in generation of diversity at antibody variable segment gene loci has also been proposed [66]. Finally, endogenous retroelements are required for endogenization of viruses which do not encode a reverse transcriptase, a recently described phenomenon [67] with a surprising multitude of examples [68]. Endogenization of such sequences may play a role in typespecific immunity via dominant negative protein production as described above [69] and/or via an RNA silencing mechanism analogous to CRISPR/Cas [70]. www.sciencedirect.com

Retroviruses and endogenization in mammals Parrish and Tomonaga 179

Endogenous retrovirus expression is high in germ cells, preimplantation embryos and the placenta [26,71,72]. This led to the hypothesis that they play a role in early human development [73–75], which has been born out at least in part with the evidence relating to syncytins. However, as gene expression in some of these tissues is pervasive [76], it was not clear that increased expression of endogenous retroviral sequences in pluripotent tissues implied function. Surprisingly, recent data suggests that endogenous retroviral expression is not only associated with pluripotent tissues but in fact appears to play a role in establishing or maintaining the pluripotency phenotype itself [77–79]. One model suggests that the endogenous retroviral RNA acts as a nuclear non-coding RNA scaffold to activate transcription of specific genes (Figure 1g) [80,81]. Another non-mutually exclusive model proposes that proteins expressed from endogenous retroviral transcripts could induce antiviral proteins and influence translation of subsets of cellular RNAs [82]. Mirroring the multiple independent syncytin exaptations across placental mammals [37], it is remarkable that a physiologic function as basal as maintenance of pluripotency is critically influenced by endogenous sequences that entered primate genomes by some estimates as recently as 40 000 years ago [83]; if endogenous retroviruses are responsible for this function in other mammals, as seems the case at least in mice [84], they are different viruses. The maintenance of totipotency across generations, the role of the germline, is also influenced by retroelements. Most mouse oocytes die prior to ovulation and recent work has suggested that variable levels of LINE-1 activity underlie this attrition, supporting a model in which the eggs with optimal control of this potential mutagen are selected for potential fertilization [85]. As similar mechanisms may be in play in spermatogenesis [56,86] and even Caenorhabditis elegans germ cell development [87], this may also be broadly conserved in principle yet with potentially different retroelement actors in each setting. As expected for nuclear elements with a role maintaining a state of unlimited cell division, dysregulation of retroelements has been proposed to be involved in carcinogenesis [88–92]. The recent data relating to hepatocellular carcinoma (HCC) demonstrates several of the diverse mechanisms involved: somatic integrations of LINE-1 in some cancers drives oncogene expression [93], integration of a gene from the non-retroviral hepatitis B virus (HBV) gene into the human genome creates a chimeric transcript with LINE-1 which functions as a long non-coding RNA in as many as 20% of HBV-associated tumors [94], and activation of endogenous retrovirus LTRs, similarly to pluripotent cells, characterizes the HCC cell transcriptome [95].

Questions for the future The past decade has seen the rapid development of tools allowing us to rationally engineer mammalian www.sciencedirect.com

genomes; doing so wisely will be challenging. The examples above provide compelling evidence to reject the hypothesis that endogenous viral sequences are selfish DNA with no specific contribution to the reproductive success of its host organism [96]. There is thus a new set of practical questions facing this field, in addition to the need for new conceptual frameworks to better explain these observations [97,98]. Can we harness the existing systems controlling endogenous retroviruses to fight pathogenic exogenous ones [99]? One lead in this direction is the observation that genes endogenized from non-retroviral viruses are used to generate piRNA [70], a class of small RNA known to silence endogenous retroviruses and retrotransposons in mammals and, in insects, exogenous viruses [100]. Thus engineering animals with ‘artificially endogenized’ viral sequences in the proper genomic context may lead to heritable immunity. Modification of the porcine genome to remove endogenous retroviruses, a theorized impediment to xenotransplantation, has already been described [101]. Studying the organismlevel consequences of such modification, and in converse the implications of engineering hyperactive retroelements into the genome [102], will allow us to prospectively test ideas previously only amenable to teleological speculation. Similarly modified organisms may make ideal systems to test intriguing hypotheses about retroelement involvement in neural processes [103]. Additionally, studying mammals in which classes of retroelements have become extinct [104] or new types of retroelements have been added [105] may also provide insights (and help qualify overexuberance) about the physiological importance of retroelements. More proximally, more work to subdivide autoimmune disorders on the basis of genetic [106,107] and biochemical [108] evidence suggesting retroelement dysregulation is needed to allow therapeutic trials of reverse transcriptase inhibitors [59] and cGAS pathway inhibitors as these become available [60]. A better mechanistic understanding of the interplay between p53 mutation, retroelement transcription/reverse transcription, and interferon production may allow us to define cancer subtypes which may respond to available immunotherapeutics [109] or hold clues for new treatment approaches [110]. In summary, we propose that there is as much to learn about mammalian physiology from the forty percent of the genome contributed by reverse-transcribed RNA as we have from the two percent encoding protein.

Acknowledgements Space limitations prevented us from citing all of the recent advances in this field and we apologize to those whose work has been excluded. Research in Prof. Tomonaga’s laboratory is funded in part by Funding Program for Next Generation World-Leading Researchers (NEXT program), KAKENHI grant numbers 26253027 and 26670225, and the Core-to-Core Program A, Advanced Research Networks from the Japan Society for the Promotion of Science (JSPS); grants from Takeda Science Foundation. Current Opinion in Microbiology 2016, 31:176–183

180 Megaviromes

References and recommended reading Papers of particular interest, published within the period of review, have been highlighted as:  of special interest  of outstanding interest 1.

King AM, Adams MJ, Lefkowitz EJ: Virus Taxonomy: Classification and Nomenclature of Viruses: Ninth Report of the International Committee on Taxonomy of Viruses. Elsevier; 2011.

2.

Blomberg J, Benachenhou F, Blikstad V, Sperber G, Mayer J: Classification and nomenclature of endogenous retroviral sequences (ERVs): problems and recommendations. Gene 2009, 448:115-123.

3.

Llorens C, Munoz-Pomer A, Bernad L, Botella H, Moya A: Network dynamics of eukaryotic LTR retroelements beyond phylogenetic trees. Biol Direct 2009, 4:41.

4.

Malik HS, Burke WD, Eickbush TH: The age and evolution of nonLTR retrotransposable elements. Mol Biol Evol 1999, 16:793-805.

5.

Eickbush TH, Jamburuthugoda VK: The diversity of retrotransposons and the properties of their reverse transcriptases. Virus Res 2008, 134:221-234.

6.

Rho M, Tang H: MGEScan-non-LTR: computational identification and classification of autonomous non-LTR retrotransposons in eukaryotic genomes. Nucleic Acids Res 2009, 37:e143.

7.

Coffin J: Reverse transcriptase and evolution. In Reverse Transcriptase. Edited by Skalka AM, Goff S. Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press; 1993:445-479.

8.

9.

van der Laan LJ, Lockey C, Griffeth BC, Frasier FS, Wilson CA, Onions DE, Hering BJ, Long Z, Otto E, Torbett BE, Salomon DR: Infection by porcine endogenous retrovirus after islet xenotransplantation in SCID mice. Nature 2000, 407: 90-94. Paprotka T, Delviks-Frankenberry KA, Cingoz O, Martinez A, Kung HJ, Tepper CG, Hu WS, Fivash MJ Jr, Coffin JM, Pathak VK: Recombinant origin of the retrovirus XMRV. Science 2011, 333:97-101.

10. Young GR, Eksmond U, Salcedo R, Alexopoulou L, Stoye JP, Kassiotis G: Resurrection of endogenous retroviruses in antibody-deficient mice. Nature 2012, 491:774-778. 11. Yoshikawa R, Shimode S, Sakaguchi S, Miyazawa T: Contamination of live attenuated vaccines with an infectious feline endogenous retrovirus (RD-114 virus). Arch Virol 2014, 159:399-404. 12. Song SU, Gerasimova T, Kurkulos M, Boeke JD, Corces VG: An env-like protein encoded by a Drosophila retroelement: evidence that gypsy is an infectious retrovirus. Genes Dev 1994, 8:2046-2057. 13. Bartolome C, Bello X, Maside X: Widespread evidence for horizontal transfer of transposable elements across Drosophila genomes. Genome Biol 2009, 10:R22. 14. El Baidouri M, Carpentier MC, Cooke R, Gao D, Lasserre E, Llauro C, Mirouze M, Picault N, Jackson SA, Panaud O: Widespread and frequent horizontal transfers of transposable elements in plants. Genome Res 2014, 24:831-838. 15. Schaack S, Gilbert C, Feschotte C: Promiscuous DNA: horizontal transfer of transposable elements and why it matters for eukaryotic evolution. Trends Ecol Evol 2010, 25:537-546. 16. Walsh AM, Kortschak RD, Gardner MG, Bertozzi T, Adelson DL: Widespread horizontal transfer of retrotransposons. Proc Natl Acad Sci U S A 2013, 110:1012-1016. 17. Hayward A, Grabherr M, Jern P: Broad-scale phylogenomics provides insights into retrovirus-host evolution. Proc Natl Acad Sci U S A 2013, 110:20146-20151. Current Opinion in Microbiology 2016, 31:176–183

18. Hayward A, Cornwallis CK, Jern P: Pan-vertebrate comparative genomics unmasks retrovirus macroevolution. Proc Natl Acad  Sci U S A 2015, 112:464-469. As genome-scale data continues to increase, unbiased phylogenomic approaches such as the ones used in this manuscript are needed to recalibrate and redefine retrovirus and retroelement taxonomy. There are independently-reportable discoveries sprinkled throughout this manuscript, which focuses on the big-picture. 19. Zhuo X, Feschotte C: Cross-species transmission and differential fate of an endogenous retrovirus in three mammal  lineages. PLoS Pathog 2015, 11:e1005279. An interesting case study demonstrating surprising fluidity in the horizontal transmission of retroviruses across mammalian orders. 20. Ishida Y, Zhao K, Greenwood AD, Roca AL: Proliferation of endogenous retroviruses in the early stages of a host germ line invasion. Mol Biol Evol 2015, 32:109-120. 21. Armezzani A, Varela M, Spencer TE, Palmarini M, Arnaud F: ‘‘Menage a Trois’’: the evolutionary interplay between JSRV,  enJSRVs and domestic sheep. Viruses 2014, 6:4926-4945. A review of one of the best-studied examples of exogenous/endogenous retrovirus coevolution: JRSV and enJRSV. 22. Cornelis G, Vernochet C, Carradec Q, Souquere S, Mulot B,  Catzeflis F, Nilsson MA, Menzies BR, Renfree MB, Pierron G, Zeller U, Heidmann O, Dupressoir A, Heidmann T: Retroviral envelope gene captures and syncytin exaptation for placentation in marsupials. Proc Natl Acad Sci U S A 2015, 112:E487-E496. This work demonstrates that marsupials have exapted syncytins from different retroviral Envs than Eutherian lineages and demonstrates that an endogenous retrovirus integration in Monodelphis domestica is polymorphic among modern individuals, suggesting very recent integration. 23. Tsangaras K, Mayer J, Alquezar-Planas DE, Greenwood AD: An evolutionarily young polar bear (Ursus maritimus) endogenous retrovirus identified from next generation sequence data. Viruses 2015, 7:6089-6107. 24. Smit A, Hubley R, Green P: RepeatMasker Open-4.0. 2013–2015 http://www.repeatmasker.org. 25. de Koning AP, Gu W, Castoe TA, Batzer MA, Pollock DD: Repetitive elements may comprise over two-thirds of the human genome. PLoS Genet 2011, 7:e1002384. 26. Astrin SM, Buss EG, Haywards WS: Endogenous viral genes are non-essential in the chicken. Nature 1979, 282:339-341. 27. Cohen JC, Varmus HE: Endogenous mammary tumour virus DNA varies among wild mice and segregates during inbreeding. Nature 1979, 278:418-423. 28. Frisby DP, Weiss RA, Roussel M, Stehelin D: The distribution of endogenous chicken retrovirus sequences in the DNA of galliform birds does not coincide with avian phylogenetic relationships. Cell 1979, 17:623-634. 29. Weiss RA: The discovery of endogenous retroviruses. Retrovirology 2006, 3:67.  Without perspective on the paradigms at work at the time of the discovery of endogenous retroviruses and earliest questions surrounding endogenous retroviruses it is difficult to make sense of the direction of this field. 30. McClintock B: The significance of responses of the genome to challenge. Science 1984, 226:792-801.  Required reading. A concise summary from a pioneer worker in the field. The concepts remain very fresh. 31. Trono D: Transposable elements, polydactyl proteins, and the  genesis of human-specific transcription networks. Cold Spring Harb Symp Quant Biol 2016 http://dx.doi.org/10.1101/ sqb.2015.80.027573. This paper represents a recent endeavor to bring together current data on the functions of endogenous retroviruses into a cohesive paradigm. 32. Mi S, Lee X, Li X, Veldman GM, Finnerty H, Racie L, LaVallie E, Tang XY, Edouard P, Howes S, Keith JC Jr, McCoy JM: Syncytin is a captive retroviral envelope protein involved in human placental morphogenesis. Nature 2000, 403:785-789. 33. Blaise S, de Parseval N, Benit L, Heidmann T: Genomewide screening for fusogenic human endogenous retrovirus www.sciencedirect.com

Retroviruses and endogenization in mammals Parrish and Tomonaga 181

envelopes identifies syncytin 2, a gene conserved on primate evolution. Proc Natl Acad Sci U S A 2003, 100:13013-13018. 34. Simmons DG, Natale DR, Begay V, Hughes M, Leutz A, Cross JC: Early patterning of the chorion leads to the trilaminar trophoblast cell structure in the placental labyrinth. Development 2008, 135:2083-2091. 35. Dupressoir A, Vernochet C, Bawa O, Harper F, Pierron G, Opolon P, Heidmann T: Syncytin-A knockout mice demonstrate the critical role in placentation of a fusogenic, endogenous retrovirus-derived, envelope gene. Proc Natl Acad Sci U S A 2009, 106:12127-12132. 36. Dupressoir A, Vernochet C, Harper F, Guegan J, Dessen P, Pierron G, Heidmann T: A pair of co-opted retroviral envelope syncytin genes is required for formation of the two-layered murine placental syncytiotrophoblast. Proc Natl Acad Sci U S A 2011, 108:E1164-E1173. 37. Imakawa K, Nakagawa S, Miyazawa T: Baton pass hypothesis:  successive incorporation of unconserved endogenous retroviral genes for placentation during mammalian evolution. Genes Cells 2015, 20:771-788. This work reviews the progress on defining which endogenous retrovirus Envs function as syncytins in different placental mammal lineages and proposes an evolutionary scheme to explain the observed abundance of convergent but not homologous routes. 38. Mangeney M, Renard M, Schlecht-Louf G, Bouallaga I, Heidmann O, Letzelter C, Richaud A, Ducos B, Heidmann T: Placental syncytins: genetic disjunction between the fusogenic and immunosuppressive activity of retroviral envelope proteins. Proc Natl Acad Sci U S A 2007, 104:20534-20539. 39. Matheson NJ, Sumner J, Wals K, Rapiteanu R, Weekes MP, Vigan R, Weinelt J, Schindler M, Antrobus R, Costa AS, Frezza C, Clish CB, Neil SJ, Lehner PJ: Cell surface proteomic map of HIV infection reveals antagonism of amino acid metabolism by Vpu and Nef. Cell Host Microbe 2015, 18:409-423. 40. Hansen TH, Bouvier M: MHC class I antigen presentation: learning from viral evasion strategies. Nat Rev Immunol 2009, 9:503-513. 41. Spencer TE, Mura M, Gray CA, Griebel PJ, Palmarini M: Receptor usage and fetal expression of ovine endogenous betaretroviruses: implications for coevolution of endogenous and exogenous retroviruses. J Virol 2003, 77:749-753. 42. Ikeda H, Sugimura H: Fv-4 resistance gene: a truncated endogenous murine leukemia virus with ecotropic interference properties. J Virol 1989, 63:5405-5412. 43. Taylor GM, Gao Y, Sanders DA: Fv-4: identification of the defect in Env and the mechanism of resistance to ecotropic murine leukemia virus. J Virol 2001, 75:11244-11248. 44. McDougall AS, Terry A, Tzavaras T, Cheney C, Rojko J, Neil JC: Defective endogenous proviruses are expressed in feline lymphoid cells: evidence for a role in natural resistance to subgroup B feline leukemia viruses. J Virol 1994, 68:2151-2160. 45. Hainaut P, Castellazzi M, Gonzales D, Clausse N, Hilgers J, Crepin M: A congenic line of the BALB/c mouse strain with the endogenous mouse mammary tumor virus proviral gene Mtv3: tissue-specific expression and correlation with resistance to mouse mammary tumor virus infection and tumorigenesis. Cancer Res 1990, 50:3754-3760. 46. Marchetto MC, Narvaiza I, Denli AM, Benner C, Lazzarini TA, Nathanson JL, Paquola AC, Desai KN, Herai RH, Weitzman MD, Yeo GW, Muotri AR, Gage FH: Differential L1 regulation in pluripotent stem cells of humans and apes. Nature 2013, 503:525-529. 47. Mura M, Murcia P, Caporale M, Spencer TE, Nagashima K, Rein A, Palmarini M: Late viral interference induced by transdominant Gag of an endogenous retrovirus. Proc Natl Acad Sci U S A 2004, 101:11117-11122. 48. Best S, Le Tissier P, Towers G, Stoye JP: Positional cloning of the mouse retrovirus restriction gene Fv1. Nature 1996, 382:826-829. www.sciencedirect.com

49. Yap MW, Colbeck E, Ellis SA, Stoye JP: Evolution of the retroviral restriction gene Fv1: inhibition of non-MLV retroviruses. PLoS  Pathog 2014, 10:e1003968. This work demonstrates that the Fv1 retriction activity is much broader than previously imagined. 50. Leonova KI, Brodsky L, Lipchick B, Pal M, Novototskaya L, Chenchik AA, Sen GC, Komarova EA, Gudkov AV: p53 cooperates with DNA methylation and a suicidal interferon response to maintain epigenetic silencing of repeats and noncoding RNAs. Proc Natl Acad Sci U S A 2013, 110:E89-E98. 51. Wylie A, Jones AE, D’Brot A, Lu WJ, Kurtz P, Moran JV, Rakheja D, Chen KS, Hammer RE, Comerford SA, Amatruda JF, Abrams JM: p53 genes function to restrain mobile elements. Genes Dev 2016, 30:64-77. 52. Roulois D, Loo Yau H, Singhania R, Wang Y, Danesh A, Shen SY,  Han H, Liang G, Jones PA, Pugh TJ, O’Brien C, De Carvalho DD: DNA-demethylating agents target colorectal cancer cells by inducing viral mimicry by endogenous transcripts. Cell 2015, 162:961-973. Along with Ref. [53] demonstrates the complex interplay between p53, endogenous viral transcripts, innate immunity, and cancer. 53. Chiappinelli KB, Strissel PL, Desrichard A, Li H, Henke C, Akman B,  Hein A, Rote NS, Cope LM, Snyder A, Makarov V, Buhu S, Slamon DJ, Wolchok JD, Pardoll DM, Beckmann MW, Zahnow CA, Mergoub T, Chan TA, Baylin SB, Strick R: Inhibiting DNA methylation causes an interferon response in cancer via dsRNA including endogenous retroviruses. Cell 2015, 162:974-986. See annotation to Ref. [52]. 54. Volkman HE, Stetson DB: The enemy within: endogenous retroelements and autoimmune disease. Nat Immunol 2014, 15:415-422. 55. Hurst TP, Magiorkinis G: Activation of the innate immune response by endogenous retroviruses. J Gen Virol 2015, 96:1207-1218. 56. Yu Q, Carbone CJ, Katlinskaya YV, Zheng H, Zheng K, Luo M,  Wang PJ, Greenberg RA, Fuchs SY: Type I interferon controls propagation of long interspersed element-1. J Biol Chem 2015, 290:10191-10199. This manuscript is the first to show that LINE-1 activity per se can induce type I interferon and to formally demonstrate the type I interferon restricts LINE-1 transposition. 57. Hung T, Pratt GA, Sundararaman B, Townsend MJ, Chaivorapol C, Bhangale T, Graham RR, Ortmann W, Criswell LA, Yeo GW, Behrens TW: The Ro60 autoantigen binds endogenous retroelements and regulates inflammatory gene expression. Science 2015, 350:455-459. 58. Voisset C, Weiss RA, Griffiths DJ: Human RNA ‘‘rumor’’ viruses: the search for novel human retroviruses in chronic disease. Microbiol Mol Biol Rev 2008, 72:157-196 table of contents. 59. Beck-Engeser GB, Eilat D, Wabl M: An autoimmune disease prevented by anti-retroviral drugs. Retrovirology 2011, 8:91. 60. Gray EE, Treuting PM, Woodward JJ, Stetson DB: Cutting edge: cGAS is required for lethal autoimmune disease in the Trex1deficient mouse model of Aicardi-Goutieres syndrome. J Immunol 2015, 195:1939-1943. 61. Zeng M, Hu Z, Shi X, Li X, Zhan X, Li XD, Wang J, Choi JH, Wang KW, Purrington T, Tang M, Fina M, DeBerardinis RJ, Moresco EM, Pedersen G, McInerney GM, Karlsson Hedestam GB, Chen ZJ, Beutler B: MAVS, cGAS, and endogenous retroviruses in T-independent B cell responses. Science 2014, 346:1486-1492. 62. DeLamarter JF, Stoye J, Schumann G, Moroni C: Evidence supporting a physiological role for endogenous C-type virus in the immune system. Haematol Blood Transfus 1979, 23:413-415. 63. Kulski JK, Gaudieri S, Bellgard M, Balmer L, Giles K, Inoko H, Dawkins RL: The evolution of MHC diversity by segmental duplication and transposition of retroelements. J Mol Evol 1997, 45:599-609. 64. Kulski JK, Gaudieri S, Inoko H, Dawkins RL: Comparison between two human endogenous retrovirus (HERV)-rich Current Opinion in Microbiology 2016, 31:176–183

182 Megaviromes

regions within the major histocompatibility complex. J Mol Evol 1999, 48:675-683. 65. Doxiadis GG, de Groot N, Bontrop RE: Impact of endogenous intronic retroviruses on major histocompatibility complex class II diversity and stability. J Virol 2008, 82:6667-6677. 66. Rothenfluh HS: Hypothesis: a memory lymphocyte-specific  soma-to-germline genetic feedback loop. Immunol Cell Biol 1995, 73:174-180. This is a controversial hypothesis that could be reappraised relatively trivially (compared to when first presented) using modern sequencing technology. 67. Horie M, Honda T, Suzuki Y, Kobayashi Y, Daito T, Oshida T, Ikuta K, Jern P, Gojobori T, Coffin JM, Tomonaga K: Endogenous non-retroviral RNA virus elements in mammalian genomes. Nature 2010, 463:84-87. 68. Katzourakis A, Gifford RJ: Endogenous viral elements in animal genomes. PLoS Genet 2010, 6:e1001191. 69. Fujino K, Horie M, Honda T, Merriman DK, Tomonaga K: Inhibition  of Borna disease virus replication by an endogenous bornavirus-like element in the ground squirrel genome. Proc Natl Acad Sci U S A 2014, 111:13175-13180. Shows that endogenous viral sequences from non-retroviral viruses can be involved in antiviral immunity in a manner similar to endogenous retroviral elements. 70. Parrish NF, Fujino K, Shiromoto Y, Iwasaki YW, Ha H, Xing J, Makino A, Kuramochi-Miyagawa S, Nakano T, Siomi H, Honda T, Tomonaga K: piRNAs derived from ancient viral processed pseudogenes as transgenerational sequence-specific immune memory in mammals. RNA 2015, 21:1691-1703. 71. Seifarth W, Frank O, Zeilfelder U, Spiess B, Greenwood AD, Hehlmann R, Leib-Mosch C: Comprehensive analysis of human endogenous retrovirus transcriptional activity in human tissues with a retrovirus-specific microarray. J Virol 2005, 79:341-352. 72. Lerner RA, Wilson CB, Villano BC, McConahey PJ, Dixon FJ: Endogenous oncornaviral gene expression in adult and fetal mice: quantitative, histologic, and physiologic studies of the major viral glycorprotein, gp70. J Exp Med 1976, 143:151-166. 73. Harris JR: The evolution of placental mammals. FEBS Lett 1991, 295:3-4. 74. Harris JR: Placental endogenous retrovirus (ERV): structural, functional, and evolutionary significance. Bioessays 1998, 20:307-316. 75. Larsson E, Andersson G: Beneficial role of human endogenous retroviruses: facts and hypotheses. Scand J Immunol 1998, 48:329-338. 76. Ramskold D, Wang ET, Burge CB, Sandberg R: An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data. PLoS Comput Biol 2009, 5:e1000598. 77. Macfarlan TS, Gifford WD, Driscoll S, Lettieri K, Rowe HM, Bonanomi D, Firth A, Singer O, Trono D, Pfaff SL: Embryonic stem cell potency fluctuates with endogenous retrovirus activity. Nature 2012, 487:57-63. 78. Santoni FA, Guerra J, Luban J: HERV-H RNA is abundant in human embryonic stem cells and a precise marker for pluripotency. Retrovirology 2012, 9:111. 79. Wang J, Xie G, Singh M, Ghanbarian AT, Rasko T, Szvetnik A, Cai H, Besser D, Prigione A, Fuchs NV, Schumann GG, Chen W, Lorincz MC, Ivics Z, Hurst LD, Izsvak Z: Primate-specific endogenous retrovirus-driven transcription defines naive-like stem cells. Nature 2014, 516:405-409. 80. Lu X, Sachs F, Ramsay L, Jacques PE, Goke J, Bourque G, Ng HH: The retrovirus HERVH is a long noncoding RNA required for human embryonic stem cell identity. Nat Struct Mol Biol 2014, 21:423-425. 81. Ohnuki M, Tanabe K, Sutou K, Teramoto I, Sawamura Y, Narita M, Nakamura M, Tokunaga Y, Nakamura M, Watanabe A, Yamanaka S, Takahashi K: Dynamic regulation of human Current Opinion in Microbiology 2016, 31:176–183

endogenous retroviruses mediates factor-induced reprogramming and differentiation potential. Proc Natl Acad Sci U S A 2014, 111:12426-12431. 82. Grow EJ, Flynn RA, Chavez SL, Bayless NL, Wossidlo M,  Wesche DJ, Martin L, Ware CB, Blish CA, Chang HY, Pera RA, Wysocka J: Intrinsic retroviral reactivation in human preimplantation embryos and pluripotent cells. Nature 2015, 522:221-225. As all of the works cited in this section are of outstanding quality, I will highlight this most recent and most ambitious of the works connecting endogenous retroelements with pluripotency. 83. Glinsky GV: Transposable elements and DNA methylation create in embryonic stem cells human-specific regulatory sequences associated with distal enhancers and noncoding RNAs. Genome Biol Evol 2015, 7:1432-1454. 84. Kunarso G, Chia NY, Jeyakani J, Hwang C, Lu X, Chan YS, Ng HH, Bourque G: Transposable elements have rewired the core regulatory network of human embryonic stem cells. Nat Genet 2010, 42:631-634. 85. Malki S, van der Heijden GW, O’Donnell KA, Martin SL, Bortvin A: A role for retrotransposon LINE-1 in fetal oocyte attrition in mice. Dev Cell 2014, 29:521-533. 86. Hadziselimovic F, Hadziselimovic NO, Demougin P, Krey G, Oakeley E: Piwi-pathway alteration induces LINE-1 transposon derepression and infertility development in cryptorchidism. Sex Dev 2015, 9:98-104. 87. Dennis S, Sheth U, Feldman JL, English KA, Priess JR: C. elegans germ cells show temperature and age-dependent expression of Cer1, a Gypsy/Ty3-related retrotransposon. PLoS Pathog 2012, 8:e1002591. 88. Rodic N, Sharma R, Sharma R, Zampella J, Dai L, Taylor MS, Hruban RH, Iacobuzio-Donahue CA, Maitra A, Torbenson MS, Goggins M, Shih Ie M, Duffield AS, Montgomery EA, Gabrielson E, Netto GJ, Lotan TL, De Marzo AM, Westra W, Binder ZA, Orr BA, Gallia GL, Eberhart CG, Boeke JD, Harris CR, Burns KH: Long interspersed element-1 protein expression is a hallmark of many human cancers. Am J Pathol 2014, 184:1280-1286. 89. Rodic N, Steranka JP, Makohon-Moore A, Moyer A, Shen P, Sharma R, Kohutek ZA, Huang CR, Ahn D, Mita P, Taylor MS, Barker NJ, Hruban RH, Iacobuzio-Donahue CA, Boeke JD, Burns KH: Retrotransposon insertions in the clonal evolution of pancreatic ductal adenocarcinoma. Nat Med 2015, 21:1060-1064. 90. Ewing AD, Gacita A, Wood LD, Ma F, Xing D, Kim MS, Manda SS, Abril G, Pereira G, Makohon-Moore A, Looijenga LH, Gillis AJ, Hruban RH, Anders RA, Romans KE, Pandey A, IacobuzioDonahue CA, Vogelstein B, Kinzler KW, Kazazian HH Jr, Solyom S: Widespread somatic L1 retrotransposition occurs early during gastrointestinal cancer evolution. Genome Res 2015, 25:1536-1545. 91. Tubio JM, Li Y, Ju YS, Martincorena I, Cooke SL, Tojo M, Gundem G, Pipinikas CP, Zamora J, Raine K, Menzies A, RomanGarcia P, Fullam A, Gerstung M, Shlien A, Tarpey PS, Papaemmanuil E, Knappskog S, Van Loo P, Ramakrishna M, Davies HR, Marshall J, Wedge DC, Teague JW, Butler AP, NikZainal S, Alexandrov L, Behjati S, Yates LR, Bolli N, Mudie L, Hardy C, Martin S, McLaren S, O’Meara S, Anderson E, Maddison M, Gamble S, Group IBC, Group IBC, Group IPC, Foster C, Warren AY, Whitaker H, Brewer D, Eeles R, Cooper C, Neal D, Lynch AG, Visakorpi T: Mobile DNA in cancer. Extensive transduction of nonrepetitive DNA mediated by L1 retrotransposition in cancer genomes. Science 2014, 345:1251343. 92. Ruprecht K, Mayer J, Sauter M, Roemer K, Mueller-Lantzsch N: Endogenous retroviruses and cancer. Cell Mol Life Sci 2008, 65:3366-3382. 93. Shukla R, Upton KR, Munoz-Lopez M, Gerhardt DJ, Fisher ME, Nguyen T, Brennan PM, Baillie JK, Collino A, Ghisletti S, Sinha S, Iannelli F, Radaelli E, Dos Santos A, Rapoud D, Guettier C, Samuel D, Natoli G, Carninci P, Ciccarelli FD, Garcia-Perez JL, Faivre J, Faulkner GJ: Endogenous retrotransposition activates oncogenic pathways in hepatocellular carcinoma. Cell 2013, 153:101-111. www.sciencedirect.com

Retroviruses and endogenization in mammals Parrish and Tomonaga 183

94. Lau CC, Sun T, Ching AK, He M, Li JW, Wong AM, Co NN, Chan AW, Li PS, Lung RW, Tong JH, Lai PB, Chan HL, To KF, Chan TF, Wong N: Viral-human chimeric transcript predisposes risk to liver cancer development and progression. Cancer Cell 2014, 25:335-349. 95. Hashimoto K, Suzuki AM, Dos Santos A, Desterke C, Collino A, Ghisletti S, Braun E, Bonetti A, Fort A, Qin XY, Radaelli E, Kaczkowski B, Forrest AR, Kojima S, Samuel D, Natoli G, Buendia MA, Faivre J, Carninci P: CAGE profiling of ncRNAs in hepatocellular carcinoma reveals widespread activation of retroviral LTR promoters in virus-induced tumors. Genome Res 2015, 25:1812-1824. 96. Orgel LE, Crick FH: Selfish DNA: the ultimate parasite. Nature 1980, 284:604-607. 97. Raoult D: The post-Darwinist rhizome of life. Lancet 2010, 375:104-105. 98. Bordenstein SR, Theis KR: Host biology in light of the microbiome: ten principles of holobionts and hologenomes. PLoS Biol 2015, 13:e1002226. 99. Colson P, Ravaux I, Tamalet C, Glazunova O, Baptiste E, Chabriere E, Wiedemann A, Lacabaratz C, Chefrour M, Picard C, Stein A, Levy Y, Raoult D: HIV infection en route to endogenization: two cases. Clin Microbiol Infect 2014, 20:1280-1288. 100. Morazzani EM, Wiley MR, Murreddu MG, Adelman ZN, Myles KM: Production of virus-derived ping-pong-dependent piRNA-like small RNAs in the mosquito soma. PLoS Pathog 2012, 8:e1002470. 101. Yang L, Guell M, Niu D, George H, Lesha E, Grishin D, Aach J,  Shrock E, Xu W, Poci J, Cortazio R, Wilkinson RA, Fishman JA, Church G: Genome-wide inactivation of porcine endogenous retroviruses (PERVs). Science 2015, 350:1101-1104. Demonstrates that CRISPR/Cas systems can be used to knockout entire classes of retroelements, in this case, one with 62 individual copies, within the genome of a cell line.

www.sciencedirect.com

102. O’Donnell KA, An W, Schrum CT, Wheelan SJ, Boeke JD: Controlled insertional mutagenesis using a LINE-1 (ORFeus)  gene-trap mouse model. Proc Natl Acad Sci U S A 2013, 110:E2706-E2713. This manuscript describes some phenotypic consequences of introducing an codon-optimized and transpositionally hyperactive LINE-1 element into the mouse genome. 103. Richardson SR, Morell S, Faulkner GJ: L1 retrotransposons and somatic mosaicism in the brain. Annu Rev Genet 2014, 48:1-27. 104. Erickson IK, Cantrell MA, Scott L, Wichman HA: Retrofitting the genome: L1 extinction follows endogenous retroviral expansion in a group of muroid rodents. J Virol 2011, 85:12315-12323. 105. Horie M, Kobayashi Y, Suzuki Y, Tomonaga K: Comprehensive analysis of endogenous bornavirus-like elements in eukaryote genomes. Philos Trans R Soc Lond B Biol Sci 2013, 368:20120499. 106. Upton KR, Gerhardt DJ, Jesuadian JS, Richardson SR, SanchezLuque FJ, Bodea GO, Ewing AD, Salvador-Palomeque C, van der Knaap MS, Brennan PM, Vanderver A, Faulkner GJ: Ubiquitous l1 mosaicism in hippocampal neurons. Cell 2015, 161:228-239. 107. Crow YJ: Aicardi-Goutieres syndrome. Handb Clin Neurol 2013, 113:1629-1635. 108. Crow MK, Olferiev M, Kirou KA: Targeting of type I interferon in systemic autoimmune diseases. Transl Res 2015, 165:296-305. 109. Dougan M, Dranoff G: Immune therapy for cancer. Annu Rev Immunol 2009, 27:83-117. 110. Tanne A, Muniz LR, Puzio-Kuter A, Leonova KI, Gudkov AV, Ting DT, Monasson R, Cocco S, Levine AJ, Bhardwaj N, Greenbaum BD: Distinguishing the immunostimulatory properties of noncoding RNAs expressed in cancer cells. Proc Natl Acad Sci U S A 2015, 112:15154-15159.

Current Opinion in Microbiology 2016, 31:176–183