Accepted Manuscript Title: Phylogenomic Proximity and Metabolic Discrepancy of Methanosarcina mazei Go1 across Methanosarcinal Genomes Authors: M. Bharathi, P. Chellapandi PII: DOI: Reference:
S0303-2647(16)30231-3 http://dx.doi.org/doi:10.1016/j.biosystems.2017.03.002 BIO 3733
To appear in:
BioSystems
Received date: Revised date: Accepted date:
17-9-2016 15-3-2017 20-3-2017
Please cite this article as: Bharathi, M., Chellapandi, P., Phylogenomic Proximity and Metabolic Discrepancy of Methanosarcina mazei Go1 across Methanosarcinal Genomes.BioSystems http://dx.doi.org/10.1016/j.biosystems.2017.03.002 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Phylogenomic Proximity and Metabolic Discrepancy of Methanosarcina mazei Go1 across Methanosarcinal Genomes
Running title: Phylogenomic proximity...............methanosarcinal genomes
Bharathi M and Chellapandi P*
Molecular Systems Engineering Lab, Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli-620 024, Tamil Nadu, India Tel: +91-431-2407071 Fax:+91-431-2407045 Email:
[email protected] *Corresponding author Highlights
Current genomic data of Methanosarcina mazei Go1 provide an important of its phylogenomic origin for the evolution of metabolic core. This study speculates its metabolic similarity and dissimilarity across the methanosarcinal genomes. Comparative metabolic data discovered of four novel putative metabolic pathways in this genome.
Abstract Methanosarcina mazei Go1 is a heterotrophic methanogenic archaean contributing a significant role in global methane cycling and biomethanation process. Phylogenomic relatedness and metabolic discrepancy of this genome were described herein by comparing its whole genome sequence, intergenomic distance, genome function, synteny homologs and origin of replication, and marker genes with very closely related genomes, Methanosarcina acetivorans and Methanosarcina barkeri. Phylogenomic analysis of this study revealed that genome functional feature and metabolic core of M. mazei and M. barkeri could be originated from M. acetivorans. The metabolic core of these genomes shares a common evolutionary origin to perform the metabolic activity at different environmental niches. Genome expansion, dynamics and gene collinearity were constrained and restrained the conservation of the metabolic core genes by duplication events occurring across methanosarcinal genomes. The Darwinian positive selection was an evolutionary constraint to purify the function of core metabolic genes. Using genome-wide metabolic survey, we found the existence of four novel putative metabolic pathways such as complete methanogenesis from acetate, indole-3-acetate biosynthesis V, 4-aminobutyrate degradation III, galactosamine biosynthesis I and siroheme biosynthesis. Overall, the present study would provide a stand point to revisit its phylogenomic status in order to understand the origin and evolution history of this organism. 1
Keywords: Methanosarcina; Methane; Genome evolution; Methanogenesis; Biogas; Metabolomics
1. Introduction Methanogenic archaea play an energetic role in the global carbon cycle by producing methane with the measured amount of one billion metric tons (Thauer et al. 2008; McInerney et al. 2009). Methanosarcina mazei Go1 (MMA) is a marine methanogenic archaean belonging to the Methanosarcina genus. It is able to use all methanogenic substrates in an anaerobic environment. It has diverse physiological functions capable of converting methanogenic substrates or reducing CO2 to methane, as revealed by global proteomic and DNA microarray experiments (Hovey et al. 2005; Li et al. 2005; Lie et al. 2005). This organism creates a favorable thermodynamic condition by scavenging H2 and keeping its partial pressure low, leading to allow the growth of sulfate reducing bacteria (Li et al. 2005; Stolyar et al. 2007). Genetic, metabolic process and biochemistry of many Methanosarcina species have been studied extensively in recent years (Chellapandi 2011a; 2011b; 2013). A huge quantity of available high-throughput data and genome sequences of methanogenic archaea has been provided the systems-level understanding of its growth physiology and metabolism (Chellapandi 2010). The genome sequence of MMA was completed and revealed the lateral gene transfer event acted as a major evolutionary constraint for its genomic expansion (Deppenmeier et al. 2002). Extensive gene rearrangements and a high degree of conservation were compared to describe its mechanism of the structural modification and functional organization across methanosarcinal genomes (Maeder et al. 2006). Intergenomic evolution across methanogenic archaea was influenced on the genomic distribution of haloarchaeal rRNA genes, tryptophan synthase and archaeal conjugative plasmids (Boucher et al. 2004; Basta et al. 2009; Lane 2014). Salt adaptation, osmoregulatory network and glycine betaine transporters of MMA were genetically identified by genome-wide gene expression profiling (Pflüger et al. 2007; Spanheimer and Müller, 2008; Saum et al. 2009a; Saum et al. 2009b). Metabolic constituents of the cell and methanogenic growth capabilities were computationally reconciled from genome-scale metabolic models of M. barkeri (MBA) iAF692 (Feist et al. 2006), iMG746 (Gonnerman et al. 2013), M. acetivorans (MAC) iVS941 (Kumar et al. 2011), iMB745 (Benedict et al. 2012), iMAC868 (Nazem‑Bokaee et al. 2016) and Methanococcus maripaludis iMM518 (Goyal et al. 2014). Inference of intergenomic distance, and whole genome phylogeny are used as general computational parameters for studying the gene duplication event and the gene distribution of closely related genomes (Maeder et al. 2006, Gao 2014). Phylogenomic studies are also employed to estimate the overall similarity and dissimilarity between the genomes (Wei et al. 2002; Auch et al. 2010). Phylogenetic markers such as 5s rRNA, 16s rRNA, 23s rRNA and SRP (Signal Recognition Particle) are used to deduce the genetic and biochemical systematics of methanogenic archaea in order to correct many major clades in the tree (Lai et al. 1999; Stantscheff et al. 2014; Petitjean et al. 2015). Secondary structure-based on rRNA such as ykok, group II intron and RNaseP sequences were also used for accurate prediction of 2
the phylogenetic status of prokaryotic genomes (Juana and Wilsona 1999). Methyl-CoM reductase A (mcrA) gene was served as a functional marker for the taxonomic classification of all methanogenic archaea (Ufnar et al. 2007; Wrede et al. 2013). Comparison of genomic features is a secondary layer of biological description, which affords a significant and effective flowchart for the metabolic pathways among methanogenic archaea (Maeder et al. 2006; Zhang et al. 2009). We have undertaken such approach to explain or to resolve the genomic proximity and metabolic discrepancy of MMA from its phylogenetic neighbors such as MAC and MBA. We inferred how intergenomic evolutionary constraints attribute its genome dynamics and metabolic adjustment across methanosarcinal genomes. Using this approach, genes are analyzed case-by-case to make genome annotation in a more complete and significant way. 2. Materials and Methods 2.1. Genome-scale tree construction The complete genome sequence of MMA was retrieved from the National Center for Biotechnology Information (NCBI) -FTP site for finding its current phylogenomic status and evolutionary relatedness among all methanogenic archaea (http://www.ncbi.nlm.nih.gov/). A whole-genome based tree was constructed by a composition vector algorithm (K-mer length = 6) using CVtree3 server (Xu and Hao 2009). Each organism is represented by a composition vector made of K-mer length obtained from a given genome. To suppress the effect of neutral mutations and to highlight the shaping role of natural selection, a subtraction procedure based on (K−6)-th order Markov prediction is introduced in the CVtree method. The collection of distances for all species pairs comprises a dissimilarity matrix, which has been calculated by using the neighbor-joining (NJ) algorithm (Saitou and Nei 1987) to construct phylogenomic trees. In the CVtree, time-consuming statistical re-sampling tests such as Jackknife and bootstrap tests can be neglected as it is an alignment-free and parameter-free method. 2.2. Analysis of genome synteny Syntenic regions in the genome of MMA were identified by comparing with those found in MAC and MBA using CoGe:SynMap (Lyons et al. 2008). Nucleotide-nucleotide search between two genomes was performed with a LastZ algorithm (http://last.cbrc.jp). A collinear set of genes was identified and merged by using interactive DAGChainer (Haas et al. 2004). False positive synteny was removed by Quota Align algorithm and then merged all refined syntenic blocks (Tang et al. 2011). Synonymous and non-synonymous substitutions between syntenic gene pairs were computed by PAML package using CodeML algorithm. Synteny blocks and evolutionary events resulted due to synonymous substitution were represented as syntenic histogram (Yang 2007). 2.3. Analysis of intergenomic distance Genomic variations in the functional genes in the MMA were detected by measuring Pearson correlation coefficients for a cluster of orthologs profiling of its closely related genomes (Benesty et al. 2009). Pearson correlation coefficients determine the intergenomic relatedness of selected genomes. Intergenomic distances were inferred between MMA and its phylogenetic neighbors using Genome-to-Genome Distance Calculator (Meier-Kolthoff et al. 2013). The genomic compositional dissimilarity between MMA and its phylogenetic 3
neighbors was measured by Bray-Curtis dissimilarity coefficient at a non-metric multidimensional scale (Bray and Curtis 1957). 2.4. Analysis of origin of replication (oriC) We detected oriCs and genes encoding for DNA replication proteins from the genomes of MMA, MAC and MBA by Ori Finder 2.0 (Luo et al., 2014). The common ORB (the origin recognition boxes), a specific motif within oriC in archaeal genome, was used to predict the sequence of oriCs across these genomes. The predicted oriCs and DNA replication protein sequences were aligned separately by ClustalX 2.0 software (Thompson et al. 1997) for detecting a sequence-specific conservation pattern across these genomes. 2.5. Analysis of molecular markers We retrieved nucleotide sequences of phylogenetic markers (5s, 16s and 23s rRNAs) and amino acid sequences of functional markers (mcrA and SRP) from Kyoto Encyclopedia of Genes and Genomes (KEGG) database (http://www.genome.jp/kegg/). Sequence similarity hits for these sequences were obtained by the BLAST tool by searching against KEGG database (Altschul et al. 1997). Multiple sequence alignments were carried out separately by ClustalX 2.0 software (Thompson et al. 1997). After complete alignment, the aligned sequences were edited manually to remove improperly aligned sequences and then phylogenetic trees constructed with MEGA 5.0 software (Tamura et al. 2011). Kimura-2parameter was chosen as an evolutionary model with 1000 bootstrap replicates and 5 Gamma distribution rates (Lukacs, 1955). Segregation sites, recombination/scalable-mutation rate, nucleotide diversity were estimated for each marker across different species of methanogenic archaea. Tajima's D test performed to determine genetic drift and genetic diversity among methanogenic archaea (Tajima 1989). All of these evolutionary genetic analyses were conducted by MEGA software. The free energy of rRNA secondary structure was calculated by MFold server at the folding temperature of 37oC with 1M NaCl ionic condition (Zuker et al. 2003). Homology models for mcrA were generated by SWISS-MODEL based a structural template (pdb id: 1E6Y: A) (Biasini et al. 2014). The reliability of modelled structures was determined with template by means of QMEAN Z-Score and sequence identity. Structural alignment and superimposition of each model were performed with 3D-SS server to calculate the disparity in the structural motifs (Sumathi et al. 2006). Superimposed structure was visualized in the UCSF Chimera 1.10 software (Pettersen et al. 2004). 2.6. Analysis of metabolic features We have carried out comparative metabolomic analysis to discover unique genes and to novel putative metabolic pathways based on the metabolic relatedness with MBA and MAC. Metabolic information for each genome was collected from the MetaCyc database (Caspi et al. 2014) and individual metabolic features, genes, enzymes and pathways compared manually. Enzymes involved in the central metabolism and methanogenesis were searched for MMA and then compared its metabolic feature with MBA and MAC. Novel putative metabolic pathways were identified by merging common genes, and enzyme shared across methanosarcinal genomes. 3. Results 3.1. Whole genome phylogeny 4
A whole genome phylogenetic tree was constructed with 39 completed genome sequences as represented in Fig 1. Methanobacterium SWAN1 and AL21 are found as out-group organisms for the taxonomic classification of the MMA genome in the tree. There are two major clusters formed separately with monophyletic groups of different physiological species. In the first cluster, many Methanococcus genomes are grouped with Methanothermococcus Methanobacterium (Methanobrevibacter smithi and Methanobrevibacter ruminantium). The genera of Methanosarcina, Methanothermobacter and Methanosaeta are placed in the second cluster. A phylogenetic proximity is observed between MMA and MAC and then with MBA, which are further grouped with Methanoregula formicicum, Methanothermobacter marburgensis and Methanothermobacter thermoautotrophicus. 3.2. Intergenomic distance The results of the genome correlation coefficient analysis indicate that MMA genome is closely related with MAC (0.80) and MBA (0.83) and all three of them are shared their genomic function with distantly related methanogenic archaea (0.67-0.56) (Table 1). Similarly, intergenomic distance analysis demonstrates that MMA genome is very close to the MAC genome (0.6), but it was slightly far away from MBA (0.7), and distantly related to other methanogenic archaea. The rank order correlation of a dissimilarity matrix describes that MMA is highly correlated with MAC and followed by MBA. Non-methanogenic archaea have shown high rank order dissimilarities to the MMA while comparing with other methanogenic archaea (Supplementary Fig. S1). 3.3. Genome synteny of homologs Genome synteny analysis shows that whole genome duplication events occurred between the MMA and MBA is higher than those events occurred between MMA and MAC (Fig 2). Syntenic orthologs in the MMA are originated from MBA, which was relatively more than those originated from MAC. It may be resulted from low rate of synonymous substitutions. A collinearity of either gene orders or gene families may be adjusted by syntenic blocks at the location of syntenic out-paralogs. 3.4. Origin of replication In the genome of MMA, we noted two different oriC sites in the MMA genome as represented in Fig 3. The first oriC region is located in upstream of cdc6 gene at the genomic position of 1564657-1566241 bp (GC cont. 26.50%). The second oriC region is located in upstream of AAA+ ATPase gene that is required for initiation of DNA replication. Both oriC regions identified in the MMA are similar to those found in the MAC, but oriC regions in the MBA are located at different positions. The ORB identified from MMA has shown 95% similarity with MAC and MBA, indicative of evolutionary conservation of the DNA boxes across methanosarcinal genomes. 3.5. Genetic diversity of marker genes Phylogenetic and metabolic markers of this genome compared with those markers similarly found in the MAC and MBA as shown in Fig 4. Phylogenetic diversity of methanogenic archaea is associated with segregation sites in the 16s rRNA and mcrA (Supplementary Fig S2 & S3). At the low evolutionary frequency rate, segregation sites determine the conservation of marker genes across methanosarcianal genomes. SRP and fdhD of 5
Methanosarciana are highly mutable during evolutionary events. Estimates of nucleotide diveristy demonstrate its genetic polymorphism explicating the moderate diversity in patterns of metabolic markers (fdhD, hemC, ogt, glyS, coaE and ahaD). There are fewer discrepancy observed in metabolic markers (fdhD, coaE and hemC) among methanogenic archaeal lineage. Apart from phylogenetic markers, all of the markers have shown to diverge functionally by imposing Darwinian positive selection. 3.6. Metabolic diversity Genome-wide metabolic mining of our study points out that MMA genome covers 123 pathways consisting of 670 metabolic reactions and 134 pathway holes. Totally, 243 genes and 137 enzymes are shared across methanosarcinal genomes and 97 genes are unique to the MMA. A total of 123 genes is shared between MMA and MAC compared to MBA as shown in Fig 5. Similarly, an increased number of enzymes are shared between MMA and MAC. Genes involved in central metabolism are common in all methanosarcinal genomes. Nevertheless, MMA consists of 3 unique genes involved in methanogenesis, suggesting the existence of a complete system for methane biosynthesis. 3.7. Discovery of novel putative metabolic pathways Genes coding for carbon monoxide dehydrogenase (EC 1.2.7.4), corrinoid/iron-sulfur protein Co-methyltransferase (EC 2.1.1.245) and co-methylating acetyl-CoA synthase (EC 2.3.1.169; α-subunit: MM_0358; β-subunit: MM_0493; acetyl CoA synthase: MM_3180) are exclusively found in the MMA, which are not present in MAC and MBA. All of these enzymes are involved in hydrogenotrophic methanogenesis. Indole-3-acetate biosynthesis V pathway is a novel putative metabolic pathway identified from MMA that can be mediated by gene MM_1229. Similarly, MM_0047, MM_0838 and MM_2341 are unique genes involved in the 4-aminobutyrate degradation. MMA also exclusively consists of unique genes MM_1162 and MM_1740 to be involved in galactosamine biosynthesis I and siroheme biosynthesis, respectively (Fig 6). 4. Discussion Our comparative genomic study was aimed to address its phylogenomic relationship and metabolic discrepancy from closely related methanosarcinal genomes (MAC and MBA). Similarly, comparative genomic analysis was used to infer its evolutionary dynamics of nickel, cobalt and vitamin B12 assimilation systems (Wei et al. 2002; Zhang et al. 2009). The results of phylogenetic analysis indicated that methanosarcinal genomes are closely related together and their genomic features and some of the metabolic core genes are shared with the genome of halophilic archaea in accordance to the previous work (Chellapandi et al. 2009). It suggested that MMA has a typical molecular system for salt adaptation and osmotolerance was found in the MMA, as evidenced earlier by gene expression profiling (Pflüger et al. 2007; Spanheimer and Müller, 2008; Spanheimer et al. 2008; Saum et al. 2009a; 2009b). MMA strain N2M9705 was isolated from aquaculture fishpond, which has shown the methylotrophic growth capability (Lai et al. 1999). We found three unique genes coding for enzymes (EC 1.2.7.4; 2.1.1.245; 2.3.1.169) involved in the hydrogenotrophic methanogenesis. Therefore, our study provides additional support for the existence of a typical system for methylotrophic capability. 6
Analysis of intergenomic distances stated that some of its metabolic features may be originated from distantly related genomes rather than methanosarcinal genomes, as suggested by the earlier hypothesis of molecular evolution of methanogens (Chellapandi 2010). Synonymous substitution is a major evolutionary event for the duplication of the syntenic orthologs and syntenic out-paralogs, which might be occurred by either -event or -event. The majority of duplicated genes in MMA was arisen from MBA through MAC. Such duplicated genes were major cause-effect in its genome complexity and species diversity through mutation and genomic drift (Zhu et al. 2013). Consequently, its metabolic function may be restrained to the new environmental niche. Gene products in the chromosomal oriC region of MMA are approximately 93-95% identical across the methanosarcinal species (Deppenmeier et al. 2002; Maeder et al. 2006). Several origin recognition boxes are flanked by oriC region, which have shown homology towards hypothetical protein MA_4646 from MAC and hypothetical protein Mbar_A2231 from MBA. It implied that replication initiation gene products (cdc6 and AAA+ ATPase) in the oriC regions share a common archaeal ancestor for all methanosarcinal species, but the genomic location of oriC and sequences are evolutionarily conserved within species. Very often, its genome seems to be expanded due to the copy number of rRNA genes and exclusive existence of ykok and group II intron. Ribosomal RNA and mcrA genes are well known molecular markers for the phylogenetic classification of methanogenic archaea and to study the evolutionary relationships (Zhang et al. 2009; Chellapandi et al. 2009; Stantscheff et al. 2014; Petitjean et al. 2015). Many metabolic marker genes are highly conserved across methanogenic archaea, but at the structural level, each marker was faintly varied within species. It may be resulted due to the Darwinian positive selection acting on metabolic markers or gene shuffling/gene duplication events. Horizontal gene transfer (HGT) plays a major role in the evolution of Methanosarcina genomes. The largest set of metabolic pathways and the large genome size in Methanosarcina genus are caused by massive HGT from bacteria (Maeder et al. 2006; Chellapandi, 2011a, 2011b, 2013). About 5% of genes in Methanosarcina are horizontally transferred from bacterial origin. About 9.4% of metabolic enzymes and 11.5% of transporters in MMA are horizontally transferred from bacteria, but the regulation of these genes is unknown (Garushyants et al. 2015). The distribution of enzymes and transporters among horizontally transferred genes was slightly varied from MMA to MBA; MMA to MAC. Hence, intergenomic evolution is a major constraint to determine its metabolic function from ancestral species by massive HGT from bacterial origins. Complete methanogenesis from acetate, indole-3-acetate biosynthesis V, 4aminobutyrate degradation III, galactosamine biosynthesis I and siroheme biosynthesis are novel putative metabolic pathways disclosed from this genome (Supplementary Fig. S4). This organism can adapt to utilize different methanogenic substrates using substrate-dependent gene expression system (Hovey et al. 2005). It is capable of utilizing acetate as a sole carbon source for its growth and methanogenesis. As MMA contains acetyl-CoA decarbonylase/synthase with all five subunits, it may able to grow on carbon monoxide as similar to those found in MMA (Matschiavelli et al. 2012), MBA (Hutten et al. 1980) and autotrophic methanogenic archaea (Chellapandi, 2011a, 2011b, 2013; Cheng et al. 2011; 7
Goyal et al. 2014). Qualitative analysis of CO dehydrogenase/acetyl coenzyme A synthase multienzyme complex (CODH/ACS)-encoding transcripts of MMA supported that CODH/ACS isoform catalyzed both CO oxidation/CO2 reduction and cleavage/synthesis of acetyl-CoA (Eggen et al. 1996). Moreover, indole-3-acetate produced from MMA via indole3-acetate biosynthesis V may indirectly support the growth of plants as a phyto-regulator. Similar pathway was reported in the plant-associated fungi such as Taphrina wiesneri, T. deformans and T. pruni (Yamada et al. 1990) and an opportunistic vertebrate pathogen, Alcaligenes faecalis (Kobayashi et al. 1993). 4-Aminobutyrate is a four-carbon non-protein amino acid that may be derived as a product of plant, animal tissue decay and bacteria. It is produced as an intermediate during bacterial degradation of putrescin. MMA consists of three unique gene coding enzymes involved in the 4-aminobutyrate degradation III as similar to Saccharomyces cerevisiae (Ramos et al. 1985) and Ralstonia eutropha H16 (Mayer and Cook 2009). Acetylornithine aminotransferase from MMA is closely related to validamycin aminotransferase from Streptomyces hygroscopicus 5008 and the putative 4-aminobutyrate aminotransferase from Deinococcus radiodurans (Bai et al. 2006). UDP-N-Acetyl-D-galactosamine is the amino sugar nucleotide donor of N-acetyl-Dgalactosamine residues for the biosynthesis of a component of several different kinds of cell surface structures and glycosylated proteins (Milewski et al. 2006). It is produced from UDPN-acetyl-D-glucosamine by action of UDP-N-acetyl-D-glucosamine 4-epimerase (EC 5.1.3.7) in the UDP-N-acetyl-D-galactosamine biosynthesis I. This enzyme encoded gene MM_1162 may be useful as a distinct cell wall marker for identification of closely related methanosarcinal species as it is not found in MAC and MBA. Jain et al (2014)., described the stability and robustness of archaeal membrane ether lipids and evolutionary aspects of the membrane composition of the last universal common ancestor. MMA uses siroheme, an iron-containing isobacteriochlorin, for heme b biosynthesis found in many archaea and sulfate reducing bacteria (Bali et al. 2014). Siroheme synthase is a single trifunctional enzyme exclusively found in the MMA, which catalyzes all four reactions in siroheme biosynthesis. In vitro enzyme activity assay demonstrated its functional characterization for the conversion of iron-coproporphyrin III into heme during the alternative heme biosynthesis pathway in MBA (Kühner et al. 2014). Nevertheless, the late heme genes are missing from the genomes of MMA, MBA and MAC (Deppenmeier et al. 2002; Galagan et al. 2008). The core genes involved in the biosynthetic pathways for methane (Bapteste et al. 2005), acetamido sugar (Namboori and Graham 2008) and N-linked glycoprotein (Larkin et al. 2013) have shown to share a common evolutionary origin in the euryarchaeal lineage. 5. Conclusions Comparative genomic analysis is useful for better understanding of its phylogenomic relatedness and metabolome complexity across methanosarcinal genomes. A strong genomic and metabolic proximity is noted between MMA and MAC at the molecular level. Core metabolic genes of MMA and MBA might be originated from MAC and shared within methanosarcinal genomes at a slow evolution rate. Current phylogenetic and functional markers would discriminate its recent ancestral status in the tree. Comparative metabolic 8
analysis disclosed the four novel putative metabolic pathways, including complete methanogenesis from acetate, indole-3-acetate biosynthesis V, 4-aminobutyrate degradation III, galactosamine biosynthesis I and siroheme biosynthesis in this genome. As the results, we revealed its methylotorphic capability, phyto-regulatory activity, unique cell wall composition (UDP-N-acetyl-D-galactosamine) and siroheme biosynthesis. Quantitative transcriptome and proteome data would provide a warrent for studying the functional evolution of its genome and to infer molecular regulatory networks to be conserved within methanosarcial genomes. Hence, our approach would provide a hint to discover novel putative metabolic pathways for its genome refinement and genome-scale metabolic reconstruction in the future. Acknowledgement The corresponding author is thankful to the University Grants Commission, New Delhi, India, for financial assistance (42-864/2013 (SR)) to carry out the work. Conflict of interest The authors confirm that this article content has no conflict of interest. References [1]. Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W., et al. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:3389-3402. doi: 10.1093/nar/25.17.3389. PMID: 9254694. [2]. Auch, A.F., Auch, A.F., Klenk, H.P., and Göker M. 2010. Standard operating procedure for calculating genome-to-genome distances based on high-scoring segment pairs. Stand Genomic Sci. 2:142–148. doi: 10.4056/sigs.541628. PMID: 21304686. [3]. Bai, L., Li, L., Xu, H., Minagawa, K., Yu, Y., Zhang, Y., Zhou, X., Floss, H.G., Mahmud, T., Deng, Z. 2006. Functional analysis of the validamycin biosynthetic gene cluster and engineered production of validoxylamine A. Chem Biol. 13:387-397. PMID: 16632251. [4]. Bali, S., Rollauer, S., Roversi, P., Raux-Deery, E., Lea, S.M., Warren, M.J., et al. 2014 Identification and characterization of the 'missing' terminal enzyme for siroheme biosynthesis in α-proteobacteria. Mol Microbiol. 92:153-163. doi: 10.1111/mmi.12542. PMID: 24673795. [5]. Bapteste, E., Brochier, C., and Boucher, Y. 2005. Higher-level classification of the Archaea: evolution of methanogenesis and methanogenic archaea. Archaea. 1:353-363. PMID: 15876569. [6]. Basta, T., Smyth, J., Forterre, P., Prangishvili, D., and Peng, X. 2009. Novel archaeal plasmid pAH1 and its interactions with the lipothrixvirus AFV1. Mol Microbiol. 71:23– 34. doi: 10.1111/j.1365-2958.2008.06488.x. PMID: 19007417. [7]. Benedict, M.N., Gonnerman, M.C., Metcalf, W.W., and Price, N.D. 2012. Genomescale metabolic reconstruction and hypothesis testing in the methanogenic archaeon Methanosarcina acetivorans C2A. J Bacteriol. 194:855–865. doi: 10.1128/JB.0604011. PMID: 22139506.
9
[8]. Biasini, M., Bienert, S., Waterhouse, A., Arnold, K., Studer, G., Schmidt, T., et al. 2014. SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res. 42:W252-258. doi: 10.1093/nar/gku340. PMID: 24782522. [9]. Boucher,Y., Douady, C.J., Sharma, A.K., Kamekura, M., and Doolittle, W.F. 2004. Intragenomic heterogeneity and intragenomic recombination among haloarchaeal rRNA genes. J Bacteriol. 186:3980–3990. doi: 10.1128/JB.186.12.3980-3990.2004. PMID: 15175312. [10]. Bray, J.R., and Curtis, J.T. 1957. An ordination of the upland forest communities of Southern Wisconsin. Ecol Monograph. 27:325-349. doi: 10.2307/1942268. [11]. Caspi, R., Altman, T., Billington, R., Dreher, K., Foerster, H., Fulcher, C.A., et al. 2014. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases, Nucleic Acids Res. 42:D459-D471. doi: 10.1093/nar/gkt1103. PMID: 24225315. [12]. Chellapandi, P. 2011a. In silico description of cobalt and nickel assimilation systems in the genomes of methanogenic archaea. Syst Synth Biol. 5:105–114. doi: 10.1007/s11693-011-9087-2. PMID: 23205154. [13]. Chellapandi, P. 2011b. Molecular evolution of methanogenic archaea based on their metabolic facets. Front Biol. 6:490-503. doi: 10.1007/s11515-011-1154-2. [14]. Chellapandi, P. 2013. Computational studies on enzyme-substrate complexes of methanogenesis for revealing their substrate binding affinities to direct the reverse reactions. Prot. Pept. Lett. 20: 265-278. doi: 10.2174/0929866511320030005. PMID: 22591469. [15]. Chen, J., Huang, Y., Cohen, I. 2009. Pearson correlation coefficient. In Noise reduction in speech processing (pp. 1-4). Springer Berlin Heidelberg. doi: 10.1007/978-3-64200296-0_5. [16]. Cheng L, Dai L, Li X, Zhang H, Lu Y (2011) Isolation and characterization of Methanothermobacter crinale sp. nov., a novel hydrogenotrophic methanogen from the Shengli oil field. Appl Environ Microbiol 77: 5212-5219. [17]. Deppenmeier, U., Johann, A., Hartsch, T., Merkl, R., Schmitz, R.A., Martinez-Arias, R., et al. 2002. The genome of Methanosarcina mazei: evidence for lateral gene transfer between bacteria and archaea. J Mol Microbiol Biotechnol. 4:453-461. PMID: 12125824. [18]. Eggen, R.I., van Kranenburg, R., Vriesema, A.J., Geerling, A.C., Verhagen, M.F., Hagen, W.R., de Vos, W.M. 1996. Carbon monoxide dehydrogenase from Methanosarcina frisia Gö1. Characterization of the enzyme and the regulated expression of two operon-like cdh gene clusters. J Biol Chem. 271:14256-14263. PMID: 8662887. [19]. Feist, A.M., Scholten, J.C., Palsson, B.Ø., Brockman, F.J., and Ideker,T. 2006. Modeling methanogenesis with a genome-scale metabolic reconstruction of Methanosarcina barkeri. Mol Syst Biol. 2:2006.0004. doi: 10.1038/msb4100046. PMID: 16738551. 10
[20]. Galagan, J.E., Nusbaum, C., Roy, A., Endrizzi, M.G., Macdonald, P., FitzHugh, W., Calvo, S., Engels, R., Smirnov, S., Atnoor, D., Brown, A., Allen, N., Naylor, J., StangeThomann, N., DeArellano, K., Johnson, R., Linton, L., McEwan, P., McKernan, K., Talamas, J., Tirrell, A., Ye, W., Zimmer, A., Barber, R.D., Cann, I., Graham, D.E., Grahame, D.A., Guss, A.M., Hedderich, R., Ingram-Smith, C., Kuettner, H.C., Krzycki, J.A., Leigh, J.A., Li, W., Liu, J., Mukhopadhyay, B., Reeve, J.N., Smith, K., Springer, T.A., Umayam, L.A., White, O., White, R.H., Conway de Macario, E., Ferry, J.G., Jarrell, K.F., Jing, H., Macario, A.J., Paulsen, I., Pritchett, M., Sowers, K.R., Swanson, R.V., Zinder, S.H., Lander, E., Metcalf, W.W., Birren, B. 2002. The genome of M. acetivorans reveals extensive metabolic and physiological diversity. Genome Res. 12:532-542. PMID: 11932238. [21]. Gao, F. 2014. Recent advances in the identification of replication origins based on the Z-curve method. Curr Genomics. 15:104–112. Ao. doi: 10.2174/1389202915999140328162938. PMID: 24822028. [22]. Garushyants, S.K., Kazanov, M.D., Gelfand, M.S. 2015. Horizontal gene transfer and genome evolution in Methanosarcina. BMC Evol Biol. 15:102. PMID: 26044078. [23]. Gonnerman, M.C., Benedict, M.N., Feist, A.M., Metcalf, W.W., Price, N.D. 2013. Genomically and biochemically accurate metabolic reconstruction of Methanosarcina barkeri Fusaro, iMG746. Biotechnol J. 8:1070-1079. doi: 10.1002/biot.201200266. PMID: 23420771. [24]. Goyal N, Widiastuti H, Karimi IA, Zhoub Z (2014) A genome-scale metabolic model of Methanococcus maripaludis S2 for CO2 capture and conversion to methane. Mol BioSyst 10:1043. [25]. Goyal, N., Widiastuti, H., Karimi, I.A., Zhou, Z. 2014. A genome-scale metabolic model of Methanococcus maripaludis S2 for CO2 capture and conversion to methane. Mol BioSyst. 10:1043. doi: 10.1039/c3mb70421a. PMID: 24553424. [26]. Guo, F.B., Ou, H.Y., and Zhang, C.T. 2003. ZCURVE: a new system for recognizing protein-coding genes in bacterial and archaeal genomes. Nucleic Acids Res. 3:1780-9. doi: 10.1093/nar/gkg254. PMID: 12626720. [27]. Haas, B.J., Delcher, A.L., Wortman, J.R., Salzberg, S.L. 2004. DAGchainer: a tool for mining segmental genome duplications and synteny. Bioinformatics. 20:3643–3646. doi:10.1093/bioinformatics/bth397. PMID: 15247098. [28]. Hovey, R., Lentes, S., Ehrenreich, A., Salmon, K., Saba, K., Gottschalk, G., et al. 2005. DNA microarray analysis of Methanosarcina mazei Gö1 reveals adaptation to different methanogenic substrates. Mol Genet Genomics. 273:225-239. 10.1007/s00438-0051126-9. PMID: 15902489. [29]. Hutten, T.J., Bongaerts, H.C., van der Drift, C., Vogels, G.D. 1980. Acetate, methanol and carbon dioxide as substrates for growth of Methanosarcina barkeri. Antonie Van Leeuwenhoek. 46:601-610. PMID: 6786216. [30]. Juana, V. and Wilsona, C. 1999. RNA secondary structure prediction based on free energy and phylogenetic analysis. J Mol Biol. 289:935–947. doi:10.1006/jmbi.1999.2801. PMID: 10369773. 11
[31]. Kobayashi, M., Izui, H., Nagasawa, T., and Yamada, H. 1993. Nitrilase in biosynthesis of the plant hormone indole-3-acetic acid from indole-3-acetonitrile: cloning of the Alcaligenes gene and site-directed mutagenesis of cysteine residues. Proc Natl Acad Sci USA. 90:247-251. PMID: 8419930. [32]. Lai, M.C., Shu, C.M., Chiou, M.S., Hong, T.Y, Chuang, M.J., Hua, J.J. 1999. Characterization of Methanosarcina mazei N2M9705 isolated from an aquaculture fishpond. Curr Microbiol. 39:79-84. PMID: 10398831. [33]. Lane, N. 2014. Bioenergetic Constraints on the Evolution of Complex Life. Cold Spring Harb Perspect Biol. 2014:6:a015982. doi: 10.1101/cshperspect.a015982. PMID: 24789818. [34]. Larkin, A., Chang, M.M., Whitworth, G.E., and Imperiali, B. 2013. Biochemical evidence for an alternate pathway in N-linked glycoprotein biosynthesis. Nat Chem Biol. 9:367-373. doi: 10.1038/nchembio.1249. PMID: 23624439. [35]. Li, Q., Li, L.; Rejtar, T.; Karger, B.L.;Ferry, J.G. The proteome of Methanosarcina acetivorans. Part II, comparison of protein levels in acetate- and methanol-grown cells. J. Proteome Res., 2005, 4, 129–136. [36]. Lie, T.J.; Wood, G.E.; Leigh, J.A. Regulation of nif expression in Methanococcus maripaludis: roles of the euryarchaeal repressor NrpR, 2-oxoglutarate, and two operators. J. Biol. Chem., 2005, 280, 5236–5241. [37]. Lukacs, E. 1955. A Characterization of the gamma distribution. Ann Math Stat. 26:319324. [38]. Luo, H., Zhang, C.T., and Gao, F. 2014. Ori-Finder 2, an integrated tool to predict replication origins in the archaeal genomes. Front Microbiol, 5:482. http://www.jstor.org/stable/2236887. [39]. Lyons, E., Pedersen, B., Kane, J., and Freeling M. 2008. The value of nonmodel genomes and an example using SynMap within CoGe to dissect the hexaploidy that predates rosids. Tropical Plant Biol. 1, 181-190. doi: 10.1007/s12042-008-9017-y. [40]. Maeder, D.L., Anderson, I., Brettin, T.S., Bruce, D.C., Gilna, P., Han, C.S., et al. 2006. The Methanosarcina barkeri genome: Comparative analysis with Methanosarcina acetivorans and Methanosarcina mazei reveals extensive rearrangement within Methanosarcinal Genomes. J Bacteriol. 188:7922-7931. doi: 10.1128/JB.00810-06. PMID: 16980466. [41]. Matschiavelli, N., Oelgeschläger, E., Cocchiararo, B., Finke, J., Rother, M. 1012. Function and regulation of isoforms of carbon monoxide dehydrogenase/acetyl coenzyme A synthase in Methanosarcina acetivorans. J Bacteriol. 194:5377-5387. [42]. Mayer, J., and Cook, A.M. 2009. Homotaurine metabolized to 3-sulfopropanoate in Cupriavidus necator H16: enzymes and genes in a patchwork pathway. J Bacteriol. 191:6052-8. doi: 10.1128/JB.00678-09. PMID: 19648235. [43]. McInerney, M.J., Sieber, J.R., and Gunsalus, R.P. 2009. Syntrophy in anaerobic global carbon cycles. Curr Opin Biotechnol. 20:623–632. doi: 10.1016/j.copbio.2009.10.001. PMID: 19897353.
12
[44]. Meier-Kolthoff, J.P., Auch, A.F., Klenk, H.P., and Göker, M. 2013. Genome sequencebased species delimitation with confidence intervals and improved distance functions. BMC Bioinformatics. 14:60. doi: 10.1186/1471-2105-14-60. PMID: 23432962. [45]. Milewski, S., Gabriel, I., and Olchowy, J. 2006. Enzymes of UDP-GlcNAc biosynthesis in yeast. Yeast 23, 1-14. doi: 10.1002/yea.1337. PMID: 16408321. [46]. Namboori, S.E., and Graham, D.E. 2008. Acetamido sugar biosynthesis in the Euryarchaea. J Bacteriol. 190, 2987-2996. doi: 10.1128/JB.01970-07. PMID: 18263721. [47]. Nazem-Bokaee, H., Gopalakrishnan, S., Ferry, J,G., Wood, T.K., and Maranas, C,D. 2016. Assessing methanotrophy and carbon fixation for biofuel production by Methanosarcina acetivorans. Microb Cell Fact. 15:10. doi: 10.1186/s12934-015-04044. PMID: 26776497. [48]. Petitjean, C., Deschamps, P., López-García, P., Moreira, D., and Brochier-Armanet, C. 2015. Extending the conserved phylogenetic core of Archaea disentangles the evolution of the third domain of Life. Mol Biol Evol. Pii, msv015. doi: 10.1093/molbev/msv015. PMID: 25660375. [49]. Pettersen, E.F., Goddard, T.D., Huang, C.C., Couch, G.S., Greenblatt, D.M., Meng, E.C., et al. 2004. UCSF Chimera--a visualization system for exploratory research and analysis. J Comput Chem. 25:1605-12. doi: 10.1002/jcc.20084. PMID: 15264254. [50]. Pflüger, K., Ehrenreich, A., Salmon, K., Gunsalus, R.P., Deppenmeier, U., Gottschalk, G., et al. 2007. Identification of genes involved in salt adaptation in the archaeon Methanosarcina mazei Gö1 using genome-wide gene expression profiling. FEMS Microbiol Lett. 277:79-89. Doi: http://dx.doi.org/10.1111/j.1574-6968.2007.00941.x. PMID: 17986088. [51]. Ramos, F., el Guezzar, M., Grenson, M., and Wiame, J.M. 1985. Mutations affecting the enzymes involved in the utilization of 4-aminobutyric acid as nitrogen source by the yeast Saccharomyces cerevisiae. Eur J Biochem. 149:401-404. Doi: 10.1111/j.14321033.1985.tb08939.x. PMID: 3888627. [52]. Saitou, N., and Nei, M. 1987. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 4:406-425. PMID: 3447015. [53]. Satish Kumar, V., Ferry, J.G., and Maranas, C.D. 2011. Metabolic reconstruction of the archaeon methanogen Methanos arcina acetivorans. BMC Syst Biol. 5:28. doi: 10.1186/1752-0509-5-28. PMID: 21324125. [54]. Saum, R., Mingote, A., Santos, H., and Müller, V. 2009a. A novel limb in the osmoregulatory network of Methanosarcina mazei Gö1: N(epsilon)-acetyl-beta-lysine can be substituted by glutamate and alanine. Environ Microbiol. 11:1056-1065. PMID: 19452593. [55]. Saum, R., Mingote, A., Santos, H., and Müller, V. 2009b. Genetic analysis of the role of the ABC transporter Ota and Otb in glycine betaine transport in Methanosarcina mazei Gö1. Arch Microbiol. 191:291-30. doi: 10.1007/s00203-008-0449-4. PMID: 19096827. [56]. Spanheimer, R., and Müller V. 2008. The molecular basis of salt adaptation in Methanosarcina mazei Gö1. Arch Microbiol. 190: 271-279. doi: 10.1007/s00203-0080363-9. PMID: 18379758. 13
[57]. Stantscheff, R., Kuever, J., Rabenstein, A., Seyfarth, K., Dröge, S., and König, H. 2014. Isolation and differentiation of methanogenic Archaea from mesophilic corn-fed onfarm biogas plants with special emphasis on the genus Methanobacterium. Appl Microbiol Biotechnol. 98:5719-5735. doi: 10.1007/s00253-014-5652-4. PMID: 24639207. [58]. Stolyar, S.; Van Dien, S. Hillesland, K.L.; Pinel, N.; Lie, T.J.; Leigh, J.A. and Stahl, D.A.; Metabolic modeling of a mutualistic microbial community. Mol. Syst. Biol., 2007, 3, 92. [59]. Sumathi, K., Ananthalakshmi, P., Roshan, M.N., and Sekar, K. 2006. 3dSS: 3D structural superposition, Nucleic Acids Res. 34:128-132. doi: 10.1093/nar/gkl036. PMID: 16844975. [60]. Tajima, F. 1989. Statistical methods to test for nucleotide mutation hypothesis by DNA polymorphism. Genetics. 123:585-595. PMID: 2513255. [61]. Tamura, K., Peterson, D., Peterson, N., Stecher, G., Nei, M., and Kumar, S. 2011. MEGA5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 28:27312739. doi: 10.1093/molbev/msr121. PMID: 21546353. [62]. Tang, H., Lyons, E., Pedersen, B., Schnable, J.C., Paterson, A.H., and Freeling, M. 2011. Screening synteny blocks in pairwise genome comparisons through integer programming. BMC Bioinformatics. 12:102. doi: 10.1186/1471-2105-12-102. PMID: 21501495. [63]. Thauer, R.K., Kaster, A.K., Seedorf, H., Buckel, W., and Hedderich, R. 2008. Methanogenic archaea: ecologically relevant differences in energy conservation. Nat. Rev. Microbiol. 6:579–591. doi: 10.1038/nrmicro1931. PMID: 18587410. [64]. Thompson, J.D., Gibson, T.J., Plewniak, F., Jeanmougin, F., and Higgins, D.G. 1997. The CLUSTAL_X Windows Interface: Flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25:4876-4882. doi: 10.1093/nar/25.24.4876. PMID: 9396791. [65]. Ufnar, J.A., Ufnar, D.F., Wang, S.Y., and Ellender, R.D. 2007. Development of a swine-specific fecal pollution marker based on host differences in methanogen mcrA genes. Appl Environ Microbiol. 73:5209-5217. doi: 10.1128/AEM.00319-07. PMID: 17586669. [66]. Wei, L., Liu, Y., Dubchak, I., Shon, J., and Park, J. 2002. Comparative genomics approaches to study organism similarities and differences. J Biomed Inform. 35:142150. PMID: 12474427. [67]. Wrede, C., Walbaum, U., Ducki, A., Heieren, I., and Hoppert, M. 2013. Localization of methyl-coenzyme M reductase as metabolic marker for diverse methanogenic Archaea. Archaea. 2013:920241. doi: 10.1155/2013/920241. PMID: 23533332. [68]. Xu, Z., and Hao, B. 2009. CVTree update: a newly designed phylogenetic study platform using composition vectors and whole genomes. Nucleic Acids Res. 37:W174W178. doi: 10.1093/nar/gkp278. PMID: 19398429. [69]. Yamada, T., Tsukamoto, H., Shiraishi, T., Nomura, T., and Oku, H. 1990. Detection of indoleacetic acid biosynthesis in some species of Taphrina causing hyperplastic 14
[70]. [71].
[72]. [73].
diseases in plants. Ann Phytopathol Soc Jpn. 56:532-540. doi: 10.3186/jjphytopath.56.532. Yang, Z. 2007. PAML 4: Phylogenetic Analysis by Maximum Likelihood. Mol Biol Evol. 24: 1586-1591. doi: 10.1093/molbev/msm088. PMID: 17483113. Zhang, Y., Rodionov, D.A., Gelfand, M.S., and Gladyshev, V.N. 2007. Comparative genomic analyses of nickel, cobalt and vitamin B12 utilization. BMC Genomics. 10:78. doi: 10.1186/1471-2164-10-78. PMID: 19208259. Zhu, Y. 2013. Evolution After Whole-Genome Duplication: A Network Perspective. G3 (Bethesda). 3:2049–2057. doi: 10.1534/g3.113.008458. PMID: 24048644. Zuker, M. 2003. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 31:3406-34015. doi: 10.1093/nar/gkg595. PMID: 12824337.
Fig.1 Phylogenomic tree constructed by a compositional vector algorithm using the whole genome sequences of MMA and other closely related methanogenic archaea. Branch lengths are proportional to evolutionary distances. Out-group organism was selected randomly for the classification.
15
Fig.2 Synonymous substitution histograms for analysis of syntenic homologs between MMA and MAC; MMA and MBA
16
Fig.3 Z-curve (AT, GC, RY and MK disparity curve) representation for comparing the location of DNA replication proteins and OriC across the genomes of MMA, MAC and MBA. Short vertical red line indicates the location of DNA replication proteins. The black 17
arrows pointing below are the oriCs with origin recognition box sequences. OriC sequence alignment is shown in below left.
Fig.4 Estimates of genetic diversity and Darwinian selection of phylogenetic and metabolic markers 18
Fig.5 Comparative genomic analyses to identify the metabolic relationships of genes (a), enzymes (b), central metabolism (c), and methanogenesis (d) across the MMA, MAC and MBA. 19
MMA: Methanosarcina mazei Go1; MAC: Methanosarcina acetivorans C2A; MBA: Methanosarcina barkeri strain fusaro
Fig.6 The novel putative metabolic pathways discovered in MMA based on the unique genes. In siroheme biosynthesis (6a), successive methylation at positions 2 and 7 of uroporphyrinogen III forms precorrin-1 and precorrin-2, respectively. Precorrin-2 is oxidized to sirohydrochlorin by precorrin-2 dehydrogenase (EC 1.3.1.76) and then chelated with 20
ferrous iron for the synthesis of siroheme by sirohydrochlorin ferrochelatase (EC 4.99.1.4). 4Aminobutyrate can be degraded into succinate semialdehyde and L-glutamate by acetylornithine aminotransferase (EC 2.6.1.19) in the presence of 2-oxoglutarate (6b). Succinate-semialdehyde dehydrogenase [NADP+] (EC 1.2.1.16) catalyses succinate semialdehyde into succinate, which further assimilated via incomplete reductive TCA cycle for energy-driven process.
Table 1 Correlation coefficient calculated for comparing intergenomic distance of MMA across the phylogenomic neighbors 21
Genome AFU MMA MAC MBA MBU MMH MRU MSI MTP NPH
AFU 1 0.43 0.33 0.35 0.45 0.51 0.35 0.38 0.48 0.5
MMA 0.43 1 0.8 0.83 0.64 0.67 0.49 0.49 0.56 0.4
MAC 0.33 0.8 1 0.87 0.53 0.58 0.42 0.38 0.49 0.33
MBA 0.35 0.83 0.87 1 0.53 0.58 0.42 0.41 0.55 0.37
MBU 0.45 0.64 0.53 0.53 1 0.77 0.4 0.42 0.55 0.48
22
MMH 0.51 0.67 0.58 0.58 0.77 1 0.48 0.5 0.61 0.49
MRU 0.35 0.49 0.42 0.42 0.4 0.48 1 0.77 0.49 0.27
MSI 0.38 0.49 0.38 0.41 0.42 0.5 0.77 1 0.52 0.26
MTP 0.48 0.56 0.49 0.55 0.55 0.61 0.49 0.52 1 0.37
NPH 0.5 0.4 0.33 0.37 0.48 0.49 0.27 0.26 0.37 1