Genetic structure of natural populations of endangered Tor mahseer, Tor tor (Hamilton, 1822) inferred from two mitochondrial DNA markers

Genetic structure of natural populations of endangered Tor mahseer, Tor tor (Hamilton, 1822) inferred from two mitochondrial DNA markers

Journal Pre-proof Genetic structure of natural populations of endangered Tor mahseer, Tor tor (Hamilton, 1822) inferred from two mitochondrial DNA mar...

8MB Sizes 0 Downloads 50 Views

Journal Pre-proof Genetic structure of natural populations of endangered Tor mahseer, Tor tor (Hamilton, 1822) inferred from two mitochondrial DNA markers

Priyanka Sah, Sangeeta Mandal, Rajeev K. Singh, Raj Kumar, Abhinav Pathak, Nimisha Dutta, J.K. Srivastava, Ved Prakash Saini, Kuldeep K. Lal, Vindhya Mohindra PII:

S2214-5400(19)30094-5

DOI:

https://doi.org/10.1016/j.mgene.2019.100635

Reference:

MGENE 100635

To appear in:

Meta Gene

Received date:

12 August 2019

Revised date:

21 October 2019

Accepted date:

11 November 2019

Please cite this article as: P. Sah, S. Mandal, R.K. Singh, et al., Genetic structure of natural populations of endangered Tor mahseer, Tor tor (Hamilton, 1822) inferred from two mitochondrial DNA markers, Meta Gene(2018), https://doi.org/10.1016/ j.mgene.2019.100635

This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

© 2018 Published by Elsevier.

Journal Pre-proof Genetic structure of natural populations of endangered Tor mahseer, Tor tor (Hamilton, 1822) inferred from two mitochondrial DNA markers Priyanka Sah1,

2

, Sangeeta Mandal1 , Rajeev K. Singh1 , Raj Kumar1 , Abhinav Pathak1 ,

Nimisha Dutta1 , J.K. Srivastava2 , Ved Prakash Saini3 , Kuldeep K Lal1 and Vindhya Mohindra1 Affiliations and addresses of the authors: 1 ICAR- National Bureau of Fish Genetic Resources (NBFGR), Lucknow, India. 2

Amity Institute of Biotechnology, Amity University, Lucknow Campus, Lucknow, Uttar

Aquaculture Research and Seed Unit, Maharana Pratap University of Agriculture and

oo

3

f

Pradesh, India.

Technology, Udaipur, Rajasthan, India

pr

Corresponding author:

al

Pr

e-

Vindhya Mohindra ICAR-National Bureau of Fish Genetic Resources, Canal Ring Road, PO Dilkusha, Lucknow, Uttar Pradesh, India [email protected], [email protected] Telephone number-+91 9415234371

Conflict of Interest:

Jo u

rn

The authors declare that they have no conflict of interest.

S.NO Author(s) details 1 Authors Email Name 2 Co-authors Email Name Institutional e-mail Research gate profile 3 Co-authors Email Name Institutional e-mail 4 Co-authors Email Name Research gate profile 5 Co-authors Email Name 6 Co-authors Email Name

[email protected] Priyanka Sah [email protected] Dr. Sangeeta Mandal [email protected] https://www.researchgate.net/profile/Sangeeta_Mandal [email protected] Dr. Rajeev K Singh [email protected] [email protected] Raj Kumar https://www.researchgate.net/profile/Raj_Kumar49 [email protected] Nimisha Dutta [email protected] Dr. Abhinav Pathak

Journal Pre-proof

8

9

[email protected] Prof.(Dr.) J.K. Srivastava [email protected] [email protected] Ved Prakash Saini https://www.researchgate.net/profile/V_P_Saini [email protected] Dr. Kuldeep K Lal [email protected] https://www.researchgate.net/profile/Kuldeep_Lal [email protected] Dr. Vindhya Mohindra [email protected] https://www.researchgate.net/profile/Vindhya_Mohindra

f

9

Co-authors Email Name Institutional e-mail Co-authors Email Name Institutional e-mail Co-authors Email Name Institutional e-mail Research gate profile Authors Email Name Institutional e-mail Research gate profile

oo

7

Abstract:

pr

Tor tor, Tor mahseer, an endangered cyprinid species, is important as sport fish. Genetic divergence in natural populations of T. tor was investigated using two mitochondrial genes,

e-

Cytb (Cytochrome b) and ATPase6/8. Analysis of 140 sequences of Cytb (1121 bp) and

Pr

ATPase6/8genes (842 bp) genes revealed 12 and 7 haplotypes respectively, whereas 23 haplotypes were found in concatenated sequences (1963 bp). Sequence analysis of mitochondrial regions revealed balancing selection and displayed low nucleotide and

al

moderate haplotype diversities. Mantel tests identified a positive relationship between

rn

pairwise geographical and genetic distances for the study region. Results of AMOVA based on genes pointed out that the genetic variations were mainly due to variation in within

Jo u

populations (ATPase6/8: 62.73%, Cytb: 42.13%, concatenated sequences: 48.31%). The total Fst was found to be significant in both genes with a value of 0.5787 (p<0.05) and 0.3726 (p<0.05) for Cytb and ATPase6/8genes respectively, whereas 0.51687 (p<0.05) was found in combined sequences, which revealed sub-structuring in the T. tor natural populations. Population pair-wise Fst ranged from 0.00901 - 0.85631 for Cytb and 0.09910 - 0.64731 for ATPase6/8genes, whereas 0.02039 - 0.85436 in combined sequences. Results indicated the presence of four genetic stocks in the populations studied. The baseline information about stock characterisation in this study would be applicable for planning strategies for conservation, management and sustainable fisheries of this species.

Keywords: Tor tor, Cytb, ATPase 6/8, mtDNA, Polymorphism, Genetic variation Introduction

Journal Pre-proof The iconic mahseer, Tor tor (Hamilton, 1822), a native of the trans-Himalayan region (Shrestha, 1994), is found in fast-running rivers and streams with rocky bottoms in Nepal, Bangladesh, India, Bhutan and Pakistan (Desai, 2003, Khare et al., 2014). In India, T. tor is known from the Ganga (including sub-Himalayan range), Indus, Brahmaputra and Narmada river systems (Shrestha, 1994) and thus T. tor contributes significantly to the capture fisheries of cold as well as warm waters, however, there is no farming of this fish species Lal et al. (2013) reported the extended distribution of T. tor in the peninsular rivers, Godavari and Krishna, which poses an interesting question about its existence as a species native to these rivers ( Pinder et al., 2019). The wild population of T. tor is reported to have declined in

oo

f

current years, as it has been facing numerous challenges in its natural range of distribution, due to anthropogenic activities and/or natural causes (Pinder and Raghavan, 2013). As a consequence, it was given a status of ‘Endangered’ species in 2010, however currently, the

pr

species is assessed as data deficient under IUCN, 2018 (Rayamajhi, et al., 2010, 2018).

e-

For the scientific management of a species to avoid population decline and ensure resource sustainability, prior information about its genetic variability and stock structure is

Pr

important. Due to absence of introns in mitochondrial DNA and its efficient repair mechanisms, the mutation rate in mtDNA is much more frequent than that of nuclear genes

al

(Andalib et al., 2017). The faster evolutionary rate, which tend to enhance recent demographic events; and maternal inheritance make mitochondrial DNA a potential genetic

rn

marker for analyzing intraspecific variability as well as phylogenetic studies (William et al.,

Jo u

2005). In addition, mitochondrial genomes have 1/4 effective population sizes (Ne) than that of nuclear genes (Hard & Clark, 1989), which helps in the detection of selection pressure, and it may have bearing on the population structure of a species (Nekrutenko

et al., 2002).

However, the genetic level studies in T. tor are very limited. Pasi et al. (2013) employed, cytb (967 bp) to reveal the population structure of T. tor from three locations of Madhya Pradesh, India, whereas Sharma et al. (2016) observed no significant genetic distances in T. tor from wild and cultures conditions with dominant RAPD markers. Thus, choice of suitable markers is very important for a particular species, for genetic divergence studies. A combination of markers

is

preferable for resolving genetic connectivity and

historical demography,

particularly in fish species (Mohindra et al., 2019). Role of natural selection on evolution of the mitochondrial genome in vertebrates may be hinted by the analysis of mitochondrial genome (Consuegra et al., 2015).

Journal Pre-proof In the present study, two full length genes from mitogenome, i.e., cytb, ATPase6/8 and concatenated sequences were used to resolve genetic variation and divergence in T. tor from six locations of four rivers (Berach, Narmada, Penganga and Godavari), which will be helpful in developing management strategies in order to conserve for sustainable fisheries of this depleting species. Our aim was to determine spatial structure in the mtDNA diversity over the landscape of the species studied and

demographic history for status of species T. tor,

intraspecific

population structure and gene flow patterns in 6 populations of Tor tor, which belong to

oo

f

different river basins and status of T. tor in the peninsular river Godavari.

Materials and Methods: Design of the study

pr

In this study, 140 samples of T. tor were collected from 6 river locations (Table 1).

e-

Cytochrome b and ATPase 8 and 6 genes of these samples were amplified and sequenced. These sequences were analysed to determine spatial structure in the mtDNA diversity over

Pr

the landscape, demographic history, population structure and gene flow patterns in these

Sample collection:

al

populations of T. tor.

rn

Tissue samples (muscles and fin) from a total of 140 individuals of T. tor were

Jo u

collected, from commercial riverine catches of six different geographical sites, i.e, Berach (17), Narmada (3 locations; Tawa, Khargaon and Kolar ) (84), Penganga (18) and Godavari (21) (Table 1, Fig. 1) and fixed in 95% ethanol and stored at 4o C until use. River Berach is a seasonal tributary of river Banas and Chambal under Ganga river basin. The samples were collected from Madar dam, located at Salumber in district Udaipur of western Rajasthan, is an irrigation dam on river Berach River Narmada is one of only three major rivers in peninsular India and is the longest west flowing river originates near Anuppur district, Madhya Pradesh state, from Amarkantak Plateau, which flows westwards and drains into the Arabian Sea.The samples were collected from Tawa reservoir, Hoshangabad; Khargone and Kolar dam of Kolar tributary on the northern side of River Narmada. The Penganga River, a tributary of Godavari Basin, originates in the Ajantha ranges in Aurangabad district in Maharashtra state and after travelling 676 km confluence into Pranhita

Journal Pre-proof River. The Godavari is India's second longest river after the Ganga, which originates in Trimbakeshwar, Maharashtra state and flows eastward to drain into the Bay of Bengal. Mitochondrial Gene Amplification Tissue samples were used for total genomic DNA extraction using a modified PhenolChloroform method (Ruzzante et al., 1997). Cytb region was amplified with primers, L14724 and H15915 (Xiao et al., 2001) and ATPase8 and ATPase6 genes with primers, ATP8.2L8331 and COIII.2H9236 (Sivasundar et al., 2001). Primer sequences are given in Supplementary Table 1. Polymerase chain reactions (PCR) were carried out in 50 μl reaction mixtures, following conditions of Mandal et al., (2016) and checked on 2 % agarose gels.

f

The products were purified and sequenced bi-directionally using an automated capillary

oo

sequencer (ABI 3730Xl) following the manufacturer’s recommendations. Consensus DNA sequence synthesis and genotyping

pr

The sequences generated from both forward and reverse primers (Suppl. Fig. 1a and

e-

1b) were aligned using Clustal Omega (Thompson et al., 1997) and consensus sequences were synthesised manually. Differences in the nucleotide sequences were identified and

Pr

haplotypes were designated as unique sequences that differed from all others by at least one nucleotide change. Haplotypes were identified from

populations studied were identified

Statistical analysis

al

through by DnaSP (Librado and Rozas, 2009).

rn

Three data sets were generated (1) Cytochrome b (cytb) (2) ATPase 6 and 8

Jo u

(ATPase6/8) and (3) concatenated sequences of cytb ATPase6/8. The use of 3rd dataset was validated through phylogeny analysis of haplogroups by MEGAX (Tamura et al., 2007) with 1000 boot-strappings. Mantels test (Mantel 1967) was used to assess the presence of significance levels of genetic distance across the distribution range by overlaying on lands cape co-ordinates and their 3-D pattern of distribution plots through genetic landscape shapes (AIS, Miller, 2005), with parameters of a 50 × 50 grid and a distance weighting parameter (a) of 1.

Ka_Ks Calculator (Zhang et al., 2006) was used tocalculate the ratio of non-

synonymous substitutions (KA) to synonymous substitutions (KS) for identification of the selective pressures occurring in protein-coding genes and best model for it was identified through MEGA. MEGA were used to determine sequence composition and variability parameters. Haplotype diversity (h), nucleotide diversity (π), AMOVA and Fst values were calculated using Arlequin 3.11 (Excoffier et al., 2005). Gene flow estimates, i.e, number of migrants,

Journal Pre-proof (Nm) by the permutation tests with 1000 replicates (Librado et al., 2009) with Jukes and Cantor correction, was calculated by DnaSP (Librado and Rozas, 2009). The median-joining haplotype network was created in Network 4.6.1.1 (Bandelt et al., 1999) from haplotype data of mitochondrial genes from all data sets. For each population of T. tor, demographic history was assessed using neutrality tests and mismatch distributions of pair-wise differences in Arlequin. Two widely used statistical tests Fu’s Fs (Fu, 1997), Tajima’s D (Tajima, 1989) and raggedness index were calculated by Arlequin to determine the goodness of fit to a unimodal distribution (Harpending, 1994).

oo

f

Results: Sequence composition

Cytb gene: From the cytb gene sequences (1121 bp) for 140 T. tor specimens, 12 different

pr

haplotypes were obtained (NCBI Accession no. MN105957 - 68). Out of the 1121 bases, 12

e-

were variable with 6 parsimony informative sites and 6 singletons. The average nucleotide frequencies were: A: 30.0%; T: 27.7%; C: 28.5%, G: 13.9% and were found to be A+T rich

Pr

(57.1%).

ATPase6/8 genes: The amplified product contained ATPase 8 and 6 genes of 842 bp.

al

ATPase 8 region extended from 1–165 bp of the sequence and ATPase 6 from 159–842 bp with a coinciding region (7 bp) between two genes from 159–165 bp. The two regions were

rn

analysed together for determining variation in T. tor. A total of 7 different haplotypes were

Jo u

obtained (NCBI Accession no.MN105950-56). Out of the 842 bases, 15 were variable with 13 parsimony informative sites and 2 singletons. The average nucleotide frequencies were: A: 31.1%; T: 28.1%; C: 27.7%, G: 13.1% and were found to be A+T rich (57.2%). Concatenated sequences (Cytb + ATPase 6/8 genes): A total of 23 different haplotypes were obtained. Out of the 1963 bases, 18 were variable with 10 parsimony informative sites and 8 singletons. The average nucleotide frequencies were: A: 30.5%; T: 27.8%; C: 28.1%, G: 13.6% and were found to be A+T rich (57.2%). When Cytb and ATPase 6/8 sequences were analysed individually and concatenated for phylogenetic relationships among the haplotypes, two consistent haplogroups were observed in all the three data sets (Suppl. Fig. 2a and 2b, Tree with ATPase not shown.). Ka/Ks ratio on Kimuras- two parameter (K2P) model (0.5831) indicated constraining (purifying) selection. Spatial structure of mtDNA diversity within T. tor

Journal Pre-proof The distribution of mitochondrial sequence variation within the natural range of T. tor was explored to identify potential spatial structure. Mantel tests identified a positive relationship between pairwise geographical and genetic distances for the study region, as a whole (r = 0.3410775634, P=0 .00099900) and pattern of scatter plot is given in Fig.2. The analysis of genetic landscape shape (Fig. 3) strongly indicated a pattern of higher genetic differentiation among populations in the central Indian region of the study area (Suppl. Fig. 3). The positive peaks with high genetic divergence P1 (20.58 77.22) and P2 (21.27 78.195) were located in the regions between the collection points of {2, 3, 4} and {5, 6} (Suppl. Fig.

oo

Genetic and phylogenetic relationship among haplotypes

f

3).

The cytb gene showed among samples from six locations, the most common haplotype to be H02 (Suppl. Table 1), while H04 was the dominant haplotype in Penganga

pr

and Godavari. Total shared haplotypes were 4, i.e., H02, H03, H04 and H06. Population

e-

specific haplotypes were found in Berach (1), Penganga (1), Narmada-Tawa (3) and Narmada-Kolar (3). Samples from river Narmada-Tawa exhibited a maximum number of

Pr

haplotypes, i.e., 6. Haplotype network (Fig. 4a) demonstrated the formation of double clades, with H02 and H04 as the ancestral haplotypes, however, no particular clade contained

geographical area.

al

haplotypes from the distinct geographical region and were distributed in a different

rn

For ATPase6/8, haplotype H02 was the most common of the 7 haplotypes and two

Jo u

haplotypes were shared among all populations (Suppl. Table 1). Haplotypes H02 was shared between all six sites, whereas H01 was also shared between all sites, except the Godavari. Population specific haplotypes were found in Berach (1), Narmada-Tawa (2), NarmadaKhargone (1) and Narmada-Kolar (1). Haplotype network demonstrated the formation of double clades, with H01 and H02 as the ancestral haplotypes; however, no specific clade contains haplotypes from a single geographical region (Fig. 4b). For concatenated sequences, haplotypes H02, H07 and H09 was the most common haplotype of the 23 haplotypes and a total of six shared haplotypes i.e., H02, H03, H07, H08, H09 and H11 (Suppl. Table 1). Population specific haplotypes were found in Berach (1), Narmada-Tawa (2), Narmada-Khargone (2), Narmada-kolar (1) and Penganga (1). Samples from river Narmada-Tawa exhibited a maximum number of haplotypes, i.e., 10. It was interesting to find single haplotype H9 in Godavari river population. Haplotype network demonstrated the formation of two clades, with H02 with H07 in one cluster and H09 in

Journal Pre-proof another cluster, as the central haplotypes (Fig. 4c)., however, no particular clade contained haplotypes from the distinct geographical region and were distributed in a different geographical area. Nucleotide and haplotype diversity T. tor mitochondrial sequences revealed absence of haplotype and nucleotide diversities, as in Godavari samples, only one haplotype was found. Rest of the populations exhibited moderate haplotype and low nucleotide diversities for all populations (table 3). In cytb, haplotype diversity (h), except ranged 0.0000 (Godavari) to 0.6691 (Berach) and nucleotide diversity (π) was low, ranged from 0.0000 (Godavari) to 0.00114 (Penganga). In

oo

f

ATPase6/8, h varied between 0.0000 (Godavari) to 0.5903 (Narmada-Tawa) and π between 0.0000 (Godavari) to 0.0079 (Narmada-Tawa) (Table 2). In concatenated sequences, h varied between 0.0000 (Godavari) to 0.7768 (Narmada-Tawa) and π between 0.0000 (Godavari) to

pr

0.000756 (Penganga) (Table 2).

e-

Genetic variability and differentiation

AMOVA results for hierarchical partitioning of variation for cytb. ATPase and

Pr

concatenated sequences are shown in Table 3. For cytb, out of total variation, 54.26% was contributed by among groups, while 42.13% to within populations. The mean F st 0.5787

al

(p<0.05) was found to be significant. For ATPase6/8, out of total variation, only 37.59% was contributed by among groups and 62.73% to within populations. The mean F ST 0.3726 (p<

rn

0.05) was found to be significant. Whereas for concatenated sequences, 49.25% was

Jo u

contributed by among groups while 48.31% to within populations. The mean F st 0.51687 (p<0.05) was found to be significant. The

pairwise

Fst

between population samples using Cytb,

ATPase6/8

and

concatenated sequences revealed that out of 15 population pairs, significant divergent between 12,10 and 10,

population pairs, respectively. The population pairs which did not

show significant divergence with ATPase and concatenated are Berach-Kolar and PengangaGodavari. The genetic relationship between populations (FST values) is presented in Table 4 based on two markers. Neutrality and mismatch distribution In cytb, the population Narmada-Kolar showed negative significant D and Fu’s values with a unimodal pattern of mismatch distribution. However, none of the population showed significant values with ATPase6/8. However, significant Fu’s values were found for three populations, Narmada- Tawa, Khargone and Kolar with unimodal distribution. In overall

Journal Pre-proof population. Negative D was not significant; Fu’s significant while raggered index was also not significant, with unimodal distribution (Fig. 4). Gene flow estimates Gene flow estimates (number of migration, N m) in 4 rivers, also showed (Table 6) very high values between pairs of Narmada- Tawa, Khargone and Kolar populations, while low to moderate for rest of the population pairs. Discussion: The present study characterized T. tor from four Indian rivers and six collection sites with two mitochondrial markers cytb and ATPase6/8genes. Nucleotide sequences of cytb and

oo

f

ATPase6/8genes were rich in A+T, which showed conformity with previous studies for fish (Luhariya et al., 2012; Mandal et al., 2016). Congruence of phylogenetic pattern observed for individual cytb and ATPase6/8 genes and concatenated genes supports the analysis of

pr

concatenated sequences for the present study (Theirgart et al., 2014). Presence of purifying

e-

selection pressure on mitochondrial genes studied may be due to strong functional constraints in mitochondria and deleterious non-synonymous mutations accumulations are removed by

Pr

effective purifying selection (Sun et al., 2011).

From our studies in T. tor from natural populations studied, a pattern of isolation by

al

distance (IBD) suggests the influences of gene flow and drift on the distribution of its genetic

rn

variability. The scatterplot pattern revealed isolation of the samples, where genetic drift has occurred independently of geographic distance (Hutchison and Templeton, 1990). With the

Jo u

highest genetic divergence found be in central Indian region, it may be speculated to be the region of origin of the species and from there, it may have migrated to other regions/rivers. A highly negative value of Fu’s for overall population depicts alleles in excess, may be owing to recent population expansion or genetic hitchhiking. Non-significant raggedness values and unimodal pattern of mismatch distribution also supports the genetic signal of sudden population expansion for overall T. tor under study (Joshi et al., 2013). Haplotype network with two clusters, in which each displayed star like topology also indicated recent expansion of population, following population bottleneck (Slatkin and Hudson, 1991). Thus, T. tor as an endangered species (IUCN 2010), in isolated populations with genetic drift, may face inbreeding depression, thus calls for study of its population structure analysis, for implementation of conservation measures.

Journal Pre-proof Moderate haplotype and low nucleotide diversity was found to be characteristics of the populations of T. tor, exception of absence of genetic diversity in Godavari population. All populations, except Godavari, are with moderate haplotype and low nucleotide diversity values suggest existence of small populations, which have undergone recent population growth (Grant and Bowen, 1998). This is indicative of recent population bottleneck or founder effect. However, only Narmada-Kolar population possesses genetic signals of demographic expansion, inferred from unimodal pattern of mismatch distribution (Rogers and Harpending 1992), along with negative significant values for Fu’s F and non-significant Raggedness index.

This demographic expansion may result in an excess of rare

oo

f

polymorphisms, as this population has several private haplotypes. Moderate level of diversity may also be due to constraining selection detected. In general, population decline for all the populations, except Narmada-Kolar population, was suggested by the neutrality tests.

pr

The AMOVA results as well as total Fst values revealed sub-structuring in the T. tor

e-

natural populations. A majority of variation observed in within population as well as among groups, while low among populations. For relatively high levels of within-population variations between population is expected

Pr

variation but limited

in migratory fishes

(Vrijenhoek, 1998) and T. tor being a migratory fish species (Talwar and Jhingran, 1991), its

al

genetic variation showed a similar pattern. Migratory behaviour also indicates that gene flow can offset the pattern of genetic differentiation among the population (Nguyen et al., 2006).

rn

Pairwise Fst comparison based on cytb as well ATPase6/8 genes showed that lack of

Jo u

genetic divergence among the three locations of Narmada and may be considered a single genetic stock. It may be due to a high level of genetic connectivity because of interconnected tributaries of Narmada river system. Thus, overall, there is the evidence of the existence of four genetic stocks, Berach, Narmada, Penganga and Godavari populations. River Berach is significantly divergent from other rivers, as it is a separate river system and there is very low gene flow with other rivers. It is important to note here that the one of the dominant haplotype observed in Berach is also unique and not present in any collection from other location. The haplotype diversity observed in Berach sample is comparable to that found in other populations including River Narmada. This indicates that the dam harbours native population of the river Berach and is differentiated from its conspecifics from the rivers of other basins. Such differentiated populations with restricted distribution are important from conservation point of view. The low genetic divergence between the three collection locations in River Narmada is expected. The sites of collection in Narmada are

Journal Pre-proof interconnecting and as T. tor has migratory nature. (Desai 2003) and thus genetic mixing is possible. However, the small divergence between the three locations within Narmada and private haplotypes found in small frequency, indicate differentiating populations existing in the tributaries. Penganga is a tributary of Godavari, there may be events of population mixing in the past and this might have resulted in low divergence, though significant, than with other populations. However,It is surprising to observe the absence of genetic variability (only one haplotype) in the main river Godavari population in contrast to its northern tributary Penganga. The presence of T. tor in Godavari river basin, including river Penganga, was first

oo

f

reported by Lal et. al. (2012), as a new extended distribution in peninsular India. The haplotype network for both genes suggests a common ancestor for the lineages of samples from rivers Godavari and Penganga. The samples from river Penganga show diversity and on

pr

the other hand the samples of river Godavari are homozygous. This could be possible that the

e-

river Goadavri has restricted habitats, suitable for existence of mahseer and possibly have small populations. However, based on the present data it cannot be said that this is

Pr

consequence of an introduced population or the populations have constricted and leading to restricted gene flow and homozygosity. It is well known that small constricted population of

al

a few individuals, possibly due to migration or introduction, would result in inbreeding, following the small population founding effect (Chen et al., 2012), which would in turn give

rn

rise to a highly homozygous population.

Jo u

Conclusion:

The results demonstrated the utility of mitochondrial cytb and ATPase6/8to determine the demographic history and intra-specific genetic diversity and to discriminate genetic stocks in the wild population of T. tor. There is an evidence of distinct genetic structure in the wild populations of T. tor. The populations where decline in population size is observed, there is a possibility of inbreeding and genetic drift to further reduce the genetic diversity in these populations. Based on the results of the present study, development of conservation strategies to maintain the natural genetic diversity is suggested. Acknowledgment: The authors are grateful to Director, ICAR-NBFGR for providing facilities for this research. This work was a part of the ICAR plan funded project entitled, Outreach Activity on Fish Genetic Stocks, Phase II. Mr. R.S. Sah, Mr. Rajesh Kumar and Mr. Pradipta Paul are

Journal Pre-proof acknowledged for their role in sampling and other technical assistance. Mr. Kantharajan G. is acknowledged for his help in the location maps.

Conflict of interest: The authors declare that there is no financial or non-financial conflict of interest.

Compliance with Ethical Standards: Funding: This study was funded by, ICAR- National Bureau of Fish Genetic Resources (ICAR-NBFGR), Lucknow, India.

oo

f

Ethical approval:

The sampling protocols followed in the current study were approved by the Institutional Animal

Ethical

Committee

(IAEC),

ICAR-NBFGR,

India

vide

No.

pr

G/CPCSEA/IAEC/2015/2 dated 27 October, 2015.

Lucknow,

e-

No studies on live animals were performed, in the present investigation.

Pr

References:

al

Andalib S, Divani AA, Michel TM, Høilund-Carlsen PF, Vafaee MS, Gjedde A. 2017. Pandora's Box: mitochondrial defects in ischaemic heart disease and stroke. Expert Rev Mol Med.,19, e5.

Jo u

rn

Ballard, J.W.O. and Kreitman, M., 1995. Is mitochondrial DNA a strictly neutral marker?. Trends in Ecology & Evolution, 10(12), pp.485-488. Bandelt, H.J., Forster, P. and Rohl, A., 1999. Median-joining networks for inferring intraspecific phylogenies. Molecular Biology and Evolution, 16(1), pp.37-48. Chen, S.Y., Zhang, Y.J., Wang, X.L., Sun, J.Y., Xue, Y., Zhang, P., Zhou, H. and Qu, L.H., 2012. Extremely low genetic diversity indicating the endangered status of Ranodon sibiricus (Amphibia: Caudata) and implications for phylogeography. PLoS One, 7(3), p.e33378. Consuegra, S., John, E., Verspoor, E. and Garcia de Leaniz, C. 2015. Patterns of natural selection acting on the mitochondrial genome of a locally adapted fish species. Genetics, selection, evolution, 47(1): 58. Desai, V.R., 2003. Synopsis of biological data on the tor mahseer Tor tor (Hamilton, 1822). FAO Fisheries synopsis, 158. Excoffier, L., Laval, G. and Schneider, S., 2005. Arlequin (version 3.0): An integrated software package for population genetics data analysis. Evolutionary Bioinformatics, 1, p.117693430500100003.

Journal Pre-proof Fu, Y.X., 1997. Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics, 147(2), pp.915-925. Grant, W.A.S. and Bowen, B.W., 1998. Shallow population histories in deep evolutionary lineages of marine fishes: insights from sardines and anchovies and lessons for conservation. Journal of Heredity, 89(5), pp.415-426. Hard, D.L. and Clark, A.G., 1997. Principles of population genetics. Sunderland, MA. Sinauer Associates, Inc., Publishers, pp.95-109. Harpending, H.C., 1994. Signature of ancient population growth in a low-resolution mitochondrial DNA mismatch distribution. Human Biology, pp.591-600.

pr

oo

f

Hutchison, D.W. and Templeton, A.R., 1999. Correlation of pairwise genetic and geographic distance measures: inferring the relative influences of gene flow and drift on the distribution of genetic variability. Evolution, 53(6), pp.1898-1914.

e-

Khare, P., Mohindra, V., Barman, A.S., Singh, R.K. and Lal, K.K., 2014. Molecular evidence to reconcile taxonomic instability in mahseer species (Pisces: Cyprinidae) of India. Organisms Diversity & Evolution, 14(3), pp.307-326.

al

Pr

Lal, K.K., Singh, R.K., Pandey, A., Gupta, B.K., Mohindra, V., Punia, P., Dhawan, S., Verma, J., Tyagi, L.K., Khare, P. and Jena, J.K., 2013. Distributional records of Tor mahseer Tor tor (Hamilton, 1822) from Southern India. Journal of Applied Ichthyology, 29(5), pp.1086-1090.

rn

Librado, P. and Rozas, J., 2009. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics, 25(11), pp.1451-1452.

Jo u

Luhariya, R.K., Lal, K.K., Singh, R.K., Mohindra, V., Punia, P., Chauhan, U.K., Gupta, A. and Lakra, W.S., 2012. Genetic divergence in wild population of Labeo rohita (Hamilton, 1822) from nine Indian rivers, analyzed through MtDNA cytochrome b region. Molecular Biology Reports, 39(4), pp.3659-3665. Mandal, S., Jena, J.K., Singh, R.K., Mohindra, V., Lakra, W.S., Deshmukhe, G., Kumar, R. and Lal, K.K., 2016. Genetic characterization of Silond catfish, Silonia silondia (Hamilton, 1822) inferred from two mitochondrial markers. Mitochondrial DNA Part A, 27(2), pp.10751079. Mantel, N., 1967. The detection of disease clustering and a generalized regression approach. Cancer research, 27(2 Part 1), pp.209-220. Miller, M.P., 2005. Alleles In Space (AIS): computer software for the joint analysis of interindividual spatial and genetic information. Journal of Heredity, 96(6), pp.722-724 Mohindra, V., Divya, B.K., Kumar, R., Singh, R.K., Dwivedi, A.K., Mandal, S., Masih, P., Lal, K.K. and Jena, J.K., 2019. Genetic population structure of a highly migratory Hilsa

Journal Pre-proof Shad, Tenualosa ilisha, in three river systems, inferred from four mitochondrial genes analysis. Environmental Biology of Fishes, 102:939–954. Nekrutenko, A., Makova, K.D. and Li, W.H., 2002. The KA/KS ratio test for assessing the protein-coding potential of genomic regions: an empirical and simulation study. Genome research, 12(1), pp.198-202. Nguyen, T.T., Ingram, B., Sungan, S., Gooley, G., Sim, S.Y., Tinggi, D. and De Silva, S.S., 2006. Mitochondrial DNA diversity of broodstock of two indigenous mahseer species, Tor tambroides and T. douronensis (Cyprinidae) cultured in Sarawak, Malaysia. Aquaculture, 253(1-4), pp.259-269.

oo

f

Pasi, K.S., Lakra, W.S., Bhatt, J.P., Goswami, M. and Malakar, A.K., 2013. Population structure of Tor tor inferred from mitochondrial gene cytochrome b. Mitochondrial DNA, 24(3), pp.290-296.

pr

Pinder, A.C. and Raghavan, R., 2013. Conserving the endangered mahseers (Tor spp.) of India: the positive role of recreational fisheries. Current Science, 104(11), pp.1472-1475.

e-

Pinder, A.C. and Dahanukar, N., 2019. Mahseer (Tor spp.) fishes of the world: status, challenges and opportunities for conservation. Reviws in Fish Biology and Fisheries, 29 ,pp. 417-452.

al

Pr

Rayamajhi A, Jha, B.R., Sharma, C.M.,2010. Tor tor. The IUCN red list of threatened species 2010: e.T166534A6231157.http://dx.doi.org/10.2305/IUCN.UK.2010-4.RLTS.T1665 34A6231157.en. Downloaded on 04 August 2019.

Jo u

rn

Rayamajhi, A., Jha, B.R., Sharma, C.M., Pinder, A., Harrison, A., Katwate, U. & Dahanukar, N. 2018. Tor tor. The IUCN Red List of Threatened Species 2018: e.T166534A126321898. http://dx.doi.org/10.2305/IUCN.UK.2018-2.RLTS. T166534A126321898.en. Downloaded on 04 August 2019. Rogers, A.R. and Harpending, H., 1992. Population growth makes waves in the distribution of pairwise genetic differences. Molecular Biology and Evolution, 9(3), pp.552-569. Ruzzante, D.E., Taggart, C.T., Cook, D. and Goddard, S.V., 1997. Genetic differentiation between inshore and offshore Atlantic cod (Gadus morhua) off Newfoundland: A test and evidence of temporal stability. Canadian Journal of Fisheries and Aquatic Sciences, 54(11), pp.2700-2708. Sharma,J., Alka, P., Pratibha, B. and Jitendra, K., 2016. Assessment of genetic diversity of mahseer (Tor tor) using RAPD markers from wild and cultured conditions in Madhya Pradesh. Journal of Experimental Zoology, India, 19(1), pp.23-29. Shrestha, T.K., 1994. Migration and spawning of golden mahseer in Himalayan waters of Nepal. Journal of Freshwater Biology, 6(1), pp.71-77.

Journal Pre-proof Sivasundar, A., Bermingham, E. and Ortí, G., 2001. Population structure and biogeography of migratory freshwater fishes (Prochilodus: Characiformes) in major South American rivers. Molecular Ecology, 10(2), pp.407-417. Slatkin M and Hudson RR. 1991. Pairwise comparisons of mitochondrial DNA sequences in stable and exponentially growing populations. Genetics 129:555–562. Sun, Y.B., Shen, Y.Y., Irwin, D.M. and Zhang, Y.P., 2010. Evaluating the roles of energetic functional constraints on teleost mitochondrial-encoded protein evolution. Molecular Biology and Evolution, 28(1), pp.39-44. Tajima, F., 1989. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics, 123(3), pp.585-595.

oo

f

Talwar, P.K. and Jhingran, A.G., 1991. Inland fisheries of India and adjacent countries. Vol. I & II, Oxford & IBH Publishing Company. pp.1-1158.

e-

pr

Tamura, K., Dudley, J., Nei, M. and Kumar, S., 2007. MEGA4: Molecular evolutionary genetics analysis (MEGA) software version 4.0. Molecular Biology and Evolution, 24(8), pp.1596-1599.

Pr

Thiergart, T., Landan, G. and Martin, W.F., 2014. Concatenated alignments and the case of the disappearing tree. BMC Evolutionary Biology, 14(1), p.266.

Conservation genetics of freshwater fish. Journal of Fish

rn

Vrijenhoek, R.C., 1998. Biology, 53, pp.394-412.

al

Thompson, J.D., Gibson, T.J., Plewniak, F., Jeanmougin, F. and Higgins, D.G., 1997. The Clustal_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Research, 25(24), pp.4876-4882.

Jo u

William J, Ballard O, Rand DM. 2005. The population biology of mitochondrial DNA and its phylogenetic implications. The Annual Reveiw of Ecology, Evolution and Systematics, 36 ,pp. 621–42. Xiao, W., Zhang, Y. and Liu, H., 2001. Molecular systematics of Xenocyprinae (Teleostei: Cyprinidae): taxonomy, biogeography, and coevolution of a special group restricted in East Asia. Molecular Phylogenetics and Evolution, 18(2), pp.163-173. Zhang, Z., Li, J., Zhao, X.Q., Wang, J., Wong, G.K.S. and Yu, J., 2006. KaKs_Calculator: calculating Ka and Ks through model selection and model averaging. Genomics, Proteomics & Bioinformatics, 4(4), pp.259-263.

rn

al

Pr

e-

pr

oo

f

Journal Pre-proof

Jo u

Fig. 1 Map showing sample collection sites for natural population of Tor tor for present study (refer table 1). Mapped in ArcMap V. 10.6 platform using the India-river basin data sets obtained from NRSC-India: Water Resources Information System (http://www.india-wris.nrsc.gov.in/)''.

Journal Pre-proof

Mantel Test Results 0.004 0.003 0.003 0.003 0.003 0.003

0.002 0.002 0.002 0.002 0.001 0.001 0.001 0.001 0.001 0.000 0.000 0 1

2

3

4 5 Geographical Distance

6

7

8

oo

f

0

e-

pr

Figure 2. Scatter plot of Mantel’s test depicting ccorrelation of genetic and geographical distances. Pattern revealed isolation of the samples, where genetic drift has occurred independently of geographic distance.

Pr

P1

rn

al

P2

Jo u

Genetic Distance

0.002

50 48 46 44 42 40 38 36 34 32 30 28 26 24 22 20 18 16 14 12 10 8 Y (Western edge)

6

4

2

5

4550 40 35 30 25 X (Southern edge) 20 15 10

Figure 3. Genetic landscape shape plot showing patterns of spatial genetic distance for 6 populations of Tor tor. X and Y axes correspond to geographic coordinates and the Z axis (height) corresponds to genetic distance between individuals

Journal Pre-proof

pr

oo

f

a, Cytochrome B

Jo u

rn

al

Pr

e-

b. ATPase6/8

c. Concatenated sequences

pr

oo

f

Journal Pre-proof

Jo u

rn

al

Pr

e-

Fig.4. Median joining Network obtained from Tor tor from six collection sites. (a) Cyt b haplotypes, (b) ATPase haplotypes (b) and concatenated (c) sequence haplotypes. The size of each haplotype circle denotes the number of observed individuals. Colors correspond to different regions. Values across lines represent base pair at which variability between adjacent haplotypes.

Fig. 4 Unimodal pattern of mismatch distribution and relative haplotype frequency for overall Tor tor under study.

Jo u

rn

al

Pr

e-

pr

oo

f

Journal Pre-proof

Journal Pre-proof Table 1: Sample collection sites for natural population of Tor tor, used in present study

River

Tributary/ reservoir

Ganga Basin Narmada Basin

River Berach, Berach Dam Narmada river , Tawa reservoir Narmada river

1

Udaipur, State Rajasthan

17

2

45

Kolar tributary, Kolar Dam Penganga River

4

Godavari River

6

Tawa, State Madhya Pradesh Khargone, State Madhya Pradesh Kolar, State Madhya Pradesh Satnala, Adilabad, State Telangana Nirmal, State Telangana

15 18

f

5

24

oo

3

N

21

latitude and longitude 26.47, 74.63 22.75, 77.72 22.17, 76.07 23.25, 77.40 19.79, 78.67 18.99, 78.37

Year of collectio n 2005 2015, 2017 2015 2003 2015, 2016 2017, 2018

pr

Godavari Basin

Collection Site

Godavari Overall

ATPase6/8 concatenated sequences 0.3235 ± 0.7353±0.0770 0.1359 0.5903 ± 0.7768±0.0421 0.0370 0.5543 ± 0.7754±0.0665 0.0525 0.5385 ± 0.6476±0.1338 0.1146 0.2092 ± 0.5294±0.1170 0.1163 0.0000 ± 0.0000±0.0000 0.0000 0.8226 +/0.0175

al

NarmadaTawa NarmadaKhargone NarmadaKolar Penganga

Cytochrome B 0.6691 ± 0.0591 0.4485 ± 0.0858 0.4819 ± 0.1106 0.3714 ± 0.1532 0.4248 ± 0.0993 0.0000 ± 0.0000

rn

Berach

Haplotype diversity

Jo u

Populations

Pr

e-

Table 2: Haplotype and nucleotide diversities calculated through analysis of Cytochrome B, ATPase6/8 and concatenated sequences in Tor tor

Nucleotide diversity Cytochrome B 0.00073 ± 0.00062 0.00071 ± 0.00058 0.00073 ± 0.00061 0.00036 ± 0.00039 0.00114 ± 0.00084 0.00000 ± 0.00000

ATPase6/8 0.00066 ± 0.00064 0.00079 ± 0.00069 0.00071 ± 0.00066 0.00076 ± 0.00070 0.00025 ± 0.00035 0.00000 ± 0.00000

concatenated sequences 0.00070±0.00051 0.00073±0.00050 0.00072±0.00051 0.00051±0.00041 0.00075±0.00053 0.00000 ± 0.00000 0.000959 +/0.000610

Journal Pre-proof Table 3: AMOVA results for natural population of Tor tor, used in present study.

Source Variation

Of D.F.

Percentage Of Fixation Indices Variation

P Values

36.804

0.45404 Va

54.26

FCT: 0.5425

0.01857

3

3.168

0.03026 Vb

3.62

FSC: 0.0791

0.01173

134

47.235

0.35250 Vc

42.13

FST : 0.5787

0.0000

139

87.207

0.83680

Among groups

2

11.537

0.14486 Va

37.59

FCT: 0.3759

0.01173

Among populations within groups Within Populations Total

3

0.640

-0.00124 Vb

-0.32

FSC: -0.0052

0.4252

134

32.395

62.73

FST: 0.3726

0.0000

139

44.571

Among populations within groups Within Populations Total

f

2

oo

Cytochrome B Among groups

Sum Of Variance Squares Components

Pr

e-

pr

ATPase6/8

0.24175 Vc

al

0.38537

2

48.317

0.59840 Va

49.25

FCT: 0.49246

0.02737

Among populations within groups Within Populations Total

3

3.830

0.02967 Vb

2.44

FSC: 0.04811

0.0000

134

78.667

0.58707 Vc

48.31

FST: 0.51687

0.01760

139

130.814

Jo u

Among groups

rn

Concatenated sequences

Journal Pre-proof Table 4. Pair wise FST for cytochrome B (below diagonal), ATPase 6/8 (above diagonal) regions and Concatenated sequences in T. tor. (*p<0.01)

Berach Populations

Narmada Tawa

Penganga

Khargone

Godavari

Kolar

0.18214*

0.28539*

0.12271

0.64731*

0.78377*

Tawa

0.21651*

-

0.0000

0.0000

0.25876*

0.39893*

Khargone

0.20244*

0.0000

-

0.00990

0.18878*

0.38032*

Kolar

0.21827*

0.04938

0.05918

-

0.39969*

0.61638*

Penganga

0.54421*

0.42541*

0.39188*

0.55113*

-

0.07140

Godavari

0.85631*

0.72032*

0.75800*

0.25579*

-

0.20713*

-

Khargone

0.23910*

0.0000

Kolar

0.16897

0.02039

Penganga

0.57686*

0.37504*

Godavari

0.83377*

0.63408*

rn Jo u

oo

-

0.04414

-

0.34250*

0.51889*

-

0.67410*

0.85436*

0.23436

al

Tawa

e-

-

Pr

Berach

0.92405*

pr

Concatenated sequences

f

-

Berach

-

Journal Pre-proof Table 5. Neutrality (Tajima's D and Fu's FS) tests for Tor tor from for natural population

Berach Tawa

Narmada Khargone

Kolar

Godavar i

17

45

24

15

18

21

Tajima's D

0.97880

-1.08869

-0.63496

-1.68500*

1.27889

0.00000

FS Raggedness index Pattern of mismatch distribution ATPase6/8

0.63824 0.18745

-1.84316 0.16869

-0.27379 0.16190

-2.36912*** 0.18122

3.52530 0.69179

0.00000 0.00000

Unimodal

Multimodal

Multimodal

Unimodal

Multimod al

-

1.04864* * -0.12611 0.37630

-0.03726

0.27046

-0.52899

0.00000

-0.20366 0.20139*

0.27741 0.21775

-0.03956 0.12631

-0.01058 0.38212

0.00000 0.00000

Unimodal

Unimodal

-

Multimod Pattern of al mismatch distribution Concatenated sequences 17

Tajima's D

-0.21079

FS

-1.59357

Raggedness index Pattern of mismatch distribution

e-

0.03649

Unimodal

45

24

15

18

21

-0.86549

-0.34412

-1.15871

0.82716

0.00000

-1.15831

-3.70710**

-3.00952**

-2.78160**

0.81219

0.00000

-13.47188***

0.07353

0.05039

0.04602

0.07864

0.30493**

0.00000

0.02967

Multimod al

Unimodal

Unimodal

Unimodal

Unimodal

-

Unimodal

Jo u

Sample size

Unimodal

Pr

FS Raggedness index

al

Tajima's D

rn

Sample size Cytochrome B

Overall

oo

Statistics

f

Penganga

pr

*(p<0.05) **(p<0.01) ) ***(p<0.005)

f

Journal Pre-proof

Berach Tawa 2.24508 inf 9.62634 0.67535 0.19414

Narmada Khargone 1.25199 inf 7.94920 0.77591 0.15963

pr

Population

oo

Table 6. Gene flow estimates (No. of Migration Nm*) for Cytochrome B (below diagonal), ATPase 6/8 (above diagonal) regions and concatenated sequences. Kolar 3.57457 inf 50.00705 0.40723 0.04110

Penganga Godavari

Jo u

rn

al

Pr

e-

0.27243 0.13794 Berach 1.80933 1.43229 0.75335 Tawa 1.96984 2.14854 0.81468 Khargone 1.79075 0.75098 0.31119 Kolar 0.41876 6.50323 Penganga 0.08390 1.45471 Godavari Concatenated sequences Berach 1.91400 Tawa 1.59115 Khargone inf 2.45915 Kolar 24.02027 10.82687 0.36676 0.83320 0.95984 0.46360 Penganga 0.09969 0.28854 0.24173 0.08524 1.63347 Godavari *Lynch and Crease 1990 (with Jukes and Cantor correction), obtained by the permutation test with 1000 replicates

Jo u

rn

al

Pr

e-

pr

oo

f

Journal Pre-proof

Journal Pre-proof Names of Authors: Priyanka Sah1,

2

, Sangeeta Mandal1 , Rajeev K. Singh1 , Raj Kumar1 , Abhinav Pathak1 ,

Nimisha Dutta1 , J.K. Srivastava2 , Ved Prakash Saini3 , Kuldeep K Lal1 and Vindhya Mohindra1 Title of the manuscript: Genetic structure of natural populations of endangered Tor mahseer, Tor tor (Hamilton, 1822) inferred from two mitochondrial DNA markers Affiliations and addresses of the authors: 1 ICAR- National Bureau of Fish Genetic Resources (NBFGR), Lucknow, India. Amity Institute of Biotechnology, Amity University, Lucknow Campus, Lucknow, Uttar

f

2

3

oo

Pradesh, India.

Aquaculture Research and Seed Unit, Maharana Pratap University of Agriculture and

pr

Technology, Udaipur, Rajasthan, India

e-

Corresponding author:

Jo u

Conflict of interest:

rn

al

Pr

Vindhya Mohindra ICAR-National Bureau of Fish Genetic Resources, Canal Ring Road, PO Dilkusha, Lucknow, Uttar Pradesh, India [email protected], [email protected] Telephone number-+91 9415234371

The authors declare that there is no financial or non-financial conflict of interest.

Figure 1

Figure 2

Figure 3

Figure 4

Figure 5