Exon skipping creates novel splice variants of DMC1 gene in ruminants

Exon skipping creates novel splice variants of DMC1 gene in ruminants

Accepted Manuscript Exon skipping creates novel splice variants of DMC1 gene in ruminants S. Ahlawat, Scientist, M. Chopra, L. Jaiswal, R. Sharma, R. ...

2MB Sizes 4 Downloads 72 Views

Accepted Manuscript Exon skipping creates novel splice variants of DMC1 gene in ruminants S. Ahlawat, Scientist, M. Chopra, L. Jaiswal, R. Sharma, R. Arora, B. Brahma, S.V. Lal, S. De PII:

S0890-8508(16)30021-4

DOI:

10.1016/j.mcp.2016.03.001

Reference:

YMCPR 1203

To appear in:

Molecular and Cellular Probes

Received Date: 31 July 2015 Revised Date:

2 March 2016

Accepted Date: 2 March 2016

Please cite this article as: Ahlawat S, Chopra M, Jaiswal L, Sharma R, Arora R, Brahma B, Lal SV, De S, Exon skipping creates novel splice variants of DMC1 gene in ruminants, Molecular and Cellular Probes (2016), doi: 10.1016/j.mcp.2016.03.001. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

Exon skipping creates novel splice variants of DMC1 gene in ruminants

1

S. Ahlawata, M. Choprab, L. Jaiswalb, R. Sharmaa, R. Aroraa, B. Brahmac, S.V. Lalb, S. Deb

2 3

National Bureau of Animal Genetic Resources, Karnal, 132001, India

RI PT

a

4

b

National Dairy Research Institute, Karnal, 132001, India

5 c

6

Krishi Vigyan Kendra, Bhaderwah, SKUAST-Jammu, 180016, India

SC

7 8

M AN U

9 10 11 12 13 14

TE D

15 16 17 18

EP

19 20 21 *

23

Sonika Ahlawat

24

Scientist, Animal Biotechnology Division, NBAGR

25

E-mail address: [email protected]

26

Tel.: +919416161369; Fax: +91 184 2267654

AC C

22

Corresponding author:

27

1

ACCEPTED MANUSCRIPT

1

Abstract Disrupted meiotic cDNA1 (DMC1) recombinase plays a pivotal role in homology search and

3

strand exchange reactions during meiotic homologous recombination. In the present study, full

4

length coding sequence of DMC1 gene was sequence characterized for the first time from four

5

ruminant species (cattle, buffalo, sheep and goat) and phylogenetic relationship of ruminant

6

DMC1 with other eukaryotes was analyzed. DMC1 gene encodes a putative protein of 340 amino

7

acids in cattle, sheep and buffalo and 341 amino acids in goat. A high degree of evolutionary

8

conservation at both nucleotide and amino acid level was observed for the four ruminant

9

orthologs. In cattle and sheep, novel alternatively spliced mRNAs with skipping of exons 7 and 8

10

(Transcript variant 1, TV1) were isolated in addition to the full length (FL) transcript. Novel

11

transcript variants with partial skipping of exon 7 and complete skipping of exon 8 (Transcript

12

variant 2, TV2) were found in sheep and goat. The presence of these variants was validated by

13

amplifying cDNA isolated from testis tissue of ruminants using two oligonucleotides flanking

14

the deleted region. To accurately estimate their relative proportions, real-time PCR was

15

performed using primers specific for each variant. Expression level of DMC1-FL was

16

significantly higher than that of TV1 in cattle and TV2 in goat (P < 0.05). Relative ratio for

17

expression of DMC1-FL: TV1: TV2 in sheep was 6.78: 1.43: 1. In-silico analysis revealed

18

presence of splice variants of DMC1 gene across other mammalian species underpinning the role

19

of alternative splicing in functional innovation.

21

SC

M AN U

TE D

EP

AC C

20

RI PT

2

Keywords: DMC1; splice variants; exon skipping; ruminants; meiosis; recombination

22 23

2

ACCEPTED MANUSCRIPT

1. Introduction

2

Sexually reproducing organisms have a specialized developmental pathway for

3

gametogenesis in which diploid cells undergo meiosis to produce haploid germ cells. Meiotic

4

recombination is a highly conserved process that is critical for the accurate segregation of

5

homologous chromosomes to opposite poles at the first meiotic division [1]. Recombination

6

ensures that the damage to the homologs is repaired and from an evolutionary perspective, it

7

contributes to genetic variability of a species in order for it to withstand the pressure of

8

natural selection. Meiotic recombination is initiated by the introduction of programmed

9

double-strand breaks (DSBs) catalyzed by an evolutionarily conserved, meiosis-specific,

10

topoisomerase-like protein, SPO11 [2]. Resection of the 5’ ends of the breaks results in 3’

11

single stranded tails which are bound by two conserved RecA-like proteins, RAD51 and

12

DMC1 in many organisms, including yeast and mammals [3, 4]. RAD51/DMC1-coated

13

nucleofilaments promote homology search and strand exchange, to form joint molecule

14

intermediates, and are then displaced from synapsing chromosomes as the recombination

15

intermediates are processed [5]. DMC1 is known to be required for proficient double-strand

16

break repair as well [6, 7]. Subsequent processing of the resulting recombination

17

intermediates results in formation of either crossovers or non-crossovers [8]. RAD51 is

18

required for homologous recombination pathways in both mitotic and meiotic cells [9]. In

19

contrast, the other recombinase, DMC1, is a meiosis-specific protein [3]. RAD51 and DMC1

20

have been shown to co-localize in side-by-side foci on meiotic chromatin, suggesting their

21

cooperative action during meiotic recombination [10]. Recent study by Brown et al. [11]

22

provided further insights into localization of RAD51 and DMC1 and proposed that these

AC C

EP

TE D

M AN U

SC

RI PT

1

3

ACCEPTED MANUSCRIPT

1

proteins co-occupy the 3′ termini of most double-strand breaks and the two ends of a DSB

2

are separated by distances of up to 400 nm. DMC1 gene was originally identified in Saccharomyces cerevisiae by using a screen for

4

meiosis-specific prophase-induced genes that cause a meiotic defect when disrupted [3]. That is

5

how its name "disrupted meiotic cDNA" or DMC1 originated. Subsequently it was isolated from

6

higher eukaryotes, such as humans and mice [12]. Importance of DMC1 in homologous

7

recombination in meiosis can be appreciated from the fact that targeted mutation of this gene

8

results in sterility in mice and the signs of poorly repaired double-strand breaks are apparent in

9

reproductive cells [6].

M AN U

SC

RI PT

3

Mutations in S. cerevisiae DMC1 lead to the accumulation of resected DSBs and prevent

11

complete meiosis in some strains [3]. However, in fission yeast, Schizosaccharomyces pombe,

12

deletion of DMC1 causes only a moderate reduction in crossing over and a slight reduction in

13

fertility, suggesting that RAD51 can partially substitute for DMC1 in inter-homolog

14

recombination [13]. Several organisms such as Drosophila melanogaster, Caenorhabditis

15

elegans, Neurospora crassa and Sordaria macrospora do not possess a DMC1 ortholog [14].

16

Why such an apparently important protein is absent in these organisms remains elusive.

EP

TE D

10

Meiotic recombination has several unique features that distinguish it from the

18

homologous recombination that occurs during mitosis. These include the strong preference for

19

inter-homolog recombination, and the coordination of recombination with meiosis specific,

20

higher-order chromosome structures, such as the synaptonemal complex which mediates the

21

stable pairing (synapsis) of homologous chromosomes during prophase I [15]. Role of DMC1

22

for these unique aspects of meiotic recombination has long been speculated.

AC C

17

4

ACCEPTED MANUSCRIPT

DMC1 gene has been identified and characterized in humans, mice, some aquatic

2

animals, crustaceans and chinese crab [16]. DMC1 orthologs have also been identified in plants

3

such as rice, maize and Arabidopsis [17]. However, there are no reports of characterization of

4

this gene in mammalian species other than humans and mice. Keeping in view the significance of

5

this gene in meiosis and homologous recombination, the present study aimed to clone and

6

sequence characterize full length coding sequence of meiotic recombinase DMC1 from four

7

ruminant species (cattle, buffalo, sheep and goat) and to analyze the diversity and phylogenetic

8

relationship of ruminant DMC1 with other eukaryotic species.

9 10 11

2. Material and Methods 2.1 Sample collection

M AN U

SC

RI PT

1

Testis tissue samples of cattle (Bos indicus), buffalo (Bubalus bubalis), sheep (Ovis aries)

13

and goat (Capra hircus) were collected from the Gazipur slaughter house, New Delhi, India,

14

with the help of an on-site veterinary officer. The fresh tissue samples were dispensed in

15

RNAlater (Sigma-Aldrich) and stored in deep freeze (-80 ͦC) until further use.

18

EP

17

2.2 RNA isolation and cDNA preparation Total RNA was isolated from tissue samples using TRIzol reagent (Sigma-Aldrich) and

AC C

16

TE D

12

19

treated with DNAse I (Fermentas) following manufacturer’s instructions. RNA integrity was

20

assessed by loading 2 ul RNA sample on 1.5 % agarose gel. The concentration of isolated RNA

21

was determined by measuring optical density at 260 nm using NanoDrop 1000

22

Spectrophotometer (Thermo Fisher Scientific, Wilmington, DE, USA). About 1 ug of total

23

mRNA was reverse transcribed with Superscript III cDNA synthesis kit (Invitrogen Canada Inc.)

5

ACCEPTED MANUSCRIPT

1

using hexamer primers according to manufacturer’s instructions and cDNA was stored at -80°C

2

for further use.

3

2.3 RT-PCR and cloning

RI PT

4

The full length coding sequence of DMC1 gene was amplified from cDNA of testicular

6

tissue of cattle, buffalo, sheep and goat. The primers for this study were designed manually using

7

the conserved sequences from Bos taurus mRNA sequence (NM_001191338). Primer sequences

8

used for amplification are as follows: F1- ATGAAGGAGGATCAAGTTGTGCTGGAAGA and

9

R1- CTTCAGCTCCTAATAAGCACTAAGAAGCA. PCR was carried out on an Veriti 96 well

10

thermal cycler (Applied Biosystems) in 25 µL reaction mixture containing 2.5 µL of 10X Taq

11

reaction buffer, 0.5 µl of 10mM dNTP mix (Fermentas), 0.5 µl of 10µM for each primer and 0.25

12

units of Taq DNA polymerase (Sigma). The PCR reaction cycle was accomplished by

13

denaturation for 3 min at 94°C; 30 cycles of denaturation at 94°C for 30 sec, annealing at 57°C

14

for 30 sec, extension step at 72°C for 30 sec, with a final extension at 72°C for 5 min.

15

products were electrophoresed alongside DNA molecular weight marker in 1.5% agarose gel and

16

then stained with ethidium bromide. The amplified fragments were gel purified using PureLink

17

Quick Gel Extraction Kit (Invitrogen Canada Inc.) and subcloned into pTZ57R/T vector

18

(Thermo scientific). Recombinant clones were selected and plasmid DNA was extracted.

19

Multiple clones were sequenced using the Sanger sequencing to obtain the coding DNA

20

sequence of DMC1 in the four ruminant species.

22

M AN U

TE D

PCR

EP

AC C

21

SC

5

2.4 Real-time PCR

6

ACCEPTED MANUSCRIPT

Quantitative RT-PCR was carried out to determine the relative mRNA expression of DMC1

2

splice variants in testis tissues (3 each) of cattle, sheep and goat using primers described in Table

3

1. Housekeeping genes GAPDH and RPS15 were included as internal controls. Equal amount of

4

RNA (quantified using NanoDrop spectrophotometer) was used for cDNA preparation

5

(Superscript III cDNA synthesis kit; Invitrogen). All qRT-PCR reactions were carried out on

6

Light Cycler 480 II Real-Time PCR machine (Roche Diagnostics, USA). Each 10 µl reaction

7

mixture consisted of 2 µl cDNA template, 5 µl of 2X SYBR Green PCR Master Mix, 0.25 µl

8

each of forward and reverse primers (10 pmol/µl) and nuclease free water. All samples were run

9

in duplicate. Analysis of real-time PCR (qRT-PCR) was performed by delta-delta-Ct (∆∆Ct)

SC

M AN U

10

RI PT

1

method [18].

12

2.4 In silico analysis

13

2.4.1 Sequence retrieval

TE D

11

A comprehensive search of the sequence database on the NCBI website was carried out in

15

order to find and compare DMC1 gene orthologs among different species. Briefly, BLASTp [19]

16

search was performed using protein sequence corresponding to DMC1 in our study to retrieve

17

similar

18

(http://smart.emblheidelberg.de) and Pfam (pfam.sanger.ac.uk/search) were used to predict the

19

functional domains of the DMC1 protein.

21 22 23

sequences

from

other

eukaryotes

(yeast,

plants

and

animals).

SMART

AC C

20

EP

14

2.4.2 Evolutionary analysis Multiple

protein

sequence

alignment

was

performed

using

MAFFT

(http://mafft.cbrc.jp/alignment/server/) and Bioedit. Phylogenetic analysis was performed using 7

ACCEPTED MANUSCRIPT

Neighbour Joining and Maximum Likelihood algorithm with MEGA 6.0 package and in each

2

case, branch confidence value was calculated with bootstrapping with 1000 iterations using

3

‘‘pairwise deletion option’’ of amino acid sequences with gamma parameters. Here, the Jones,

4

Taylor and Thorton (JTT) model for amino acid sequences and gamma parameters was used.

RI PT

1

5 6

2.4.3 Protein tertiary structure prediction

To predict the DMC1 protein structure in the four ruminant species, comparative

8

modeling was performed using Modeller 9.15 (https://salilab.org/modeller/about_modeller.html)

9

and PyMol was used for structure visualization. To identify the target structure, BlastP was

10

performed with PDB database and protein structure with PDB ID 1V5W was taken as target

11

structure for modeling. Protein model was generated through many iterations by Modeller, which

12

produces homology models by satisfaction of spatial restraints.

13

prepared for each of the ruminant species. DOPE (Discrete Optimized Protein Energy) score

14

was calculated for all structures. Lower the energy score, higher is the stability of modeled

15

structure. The models returning to the minimum molpdfs and minimum DOPE score were

16

chosen as best probable structures. Z-score was calculated using ProSA-web server to recognize

17

the errors in experimental and theoretical models of the protein. The degree of similarity of two

18

protein structures was measured with the RMSD (Root-Mean-Square Distance).

20

M AN U

EP

TE D

Five protein models were

AC C

19

SC

7

3. Results

21

The complete open reading frame of DMC1 gene was amplified by RT-PCR from mRNA

22

isolated from testis tissue of cattle, buffalo, sheep and goat. The amplified products were

23

sequenced in both directions after cloning in pTZ57R/T vector and the DNA sequences from 8

ACCEPTED MANUSCRIPT

these four species were submitted to NCBI and are available under accession numbers:

2

KR935226-KR935230, KT318875-KT318877. The complete open reading frame of DMC1 gene

3

in cattle, buffalo and sheep was observed to be 1023 nucleotides in length encoding a putative

4

protein of 340 amino acids. In goats, the complete ORF was found to be 1026 nucleotides in

5

length encoding a putative protein of 341 amino acids. Comparative analysis of cDNA sequence

6

of four ruminant orthologs revealed limited sequence divergence at both the nucleotide and the

7

amino acid level indicating a high degree of evolutionary conservation. The sequence identity at

8

nucleotide and amino acid level between cattle and buffalo was 97.3% whereas between sheep

9

and goat, it was 95.3%. The homology of cattle and buffalo DMC1 amino acid sequence was

10

more with that of sheep as compared to goat. In cattle and sheep, novel alternatively spliced

11

mRNAs of 858 nucleotides were isolated in addition to the full length (FL) transcript.

12

Interestingly, novel transcript variants of 897 nucleotides were found in sheep and goat (Fig. 1).

13

However, in case of buffalo only full length transcript was observed even by increasing the

14

sample size to three.

SC

M AN U

TE D

15

RI PT

1

Evolutionary analysis of ruminant DMC1 with other animals, humans, fish, plants and yeast clearly separated Saccharomyces cerevisiae (yeast) gene from all other sequences (Fig. 2).

17

Plants such as Arabidopsis thaliana and Solanum lycopersicum formed a separate cluster which

18

was quite distinct from all other animal species. As is expected, Mus musculus and Rattus

19

norvegicus clustered together with 75% probability and Homo sapiens and Pan troglodytes with

20

84% probability. All the mammals were clearly separated from Danio rerio (Zebra fish),

21

Xenopus tropicalis (frog) and Gallus gallus (Junglefowl) with high confidence values. All the

22

ruminants (our sequences as well as sequences retrieved from NCBI) clustered together

23

indicating high sequence similarity.

AC C

EP

16

9

ACCEPTED MANUSCRIPT

Functional feature prediction by SMART and PDBsum revealed significant identity of cattle,

2

buffalo, sheep and goat DMC1 protein with humans as well as other animal species. Multiple

3

sequence alignment of the ruminant DMC1 protein with other orthologs also revealed significant

4

sequence homology (Fig. 3). DMC1 protein in humans and mice typically has 2 domains:

5

Domain I (amino acid residues 27-95) and domain II (amino acid residues 96-319) [12]. We also

6

observed these domains to be present in the DMC1 protein of ruminants. Domain I has a helix-

7

hairpin-helix (HhH) motif of 54 amino acids and the sequence of this motif was found to be

8

conserved in all orthologs. The domain II has Walker A and Walker B motifs which interact with

9

single and double stranded DNAs respectively. The sequence of these motifs were also found to

10

be highly conserved in all DMC1 homologs except for one amino acid change in Walker A motif

11

in Ovis aries sequences isolated in this study. The C-terminal region of DMC1 protein includes

12

amino acid residues 319-340 and its sequence was conserved in all DMC1 orthologs. Sequence

13

analysis of the full length transcripts and the two novel splice variants revealed that the transcript

14

variant of 858 nucletides identified in cattle and sheep had a deletion of amino acids at residues

15

142-196. This deletion lies in the domain II region of DMC1 protein. This deletion also altered

16

the junctional amino acids in the putative protein. The junctional amino acid at the start of the

17

deletion was glycine instead of valine at position 142 and serine at the end of the deletion at

18

position 196 was also replaced by glycine. As far as the second splice variant of 897 nucleotides

19

is concerned, there is deletion of residues 154-196 within the domain II region of DMC1 protein.

20

Here again, the junctional amino acid at position 196 is glycine in place of serine.

21

DMC1 gene in cattle consists of 13 exons (AC_00162) whereas in other three ruminant species

22

i.e. buffalo (NW_005785436), sheep (NC_019460) and goat (NC_022297), there are 14 exons,

23

out of which 13 code for functional protein. Comparison of the sequence of transcripts with

AC C

EP

TE D

M AN U

SC

RI PT

1

10

ACCEPTED MANUSCRIPT

1

DMC1 gene sequence in the analyzed species indicated that there is skipping of coding exons 7

2

and 8 in transcript variant 1 (TV1) and partial skipping of exon 7 and complete skipping of exon

3

8 in transcript variant 2 (TV2) (Fig. 4). To confirm the presence and location of different spliced forms of mRNAs, two

5

oligonucleotides flanking the deleted region were used to amplify cDNA isolated from testis

6

tissue of ruminants such that the full length transcript would yield a product of 326 bp and the

7

splice variants 1 and 2 would produce products of 161 and 200 bp respectively. The sequences of

8

the primers employed were: F2: TAAGTTGCTAGGAGGTGGAATTGAAAGCAT and R2:

9

TAAAGATGCCAGCTTCTTCATGGAACTTTGC. The position of these primers is depicted in

10

Fig. 4. We observed amplified products of expected sizes from the different transcripts of DMC1

11

gene in cattle, buffalo, sheep and goat. Sequencing of the products also confirmed that they were

12

indeed the different splice variants of DMC1 and that there is loss of two exons in the variants

13

(Fig. 5).

TE D

M AN U

SC

RI PT

4

Expression patterns of full length transcript and different splice variants were assessed by

15

real-time PCR in testis tissue of cattle, sheep and goat using primers described in Table 1. The

16

results revealed that the expression level of DMC1-FL was significantly higher than that of TV1

17

in cattle and TV2 in goat (P < 0.05). In sheep, DMC1-FL was most abundantly expressed

18

followed by TV1 and TV2 (DMC1 FL: TV1: TV2 = 6.78: 1.43: 1) (Fig. 6).

AC C

19

EP

14

The tertiary structure for the different variants of DMC1 protein was predicted for the four

20

ruminant species using Modeller 9.15 as the structure is not yet available in the PDB database.

21

Errors in experimental and theoretical models of the predicted protein were recognized by

22

calculating the z-score using ProSA-web server. The structure for DMC1 protein could be reliably

23

predicted with satisfactory Z score and the protein was found to contain Walker A, Walker B and

11

ACCEPTED MANUSCRIPT

HhH motifs that are peculiar features of this protein. We found that tertiary structure of full length

2

sequences in cattle, buffalo and sheep had 9 α-helices and 8 β-sheets whereas goat protein

3

structure was found to contain 9 α-helices and 7 β-sheets because of difference in sequence at the

4

C terminal end. Transcript variants revealed loss of some α-helices and β-sheets. Transcript

5

variant 1 had 7α-helices and 6 β-sheets whereas transcript variant 2 had 6 α-helices and 7 β-

6

sheets. We found that few residues in ATP binding sites {128 (F), 132(K), 133(T), 158 (F),

7

160(D), 162(E), 223(D)} and multimer BRC sites {157 (I), 159 (F) , 167(P), 170(L), 185 (L) ,

8

188(V), 189 (L), 191(A), 202 (L), 203 (L), 204(D), 205(Y), 206(V), 207(A), 209(K), 251(S) , 254

9

(Q), 255(K), 258(E), 259 (E)} were absent in both the variants because of deletion of 2 exons in

10

the crucial domain II region of the protein (Supplement Fig. S1-S4). Both template and query

11

structures were superimposed to calculate the similarity of target and template structure.

12

Complete protein sequence had approximately 92% and partial sequences had 79% residues

13

similarity, suggesting reliable 3D structure. Hence, the representative structure verified by

14

PDBsum, PROCHEK, ProSA z-score was satisfactory and can thus be considered as a reliable

15

source for further analyses. DOPE score, z score and Molpdf for different variants are depicted in

16

Table 2.

18 19

EP

AC C

17

TE D

M AN U

SC

RI PT

1

4. Discussion

Homologous recombination (HR) is an indispensable process for the maintenance of

20

genomic integrity and for creation of genetic diversity [9]. During HR reaction, the double strand

21

break ends introduced by the SPO11 complex undergo 5′ strand resection to generate 3′ single

22

stranded DNA tails that are bound by either of the two conserved recombinases, DMC1 and

23

RAD51. Both RAD51 and DMC1 are homologs of Escherichia coli RecA gene which is among

12

ACCEPTED MANUSCRIPT

the most studied recombination gene encoding a DNA-dependent ATPase that binds to single-

2

stranded DNA and promotes strand invasion and exchange between homologous DNA

3

molecules [4]. RAD51 and DMC1 are composed of two domains: the highly conserved ATPase

4

domain, which makes up the core oligomeric structure, and the N-terminal domain (NTD), which

5

is not conserved in the bacterial RecA protein. The ATPase domain is the core catalytic domain

6

important for DNA binding and ATP hydrolysis [20]. DNA binding is mediated by N-terminal

7

domain in RAD51 and DMC1 and by C-terminal domain in RecA. Specifically, it is the helix–

8

hairpin–helix (HhH) motif in the NTDs of RAD51 and DMC1 which mediates dsDNA binding

9

[20, 21]. Two oligomeric DNA binding forms, an octameric ring and a helical filament have

SC

M AN U

10

RI PT

1

been reported for DMC1 whereas the binding form in case of RAD51 is the helical filament. In the present study, open reading frame of DMC1 gene was characterized for the first

12

time in four ruminant species (cattle, buffalo, sheep and goat). The ORF of DMC1 was predicted

13

to encode a putative protein of 340 amino acids in cattle, buffalo and sheep and 341 amino acids

14

in goats. Two novel transcript variants of 858 and 897 nucleotides were also identified in these

15

ruminants which encode proteins of 285 and 298 amino acids respectively. Transcript variant 1

16

(858 nucleotides) which was observed in cattle and sheep has previously been reported in

17

humans and mice [12] whereas the other variant (897 nucleotides) which was isolated in sheep

18

and goat has been identified for the first time in mammals. Another study [7] also confirmed

19

alternative splicing of DMC1 mRNAs in mice where targeted disruption of DMC1 gene resulted

20

in an arrest of meiosis of germ cells in both sexes. Western blotting was performed using rabbit

21

anti-DMC1 antibody on the nuclear extracts of wild type and DMC1 deficient mouse testis. The

22

nuclear extracts from wild-type and heterozygous mouse testes revealed two bands

23

corresponding to full length DMC1 and DMC1-D (a product from alternatively spliced allele)

AC C

EP

TE D

11

13

ACCEPTED MANUSCRIPT

1

proteins, respectively as had been previously reported [12]. In contrast, the nuclear extract from

2

the DMC1-deficient mouse testis did not show these bands. In silico analysis revealed that alternatively spliced variants with skipping of coding

4

exons 7 and 8 are predicted in a wide range of mammalian species such as Bos mutus, Pan

5

troglodytes and Pan paniscus in addition to those reported in Homo sapiens and Mus musculus.

6

Splice variants with skipping of some other exons have also been predicted in other species as

7

shown in Table 3. However, in some species such as Rattus norvegicus, Equus asinus and Canis

8

lupus familaris, there is no report available so far on alternate splicing. By real time PCR, we

9

found that mRNA expression levels of DMC1-FL were significantly higher than TV1 and TV2

10

in cattle, sheep and goat suggesting that DMC1-FL is the most abundantly expressed isoform. It

11

is worth emphasizing that evolutionary conservation of splice variants of DMC1 gene across

12

different taxa possibly reinforces a major physiological role for these isoforms since alternative

13

splicing is a major mechanism for augmentation of transcriptome and proteome diversity [22].

TE D

M AN U

SC

RI PT

3

Sequence analysis revealed that all the variants of DMC1 identified in the present study

15

have intact N-terminal domain with highly conserved HhH motif. Researchers in the past have

16

reported that high flexibility or multiple conformations of DMC1 are because of the N-terminal

17

domain [20]. It has been experimentally validated that deletion mutant of DMC1 which lacks N-

18

terminal 81 amino acid residues is completely defective in both single-stranded and double-

19

stranded DNA binding activities [20]. Since the variants identified in the present study have

20

intact N-terminal domain, it indicates that DNA binding activity is probably retained in the splice

21

variants. DMC1 transcript variant 1 does not contain the region at amino acids positions 142-196

22

and transcript variant 2 lacks region at amino acids positions 154-196 of ruminant DMC1. These

23

deletions lie in the critical domain II region that is conserved in the RecA-like gene family. Also

AC C

EP

14

14

ACCEPTED MANUSCRIPT

the deleted region locates close to Walker A and Walker B motifs which are necessary for ATP

2

binding, therefore the DMC1 splice variants (1 and 2) may have a low binding affinity for

3

nucleotides and/or low ATPase activity. Domain II region also has multimer BRC interface that

4

binds to BRCA2 tumour suppressor protein which is involved in maintaining the integrity of the

5

genome by participating in error-free double-strand break repair [23]. BRCA2 is a universal

6

regulator of RAD51/DMC1 recombinase actions. In the novel splice variants identified in the

7

present study, some of the residues which are part of the multimer BRC interface have been

8

deleted. So it is logical to speculate that these variants may have reduced binding affinity for

9

BRCA2.

M AN U

SC

RI PT

1

Amino acid residues from 319-340 comprise the C-terminal region of DMC1 protein

11

which was found to be conserved in all transcript variants of DMC1. Sequence conservation of

12

this region in all eukaryotic DMC1 orthologs suggests that this region may be important in

13

recombination by interacting with other components of the recombination machinery, one of

14

them being the tumor suppressor protein p53. It is a multifunctional molecule that regulates

15

meiotic cell-cycle and DNA repair. It binds to C-terminal domain of mammalian DMC1 and is

16

involved in homologous recombination and maintenance of genomic stability during meiosis

17

[24]. Therefore probably the novel transcript variants retain the ability to interact with other

18

proteins.

EP

AC C

19

TE D

10

It is interesting and intriguing to note that since the transcript variants are conserved in

20

different mammalian species such as humans, mice [12], cattle, sheep and goat (our study) and

21

also predicted in other animal species, it implies that these variants with the deletion in domain II

22

region which harbors the ATPase domain might have novel roles to play in recombination.

23

Kowalczykowski and Krupp [25] reported that ATPase activity is not required for the strand

15

ACCEPTED MANUSCRIPT

exchange reaction with bacterial RecA. On similar lines, DMC1 splice variants may have

2

attenuated ATPase activity but could still be capable of strand exchange. Put together, these

3

observations suggest that the novel transcript variants of DMC1 with intact N-terminal and C-

4

terminal domains represent proteins with diverse biological functions or with altered functional

5

activities and represent an interesting area to explore for researchers in future.

RI PT

1

6

8

SC

7

5. Conclusion

The present study for the first time determined the full length coding sequence of DMC1 gene in

10

four ruminant species (buffalo, cattle, sheep and goat). Evolutionary analysis between the

11

different orthologs suggested that these proteins are conserved from yeast to humans. Two novel

12

transcript variants with deletion in the ATPase domain were also characterized for the first time

13

in ruminants. Since these transcripts are also observed in other mammalian species such as

14

humans and mice and predicted in some other species, it indicates that they are evolutionarily

15

conserved and may have novel/diverse biological functions in recombination.

TE D

M AN U

9

EP

16

Conflict of interest statement

18

The authors declare that they have no competing interests.

19

AC C

17

20

Acknowledgements

21

This study was supported by the Indian Council of Agricultural Research, New Delhi, India.

22

23 16

ACCEPTED MANUSCRIPT

1

References [1] Henry JM, Camahort R, Rice DA, Florens L, Swanson SK, Washburn MP, et al.

3

Mnd1/Hop2 Facilitates DMC1-Dependent Interhomolog Crossover Formation in Meiosis

4

of Budding Yeast. Mol Cell Biol 2006; 26(8):2913–2923.

5

[2] Keeney S. SPO11 and the formation of DNA double-strand breaks in meiosis.

6

Genome Dynam Stabil 2008; 2:81–123.

7

[3] Bishop DK, Park D, Xu L, Kleckner N. DMC1: a meiosis-specific yeast homolog of

8

E. coli RecA required for recombination, synaptonemal complex formation, and cell cycle

9

progression. Cell 1992; 69:439–456.

M AN U

SC

RI PT

2

10

[4] Shinohara A, Ogawa H, Ogawa T. RAD51 protein involved in repair and

11

recombination in S. cerevisiae is a RecA-like protein. Cell 1992; 69:457–470.

12

[5] Hong EL, Shinohara A,

13

promotes renaturation of single-strand DNA (ssDNA) and assimilation of ssDNA into

14

homologous super-coiled duplex DNA. J Biol Chem 2001; 276:41906–41912.

15

[6] Pittman DL, Cobb J, Schimenti KJ, Wilson LA, Cooper DM, Brignull E, et al.

16

Meiotic prophase arrest with failure of chromosome synapsis in mice deficient for

17

DMC1, a germ line specific RecA homolog. Mol Cell 1998; 1:697–705.

19 20

TE D

EP

AC C

18

Bishop DK. Saccharomyces cerevisiae DMC1 protein

[7] Yoshida K, Kondoh G, Matsuda Y, Habu T, Nishimune Y, Morita T. The mouse RecA-like gene DMC1 is required for homologous chromosome synapsis during meiosis. Mol Cell 1998; 1:707–718.

21

[8] de Massy B. Initiation of meiotic recombination: how and where? Annu Rev Genet

22

2013; http://dx.doi. org/ 10.1146/annurev-genet-110711-155423.

17

ACCEPTED MANUSCRIPT

[9] Krogh BO, Symington LS. Recombination proteins in yeast. Annu Rev Genet 2004;

2

38:233–271.

3

[10] Bishop DK. RecA homologs DMC1 and RAD51 interact to form multiple nuclear

4

complexes prior to meiotic chromosome synapsis. Cell 1994; 79:1081–1092.

5

[11] Brown MS, Grubb J, Zhang A, Rust MJ, Bishop DK. Small RAD51 and DMC1

6

complexes often co-occupy both ends of a meiotic DNA double strand break. PLoS

7

Genet 2015; 11(12): e1005653.

8

[12] Habu T, Taki T, West A, Nishimune Y, Morita T. The mouse and human homologs

9

of DMC1, the yeast meiosis-specific homologous recombination gene, have a common

M AN U

SC

RI PT

1

unique form of exon-skipped transcript in meiosis. Nucleic Acid Res 1996; 24:470–477.

11

[13] Grishchuk AL, Kohli J. Five RecA-like proteins of Schizosaccharomyces pombe are

12

involved in meiotic recombination. Genetics 2003; 165:1031–1043.

13

[14] Neale MJ, Keeney S. Clarifying the mechanics of DNA strand exchange in meiotic

14

recombination. Nature 2006; 442:153–158.

15

[15] Roeder GS. Meiotic chromosomes: it takes two to tango. Genes Dev 1997; 11:2600–

16

2621.

17

[16] Liu Y, Cui Z. Molecular Cloning and Characterization of DMC1 from the Chinese

18

Mitten Crab (Eriocheir sinensis). Int J Aquacult Fish Sci 2015; 1(1):024–029.

20

EP

AC C

19

TE D

10

[17] Etedali F, Baghban Kohnehrouz B, Valizadeh M, Gholizadeh A, Malboobi MA. Genome wide cloning of maize meiotic recombinase DMC1 and its functional structure

21

through molecular phylogeny. Genet Mol Res 2011; 10(3):1636–1649.

22

[18] Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time

23

quantitative PCR and the 2−∆∆Ct method. Methods 2001; 25:402–408.

18

ACCEPTED MANUSCRIPT

[19] Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment

2

search tool. J Mol Biol 1990; 215(3):403–10.

3

[20] Kinebuchi T, Kagawa W, Kurumizaka H, Yokoyama S. Role of the N-terminal

4

Domain of the Human DMC1 Protein in Octamer Formation and DNA Binding. J Biol

5

Chem 2005; 280(31):28382–28387.

6

[21] Aihara H, Ito Y, Kurumizaka H, Yokoyama S, Shibata T. The N-terminal domain of

7

the human RAD51 protein binds DNA: structure and a DNA binding surface as revealed

8

by NMR. J Mol Biol 1999; 290:495–504.

9

[22] Keren H, Lev-Maor G, Ast G. Alternative splicing and evolution: diversification,

M AN U

SC

RI PT

1

exon definition and function. Nat Rev Genet 2010; 11:345–355.

11

[23] Venkitaraman AR. Cancer susceptibility and the functions of BRCA1 and BRCA2.

12

Cell 2002; 108:171–182.

13

[24] Habu T, Wakabayashi N, Yoshida K, Yomogida K, Nishimune Y, Morita T. p53

14

protein interacts specifically with the meiosis-specific mammalian RecA-like protein

15

DMC1 in meiosis. Carcinogenesis 2004; 25(6):889–893.

16

TE D

10

17

EP

[25] Kowalczykowski SC, Krupp RA. DNA-strand exchange promoted by RecA protein in the absence of ATP: implications for the mechanism of energy transduction in proteinpromoted nucleic acid transactions. Proc Natl Acad Sci U S A 1995; 92(8):3478-3482.

AC C

18

19

ACCEPTED MANUSCRIPT

Table 1. Primers for Real-time PCR of DMC1 variants in ruminants Variant

Primer sequence

Product size

RI PT

(bp)

Wild type

F- ACAGAAGCTTTTGGAGAATTTCGAACTGG

151

F- TTTCCATATCACCACTGGGAGCCAGGA

152

cattle and sheep

R- CAAGTAGCTCCATCTGATGTTCACCACAG

Transcript variant 2:

F- GCTAGGAGGTGGAATTGAAAGCATGGC

152

sheep and goat

R- CAAGTAGCTCCATCTGATGTTCACCTCCT

AC C

EP

TE D

M AN U

Transcript variant 1:

SC

R- CCGAAGGCGATCTGGACGGAAAGTATT

ACCEPTED MANUSCRIPT

Table 2. Energy minimization score using different methods Molpdf

DOPE score

z-score

Cattle

2292.52222

28590.24805

-4.58

Cattle_TV1

2240.96851

RI PT

Structure

-23632.13086

-3.13

2093.68140

-29477.53125

-4.81

Sheep

2181.68262

-28506.82227

-4.07

Sheep_TV1

2018.29260

-21557.39453

-3.03

Sheep_TV2

2083.81421

-24983.40039

-3.35

Goat

2066.36719

-28991.46680

-4.49

Goat_TV2

2008.12634

-22084.85156

-3.95

AC C

EP

TE D

M AN U

SC

Buffalo

ACCEPTED MANUSCRIPT

Table 3. Alternative splicing of DMC1 gene in different species Accession

Total exon

Coding

Skipped

number

count

exon count

coding exon

RI PT

Species

in variant

Bos taurus

NM_001191338

13

Bos indicus_NDRI

KT318875

13

Bos indicus_NDRI_TV

KR935227

11

Bubalus bubalis

XM_006071328

14

Bubalus bubalis_NDRI

KR935226

13

13

Ovis aries

XM_012175647

14

13

Ovis aries_NDRI

KT318876

13

13

Ovis aries_NDRI_TV1

KR935228

11

11

Exons 7 and 8

12

12

Partial exon 7

Capra hircus

SC

13

M AN U

TE D KT318877

EP

Ovis aries_NDRI_TV2

13

11

13

and complete exon 8

XM_005681072

14

13

XM_005681073

11

11

Capra hircus_NDRI

KR935229

13

13

Capra hircus_NDRI_TV

KR935230

12

12

AC C

Capra hircus_TV

Exons 7 and 8

Exons 7 and 8

Partial exon 7 and complete exon 8

Bos mutus

XM_005899439

14

13

ACCEPTED MANUSCRIPT

XM_005899440

11

11

Homo sapiens

NM_007068

14

13

Homo sapiens_TV

NM_001278208

11

11

Mus musculus

NM_010059

14

13

Mus musculus_TV1

NM_001278226

12

Mus musculus_TV2

NR_103477

9

Sus scrofa

XM_005655492

14

Sus scrofa_TV

XM_005655493

12

Exons 7 and 8

Exons 7 and 8

RI PT

Bos mutus_TV

11

Exons 7 and 8

9

Exons 6-9, 11

M AN U

SC

13 11

Exons 12 and 13

XM_009438403

14

13

Pan troglodytes_TV1

XM_009438407

13

12

Exon 3

Pan troglodytes_TV2

XM_009438408

12

11

Exons 7 and 8

Pan paniscus

XM_003821593

14

13

XM_008975092

13

12

Exon 6

XM_008975093

12

11

Exons 7 and 8

XM_002914548

14

13

Pan paniscus_TV1 Pan paniscus_TV2

EP

Ailuropoda melanoleuca

TE D

Pan troglodytes

XM_011218836

13

12

Equus asinus

XM_014851322.1

13

13

Rattus norvegicus

NM_001130567.1

15

13

Canis lupus familaris

XM_005625878

14

13

AC C

Ailuropoda melanoleuca _TV

Exon 3

Lane 1: DMC1-FL (1023 bp) and TV1 (858 bp) in cattle

M AN U

Lane 2: DMC1-FL in buffalo

SC

Fig. 1. Gel image of RT-PCR product of DMC1 gene in ruminants

RI PT

ACCEPTED MANUSCRIPT

Lane 3: DMC1-FL (1026 bp), TV2 (897 bp) in goat

Lane 4: DMC1-FL, TV2 (897 bp) and TV1 (858 bp) in sheep

AC C

EP

TE D

M: 100 bp DNA marker, bright bands correspond to 500 bp and 1000 bp

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

Fig. 2. Phylogenetic tree of DMC1 in eukaryotes constructed by neighbour joining method

AC C

(Poission correction with gamma parameters)

ACCEPTED MANUSCRIPT HhH motif

RI PT

MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLPEAKADKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTEFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTEFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEVKVDKIKEAANKHIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLPEAKADKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTEFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLAAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKQKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEDESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVAEEPGFQDEEESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVAEEPGFQDEEESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVQEESGFQDDEESLFQDIDLLQKHGINMADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVEKIKEAANKLIEPGFLTAFQYSERRKMV MK--EDQVVQEESGFQDDEESLFQDIDLLQKHGINMADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVEKIKEAANKLIEPGFLTAFQYSERRKMV MKVLEEQTLEEESGYQDDEESFFQDIELLQKHGINVADIKKLKSVGICTVKGIQMTTRRALCNIKGLSEAKVDKIKEAAGKLLTCGFQTASEYCIKRKQV MK--EDQVVQEESGFQDEEESLFQDIDLLQKHGINMADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVEKIKEAANKLIEPGFLTAFQYSEKRKMV MKAMEDQVVQEESGYHDDEESFFQDIDLLQKHGINVADIKKLKSVGICTIKGVQMTTRRALCNVKGLSEVKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVLEEPGFQDEEESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MK--EDQVVVEEPGFQDEEESLFQDIDLLQKHGINVADIKKLKSVGICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMV MKSMEDQVVEEDVGFHDE-ESFFHDIEMLQKQGINVADIKKLKSVGICTIKGIQMTTRKALCNIKGLSEAKVEKIKEAANKVIEPGFLTAFEYSAKRRMV

Walker A

TE D

M AN U

FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHAPCVTAQLPGAGGYSGGKIIFIDTENTFRPDRLRDIADRFNVDHNAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGGFRTGKTQLSHTLCVTAQLPGAGGYSGGKIIFIDTENTFRPDRLRDIADRFNVDHNAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGGFRTGKTQLSHTLCVTAQLPGAGGYSG------------------------------------------GEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYSGGKIIFIYTENTFRPDRLRDIADRFNVDHNAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHAPCG-------------------------------------------------------EH FHITTGSQEFDKLLGGGIESMAITEAFGGFRTGKTQLSHTLC-------------------------------------------------------GEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYSGGKIIFIDTENTFRPDRLRDIADRFNVDHNAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYSG------------------------------------------GEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYSGGKIIFIDTENTFRPDRLRDIADRFNVDHNAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYSGGKIIFIDTENTFRPDRLRDIADRFNVDHNAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYSGGKIIFIDTENTFRPDRLRDIADRFNVDHNAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLC-------------------------------------------------------GEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYSGGKIIFIDTENTFRPDRLRDIADRFNVDHNAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLC-------------------------------------------------------GEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYSGGKIIFIDTENTFRPDRLRDIADRFNVDHNAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLC-------------------------------------------------------GEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCG-------------------------------------------------------EH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYPGGKIIFIDTENTFRPDRLRDIADRFNVDHDAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCG-------------------------------------------------------EH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGTGGYSGGKIIFIDTENTFRPDRLRDIADRFNVDHEAVLDNVLYARAYTSEH FHITTGSLEFDKLLGGGVESMAITEAFGEFRTGKTQLSHTLCVTAQLPGEYGYTGGKVIFIDTENTFRPERLKDIADRFNVDHEAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGADGYSGGKIIFIDTENTFRPDRLRDIADRFNVDHDAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGPKGYTGGKIIFIDTENTFRPDRLRDIADRFNVDHDAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYSGGKIIFIDTENTFRPDRLRDIADRFNVDHDAVLDNVLYARAYTSEH FHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYPGGKIIFIDTENTFRPDRLRDIADRFNVDHDAVLDNVLYARAYTSEH FHISTGSQEFDKLLGGGIESMAITETFGEFRTGKTQLAHTLCVTAQLPGPNGYTGGKIIFIDTENTFRPDRLHDIADRFSVDHDAVLDNVLYARAYTSEH

EP

B.indicus_NDRI O.aries_NDRI O.aries_NDRI_TV2 B.Bubalis_NDRI B.indicus_NDRI_TV1 O.aries_NDRI_TV1 C.hircus_NDRI C.hircus_NDRI_TV2 B.taurus B.bubalis B.mutus B.mutus_TV C.hircus C.hircus_TV O.aries O.aries_TV H.sapiens_TV H_sapiens M.musculus_TV M.musculus D.rerio R.norvegicus G.gallus C.L.familiaris P.troglodytes X.tropicalis

SC

B.indicus_NDRI O.aries_NDRI O.aries_NDRI_TV2 B.Bubalis_NDRI B.indicus_NDRI_TV1 O.aries_NDRI_TV1 C.hircus_NDRI C.hircus_NDRI_TV2 B.taurus B.bubalis B.mutus B.mutus_TV C.hircus C.hircus_TV O.aries O.aries_TV H.sapiens H_sapiens_TV M.musculus M.musculus_TV D.rerio R.norvegicus G.gallus C.L.familiaris P.troglodytes X.tropicalis

Walker B

QMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQNLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGRHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMTLFRVDFSGRGESAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMTLFRVDFSGRGESAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQNLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGRHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMTLFRVDFSGRGESAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKILRGYNVAVFVTNQMTADPRAPMTFQADPKNPFGGHISGHHCF QMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKEPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDFVAAKFHEEGGVFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGAGMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIVDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPVGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHASQMELLDYVAAKFHEEPGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGATMTFQADPKKPIGGHILAHAS-

AC C

B.indicus_NDRI O.aries_NDRI O.aries_NDRI_TV2 B.Bubalis_NDRI B.indicus_NDRI_TV1 O.aries_NDRI_TV1 C.hircus_NDRI C.hircus_NDRI_TV2 B.taurus B.bubalis B.mutus B.mutus_TV C.hircus C.hircus_TV O.aries O.aries_TV H.sapiens H_sapiens_TV M.musculus M.musculus_TV D.rerio R.norvegicus G.gallus C.L.familiaris P.troglodytes X.tropicalis

TTRISLREGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISSRKGRGELRIAQIYGSPEMPENEATFAITAGGIGDAKE TTRISSRKGRGELRIAQIYGSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFTITAGGIGDAKE TTRISLREGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISSRKGRGELRIAQIYGSPEMPENEATFAITAGGIGDAKE TTRISLAKRKRKLEMPKIYDSPEIAENEANLAITAGGIGNAKR TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFTITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRAELRIAKIFDSPHMPENEATFAITAGGITDAKD TTRISLRKGRGELRIAKIYDSPEMPENEATFAITTGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITPGGIGDAKE TTRVSLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE TTRISLRKGRGELRIAKIYDSPEMPENEATFAITSGGINDAKE

SC

B.indicus_NDRI O.aries_NDRI O.aries_NDRI_TV2 B.Bubalis_NDRI B.indicus_NDRI_TV1 O.aries_NDRI_TV1 C.hircus_NDRI C.hircus_NDRI_TV2 B.taurus B.bubalis B.mutus B.mutus_TV C.hircus C.hircus_TV O.aries O.aries_TV H.sapiens H_sapiens_TV M.musculus M.musculus_TV D.rerio R.norvegicus G.gallus C.L.familiaris P.troglodytes X.tropicalis

RI PT

ACCEPTED MANUSCRIPT

M AN U

Fig. 3. Multiple sequence alignment of DMC1 gene in different eukaryotes. HhH motif in domain I and Walker A & Walker B motifs in domain II are highlighted by horizontal bars above the sequence alignment.

AC C

EP

TE D

ATP binding sites are marked by asterisks and BRCA2 interface by bolts.

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

Fig. 4. Schematic representation of DMC1 gene and different splice variants

TE D

The position of primers used for cloning (F1 and R1) and for confirmation of variants (F2 and

AC C

EP

R2) has been indicated with arrows.

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

Fig. 4. Gel image of RT-PCR product of DMC1 gene in ruminants confirming deletion of two exons

TE D

Lane 1: DMC1-FL (326 bp) and TV1 (161 bp) in cattle Lane 2: DMC1-FL in buffalo

Lane 3: DMC1-FL, TV2 (200 bp) in goat

EP

Lane 4: DMC1-FL, TV2 (200 bp) and TV1 (161 bp) in sheep

AC C

M: 100 bp DNA marker, bright band corresponds to 500 bp

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

AC C

EP

TE D

Fig. 6. Expression profiles of DMC1 splice variants in sheep, goat and cattle (P < 0.05)

ACCEPTED MANUSCRIPT

Highlights •

DMC1 recombinase was sequence characterized for the first time in four ruminant species. Comparative analysis of ruminant DMC1 orthologs revealed limited sequence

RI PT



divergence.

Novel alternatively spliced mRNAs with skipping of exons 7 and 8 were also isolated.



Presence of variants was validated by amplifying cDNA using primers flanking the

SC



deleted region.

Real-time PCR was performed to estimate relative proportion of each variant.



Alternative splicing was observed to be an evolutionarily conserved process across

M AN U



AC C

EP

TE D

species.