Accepted Manuscript Study of microRNA mediated gene regulation in Striga hermonthica through in-silico approach
Swati Srivastava, Ashok Sharma PII: DOI: Reference:
S2352-2151(17)30020-X doi:10.1016/j.aggene.2017.09.004 AGGENE 55
To appear in: Received date: Revised date: Accepted date:
20 February 2017 12 July 2017 25 September 2017
Please cite this article as: Swati Srivastava, Ashok Sharma , Study of microRNA mediated gene regulation in Striga hermonthica through in-silico approach. The address for the corresponding author was captured as affiliation for all authors. Please check if appropriate. Aggene(2017), doi:10.1016/j.aggene.2017.09.004
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT Study of microRNA mediated gene regulation in Striga hermonthica through in-silico approach Swati Srivastava and Ashok Sharma* Biotechnology Division, CSIR-Central Institute of Medicinal and Aromatic Plants, Post Office CIMAP, Lucknow, India *Corresponding Author:
[email protected] Phone no.: 0522-2718517
AC C
EP T
ED
MA
NU
SC
RI
PT
Fax no.: 0522-2342666
ACCEPTED MANUSCRIPT
Abstract Striga hermonthica is a parasitic plant that attacks mostly cereal crops for its growth and development. S. hermonthica parasitism affects two thirds of the arable land and over 100 milion people. It is also well known as a folk medicine due to presence of flavonoids, terpenes, saponins, cardiac glycosides, alkaloids, tannins and coumarins miRNAs are important gene regulatory elements involved in almost all biological processes during
PT
different biotic and abiotic stresses in plants. Mobile small RNA are reported to move bidrection between parasitic plant and host plants. The study was to investigate the miRNA
RI
of S. hermonthica and miRNA mediated regulation of host plant Oryza sativa genes. In-silico
SC
identification of miRNAs revealed 13 conserved miRNA families (miR1846c-3p, miR1848, miR1851, miR1857-5p, miR2102-3p, miR2864.1, miR417, miR437, miR444e, miR529a,
NU
miR810b.2, miR156e, miR5564b). Approximately, 185 genes playing diverse roles have been predicted as targets for the identified 12 miRNA families (miR1848, miR1851, miR1857-5p, miR2102-3p, miR2864.1, miR417, miR437, miR444e, miR529a, miR810b.2,
MA
miR156e, miR5564b). Manipulation in miRNAs or their targets may be utilized for
ED
developing better crop protection strategies.
Keywords: Striga hermonthica, In-silico analysis, miRNA prediction, Target genes, mobile
EP T
small RNAs
Abbrevations: miRNA: MicroRNA, MFE: Minimal Folding Energy, MFEI: Minimal
Sequence
AC C
Folding Index, GO: Gene Ontology, EST: Expressed Sequence Tag, GSS: Genome Survey
1. Introduction
Striga hermonthica belongs to family Orobanchaceae (Minorsky 2015; Spallek et al., 2013, Joel 2000). S. hermonthica is reported to attack cereal crops specially sorghum, pearl millet, rice, maize, tobacco and sugarcane in Africa, India, Asia, Australia and some part of the USA (Musselman 1980). S. hermonthica is also a well-known medicinal plant which has been
used
widely as traditional medicine
in Africa with a broad
spectrum of
pharmacological impacts against many physiological and infectious diseases in both animals and
humans (Hiremath et al., 2000; EL-Kamali 2009; Koua et al., 2011a,b; Okpako and
Ajaiyeoba, 2004; Khan et al., 1998; Choudhury et al., 2000; Baoua et al., 1980; Koua et al.,
ACCEPTED MANUSCRIPT 2011a,b). Communication from cell to cell occurs between organisms that have parasitic, pathogenic or symbiotic relationship. This type of
communication involves transportion of
regulatory molecules across the cellular boundaries between the host and its interacting pathogens, pests and parasitic or symbionts (Weiberg et al., 2015). Mobile small RNAs (such as siRNA
and miRNA) have been indicated to function in communiction between
hosts and their intracting
pathogens, pests and parasites (Brosnan and Voinnet, 2011;
Greenwood et al., 2016; Molnar et al., 2011). The small endogenous non-coding RNAs the regulation of expression of protein
PT
(ncRNAs) (21-25 nt) play an important role in
coding and noncoding genes in plants, animals and humans (Leucci et al., 2013). The
RI
function of ncRNAs have shown some differences between plants and animals in using
SC
different ribonuclease, genomic arrangement of miRNA genes, target recognition and mode of evolutionary emergence (Axtell et al., 2011). The mechanism of its interaction with
NU
lncRNA and siRNA in targeting DNA methylation is also different in plants and animals. miRNA regulates the gene expression by targeting complementary site at 3’ untranslated regions (UTRs) of specific mRNAs for post-transcriptional repression or degradation (He
MA
and Hannon, 2004; Sala-Cirtog et al., 2015, Reinhart et al., 2002). These miRNAs have multiple functions, including regulation of mRNA expression and siRNA biogenesis as well
ED
as various stress responses. Several methods have been reported for the identification of miRNAs such as genetic screening, direct cloning after isolation of small RNAs, computational approach and expressed sequence tags (ESTs) analysis (Zhang et al., 2006).
EP T
In this study, we have used computational approach for the identification of miRNAs families expressed in S. hermonthica and predicted their putative targets (one of reported host plants). The available EST and GSS data of S. hermonthica provide an opportunity for conserved miRNA and their putative target transcripts. In miRBase
AC C
the prediction of
database, 7385 mature plant miRNA are reported from 72 plant species including many medicinally as well as economically important crop species (Sala-Cirtog et al., 2015). However, no miRNAs are available for S. hermonthica in miRBase database. In S. hermonthica, 67,955 and 4356 sequences of ESTs and GSS data, respectively, were available in the NCBI database. In our study, we used EST and GSS data for in-silico identification and
characterization of conserved miRNAs and their predicted target
transcripts. The identification of miRNA and corresponding targets is necessary for the understanding of their role in controlling different biological functions (Pio et al., 2014; Qu et al., 2011).
ACCEPTED MANUSCRIPT 2. Material and methods
2.1 Computational analysis of sequence data EST
and
GSS
data
of
S.
hermonthica
were
retrieved
from
NCBI
(http://www.ncbi.nlm.nih.gov/nucest). To remove the redundancy among 72,311 (67,955 EST and 4,356 GSS) sequences, CAP3 tool (Huang and Madan, 1999) was used which resulted into 8226 contigs. BLASTX (e-value<=1e-5) was performed against protein UniprotKB/Swissprot
(release
2010_12)
and
UniProtKB/TrEMBL
PT
databases
(release
2011_01) to remove the protein-coding sequences. The non-coding contigs were further used
RI
for the identification of miRNAs. miRNAs and their corresponding targets were identified by
SC
C-mii (version 1.11) (Numnark et al., 2012). The putative miRNA candidates were scanned against the available miRNAs in Oryza sativa (660), Sorghum bicolor (172), Zea mays (172)
NU
from miRBase using BLASTN (e-value cut-off = 10). The RNA database, Rfam 10 (Griffiths-Jones et al., 2003) was used to predict primary and precursor structure of identified miRNAs. UNAFold parameters as applied were maximum base pair distance 3000,
MA
maximum bulge/interior loop size 30 and single tread run 37o C temperature.
ED
2.2 Prediction of potential S. hermonthica miRNAs and their target transcripts For a homology search based miRNA identification, miRNAs of O. sativa, S. bicolor, Z. mays were selected as a reference dataset. Following criteria were considered for a miRNA
EP T
candidate: 1) The length of the predicted mature miRNAs should be 19–25 nucleotides, 2) maximum four mismatches were allowed for the predicted mature miRNAs against reference miRNA, 3) localization of the mature miRNA within stem–loop structure should be one arm,
AC C
4) not more than 5 mismatches were allowed between miRNA sequence and the guide miRNA sequence, 5) AU content should be high and 6) minimal folding free energy (MFE) and high MFE index (MFEI) value of the secondary structure should be highly negative. The results were again validated at sequence and structure level through miRNA-dis (Liu et al., 2015). The identified miRNA candidates were used for the target identification against the same transcripts which were used for miRNA identification through psRNATarget (Dai and Zhao, 2011). Following criteria were set for the prediction of miRNA target genes: 1) not more than four mismatches were allowed between predicted mRNAs and target gene, 2) no mismatches were allowed for 10th and 11th positions of complementary site as it is considered as a cleavage site and 3) maximum 4 GU pair was allowed in the complimentary alignment.
ACCEPTED MANUSCRIPT 2.3 Interaction study of predicted miRNA and their target transcripts For understanding the interaction of predicted miRNA and their target transcripts, biological networks were constructed through Cytoscape 3.2 (Shannon et al., 2003).
2.4 Gene ontology and pathway for predicting target transcripts To understand the function of identified miRNAs and their regulating targets, the gene ontology analysis of the identified target transcript was performed by Blast2GO (Conesa and
PT
Götz, 2008). Furthermore to analyze the regulation of identified miRNA pathway analysis
RI
was performed by using KASS server (Moriya et al., 2007).
NU
3.1 miRNA identification and characterization
SC
3. Results
Twenty-three miRNA families were predicted from 8226 contigs through C-mii tool. These predicted miRNAs were considered further for reducing the duplicacy and improving the
MA
accuracy. The results were again validated through miRNA-dis. Out of 23 miRNA families,
Table1
ED
only 13 miRNA families were validated and considered for further study (Table1).
Homolog of S. hermonthica miRNA, predicted miRNA. Homolog
EP T
Contigs
Contig8091 Contig7944 Contig2922 Contig832 Contig2694 Contig6749 Contig4047 Contig3652 Contig388 Contig267 Contig2681 Contig4481 Contig6798
AC C
Pre dicte d miRNA family miR1846c-3p miR1848 miR1851 miR1857-5p omiR2102-3p miR2864.1 miR417 miR437 miR444e miR529a miR810b.2 miR156e miR5564b
miRNA
osa-miR1846c-3p osa-miR1848 osa-miR1851 osa-miR1857-5p osa-miR2102-3p osa-miR2864.1 osa-miR417 osa-miR437 osa-miR444e osa-miR529a osa-miR810b.2 sbi-miR156e sbi-miR5564b
Predicted miRNA sequence 5': GGCCCCCGGGCUGCUCGCUGG :3' 5': CCGUGCCCGCGCGCGCGUGCU :3' 5': CCUUUGGGAUGGCAUCUUGGA :3' 5': UGGCAUUUUUGGAGCGGGAGG :3' 5': CUCGGUGCCGGUUACGGCGGCG :3' 5': GUUUGCUGCCCUUGGUUCGGA :3' 5': UCAGGUUCAUCAACAUCGUCC :3' 5': GGUUGAAGUGAAUUUGUUUCA :3' 5': AAAGCAAUUGAAGUUUGACUU :3' 5': UUUGGAGAUGUUAUUUUGGUAA :3' 5': CUGUACCCUCUCUGGUUAUC :3' 5':AAGUGAUUUAAUUUUACCUUU:3' 5': CUGUACCCUCUCUGGUUAUC :3'
3.2 Analysis of length variation of miRNAs Variation in pre-miRNA
range of 43 to 1105 nt with an average length of 171 nt was
observed. On the other hand, the variation from 20 to 22 nt was observed in mature miRNA sequences as earlier reported (Fig. 1).
NU
SC
RI
PT
ACCEPTED MANUSCRIPT
3.3 AU and GC content study
MA
Fig. 1. pre-miRNA and mature miRNA sequence lengths.
ED
Higher AU content is an important feature to measure the difference between miRNA with other non-coding RNAs. In our results, we have also observed that predicted miRNAs
EP T
showed AU content and GC content in range of 18.71 to 75.71 and 24.29 to 81.29 with an
AC C
average value of 48.90 and 51.09, respectively (Fig. 2).
Fig. 2. Show predicted mature-miRNA nucleotide content, AU and GC content.
3.4 MFE and MFEI calculations
ACCEPTED MANUSCRIPT MFE (Minimum Folding Energy) is an important parameter for determining the secondary structure of pre-miRNA. The highly negative MFE indicates thermodynamically stable secondary structure of the corresponding sequences. In our analysis, the range of MFE was between 16 to 447.1 (-kcal/mol) with an average 68.32 (-kcal/mol).
MFEI (Minimum
Folding Free Enrgy Index) is considered to distinguish pre-miRNA from other coding and non coding RNA and RNA fragments. MFEI value ranged from 0.609 to 1.069 (-kcal/mol)
3.5 miRNA target identification identified
miRNA
families
were
ED
The
MA
Fig. 3. Predicted miRNA MFE and MFEI.
NU
SC
RI
PT
with an average of 0.68 (-kcal/mol) (Fig. 3).
considered
further
for
corresponding
target
identification. Out of 13, 12 miRNA families were found to have 185 target transcripts
EP T
through psRNATargetScan. The targets for miR1846c could not be identified. As it is reported that miRNA have tendency to potentially regulate multiple distinct gene suggesting that these genes are subjected to combitorial control by miRNAs (Srivastava et al., 2016).
AC C
Out of 12 miRNA families, 11 were analyzed to regulate more than one gene, except miR529a (Fig. 4).
NU
SC
RI
PT
ACCEPTED MANUSCRIPT
MA
Fig. 4. No. of predicted miRNA target transcripts.
3.6 Interaction study, pathway and annotation for predicting target transcripts We constructed a regulatory network among 12 predicted miRNA families with their
ED
corresponding target transcripts (Fig. 5a). The pathway analysis of 185 target transcripts was also performed. As a result, 46 target transcripts were found to be involved in different
EP T
pathways (Table 2). Further, we have done annotation of these 46 target transcripts which were found to be involved
in 120 molecular functions, 105 biological functions and 54
cellular components (Table 3, Supplementary 1). A regulatory network of 46 target
AC C
transcripts involved in different function with their corresponding 12 miRNA families was constructed (Fig. 5b).
RI
PT
ACCEPTED MANUSCRIPT
Fig. 5. (a) Interaction of potential predicted miRNAs and their target genes, yellow triangle
SC
show miRNAs family and blue rectangle represents target genes and edges show UPE value. (b) light green triangle represents miRNAs and orange octagon represents corresponding
NU
target functions and edges represents inhibition type.
MA
Table 2
Functional annotation and pathway analysis of target transcripts. Targe t Acc.
UPE
Predicted target sequence
KO
pathway
Name
Inhibition type
osa-miR1848
Os07t0471000-01
22.746
GCACGCACGCGUGGGCGCGU
K08852
ERN1
Cleavage
osa-miR1848
Os01t0685850-00
22.23
GGCGCGCG-GCGCGGGCACGG
K02133
ko04141, ko04210, ko04932, ko05010 ko00190, ko05010, ko05012, ko05016
Cleavage
osa-miR1851 osa-miR1851
Os07t0178600-01 Os04t0381000-01
22.574 17.997
UCCAAGAUGCCAUUUCAAAGA UUGAAGAUGCUGUUCCAAAGG
K09419 K17065
osa-miR1857-5p osa-miR1857-5p osa-miR1857-5p osa-miR1857-5p
Os05t0134300-01 Os05t0386000-03 Os12t0123500-01 Os10t0369900-01
18.006 16.993 15.859 17.1
CUCUUGCUCCAAUAAUGUCA UCUUCUGCUCCCAAGGUGCCA CUACCGCUCCAAAGGUGCCC CUCCCGCGCCAACAAUGUCA
K20827 K20782 K14641 K01858
ko03000 ko04139, ko04214, ko04621, ko04668 ko01000 ko00514 ko00230, ko00240 ko00521, ko00562
AT PeF1B, AT P5B, AT P2 HSFF DNM1L
Cleavage T ranslation Cleavage Cleavage
osa-miR2102-3p
Os02t0599100-00
24.517
CGCUGCCGUAGCCGGUACCGUG
K11137
ko03460, ko04150
osa-miR2102-3p
Os02t0598800-00
24.517
CGCUGCCGUAGCCGGUACCGUG
K11137
ko03460, ko04150
osa-miR2102-3p osa-miR2102-3p
Os04t0186800-00 Os03t0626700-01
24.861 24.508
CGCCGCCGUCACCGGCAUCGCG CGCCGCCGUCACCGUCGUCGAG
K08176 K03327
ko02000 ko02000
osa-miR2102-3p osa-miR2864.1
Os09t0468800-00 Os08t0447000-01
21.257 19.224
CGCCGCCGUCGCCGCCGCCGAG CCCAGCCAAGGACAGCAAGC
K10664 K00058
osa-miR417 osa-miR417
Os01t0172900-00 Os03t0402000-01
13.781 14.432
UGAAACAAAUUAAAUUCAACG GGAAUUAAUUCACUUCAAUC
K01184 K20302
ko01000 ko00260, ko00680, ko01200, ko01230 ko00040 ko04131
osa-miR417 sbi,osa-miR437 sbi,osa-miR437 sbi,osa-miR437
Os01t0743200-02 Os10t0542900-01 Os01t0382000-01 Os03t0826300-00
12.42 14.879 12.576 14.168
GAAGCAAAGUCACUUUAAUU AAGGCAAAUAUCAAUUGCUUU GGUUAAAGUUAAAUUGCUUU GGUCAAUGUUCAAUUGUUUU
K01051 K20547 K13449 K15356
ko00040 ko00520, ko04016 ko04016, ko04075, ko04626 ko02000
osa-miR444e osa-miR444e osa-miR444e
Os03t0323200-03 Os02t0796500-03 Os02t0796500-02
22.384 21.817 21.817
GAGCUUGCUCUAGCAACAGCA
K03403 K14491 K14491
ko00860 ko04075 ko04075
RPAP2 HPAT APY1_2 INO1, ISYNA1 T ELO2, T EL2 T ELO2, T EL2 PHO84 T C.MAT E, SLC47A, norM, mdtK, dinF AT L6S serA, PHGDH E3.2.1.15 T RAPPC3, BET 3 E3.1.1.11 CHIB PR1 VRG4, GONST 1 chlH, bchH ARR-B ARR-B
AC C
EP T
ED
Pre dicte d miRNA family
GAACUUAGCUCUAGCAGCUGCA GAACUUAGCUCUAGCAGCUGCA
Cleavage Cleavage
Cleavage Cleavage Cleavage Cleavage
Cleavage T ranslation T ranslation Cleavage Cleavage Cleavage T ranslation Cleavage Cleavage Cleavage Cleavage
ACCEPTED MANUSCRIPT Os03t0711250-01 Os03t0424800-00
16.832 20.324
GAACAUAUUUCAGCAACUGCA AACAUCCUUCAGCAGCUGCA
K00888 K02966
ko00562, ko04011, ko04070 ko03010
osa-miR444e
Os03t0424500-01
22.44
AACAUCCUUCAGCAGCUGCA
K02966
ko01524, ko04016
osa-miR444e osa-miR444e osa-miR444e
Os04t0556000-02 Os04t0556000-01 Os03t0246800-03
18.3 18.3 20.626
GAUCUUGCACCAGAAACUGCA GAUCUUGCACCAGAAACUGCA AACUU-CUUCAGCAGCUGCA
K17686 K17686 K18442
ko04131 ko00520 ko00001
osa-miR529a osa-miR810b.2 osa-miR810b.2 osa-miR810b.2 sbi, zma-miR156e sbi, zma-miR156e
Os07t0632000-01 Os12t0607100-02 Os07t0658400-01 Os07t0658400-02 Os07t0681600-01 Os01t0769000-01
24.263 20.77 18.498 18.498 19.444 21.573
GACAAGCAGAGCGGGUACAG UAGGUGAAGUUGGAUCACUU GAAGGUGAACUAGAAUCACUU GAAGGUGAACUAGAAUCACUU UUGUCAGCCCUCUUCUGUCA GUACCGGCUCUCUUCUGUUG
K01183 K03242 K00284 K00284 K03654 K12617
ko00630, ko00910 ko03018 ko03018 ko01000 ko03018 ko03018
sbi, zma-miR156e sbi, zma-miR156e sbi, zma-miR156e sbi-miR5564b sbi-miR5564b sbi-miR5564b sbi-miR5564b sbi-miR5564b
Os01t0842500-01 Os06t0367100-02 Os06t0367100-01 Os10t0560900-01 Os10t0147250-00 Os02t0654400-01 Os06t0166400-01 Os04t0469500-01
19.759 20.391 20.391 23.978 24.802 23.944 20.943 17.393
UUGCUAGCUUGCUUCUGUUG CUAUCAGCUCUAUUCUGUUG CUAUCAGCUCUAUUCUGUUG UCAAGCUGUUCGACGGAAUGG GAAGGUGUUCGAGGAAAUGG CAGGCUGUUGGAGGAAGUGA CAAGGUGUUCGGCGAAAUGC CGAGCUGUUCCAGCAAAUGC
K05909 K00700 K00700 K00652 K17964 K03129 K09286 K00850
sbi-miR5564b sbi-miR5564b
Os10t0138100-01 Os01t0851700-01
22.162 17.153
CAACCUGUUCGACGAAAUGU CAAGCGGUUCAAGGAAAUGG
sbi-miR5564b
Os04t0490800-01
22.016
CAAGCUGAUCGACGGAGUGC
ko01000 ko00500 ko00500 ko00780 H00072, H01354, H01368 ko03022, ko05016, ko05168 ko03000 ko00010, ko00030, ko00051, ko00052, ko00680, ko01200, ko01230, ko03018, ko04152, ko05230 ko00906 ko00500, ko04910, ko04922, ko04931 ko00630, ko01200
SC
RI
PT
osa-miR444e osa-miR444e
NU
K09842 K00688 K19269
PI4KA RP-S19e, RPS19 RP-S19e, RPS19 copA, AT P7 copA, AT P7 ARFGEF, BIG E3.2.1.14 EIF2S3 E1.4.7.1 E1.4.7.1 recQ PAT L1, PAT 1 E1.10.3.2 GBE1, glgB
Cleavage Cleavage Cleavage Cleavage Cleavage Cleavage T ranslation Cleavage T ranslation T ranslation Cleavage Cleavage
bioF LRPPRC T AF4 EREBP pfkA, PFK
T ranslation T ranslation T ranslation Cleavage Cleavage T ranslation Cleavage T ranslation
AAO3 PYG, glgP
Cleavage T ranslation
PGP, PGLP
Cleavage
MA
UPE:Unpaired energy (UPE) required to open secondary structure around miRNA’s target site on mRNA, KO: KEGG orthology. Table 3
S. No.
miRNA
ED
Annotation of predicted target transcripts. No. of
Molecular functions
Biological processes
Cellular components
9
9
4
5
6
2
2
6
9
3
4
4
6
5
5
pathways miR1848
2
miR1851
3
miR1857-5p
4
miR2102-3p
5
miR2864.1
4
5
5
0
6
miR417
2
9
11
3
miR437
5
0
0
3
8
miR444e
11
15
22
14
9
miR529a
2
3
2
1
10
miR810b.2
2
7
6
0
11
miR156e
3
21
21
9
12
miR5564b
5
30
19
9
AC C
7
7
EP T
1
ACCEPTED MANUSCRIPT 4. Discussion
S. hermonthica as per recent classification has been placed in the family Orobanchaceae (Spallek et al., 2013). Which contains the highest number of parasitic species (Bennett and Mathews, 2006). S. hermonthica is well known as a hemi-parasitic weed which damages major cereal crops. It also plays an important role as a folk medicine for broad spectra of diseases (Holbrook-Smith et al., 2016; Koua et al., 2011; Okpako and Ajaiyeoba, 2004). We of this plant and
predict their role in the
PT
predicted and analyzed the non-coding RNAs
regulation of mRNA in host plant (Oryza sativa). No information of miRNA of S. hermontiga
RI
is available in miRBase database. Our study to identify miRNAs and their target sequences
SC
was based on homology method from EST and GSS data of S. hermonthica. For miRNA identification, we used homology identification with already reported miRNAs in Oryza
NU
sativa (660), Sorghum bicolor (172) and Zea mays (176). The reason for using miRNAs of crop plants was because S. hermonthica is a parasitise on these plants. In our analysis, 13 miRNA families were identified. Out of which 11 miRNA families (miR1846c-3p, miR1848,
MA
miR1851, miR1857-5p, miR2102-3p, miR2864.1, miR417, miR437, miR444e, miR529a, miR810b.2) have shown homology with O. sativa and two miRNA families (miR156e,
ED
miR5564b) with S. bicolor. The identified mature miRNAs were 20-22 nt which were in accordance with mature miRNAs of potato, soyabean, switch grass, mentha, ginger and chicory (Dehury et al., 2013; Singh and Sharma, 2014; Srivastava et al., 2016). It has been
EP T
reported that, the stability of secondary structure of RNA molecule depends upon the minimal folding free energy (MFE). Lower value of MFE indicate the stability of the RNA molecule. The MFE value of miRNAs are reported to be lower than other RNAs. In our result, the
AC C
average MFE of S. hermonthica miRNAs was -68.32 kcal/mol which is lower as compared to tRNA (- 27 kcal/mol) and sRNA (-33 kcal/mol) (Zhang et al., 2006). The MFE values of S. hermonthica miRNAs were found to be more negative as compared to cotton (-35.19 kcal/mol), helianthus (-35.83 kcal/mol) and ginger (−33.55 kcal/mol) (Barozai et al., 2012; Singh and Sharma, 2014; Zhang et al., 2006). Thus, the identified miRNAs in S hermionthica were significantly stable. The difference between miRNAs and other non-coding and coding RNAs also depend upon MFEI value (Srivastava et al., 2016). The identified MFEI value of pre-miRNAs in S. hermonthica have shown a significantly higher value than other RNAs (Srivastava et al., 2016). Our analysis indicated that S. hemonthica miRNAs were involved in different
biological
process,
molecular
function
and
cellular
component
(Table
3,
Supplementary 1). The identified miR1848 was observed to regulate 8 targets. Out of 8, 2
ACCEPTED MANUSCRIPT putative targets serine/threonine protein kinase domain containing protein and hypothetical protein have shown their involvement in different regulations such as protein processing in endoplasmic reticulum, apoptosis, non-alcoholic fatty liver disease (NAFLD), Alzheimer's disease, oxidative phosphorylation, Parkinson's disease and Huntington's disease. In previous studies, it was reported that miR1848 play a major role in the wax biosynthesis pathway, leaf senescence through phytohormone signaling pathways, APETALA2 (AP2), zinc finger proteins, salicylic acid-induced protein 19 (SIP19), auxin response factors (ARF) and NAC
PT
transcription factors, in rice (Xia et al., 2015; Xu et al., 2014). miR1851 was also predicted to regulate 8 targets. Out of 8 targets, 3 have shown similarity with heat shock transcription 29
(Fragment),
DRP3A
(DYNAMIN-RELATED
PROTEIN
RI
factor
3A)
and
GTP
SC
binding/GTPase/phosphoinositide binding protein showing the involvement in mitophagy yeast, apoptosis - fly, NOD-like receptor signaling pathway,
TNF signaling pathway. In
NU
earlier studies, it was reported that WRKY transcription factor 54, vacuolar-sorting receptor 6, subtilisin, MYB and scarecrow-like protein were targeted by osa-miR1851 (Xu et al., 2014). miR1857-5p was found to regulate 12 targtes. Out these 4 targets, protein of unknown
3.6.1.5)
(ATP-diphosphatase)
diphosphohydrolase)
and
similar
(adenosine
to
diphosphatase)
myo-inositol-1-phosphate
ED
(EC
MA
function DUF408 family protein, similar to predicted protein, similar to apyrase precursor (ADPase)
synthase
showed
(ATPtheir
involvement in different pathways such as O-glycan biosynthesis, purine metabolism, pyrimidine
metabolism,
streptomycin
biosynthesis
and
inositol phosphate
metabolism.
EP T
miR1857-5p was earlier, reported to play an important role in expression of drought-tolerant up-ground tissues and down-regulated or repressed with drought treatment (Candar-Cakir et al., 2016). Thirteen targets were observed to be regulated by miR2102-3p. Out of these five
AC C
have shown hypothetical conserved gene, telomere length regulation protein, conserved domain-domain
containing
protein,
similar
to
OSIGBa0113D21.1
protein and
Multi
antimicrobial extrusion protein MatE family protein and Similar to RING-H2 finger protein ATL5F showing their involvement in fanconi anemia pathway and mTOR signaling pathway. miR2102-3p has been reported to participate in the regulation of TCP family transcription factor containing protein (Xue et al., 2009). miR2864.1 was found to regulate 3 target transcripts. Out of these 1 target transcripts was found similar to D-3-phosphoglycerate dehydrogenase involved in glycine, serine and threonine metabolism, methane metabolism, carbon metabolism, biosynthesis of amino acids. It has been reported that FCA gene Os09g03610 controling flowering time is cleaved by osa-miR2864.1 at two different sites (Devi et al., 2016). Predicted miR417 of S. hermonthica was observed to regulate putative 7
ACCEPTED MANUSCRIPT tragets.
Out of these some targets, glycoside hydrolase, family 28 domain containing protein,
TRAPP I complex, Bet3 domain containing protein and similar to pectinesterase have shown their involvement in pentose and glucuronate interconversions. Previous studies have reported that differential expression profile of the miR417 exhibits a negative regulation over seed germination under salt stress condition (Das et al., 2015). We observed miR437 to regulated 4 putative targets. Out of which, 3 were coding for, similar to chitinase, similar to pathogenesis-related protein PRB1-2 precursor and similar to predicted protein involved in
PT
different pathways such as amino sugar and nucleotide sugar metabolism, MAPK signaling pathway, plant hormone signal transduction and plant-pathogen interaction. An earlier study
RI
reported that miR437 regulates Glu receptor proteins (Sunkar 2005). Predicted miR444e was
subunit H family protein,
expressed,
SC
observed to target 26 transcripts. 9 targets were coding for, similar to magnesium-chelatase B-type response regulator, cytokinin signaling,
NU
hypothetical gene, similar to 40S ribosomal protein S19-3, similar to 40S ribosomal protein S19-3, Similar to heavy metal ATPase, heavy metal P-type ATPase, xylem loading of copper and similar to guanine nucleotide-exchange protein GEP2. These targets have shown
their
MA
involvement in different pathways such as porphyrin and chlorophyll metabolism, Plant hormone signal transduction, inositol phosphate metabolism, MAPK signaling pathway -
ED
yeast, phosphatidylinositol signaling system, ribosome, platinum drug resistance, MAPK signaling pathway, brefeldin A-inhibited guanine nucleotide-exchange protein and membrane trafficking. Earlier, miR444e is reported to regulated MADS-box factor 27 and MADS-box Our results suggested that the predicted miR529a regulated
EP T
factor 57 (Sunkar et al., 2008).
only 1 target, similar to xylanase inhibitor protein I precursor showed involvement in amino sugar and nucleotide sugar metabolism. Earlier studies showed the involvement of miR529a
AC C
in the regulation of drought tolerance in rice (Zhou et al., 2010). miR810b.2 was observed to target 12 transcripts. Out of which 3 have shown similar to eukaryotic translation initiation factor 2 gamma subunit, Similar to ferredoxin-dependent glutamate synthase and similar to Fd-GOGAT protein (Fragment). These targets were found to involve in RNA transport, glyoxylate and dicarboxylate metabolism and nitrogen metabolism. In previous studies, it is reported that miR810b.2 was involved in DNA polymerase alpha catalytic subunit (Cheah et al., 2015). miR156e was target shown to 9 transcript sequences coding for DNA helicase, ATP-dependent, RecQ type domain containing protein, topoisomerase II-associated protein PAT1 domain containing protein, similar to laccase (EC 1.10.3.2), similar to starch branching enzyme III
and glycoside hydrolase, subgroup, catalytic core domain containing protein.
These targets were found to be involved in pathways such as RNA degradation and starch
ACCEPTED MANUSCRIPT and sucrose metabolism. In previous studies miR156e was reported to be involved in overexpression of production of a bushy mutant and regulation of strigolactones (SLs) pathway (Chen et al., 2015). miR5564b regulated 82 different targets. Out of these 8 targets coded for pyridoxal phosphate-dependent transferase, major region, subdomain 1 domain containing
protein,
hypothetical conserved
gene,
transcription initiation factor TFIID
component TAF4 domain containing protein, similar to TINY-like protein (AP2 domain containing protein RAP2.10) (Fragment), similar to OSIGBa0124N08.2 protein, similar to
PT
aldehyde oxidase-2, similar to cytosolic starch phosphorylase (Fragment) and haloacid dehalogenase-like hydrolase domain containing protein. miR5564b was reported to be
RI
involved in biotin metabolism, pyruvate dehydrogenase complex deficiency, leigh syndrome,
SC
cytochrome c oxidase (COX) deficiency, basal transcription factors, huntington's disease, herpes simplex infection, glycolysis/gluconeogenesis, pentose phosphate pathway, fructose
NU
and mannose metabolism, galactose metabolism, methane metabolism, carbon metabolism, biosynthesis of amino acids, RNA degradation, AMPK signaling pathway, central carbon metabolism in cancer, carotenoid biosynthesis, starch and sucrose metabolism, insulin pathway,
glucagon
signaling
pathway,
MA
signaling
insulin
resistance,
glyoxylate
and
dicarboxylate metabolism and carbon metabolism. Previous studies have been reported that
ED
miR5564b was involved in stress tolerance in Sorghum bicolor (Zhang et al., 2011).
EP T
5. Conclusion
Computational approach for small RNA identification plays an important role to study mechanisms of miRNAs and their targets playing important regulatory functions. The
AC C
identification of S. hermonthica miRNAs and their predicted target genes adds to our understanding
the molecular mechanisms of RNA communication and exchange between
both parasitic and host plants. The identified miRNAs and their putative targets may lead to an improved
understanding of the molecular mechanism allowing S. hermonthica to
parasitize host plants We reported 13 conserved miRNA families and 12 miRNA families were found to have function in different pathways. It was overall observed that predicted miRNA was mainly involved in serine family amino acid biosynthetic process, signal transduction,
cytokinin-activated
signaling pathway, metal ion transport, abscisic acid
biosynthetic process, xanthine catabolic process, auxin biosynthetic processresponse to water deprivation etc. These 12 miRNAs of
S. hermonthica may structurally mimic host plants
miRNAs and hamper the regulation of host plant mRNA involved in growth, development
ACCEPTED MANUSCRIPT and
defense
system.
Computational methods of miRNA prediction combined
with
transcriptomics detail of organism on non model organism, could be of help to assess whether parasitic plants have evolved mechanisms to exchange
molecules to their advantage with
their host plants (Ichihashi et al., 2015; Weiberg et al., 2015, Knip et al., 2014).
Author contribution statement AS and SS designed and conceptualized the research, SS preformed the
identification,
PT
analysis, assembly, annotation and wrote the manuscript. AS critically revised the article. All
RI
authors read and approved the manuscript.
SC
Acknowledgements We would like to thank EST database of NCBI for making the data freely available to the scientific community. Financial assistance under BTISnet programme
NU
of DBT, New Delhi is gratefully acknowledged.
MA
Reference
AC C
EP T
ED
Axtell, M.J., Westholm, J.O., Lai, E.C., 2011. Vive la différence: biogenesis and evolution of microRNAs in plants and animals. Genome Biol. 12, 221, doi:10.1186/gb-2011-12-4221. Baoua, M., Bessiere, J.M., Pucci, B., Rigaud, J.P., 1980. Alkaloids from Striga hermonthica. Phytochemistry. 19, 718. Barozai, M.Y., Baloch, I.A., Din, M., 2012. Identification of MicroRNAs and their targets in Helianthus. Mol. Biol. Rep. 39, 2523-2532, doi:10.1007/s11033-011-1004-y. Brosnan, C.A., Voinnet, O., 2011. Cell-to-cell and long-distance siRNA movement in plants: mechanisms and biological implications. Curr. Opin. Plant Biol. 14, 580–587. doi:10.1016/j.pbi.2011.07.011 Choudhury, M.K., Phillips, A.L., Mustapha, A., 1998. Pharmacological studies of Striga senegalensis Benth. (Scrophulariaceae) as an Abortifacient. Phytother Res. 12, 141143. Conesa, A., Götz, S., 2008. Blast2GO: A comprehensive suite for functional analysis in plant genomics. Int J Plant Genomics, 2008, 619832. doi:10.1155/2008/619832 Dai, X., Zhao, P.X., 2011. psRNATarget: a plant small RNA target analysis server.Nucleic Acids Res. 39, W155-159, doi:10.1093/nar/gkr319. Dehury, B., Panda, D., Sahu, J., Sahu, M., Sarma, K., Barooah, M., Sen, P., Modi, M., 2013. In silico identification and characterization of conserved miRNAs and their target genes in sweet potato (Ipomoea batatas L.) expressed sequence tags (ESTs). Plant Signal Behav. 8, e26543. El-Kamali H.H., 2009, Ethnopharmacology of medicinalp used in North Kordofan (Western Sudan). Ethnobot. leaflets.13, 203-210. Greenwood, J.M., Ezquerra, A.L., Behrens, S., Branca, A., Mallet, L., 2016. Current analysis of host?parasite interactions with a focus on next generation sequencing data. Zoology 119, 298–306. doi:10.1016/j.zool.2016.06.010
ACCEPTED MANUSCRIPT
AC C
EP T
ED
MA
NU
SC
RI
PT
Griffiths-Jones, S., Bateman, A., Marshall, M., Khanna, A., Eddy, S.R., 2003. Rfam: an RNA family database. Nucleic Acids Research. 31(1), 439-441. He, L., Hannon, G.J., 2004. MicroRNAs: small RNAs with a big role in gene regulation, Nat. Rev. Genet. 5(7), 522–531. Huang, X., Madan, A., 1999. CAP3: A DNA sequence assembly program. Genome Res. 9, 868–877. Ichihashi, Y., Mutuku, J.M., Yoshida, S., Shirasu, K., 2015. Transcriptomics exposes the uniqueness of parasitic plants. Brief. Funct. Genomics 14, 275–282. doi:10.1093/bfgp/elv001 Joel, D.M., 2000. The long-term approach to parasitic weed control:manipulation of specific developmental mechanisms of the parasite. Crop Prot. 19, 753–758. Kakrana, A., Hammond, R., Patel, P., Nakano, M., Meyers, B.C., 2014. sPARTA: a parallelized pipeline for integrated analysis of plant miRNA and cleaved mRNA data sets, including new miRNA target-identification software, Nucl. Acids Res. 42(18), e139. Khan, I.Z., Aqil, M., Kolo, B., 1998. 5-hydroxy-6,8-dimethoxyflavone7,4`-O-diglucoside from Striga hermonthica (Del.) Benth. Ultra Sci. Phys. Sci. 10, 278–280. Knip, M., Constantin, M.E., Thordal-Christensen, H., 2014. Trans-kingdom Cross-Talk: Small RNAs on the Move. PLoS Genet. 10, e1004602. doi:10.1371/journal.pgen.1004602 Koua, F.H.M., Abbass, F.M., Elgaali, E.I., Khalafallah, M.M., Babiker, H.A.A, 2011a. In vitro host-free seed culture, callus development and organogenesis of an obligatory root-parasite Striga hermonthica (Del.) Benth: the witch weed and medicinal plant. Int. J. Plant Biol. 2, 34–39. Koua, F.H.M., Babiker, H.A.A., Halfawi, A., Ibrahim, R.O., Abbas, F.M., Elgaali, E.I., Khalafallah, M.M., 2011b. Phytochemical and biological study of Striga hermonthica (Del.) Benth Callus and Intact. Plant Res. Pharma. Biotechnol. 3, 85-92. Leucci, E., Patella, F., Waage, J., Holmstrøm, K., Lindow, M., Porse, B., Kauppinen, S., Lund, A.H., 2013. microRNA-9 targets the long non-coding RNA MALAT1 for degradation in the nucleus, Sci. Rep. 3, 2535. Liu, B., Fang, L., Chen, J., Liu, F., Wang, X., 2015. miRNA-dis: microRNA precursor identification based on distance structure status pairs. Mol. BioSyst.,11, 1194-1204 Minorsky, P.V., 2015. On the Inside. Plant Physiol. 168, 765-766, doi: 10.1104/pp.15.00897. Molnar, A., Melnyk, C., Baulcombe, D.C., 2011. Silencing signals in plants: a long journey for small RNAs. Genome Biol. 12, 215. doi:10.1186/gb-2010-11-12-219 Moriya, Y., Itoh, M., Okuda, S., Yoshizawa, A.C., Kanehisa, M., 2007. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 35, W182– W185. doi:10.1093/nar/gkm32 Numnark, S., Mhuantong, W., Ingsriswang, S., Wichadakul, D., 2012. C-mii: a tool for plant miRNA and target identification. BMC Genomics. 13 Suppl 7, S16. doi:10.1186/14712164-13-S7-S16 Okpako, L.C., Ajaiyeoba, E.O, 2004. In vitro and in vivo antimalarial studies of Striga hermonthica and Tapinanthus sessilifolius extracts. Afr. J. Med. Sci. 33, 73-75. Pio, G., Ceci, M., Elia, D.D. Loglisci, C., Malerba, D., 2013 Integrating microRNA target predictions for the discovery of gene regulatory networks: a semi-supervised ensemble learning approach. BMC Bioinformatics, 14, S8. Press, M.C., Phoenix, G.K., 2005. Impacts of parasitic plants on natural communities: Tansley review. New Phytol. 166, 737–751. doi:10.1111/j.1469-8137.2005.01358.x
ACCEPTED MANUSCRIPT
AC C
EP T
ED
MA
NU
SC
RI
PT
Qu, T., Liu, R., Wang, W., An, L., Chen, T., Liu, G., Zhao, Z., 2011. Brassinosteroids regulate pectin methylesterase activity and AtPME41 expression in Arabidopsis under chilling stress. Cryobiology, 63,111-117. Reinhart, B.J., Weinstein, E.G., Rhoades, M.W., Bartel, B., Bartel, D.P., 2002. MicroRNAs in plants. Genes Dev. 16, 1616-1626, doi:10.1101/gad.1004402. Sala-Cirtog, M., Marian, C., Anghel, A., 2015. New insights of medicinal plant therapeutic activity—The miRNA transfe. Biomedicine & Pharmacotherapy, 74, 228-232, doi:10.1016/j.biopha.2015.08.016. Shannon, P. et al., 2003. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. Singh, N., Srivastava, S., Shasany, A.K., Sharma, A., 2016. Identification of miRNAs and their targets involved in the secondary metabolic pathways of Mentha spp. Comput. Biol. Chem. 64, 154-162. Spallek, T., Mutuku, M., Shirasu, K., 2013. The genus Striga: a witch profile. Mol. Plant Pathol. 14, 861–869. doi:10.1111/mpp.12058 Srivastava, S., Singh, N., Srivastava, G., Sharma, A., 2016. miRNA mediated gene regulatory network analysis of Cichorium intybus (chicory). Agri. Gene. 3, 37-45, doi: 10.1016/j.aggene.2016.11.003. Weiberg, A., Bellinger, M., Jin, H., 2015. Conversations between kingdoms: small RNAs. Curr. Opin. Biotechnol. 32, 207–215. doi:10.1016/j.copbio.2014.12.025 Zhang, B., Pan, X., Cobb, G.P., Anderson, T.A., 2006. Plant microRNA: a small regulatory molecule with big impact, Dev. Biol. 289, 3–16 Zhang, B., Pan, X., Cannon, C.H., Cobb, G.P., Anderson, T.A., 2006. Conservation and divergence of plant microRNA genes. Plant J. Cell Mol. Biol. 46, 243–259. doi:10.1111/j.1365-313X.2006.02697.x Zhang, B.H., Pan, X.P., Cox, S.B., Cobb, G.P., Anderson, T.A., 2006. Evidence that miRNAs are different from other RNAs. Cell Mol Life Sci, 63, 246-254.
ACCEPTED MANUSCRIPT Abbrevations: miRNA: MicroRNA, MFE: Minimal Folding Energy, MFEI: Minimal Folding Index, GO: Gene Ontology,
PT
EST: Expressed Sequence Tag,
AC C
EP T
ED
MA
NU
SC
RI
GSS: Genome Survey Sequence
ACCEPTED MANUSCRIPT
AC C
EP T
ED
MA
NU
SC
RI
PT
Graphical abstract
ACCEPTED MANUSCRIPT Highlights
PT
RI SC NU MA ED
EP T
Mobile small RNAs play an important role in regulation of trans- kingdom genes. Thirteen conserved miRNAs identified in Striga hermonthica. These thirteen miRNAs showed homology with their host plants Oryza sativa and Sorghum bicolor. Out of thirteen miRNAs, 12 miRNAs predicted to regulate 185 target mRNA of Oryza sativa. Predicted miRNA of S. hermonthica might be used as a tool for protection of cereal crops damage.
AC C