Journal Pre-proof Five mitochondrial genomes of black fungus gnats (Sciaridae) and their phylogenetic implications
Xiaoqian Miao, Junhao Huang, Frank Menzel, Qingyun Wang, Qiaoyu Wei, Xiao-Long Lin, Hong Wu PII:
S0141-8130(19)38697-0
DOI:
https://doi.org/10.1016/j.ijbiomac.2020.01.271
Reference:
BIOMAC 14576
To appear in:
International Journal of Biological Macromolecules
Received date:
26 October 2019
Revised date:
26 January 2020
Accepted date:
27 January 2020
Please cite this article as: X. Miao, J. Huang, F. Menzel, et al., Five mitochondrial genomes of black fungus gnats (Sciaridae) and their phylogenetic implications, International Journal of Biological Macromolecules(2018), https://doi.org/10.1016/ j.ijbiomac.2020.01.271
This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
© 2018 Published by Elsevier.
Journal Pre-proof Five mitochondrial genomes of Black Fungus Gnats (Sciaridae) and their phylogenetic implications Xiaoqian Miaoa, Junhao Huanga*, Frank Menzelb, Qingyun Wanga, Qiaoyu Weia, Xiao-Long Linc, Hong Wua
-p
ro
of
a Department of Forestry Protection, School of Forestry and Biotechnology, Zhejiang A&F University, 666 Wusu street, Linan, Hangzhou, Zhejiang 311300, China b Senckenberg Deutsches Entomologisches Institut, Eberswalder Straße 90, 15374 Müncheberg, Germany. E-mail:
[email protected] c College of Life Sciences, Nankai University, Tianjin 300071, China. E-mail:
[email protected] * Corresponding author: E-mail:
[email protected], Tel: 86-571-63732758, Fax: 86-571-63740898
re
Abstract: Sciaridae is a family of great species diversity, distributed worldwide,
lP
that includes important agricultural pests of cultivated mushrooms and plants produced in greenhouses. Here we sequenced five nearly complete
na
mitochondiral genomes representing three subfamilies of Sciaridae. The lengths of these mitogenomes range from 13,849 bp to 16,923 bp with 13
Jo ur
protein-coding genes (PCGs), 20–22 transfer RNA (tRNA) genes, two ribosomal RNA (rRNA) genes, and a control region (CR). Compared with other dipteran species, rearrangements in Sciaridae are more common. Inversion or transition is observed frequently of trnL2, and in the tRNA clusters trnI-trnQ-trnM,
trnW-trnC-trnY,
and
trnA-trnR-trnN-trnS1-trnE-trnF.
Phylogenetic relationships within the family were reconstructed based on these newly sequenced species, combined with the published mitogenomes of related
families,
and
recovered
the
topology
within
Sciaroidea
as
Cecidomyiidae + (Sciaridae + Keroplatidae). Relationships recovered within Sciaridae were Sciarinae + (‗Pseudolycoriella group‘ + Megalosphyinae). Keywords: Sciariodea; molecular phylogeny; Diptera
Journal Pre-proof 1. Introduction The adults of black fungus gnats (Diptera: Sciaroidea: Sciaridae) are mostly tiny flies with a slender dark-colored body, long legs and simple wings [1, 2]. They are widely distributed across the world, with more than 2,800 species [3]. Sciarids are found in various terrestrial habitats ranging from caves to high altitude mountains, but primarily in forests and other moist shady areas [4]. Larvae feed on fungi, detritus, rotten wood and organic matter in soil, playing a role as decomposers [1, 5, 6]. Nevertheless, several sciarid species
of
are also known as economically significant pests. The larvae of Bradysia
ro
cause serious damage to onions, carrots, and edible mushrooms [1]. In addition, adult flies serve as vectors for the pathogen Fusarium foetens on the
-p
ornamental plant begonias, causing serious economic loss [7, 8].
re
Phylogenetic relationships within Sciaridae are still controversial despite more than two-hundred years of research [9, 10]. Menzel & Mohrig (2000)
lP
used 160 characters to construct a morphological phylogenetic tree of Sciaridae, suggesting that this family include Sciarinae, Megalosphyinae,
na
Cratyninae and a clade composed of the Pseudolycoriella group plus the Corynoptera sensu lato group. However, all species included in Menzel &
Jo ur
Mohrig (2000) were confined to the Palearctic region. Although Vilkamaa & Hippa (2004) and Hippa & Vilkamaa (2005, 2006) reconstructed morphological trees for Cratyna senso lato and of fossil Sciaroidea, phylogenetic relationships with the broader family remain unresolved because of limited taxon
representation
and
homoplastic
characters.
Recent
molecular
phylogenetic trees [15, 16] based on a few loci supported major clades of Sciarinae, Cratyninae, Megalosphyinae and the Pseudolycoriella group that were found in the morphological work by Menzel & Mohrig (2000), plus a Chaetosciara group, which was later recognized as a new subfamily Chaetosciarinae [17]. Topological relationships within Sciaridae based on either morphological characters or a few molecular loci were incongruent, and were unstable when additional taxa or genes were included [16].
Journal Pre-proof In general, the mitochondrial genome of most insects is a closed circular DNA molecule ranging in size from 14 to 20 kb, that encodes a common set of 37 genes (13 protein-coding genes, 22 tRNA genes, and two rRNA genes) and an A+T-rich region [18, 19, 20]. The mitochondrial genome is small-sized, with high coding content conservation, high evolutionary rates, maternal inheritance and rare recombination, and as such has been widely used for species identification and molecular evolutionary studies of Diptera [21, 22, 23, 24, 25]. Due to the rapid progress of sequencing technology, studies of Diptera
of
mitogenomes have covered most families, especially medically significant taxa
ro
such as mosquitoes (Culicidae) [27, 28, 29], or economically important species from leaf-miners (Agromyzidae) and fruitflies (Tephritidae) [23, 30, 31, 32].
-p
Additionally, many common families have been sequenced as well [e.g. 33, 34,
re
35]. However, the study on the mitochondrial genome of Sciaridae is still
lP
limited with only one species been published, that is incomplete with ten tRNA genes missing [24].
na
In this study, we sequenced nearly complete mitochondrial genome sequences from five species representing three sciarid subfamilies using
Jo ur
next-generation sequencing. We compared mitogenomes within Sciaridae and with other families of Sciaroidea, and analyzed phylogenetic relationships among the Sciaroidea based on nine mitochondrial genomes.
2. Materials and methods
2.1 DNA extraction and species identification Representative specimens were identified based on morphology: Bradysia sp. (representative of B. pallipes species group); Dolichosciara megumiae; Sciara ruficauda (representative of S. ruficauda species group); Trichosia lengersdorfi (representative of subgenus Trichosia Winnertz s. str.); Pseudolycoriella sp. (representative of Ps. bruckii species group) [11, 36, 37, 38, 39]. The unnamed species from genera Bradysia Winnertz and Pseudolycoriella Menzel & Mohrig are new to science and will be described
Journal Pre-proof elsewhere in a taxonomic paper. Information on the specimens studied (e.g. collecting sites, GenBank accession numbers) is summarized in Table 1. All specimens were preserved in absolute alcohol and stored at −20°C before DNA extraction. Total genomic DNA was extracted was from single sample using the DNeasy Blood & Tissue kit (Qiagen Hilden, Germany) following manufacturer instructions. Concentration of extracted genomic DNA was quantified by Qubit 3.0 (Invitrogen, Life Technologies, Carlsbad, CA, USA). Voucher specimens are deposited in the School of Forestry and Biotechnology,
of
Zhejiang A&F University, China (Specimen number: SCMI1–SCMI5).
ro
2.2 Mitochondrial genome sequencing, assembly and annotation Genomic DNA libraries were constructed for each sample using the
-p
Illumina® VAHTSTM Universal DNA Library. Indexed libaries were directly
re
sequenced using Illumina NovaSeq 150bp paired-end reads of Novogene (Tianjin, China).
lP
FastQC version 0.11.3 was used to evaluate read quality [40], with low quality reads and sites filtered by Trimmomatic version 3.2.57 [41]. To simplify
na
de novo assembly, target mitochondrial genome sequences were extracted by BLAST version 3.6 [42] run against a local database composed of GenBank
Jo ur
accessions of Sciaroidea mitochondrial genomes. Whole mitochondrial genomes were assembled by Spades version 3.13.1 with default settings [43, 44]. Mitochondrial genome annotations were predicted by the Mitos webserver [45] using the invertebrate mito genetic code. tRNA genes were confirmed by tRNA scan-SE [46]. The 5′ and 3′ ends of each protein-coding or rRNA gene were checked with Geneious-prime against homologous genes from Diptera. All mitochondrial genomes were submitted to GenBank (accession numbers: MN161585–MN161589).
Table 1. Information on the mitochondrial genomes used in this study. Note: -, represents an unknow collecting site. Species
Family
Subfamily
GenBank accession number
Collecting sites
Reference
Journal Pre-proof Bradysia sp.
Sciaridae
Megalosphyinae
MN161585
Zhejiang, China
This study
Dolichosciara megumiae
Sciaridae
Megalosphyinae
MN161588
Zhejiang, China
This study
Undescribed
subfamily
(named ‘Pseudolycoriella Pseudolycoriella sp.
Sciaridae
group’)
MN161587
Sichuan, China
This study
Sciara ruficauda
Sciaridae
Sciarinae
MN161586
Zhejiang, China
This study
Trichosia lengersdorfi
Sciaridae
Sciarinae
MN161589
Zhejiang, China
This study
Providence, RI,
Beckenbach
USA
and Joy [47]
-
Beckenbach
Bradysia tilicola
Sciaridae
Megalosphyinae
GQ387651
Arachnocampa flava
Keroplatidae
Arachnocampinae
JN861748
Cecidomyiidae
KM888183
of
Rhopalomyia pomum
Cecidomyiinae Cecidomyiinae
2.3 Genome feature analysis
GQ387649
ro
Cecidomyiidae
Hyderabad,
Atray et al.
India
[49]
Kamloops,
BC,
Canada
-p
Orseolia oryzae
[48]
Base composition and codon usage of PCGs was calculated in MEGA7
re
[50]. Compositional skew analysis was calculated according to the formulas:
lP
AT-skew = (A-T) / (A+T) and GC-skew = (G-C) / (G+C) [51]. Rates of synonymous (Ks) and non-synonymous (Ka) substitutions for each PCG were
na
calculated in DnaSP 5.0 [52].
Jo ur
2.4 Phylogenetic inference
A total of 9 species were analyzed, including five species of Sciaridae sequenced in this study and four additional species, Bradysia amoena (Winnertz, 1867) [= Bradysia tilicola (Loew, 1850)] (Sciaridae), Arachnocampa flava (Harrison, 1966) (Keroplatidae), Rhopalomyia pomum (Gagné, 1975) and Orseolia oryzae (Wood-Mason, 1889) (both Cecidomyiidae, as outgroup) were retrieved from GenBank (Table 1). MAFFT version 7.205 was used to align each protein-coding gene separately [47, 53]. Nucleotide sequences of PCGs were aligned by codon using the G-INS-I algorithm [53, 54]. Aliscore version 2.2 and Alicut version 3.2 were applied to identify and mask sites with ambiguous alignment to reduce noise [55].. FASconCAT-G version 1.0 was used to concatenate genes into data matrices [56]. Partitioning scheme and substitution models were selected using PartitionFinder version 1.1.1 [57].
Beckenbach and Joy [47]
Journal Pre-proof Initial partitions were by codon position and gene for each PCG (39 partitions), and by gene for amino acids analyses (13 partitions) (Supplement Table S1). Two inference methods, maximum likelihood (ML) and Bayesian inference (BI)
were
performed
using
the
IQ-Tree
online
sever
[58]
(http://iqtree.cibiv.univie.ac.at.) and MrBayes version 3.2.6 [59, 60] respectively. Two data matrices, DNA and amino acid sequences from all protein-coding genes were used for phylogenetic analysis. 3 Results
of
3.1 Genome structure and organization
ro
Mitochondrial genome lengthes range from 15,167 bp to 16,170 bp in Sciaridae (Table 2). Each mitogenome contains the typical set of 37 genes,
-p
including 13 PCGs, two rRNAs, and 22 tRNAs (except Bradysia sp. and
re
Dolichosciara megumiae are missing trnL2, and Bradysia sp. is missing trnI).
lP
These genomes were sequenced near completely, with the CRs of all species failing to completely assemble. A + T content is lower than 80% in all species
na
of Sciaridae. A comparison of nucleotide composition suggests that the AT skew values are positive (from 0.0165 to 0.1050), indicating bias towards A
Jo ur
content rather than T content in all species, except for Pseudolycoriella sp. (-0.0101) and Arachnocampa flava (-0.0341), while the GC skew values are negative (from -0.11 to -0.2233), indicating bias towards C content in all species.
Table 2. Base composition of the mitochondrial genomes in Sciaroidea. Note: AT-skew and GC-skew were calculated to analyze the A + T and G + C bias of PCGs, with the formula AT-skew= [A−T] / [A+T] and GC-skew= [G−C] / [G+C], respectively. Species Bradysia sp. Dolichosciara megumiae Pseudolycoriella sp. Sciara ruficauda Trichosia lengersdorfi Rhopalomyia pomum Bradysia tilicola
Assembled genome T(U)% 35.7 37.9 39.9 38.8 38.5 40.6 38.5
C%
A%
G%
AT-skew
GC-skew
Length
14.6 12.9 11.6 12.4 12.6 8.3 12.1
39.4 39.5 39.1 40.1 40.9 44.6 39.8
10.3 9.7 9.3 8.7 8.0 6.5 9.5
0.0493 0.0207 -0.0101 0.0165 0.0302 0.0469 0.0166
-0.1727 -0.1416 -0.1100 -0.1754 -0.2233 -0.1216 -0.1204
15512.0 15931.0 15981.0 15167.0 16170.0 14503.0 13849.0
Journal Pre-proof Arachnocampa flava Orseolia oryzae
42.4 38.3
10.6 8.1
39.6 47.4
7.3 6.2
-0.0341 0.1050
-0.1844 -0.1329
16923.0 15286.0
The mitochondrial genome of all five Sciaridae species included 13 PCGs. Nine genes are located in J strand, nd2, cox1, cox2, atp8, atp6, cox3, nd3, nd6 and cytb, while the others nd1, nd5, nd4, and nd4l are located in the N strand. Total PCG lengthes range from 11,161 to 11,268 bp (Table 4). A + T content across PCGs are 73.5% in Bradysia sp., 74.3% in Dolichosciara megumiae,
of
76.8% in Trichosia lengersdorfi, 76.5% in Pseudolycoriella sp. and 76.8% in
ro
Sciara ruficauda, with an average of 75.6% (Table 3).
re
-p
Table 3. Base composition of Protein-Coding Genes in the mitochondrial genomes of Sciaridae. Note: AT-skew and GC-skew were calculated to analyze the A + T and G + C bias of PCGs, with the formula AT-skew= [A−T] / [A+T] and GC-skew= [G−C] / [G+C], respectively. All Protein-Coding Genes (PCGs) T% 41.1 42.6 44.4 44.4 44.2
14.1 12.9 11.7 11.7 11.8
na
Bradysia sp. Dolichosciara megumiae Pseudolycoriella sp. Sciara ruficauda Trichosia lengersdorfi
C%
A%
G%
AT-skew
GC-skew
Length
30.8 31.7 32.1 32.4 32.6
14.0 12.8 11.9 11.5 11.4
-0.1433 -0.1467 -0.1608 -0.1563 -0.1510
-0.0036 -0.0039 0.0085 -0.0086 -0.0172
11249.0 11215.0 11268.0 11161.0 11201.0
lP
Species
Jo ur
3.2 Gene arrangement
Compared to the ancestral gene arrangement of insects, only Sciara ruficauda retains the typical gene positions and orientation (Figure 1). Each of the other five Sciaridae species showed rearrangement of tRNAs. In Dolichosciara megumiae and Bradysia sp., trnL2 was all missed, trnC-trnY was translocated into the tRNA cluster between nad3 and nad5, while trnR-trnN was translocated between nad6 and cob. trnI was not found in either Bradysia sp. or Bradysia tilicola [= B. amoena]. Differences were also found in Pseudolycoriella sp. and Trichosia lengersdorfi. In Pseudolycoriella sp. trnC moved from its typical location to a position between trnQ and trnM, while trnY and trnL2 transposed to the position between nad2 and CR. trnC transposed from the typical position between nad2 and cox1 to the block of tRNA genes
Journal Pre-proof between the CR and nad2, while the trnA and trnR genes were switched in Trichosia lengersdorfi. Rearrangements in sciarid mostly occurred in trnL2 and the
tRNA
clusters
of
the
trnI-trnQ-trnM,
trnW-trnC-trnY,
and
lP
re
-p
ro
of
trnA-trnR-trnN-trnS1-trnE-trnF.
Jo ur
na
Figure.1 Mitochondrial gene arrangement pattern of Sciaridae. Note: the blue rectangles represented 13 PCGs (cox1-cox3: cytochrome oxidase subunits; ctyb: cytochrome b; nad1-nad6: nadh dehydrogenase com-ponents; atp6, atp8: ATP synthase subunit 6 and 8 genes); the green rectangles represented tRNAs and the red rectangles represented rrnL and rrnS (ribosomal RNAs). Single letters identify the transfer RNA genes. The broken lines represent unsequenced regions in the mitochondrial genome and the arrow indicated the tRNA gene transfer orientation and location.
3.3 Phylogenetic relationships Phylogenetic trees (Figure 2) were reconstructed based on nine near complete mitochondrial genomes of Sciaroidea, representing three families. Similar phylogenetic topologies were found from both data matrixes (AA-PCGs, NU-PCGs) and both inference methods (BI and ML method). All analyses support the monophyly of Sciaridae (AA-PCGs: PP (posterior probability)=1, BS (boot-straps)=100; NU-PCGs: PP=1, BS=100). Within Sciaridae, the relationships of three subfamilies are Sciarinae + (‗Pseudolycoriella group‘ + Megalosphyinae). Sciara ruficauda and Trichosia lengersdorfi form Sciarinae with strongsupport (PP=1, BS>70), while Dolichosciara megumiae, Bradysia tilicola and Bradysia sp. form the subfamily Megalosphyinae with strong
Journal Pre-proof support (AA-PCGs: PP=1/1, BS=84/100; NU-PCGs: PP=1/1, BS=79/100) and
Jo ur
na
lP
re
-p
ro
of
Pseudolycoriella sp. is sister to Megalosphinae.
Figure 2.Phylogenetic relationships of Sciaroidea inferred from mitochondrial genomes. Note: The numbers separated by ―/‖ near the nodes represent support values of BI and ML analyses based on matrixes of AA-PCG (in purple) and NU-PCG (in yellow) respectively, while ‖*‖ represents the support values PP=1 and BS=100.
4. Discussion The monophyly of Sciaridae has been demonstrated in recent studies [15, 16, 61] and was well supported in this study. The topology of Sciaridae was recovered as (Sciarinae + (‗Pseudolycoriella group‘ + Megalosphyinae)), consistent with previous morphological and molecular studies [11, 15, 16]. The
Journal Pre-proof Sciarinae and Megalosphyinae are highly supported, consistent with previous research [15, 16]. The present phylogenetic system of Diptera is composed of five major clades including the ―lower‖ Diptera groups, Tipulomorpha, Culicomorpha, Psychodomorpha and Bibionomorpha, and the ―higher‖ Diptera, the Brachycera [24, 62]. Sciaroidea is usually regarded as the most speciose superfamily
of
the
Bibionomorpha
[63,
64].
Most
of
mitogenome
rearrangements found in Diptera are duplications or inversions of tRNA genes 66].
For
example,
in
Paracladura
trichoptera
of
[65,
(Tipulomorpha:
ro
Trichoceridae), rearrangements involved both tRNA and protein-coding genes [48]; trnS is inverted in Anopheles quadrimaculatus and A. gambiae
-p
(Culicomorpha: Culicidae) [27, 28]. Novel gene orders were found in
re
Chrysomya (Brachycera: Calliphoridae with a trnI duplications on either side of
lP
the CR [67, 68, 69]. Compared with other dipteran species, rearrangements in Sciaridae are more common in that inversion or transition of trnL2 frequent,
na
as is rearrangements in the tRNA clusters of trnI-trnQ-trnM, trnW-trnC-trnY, and trnA-trnR-trnN-trnS1-trnE-trnF.
Jo ur
Although the composition of the Sciaroidea is still controversial [13, 14, 61, 70], studies based on mitochondrial and nuclear loci show Cecidomyiidae nested within Sciaroidea [47, 61, 62]. Within Sciaroidea, PCGs are not translocated within the mitogenome. Rearrangements within Cecidomyiidae mitochondrial genome are notably complicated for Diptera, including losses of tRNA genes and inversions of trnI, trnC, trnY, trnE, trnN, trnP and trnT [47]. Rearrangements in Sciaridae are comparatively less complicated involving trnL2, trnC, trnY, trnR and trnN, while Keroplatidae has only a trnE inversion recorded in Arachnocampa flava [48].
5. Conclusion In this study, we sequenced five nearly complete mitochondrial genomes by next generation sequencing technologies. Our phylogenetic results support
Journal Pre-proof previous studies based on molecular data [15] and morphological characters [11], however, the taxon selection is still limited. Whole mitogenomes will serve as a useful dataset for further study of the genetics, systematics and phylogeny of Sciaridae.
Acknowledgments We are grateful to Dr. Pu Tang and Ms. Bo-Ying Zheng (Zhejiang
of
University, China) for their assistance on data analysis and manuscript
ro
improvement. We also thank Andrew Liston (Senckenberg Deutsches Entomologisches Institut, Müncheberg, Germany) for checking the English and
-p
Mr. Xue Yang and Mr. Zuluan Chen (Zhejiang A&F University, China) for their
re
help with the identification. This study was supported by the National Natural
lP
Science Foundation of China (NSFC, grant no. 31872270).
na
References
Jo ur
[1] M. Arimoto, R. Uesugi, N. Hinomoto, M. Sueyoshi, S. I. Yoshimatsu, Molecular marker to identify the fungus gnat, Bradysia sp. (Diptera: Sciaridae), a new pest of Welsh onion and carrot in Japan, Appl. Entomol. Zool. 53 (2018) 419–424. [2] N.E.H. El Ouazzani, K. Heller, K. Kettani, The first checklist of black fungus gnats (Diptera: Sciaridae) of Morocco, Ann. Soc. Ent. Fr. (NS) 55 (2019) 274–290. [3] X. Yang, K. Shi, K. Heller, F. Menzel, Morphology and DNA barcodes of two species of Bradysia Winnertz from China (Diptera, Sciaridae), with the description of Bradysia minorlobus Yang, Shi & Huang sp. n., Zootaxa 4612 (2019) 85–94. [4] A. Köhler, W. Mohrig, Additions to the New Zealand fauna of black fungus gnats (Diptera: Sciaridae), with descriptions of six new species, N. Z.
Journal Pre-proof Entomol. 39 (2016) 91–109. [5] D.K. Yeates, D. Bickel, D.K. McAlpine, D.H. Colless, Chapter Eight. Diversity, Relationships and Biogeography of Australian Flies, In: Diptera Diversity: Status, Challenges and Tools (2010) 227–256. [6] W. Mohrig, K. Heller, H. Hippa, P. Vilkamaa, F. Menzel, Revision of the black fungus gnats (Diptera: Sciaridae) of North America, Studiadipt. 19 (2013) 141–286. [7] K. Scarlett, L.Tesoriero, R. Daniel, D. Guest, Sciarid and shore flies as
of
aerial vectors of Fusarium oxysporum f. sp. cucumerinum in greenhouse
ro
cucumbers, J. Appl. Entomol. 138 (2014) 368–377.
[8] W. H. Elmer, Preventing spread of Fusarium wilt of Hiemalis begonias in the
-p
greenhouse, Crop Protection 27 (2008) 1078–1083.
lP
Notul. Ent. 22 (1942) 5–44.
re
[9] R. Frey, Entwurf einer neuen Klassifikation der Muckenfamilie Sciaridae,
[10] J.W. Meigen, Klassifikazion und Beschreibung der europäischen
na
zweiflügligen insekten (Diptera Linn.), Karl Reichard (1804) 1–152. [11] F. Menzel, W. Mohrig, Revision der paläarktischen Trauermücken (Diptera:
Jo ur
Sciaridae), Studia dipt. Suppl. 6 (2000) 1–761. [12] P. Vilkamaa, H. Hippa, The genus Xenosciara gen. n. and the phylogeny of the Sciaridae (Diptera), Zootaxa 699 (2004) 1–24. [13] P. Vilkamaa, H. Hippa, The genus Sciarotricha gen. n. (Sciaridae) and the phylogeny of recent and fossil Sciaroidea (Diptera), Insect Syst. Evol. 36 (2005) 121–143. [14] H. Hippa, P. Vilkamaa, Phylogeny of the Sciaroidea (Diptera): the implication of additional taxa and character data, Zootaxa 1132 (2006) 63– 68. [15] S. Shin, S. Jung, F. Menzel, K. Heller, H. Lee, S. Lee, Molecular phylogeny of black fungus gnats (Diptera: Sciaroidea: Sciaridae) and the evolution of larval habitats, Mol. Phylogenet. Evol. 66 (2013) 833–846. [16] P. Vilkamaa, H.G. Rudzinski, N. Burdikova, J. Ševčík, Phylogenetic
Journal Pre-proof position of Aerumnosa Mohrig (Diptera, Sciaridae) as revealed by multigene analysis, with the description of four new Oriental species, Zootaxa 4399 (2018) 248–260. [17] S. Shin, H. Lee, S. Lee, Proposal of a new subfamily of Sciaridae (Diptera: Sciaridae), with description of one new species from South Korea, Zootaxa 4543 (2019) 127–136. [18] J.L. Boore, Animal mitochondrial genomes, Nucleic. Acids. Res. 27 (1999) 1767–1780.
of
[19] D.R. Wolstenholme, Animal mitochondrial DNA: structure and evolution,
ro
Int. Rev. Cytol. 141 (1992) 173–216.
[20] S. L. Cameron, Insect mitochondrial genomics: implications for evolution
-p
and phylogeny, Annu. Rev. Entomol. 59 (2014) 95–117.
re
[21] A.C. Lessinger, A.C. Martins Junqueira, T.A. Lemos, E.L. Kemper, F.R. Da
lP
Silva, A.L. Vettore, L. Azeredo‐A. M. Espin, The mitochondrial genome of the primary screwworm fly Cochliomyia hominivorax (Diptera:
na
Calliphoridae), Insect. Mol. Biol. 9 (2000). 521–529. [22] J. Krzywinski, O.G. Grushko, N.J. Besansky, Analysis of the complete
Jo ur
mitochondrial DNA from Anopheles funestus: an improved dipteran mitochondrial genome annotation and a temporal dimension of mosquito evolution, Mol. Phylogenet. Evol. 39 (2006) 417–423. [23] F. Yang, Y.Z. Du, L.P. Wang, J.M. Cao, W.W. Yu, The complete mitochondrial genome of the leafminer Liriomyza sativae (Diptera: Agromyzidae): Great difference in the A+T-rich region compared to Liriomyza trifolii, Gene 485 (2011) 7–15. [24] B.M. Wiegmann, M.D. Trautwein, I.S. Winkler, N.B. Barr, J.W. Kim, C. Lambkin, B.M. Wheeler, Episodic radiations in the fly tree of life, Proc. Nat. Acad. Sci. 108 (2011) 5690–5695. [25] X. Zhang, Z. Kang, S. Ding, Y. Wang, C. Borkent, T. Saigusa, D.Yang, Mitochondrial Genomes Provide Insights into the Phylogeny of
Journal Pre-proof Culicomorpha (Insecta: Diptera), Int. J. Mol. Sci. 20 (2019) 747. [26] M. P. Ramakodi, B. Singh, J. D. Wells, F. Guerrero, D. A. Ray, A 454 sequencing approach to dipteran mitochondrial genome research, Genomics 105 (2015) 53–60. [27] C.B. Beard, D.M.Hamm, F.H.Collins, The mitochondrial genome of the mosquito Anopheles gambiae: DNA sequence, genome organization, and comparisons with mitochondrial sequences of other insects, Insect Mol. Biol. 2 (1993) 103–124.
of
[28] S.E. Mitchell, A.F. Cockburn, J.A. Seawright, The mitochondrial genome of
ro
Anopheles quadrimaculatus species A: complete nucleotide sequence and gene organization, Genome 36 (1993) 1058–1073.
-p
[29] A. De Oliveira Aragão, J.P.N. Neto, A.C.R. Cruz, S.M.M. Casseb, J.F.
re
Cardoso, S.P.da Silva, E.A.Y. Ishikawa, Description and phylogeny of the
lP
mitochondrial genome of Sabethes chloropterus, Sabethes glaucodaemon and Sabethes belisarioi (Diptera: Culicidae), Genomics
na
111 (2019) 607–611.
[30] Z. Zhao, T.J. Su, D. Chesters, S.Y. Ho, C.D. Zhu, X.L.Chen, C.T. Zhang,
Jo ur
The mitochondrial genome of Elodia flavipalpis Aldrich (Diptera: Tachinidae) and the evolutionary timescale of tachinid flies, PLOS ONE 8 (2013) e61814.
[31] J.Y. Chen, Y.W. Chang, S.Z. Zheng, M.X. Lu, Y.Z. Du, Comparative analysis of the Liriomyza chinensis mitochondrial genome with other Agromyzids reveals conserved genome features, Sci. Rep. 8 (2018) 8850. [32] L.T. da Costa, C. Powell, S. van Noort, C. Costa, M. Sinno, V. Caleca, B. van Asch, The complete mitochondrial genome of Bactrocera biguttula (Bezzi) (Diptera: Tephritidae) and phylogenetic relationships with other Dacini, Inter. J. Biolo. Macrom. 126 (2019) 130–140. [33] Z. Kang, X. Li, D. Yang, The complete mitochondrial genome of Dixella sp. (Diptera: Nematocera, Dixidae), Mitochondr. DNA Part A 27 (2016) 1528– 1529.
Journal Pre-proof [34] J. Ren, Q. Yang, S. Gao, Z. Pan, W. Chang, D. Yang, The mitochondrial genome of Limonia phragmitidis (Diptera Limoniidae), Mitochondr. DNA Part B 4 (2019) 719–720. [35] L. Wang, X. Li, D. Yang, The complete mitochondrial genome of Silba sp. (Diptera: Lonchaeidae), Mitochondr. DNA Part B 4 (2019) 2694–2695. [36] K. Heller, A. Köhler, F. Menzel, K. M. Olsen, Ø. I. Gammelmo, Two formerly unrecognized species of Sciaridae (Diptera) revealed by DNA barcoding, Norw. J. Entomol., 63 (2016) 96–115.
of
[37] M. Sasakawa, Fungus gnats associated with flowers of the genus
ro
Arisaema (Araceae) Part 3. Sciaridae (Diptera), Jpn. J. Entoml. 62 (1994) 667–681.
-p
[38] F. Menzel, W. Mohrig, Beiträge zur Taxonomie und Faunistik der
re
paläarktischen Trauermücken (Diptera, Sciaridae). Teil VI - Neue
lP
Ergebnisse aus Typenuntersuchungen und die daraus resultierenden
351–378.
na
taxonomisch-nomenklatorischen Konsequenzen, Studia dipt. 5 (1998)
[39] J. Winnertz, Beitrag zu einer Monographie der Sciarinen, Braumüller
Jo ur
(1867) 1–187.
[40] S. Andrews, FastQC: a quality control tool for high throughput sequence data. Babraham Bioinformatics, online (2010) [Mar 2016]. [41] A. M. Bolger, M. Lohse, B. Usadel, Trimmomatic: a flexible trimmer for Illumina sequence data, Bioinformatics 30 (2014) 2114–2120. [42] W. J. Kent, BLAT—the BLAST-like alignment tool, Genome Res. 12 (2002) 656–664. [43] A. Bankevich, S. Nurk, D. Antipov, A.A. Gurevich, M. Dvorkin, A. S. Kulikov, A. V. Pyshkin, SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing, J. Comput. Biol. 19 (2012) 455– 477. [44] S. Nurk, A. Bankevich, D. Antipov, A. Gurevich, A. Korobeynikov, V Lapidus, R. Stepanauskas, Assembling genomes and mini-metagenomes
Journal Pre-proof from highly chimeric reads, In: Annual International Conference on Research in Computational Molecular Biology (2013) 158–170. [45] M. Bernt, A. Donath, F. Jühling, F. Externbrink, C. Florentz, G. Fritzsch, P.F. Stadler, MITOS: improved de novo metazoan mitochondrial genome annotation, Mol. Phylogenet. Evol. 69 (2013) 313–319. [46] T. M. Lowe, S.R. Eddy, tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence, Nucleic. Acids Res. 25 (1997) 955–964.
of
[47] A.T. Beckenbach, J.B. Joy, Evolution of the mitochondrial genomes of gall
ro
midges (Diptera: Cecidomyiidae): rearrangement and severe truncation of tRNA genes, Genome Biol. Evol. 1 (2009) 278–287.
-p
[48] A.T. Beckenbach, Mitochondrial genome sequences of Nematocera (lower
re
Diptera): evidence of rearrangement following a complete genome
lP
duplication in a winter crane fly, Genome Biol. Evol. 4 (2012) 89–101. [49] I. Atray, J. S. Bentur, S. Nair, The Asian rice gall midge (Orseolia oryzae)
na
mitogenome has evolved novel gene boundaries and tandem repeats that distinguish its biotypes, PLOS ONE 10 (2015) e0134625.
Jo ur
[50] K. Tamura, G. Stecher, D. Peterson, A. Filipski, S. Umar, MEGA6: molecular evolutionary genetics analysis version 6.0, Mol. Biol. Evol. 30 (2013) 2725–2729.
[51] N. T. Perna, T. D. Kocher, Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes, J. Mol. Evol. 41 (1995) 353–358. [52] J. Rozas, DNA sequence polymorphism analysis using DnaSP, In: Bioinformatics for DNA sequence analysis (2009) 337–350. [53] K. Katoh, K. Misawa, K. I. Kuma, T. Miyata, MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform, Nucleic. Acids. Res. 30 (2002) 3059–3066. [54] Q. Li, S.J. Wei, P. Tang, Q. Wu, M. Shi, M.J. Sharkey, X.X. Chen, Multiple lines of evidence from mitochondrial genomes resolve phylogenetic
Journal Pre-proof relationships of parasitic wasps in Braconidae, Genome Biol. Evol. 8 (2016) 2651–2662. [55] P. Kück, ALICUT: a Perlscript which cuts ALISCORE identified RSS. Department of Bioinformatics, Zoologisches Forschungsmuseum A. Koenig (ZFMK), Bonn, Germany, version 2 (2009). [56] P. Kück, G.C. Longo, FASconCAT-G: extensive functions for multiple sequence alignment preparations concerning phylogenetic studies, Front. Zool. 11 (2014) 81.
of
[57] R. Lanfear, B. Calcott, S.Y. Ho, S. Guindon, PartitionFinder: combined
ro
selection of partitioning schemes and substitution models for phylogenetic analyses, Mol. Biol. Evol. 29 (2012) 1695–1701.
-p
[58] D.T. Hoang, O. Chernomor, A. Von Haeseler, B.Q. Minh, L.S. Vinh,
re
UFBoot2: improving the ultrafast bootstrap approximation, Mol. Biol. Evol.
lP
35 (2017) 518–522.
[59] F. Ronquist, M. Teslenko, P. Van Der Mark, D. L. Ayres, A. Darling, S.
na
Höhna, J.P. Huelsenbeck, MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space, Syst. Biol. 61
Jo ur
(2012) 539–542.
[60] A. Stamatakis, Using RAxML to infer phylogenies, Bioinformatics 51 (2015) 6–14.
[61] J. Ševčík, D. Kaspřák, M. Mantič, S. Fitzgerald, T. Ševčíková, A. Tóthová, M. Jaschhof, Molecular phylogeny of the megadiverse insect infraorder Bibionomorpha sensu lato (Diptera), PeerJ 4 (2016) e2563. [62] B.M. Wiegmann, D.K. Yeates, A.H. Kirk-Spriggs, B.J. Sinclair, Phylogeny of the Diptera, Manual of Afrotropical Diptera 1 (2017) 253–265. [63] P. D. Hebert, S. Ratnasingham, E. V. Zakharov, A. C. Telfer, V. Levesque-Beaudin, M. A. Milton, J.R. deWaard, Counting animal species with DNA barcodes: Canadian insects, Philos. T. R. Soc. B. 371 (2016) 20150333. [64] J. Morinière, M. Balke, D. Doczkal, M. F. Geiger, L. A. Hardulak, G.
Journal Pre-proof Haszprunar, S. Schmidt, A DNA barcode library for 5,200 German flies and midges (Insecta: Diptera) and its implications for metabarcoding‐ based biomonitoring, Mol. Ecol. Res. (2019). [65] S.J. Wei, M. Shi, M.J. Sharkey, C. van Achterberg, X.X. Chen, Comparative mitogenomics of Braconidae (Insecta: Hymenoptera) and the phylogenetic utility of mitochondrial genomes with special reference to Holometabolous insects, BMC genomics 11 (2010) 371.
of
[66] D.J. Yu, L. Xu, F. Nardi, J.G. Li, R.J. Zhang, The complete nucleotide sequence of the mitochondrial genome of the oriental fruit fly, Bactrocera
ro
dorsalis (Diptera: Tephritidae), Gene 396 (2007) 66–74.
-p
[67] J.R. Stevens, H. West, R. Wall, Mitochondrial genomes of the sheep blowfly, Lucilia sericata, and the secondary blowfly, Chrysomya
re
megacephala, Med. Vet. Entomol. 22 (2008) 89–91.
lP
[68] A.C.M. Junqueira, A.C. Lessinger, T.T. Torres, F.R. da Silva, A.L. Vettore, P. Arruda, A.M.L.A. Espin, The mitochondrial genome of the blowfly
na
Chrysomya chloropyga (Diptera: Calliphoridae), Gene 339 (2004) 7–15. [69] L. A. Nelson, C. L. Lambkin, P. Batterham, J. F. Wallman, M. Dowton, M. F.
Jo ur
Whiting, S. L. Cameron, Beyond barcoding: A mitochondrial genomics approach to molecular phylogenetics and diagnostics of blowflies (Diptera: Calliphoridae), Gene 511(2012) 131–142. [70] D. D. Amorim, E. Rindal, Phylogeny of the Mycetophiliformia, with proposal of the subfamilies Heterotrichinae, Ohakuneinae, and Chiletrichinae for the Rangomaramidae (Diptera, Bibionomorpha), Zootaxa 1535 (2007) 1–92.
Journal Pre-proof Author statement Xiaoqian Miao: Writing, Investigation, Formal analysis Junhao Huang: Conceptualization, Writing-Review, Funding acquisition Frank Menzel: Reviewing Qingyun Wang: Visualization Qiaoyu Wei: Data curation, Software, Validation Xiao-Long Lin: Methodology, Reviewing
Jo ur
na
lP
re
-p
ro
of
Hong Wu: Supervision