Genetic diversity and population structure of Commelina communis in China based on simple sequence repeat markers

Genetic diversity and population structure of Commelina communis in China based on simple sequence repeat markers

Journal of Integrative Agriculture 2018, 17(10): 2292–2301 Available online at www.sciencedirect.com ScienceDirect RESEARCH ARTICLE Genetic diversi...

714KB Sizes 0 Downloads 70 Views

Journal of Integrative Agriculture 2018, 17(10): 2292–2301 Available online at www.sciencedirect.com

ScienceDirect

RESEARCH ARTICLE

Genetic diversity and population structure of Commelina communis in China based on simple sequence repeat markers YANG Juan1, 2, YU Hai-yan2, LI Xiang-ju2, DONG Jin-gao1 1

Key Laboratory of Hebei Province for Plant Physiology and Molecular Pathology, College of Life Science, Hebei Agricultural University, Baoding 071001, P.R.China

2

Key Laboratory of Weed and Rodent Biology and Management, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, P.R.China

Abstract Commelina communis (Asiatic dayflower) is a troublesome weed in China. Genetic variation of 46 C. communis populations from different collection sites in our country was investigated using 12 simple sequence repeat (SSR) primer pairs. Polymorphism analysis results showed high level of genetic diversity among these populations. The alleles (bands) were amplified by these primer pairs. The polymorphic proportion was 18.25%, and the average polymorphism information content was 0.1330. The highest effective number of alleles was 1.9915 at locus YP33, and the lowest value was 1.0000 at both loci YP25 and YP31. C. communis showed major average observed heterozygosity value (0.8655) than that of average expected heterozygosity (0.1330). C. communis populations were divided into three groups on the basis of unweighted pair-group method with arithmetic mean cluster analysis (Dice genetic similarity coefficient=0.772) and genetic structure analysis (K=3), and a principal coordinate analysis. The results of this study further illustrated that C. communis populations contained abundant genetic information, and the 12 SSR markers could detect the microsatellite loci of C. communis genomic DNA. These results might indicate that C. communis maintains high genetic diversity among different populations. Keywords: Commelina communis, UPGMA, population structure, principal component analysis, microsatellite

monocot plant belonging to the Commelinaceae family. It

1. Introduction Commelina communis, commonly known as the Asiatic dayflower or bamboo leaf weed, is an annual to perennial,

germinates in late spring and mainly affects the growth of maize, soybean, wheat, vegetables, and other crops (Li 1998; Mishra et al. 2002). For example, 98% of maize growers and soybean growers in Northeast China rank C. communis as the top of three troublesome weeds in recent years because of the succession of weed community of the fields (Ma et al. 2010).

Received 29 December, 2017 Accepted 26 February, 2018 YANG Juan, E-mail: [email protected]; Correspondence LI Xiang-ju, Tel: +86-10-62813309, E-mail: [email protected]; DONG Jin-gao, Tel: +86-312-7528266, E-mail: [email protected] © 2018 CAAS. Publishing services by Elsevier B.V. All rights reserved. doi: 10.1016/S2095-3119(18)61906-9

C. communis is native to temperate Northeast Asia. Moreover, it is cosmopolitan and has been broadly distributed all over the world, especially in the Northern Hemisphere (Santiago and Micheal 2009). The phenotypic plasticity of C. communis allows itself to maintain dominance in some wicked environments, so that it has a wide

YANG Juan et al. Journal of Integrative Agriculture 2018, 17(10): 2292–2301

ecological tolerance to climatic and soil factors (Kutbay and Uckan 1998). A few reasons underlie the serious abundance of C. communis. On one hand, C. communis has an aggressive growth habit, creeping along the soil and rooting adventitiously at the nodes, thus increasing the potential for survival (Huang et al. 2000; Pyšek 2001; Li et al. 2008; Santiago and Micheal 2009). On the other hand, C. communis is an out-crossing species. That is, a single plant can produce 5–80 bisexual flowers in each life cycle (Li et al. 2016). Additionally, C. communis can reproduce by both vegetative reproduction and seed propagation, and like other Commelinaceae species, C. communis is known for its opportunistic germination throughout the growing season (Prostko and Webster 2007; GómezVargas 2012). Furthermore, C. communis is tolerant to most herbicides popularly used in maize and soybean field such as imazethapyr, acetochlor, fluroxypyr, glyphosate, etc. (Ma et al. 2009; Santiago and Micheal 2009). The control efficacy of a herbicide varied on C. communis populations in different locations, causing this species escaping from herbicide treatments in some areas. At present, farmers in China are mainly focused on increasing the dosages of herbicides in the management of C. communis without knowing the diversity of populations, bring environment pollution and the carry over to the subsequent crops (Santiago and Micheal 2009). Genetic variation within local populations and shared among populations in a geographic area determines the potential for evolutionary adaptation to changing environmental conditions. The potential for weed populations which was adapted to management practices is dependent on population genetic variation. Thus, knowledge of weed genetic diversity is essential for designing effective long-term weed management programs. Simple sequence repeats (SSRs), also known as microsatellites, are shortly repeated DNA sequences generally with 1–6 base pairs (the majority has 2–4 base pairs) in length per unit (Liu et al. 2016). SSR is distributed extensively throughout the genomic DNA. In addition, SSR has several advantages, including typically codominant, highly polymorphic, reproducible, and highly accessible (Zane et al. 2002; Selkoe and Toonen 2006). Studies on genetic diversity and genetic structure based on SSR were conducted on plants, mushrooms, animals, and other organisms. For example, a total of 13 highly polymorphic and transferable cross-species SSR primers were evaluated for genetic diversity in Solanum elaeagnifolium populations, these primers laid the foundation for studying the extent of genetic diversity in S. elaeagnifolium populations (Zhu et al. 2012). The phylogenetic relationship of Echinochloa species was also studied based on 23 SSR primer pairs and 24 morphological traits successfully (Lee et al. 2016).

2293

In this study, the genetic diversity of C. communis populations was analyzed using SSR fluorescent markers. The specific objective of this study was to quantify the genetic diversity of C. comminus, which may serve as potential information between the correlating gene diversity parameters and herbicide use patterns.

2. Materials and methods 2.1. Plant materials Samples of C. communis representing populations were collected from 8 provinces in China, mainly from corn fields through “Z” sampling or from roadsides by randomly sampling (Table 1). A total of 20 plants were selected from each population for analysis after one year vegetative reproduction.

2.2. DNA extraction C. communis genomic DNA was extracted from fresh leaves using the Tiangen DNA Extraction Kit (Tiangen Biotech (Beijing) Co., Ltd., China). The quality and concentration of DNA were detected using 1% agarose and Nano Drop 2000 (Thermo Scientific, USA), respectively. Furthermore, the concentration was diluted to 30–50 ng μL–1 using TE buffer.

2.3. SSR primer screening and PCR amplification A total of 34 SSR primers derived from the C. communis genome were used in this study (Li et al. 2015). The PCR reaction system was carried out in a 20-μL volume containing 1 μL of 30–50 ng genomic DNA, 0.5 μL or 0.6 μL of forward primer and reverse primer (1.0×10–8 mol L–1), 10 μL 2×Taq PCR Master Mix (0.1 U Taq polymerase μL–1, 5.0×10–4 mol L–1 dNTP each, 2.0×10–2 mol L–1 Tris-HCl (pH 8.3), 0.1 mol L–1 KCl, 3.0×10–3 mol L–1 MgCl2; Tiangen, Beijing, China). The PCR amplification conditions were as follows: 95°C for 3 min, 94°C for 30 s, annealing temperature for 30 s, 72°C for 30 s or 1 min, 35 cycles from steps 2 to 4, and final extension steps at 72°C for 15 or 30 min. The SSR primers with specific amplification and high polymorphism were chosen as candidate makers from a total of 34 SSR primers derived from C. communis reported in reference (Li et al. 2015). Among these SSR markers, the 12 primers of polymorphisms were screened and labelled with fluorescent dyes 6-FAM, or HEX by Invitrogen Biotechnology Co., Ltd. (Shanghai, China). The annealing temperature was optimized in accordance with each specific primer. The high polymorphism primer sequences and loci in assembled scaffolds are shown in Table 2.

2294

YANG Juan et al. Journal of Integrative Agriculture 2018, 17(10): 2292–2301

Table 1 Information of Commelina communis populations Code Population Collected locations 1 HLJ-1 Experimental base of Agricultural Bureau of Nenjiang County, Heihe City, Heilongjiang Province 2 HLJ-2 Experimental base of Agricultural Technology Promotion Centre of Wudalianchi County, Heihe City, Heilongjiang Province 3 HLJ-3 Experimental base of Agricultural Technology Promotion Centre of Bei’an County, Heihe City, Heilongjiang Province 4 HLJ-4 Zhaoguang Farm, Bei’an County, Heihe City, Heilongjiang Province 5 HLJ-5 Experimental base of Heihe Branch of Heilongjiang Academy of Agricultural Sciences, Heihe City, Heilongjiang Province 6 JL-1 Minzhu Village, Xiwei Town, Yitong Manchu Autonomous County, Siping City, Jilin Province 7 JL-2 Hongguang Village, Xiwei Town, Yitong Manchu Autonomous County, Siping City, Jilin Province 8 JL-3 Zhuangjia Village, Kaoshan Town, Yitong Manchu Autonomous County, Siping City, Jilin Province 9 JL-4 Mangzhang Village, Kaoshan Town, Yitong Manchu Autonomous County, Siping City, Jilin Province 10 JL-5 Yitong Town, Yitong Manchu Autonomous County, Siping City, Jilin Province 11 JL-6 Experimental base of Tonghua Academy of Agriculture Sciences, Meihekou County, Tonghua City, Jilin Province 12 JL-7 Dongfeng County, Liaoyuan City, Jilin Province 13 JL-8 Dongliao County, Liaoyuan City, Jilin Province 14 JL-9 Liuhe County, Tonghua City, Jilin Province 15 JL-10 Huinan County, Tonghua City, Jilin Province 16 JL-11 Panshi County, Jilin City, Jilin Province 17 JL-12 Heilinzi town, Gongzhuling County, Siping City, Jilin Province 18 JL-13 Experimental base of Jilin Academy of Agriculture Sciences, Gongzhuling County, Siping City, Jilin Province 19 LN-1 Niugangzi Village, Yinzhou District, Tieling City, Liaoning Province 20 LN-2 Qianbali Village, Yinzhou District, Tieling City, Liaoning Province 21 LN-3 Baowei Village, Caohe Town, Fengcheng County, Dandong City, Liaoning Province 22 LN-4 Sanguan Village, Dabu Mongolian Nationality Town, Fengcheng County, Dandong City, Liaoning Province 23 LN-5 Dayingzi Village, Baoshan Town, Fengcheng County, Dandong City, Liaoning Province 24 LN-6 Experimental base of Jinzhou Academy of Agriculture Sciences, Jinzhou City, Liaoning Province 25 LN-7 Taoli Village, Fuxin Town, Fuxin Mongolian Autonomous County, Fuxin City, Liaoning Province 26 LN-8 Zhalan Village, Dasijiazi Town, Zhangwu County, Fuxin City, Liaoning Province 27 LN-9 Beiguan Village, Heishan Town, Heishan County, Jinzhou City, Liaoning Province 28 HB-1 Experimental base of Handan Academy of Agriculture Sciences, Handan City, Hebei Province 29 HB-2 Experimental base of Tangshan Academy of Agriculture Sciences, Tangshan City, Hebei Province 30 HB-3 Qianxi County, Tangshan City, Hebei Province 31 HB-4 Dayangguanying Village, Yejituo Town, Qian’an County, Tangshan City, Hebei Province 32 HB-5 Tianzhuang Village, Xingcheng Town, Qianxi County, Tangshan City, Hebei Province 33 HB-6 Liangshan Village, Liangshan Town, Changli County, Qinhuangdao City, Hebei Province 34 HB-7 Huaguoshan Village, Gehetou Town, Qinglong Manchu Autonomous County, Qinhuangdao City, Hebei Province 35 HB-8 Experimental base of Agricultural Bureau of Pingquan County, Chengde City, Hebei Province 36 HB-9 Experimental base of Hebei North University, Zhangjiakou City, Hebei Province 37 HHB-1 Hushan Village, Chengguan Town, Zhushan County, Shiyan City, Hubei Province 38 HHB-2 Gongjiapu Village, Zhifang Street, Jiangxia District, Wuhan City, Hubei Province 39 HHB-3 Jinshui Farm, Jiangxia District, Wuhan City, Hubei Province 40 JS-10 Qiuzhuang Village, Zhu Town, Liuhe District, Nangjing City, Jiangsu Province 41 ZJ-1 Fuyang District, Hangzhou City, Zhejiang Province 42 GZ-1 Baiyun Town, Pingba County, Anshun City, Guizhou Province 43 GZ-2 Maguan Town, Puding County, Anshun City, Guizhou Province 44 GZ-6 Dashan Town, Pan County, Liupanshui City, Guizhou Province 45 GZ-9 Shizi Town, Pingba County, Anshun City, Guizhou Province 46 GZ-10 Jingyang Village, Gudong Town, Majiang County, Qiandongnan Miao and Dong Autonomous Prefecture, Guizhou Province

2.4. Detection of PCR products PCR products were detected using 1.8% agarose gel

Habitat Corn field Corn field Corn field Corn field Corn field Corn field Corn field Corn field Corn field Corn field Corn field Road sides Corn field Corn field Corn field Road sides Corn field Corn field Corn field Corn field Corn field Corn field Corn field Corn field Corn field Corn field Corn field Corn field Corn field Corn field Road sides Road sides Corn field Corn field Corn field Road sides Corn field Corn field Corn field Road sides Road sides Road sides Road sides Road sides Road sides Road sides

electrophoresis, then, they were distinguished with primer dimers. Primers that displayed polymorphism were screened and labeled with different fluorescent dyes. These

2295

YANG Juan et al. Journal of Integrative Agriculture 2018, 17(10): 2292–2301

Table 2 Characteristics of 12 simple sequence repeat (SSR) markers Primer name YP7 YP10 YP13 YP19 YP20 YP22 YP25 YP26 YP31 YP33 YP35 YP37

Primer sequence (5´→3´) F: AAGCACTCTCCACTGTAAT R: AGAAATGTAGGGGATAAAC F: AAGCACTCTCAACTGAAAAT R: CAGTACAAGGAGACATAGA F: CATGTTATGGTTGCAAGCTC R: ACCGATATATGATCCCGTCC F: AGGTTGAGTGGATAAAATG R: GAAGGAGAAAGGTGATGAG F: CTCCAACCTTGCCATCCTAC R: TCTGGTGCTCCAAATCCTGC F: AGTTGGCAAAAGAAAGGGA R: GCAGTATGTGACGGAGGGT F: AAATACGTCAAATGTTCTG R: ACCTGAGTCAACGACAAAC F: ACAAAGAGGGGAGGCAACA R: GGAGAAAGGGGAAAGGTGA F: AAATTCAAATTCGATAGCA R: AGAGTTTATATTTTGTGGA F: ATTATCTCTATGCCATCTCC R: TTCTCTCCCTACACTTTTCT F: GTTAGTCGAGTTACCAGCCT R: GAAATGTTCTACGCCAATAG F: CAACAAATTGTTTTGAGAAT

fluorescent markers were used to detect the specific loci of C. communis populations. The HUM-STR method was applied for electrophoretic analysis (capillary temperature 60°C; sample injection 2.0 kV, 30 s; electrophoresis 4.8 kV, 65 min).

2.5. Analysis of genetic diversity The amplified production using fluorescent SSR primers were detected by ABI PRISM 3730xl DNA Sequencer with GS500 (Applied Biosystems, USA) as an internal size standard. The allelic sizes were determined using GeneMarker version 2.2.0 (Applied Biosystems). The effective allelic sizes were transformed into a [0, 1] matrix by CEQ fragment analysis software (Beckman Coulter, USA). To evaluate the genetic diversity and genotypes of C. communis populations, we used the program POPGENE32 (version 1.31) to calculate the following parameters: observed number of alleles (Na) per locus, effective number (Ne) of alleles per locus, expected heterozygosity (He), observed heterozygosity (Ho), and Shannon’s information index (I) (Nei 1978; Francis et al. 1999). The Dice genetic similarity coefficient values were used to describe the genetic relationship of populations by the unweighted pair-group method with arithmetic averaging (UPGMA) cluster analysis of the NTsys-pc software (version 2.10s) (Rolf 1998). The analysis of

Allele size range (bp) 116–150

Tm (°C) 53

5´ modify 6-Fam

125–157

49.4

Hex

194–212

55

Hex

194–210

54

Hex

133–151

57

6-Fam

238–268

57

6-Fam

114–158

50

Hex

216–262

56

6-Fam

68–106

48

6-Fam

287–307

51.9

6-Fam

226–256

51.7

Hex

213–231

52.4

Hex

molecular variance (AMOVA) in “among groups”, “among populations” and “within populations” values were estimated by GenALEx software (version 6.503) through 999 data replications (http://www.anu.edu.au/BoZo/GenAlEx). The admixture model and the option of correlated allele frequencies were used to analyze the genetic structure of the population by software STRUCTURE 2.3.4 (Falush et al. 2003; Pritchard et al. 2007). The parameters Burnin Period and Markov Chain Monte Carlo (MCMC) were set as 5 000. The value of the Ln (likelihood) was calculated, in which K changed from 1 to 10. Each K-value was run thrice, and the final result was the average value (Song et al. 2003). Additionally, principal coordinate analysis (PCoA) was conducted using the Dice genetic distance matrix and the Dcenter, Eigen, and Mxplot proceures of NTsys-pc software (version 2.10s).

3. Results 3.1. Polymorphism analysis of fluorescent SSR markers The genomic DNAs of 46 C. communis populations were amplified using 12 SSR fluorescent primers (Table 3). Results showed that a total of 652 bands were amplified, containing 119 polymorphism bands. The number of bands varied from 6 (YP19, YP20, YP31, and YP35) to 16 (YP26),

2296

YANG Juan et al. Journal of Integrative Agriculture 2018, 17(10): 2292–2301

with an average of 10 bands (alleles) per primer. The polymorphic band proportion (PBP) ranged from 8.99% (YP33) to 32.61% (YP25), and the average PBP was 18.25%. The polymorphism information content (PIC) for each maker was less than 0.5, with an average value of 0.1330. The highest PIC was 0.4978 (YP33). The allele sizes, Ne, He, Ho, and I of 12 SSR primers in all C. communis populations are shown in Table 4. A total of 94 alleles ranging in size from 68 to 307 bp were detected by CEQ fragment analysis software (Beckman Coulter, USA). The value of He in different loci varied from 0.0000 (loci YP25 and YP31) to 0.4979 (locus YP33) with an average of 0.1330. The highest value of Ne was 1.9915 at locus P33, and the lowest value was 1.0000 at loci YP25 and YP31. Nevertheless, the Ho value in different loci varied Table 3 PCR amplified results of 12 simple sequence repeat (SSR) markers1) Primer names YP7 YP10 YP13 YP19 YP20 YP22 YP25 YP26 YP31 YP33 YP35 YP37 Total Mean

Amplified bands 53 48 62 64 50 49 46 50 46 89 47 48 652 54

Polymorphic bands 12 12 9 6 6 13 15 16 6 8 6 10 119 10

PBP

PIC

22.64 25.00 14.52 9.38 12.00 26.53 32.61 32.00 13.04 8.99 12.77 20.83 – 18.25

0.1406 0.0425 0.2874 0.3148 0.0832 0.0631 0 0.1027 0 0.4978 0.0215 0.0425 – 0.1330

1)

PBP, polymorphic band proportion in Commelina communis; PIC, polymorphism information content in C. communis. –, no data.

from 0.4967 (locus YP33) to 1.0000 (loci YP25 and YP31) with an average of 0.8655. The I and Nei’s indexes, ranging from 0.0000 to 0.6910 and from 0.0000 to 0.4674, achieved an average value of 0.2266 and 0.0915, respectively. These results indicate that primer YP33 exhibited more significant polymorphism at locus in C. communis genomic DNA.

3.2. Cluster analysis The Dice similarity coefficient was calculated using NTsys-pc software (version 2.10s) on the basis of SSR markers. The genetic similarity coefficient ranged from 0.760 to 0.946. A dendrogram based on SSR data was generated (Fig. 1). Furthermore, the genetic relationships were elucidated by the dendrogram. On the basis of the cluster analysis, the 46 C. communis populations could be divided into three major groups (I–III) when the Dice genetic similarity coefficient was 0.772. Group I clustered the five populations from Heilongjiang Province, displaying their similar genetic backgrounds. Group II clustered 33 populations with C. communis from Jilin, Liaoning, Hebei, Jiangsu, and Zhejiang provinces. The populations collected from Hubei and Guizhou provinces was clustered in group III. These results illustrated that clustered populations in the same group had similar genetic background. There is a certain regional specialization in the distribution of C. communis populations. The closer the geographical location, the higher value the genetic similarity coefficient is. AMOVA values showed that a high partition of the variation is attributed to “among populations”. It was showed that 29% of the total variation came from groups clustered by UPGMA method. A total of 51% of the total variation was due to differences among populations, nevertheless, 20% was attributed to the individuals within populations (Table 5).

Table 4 Genetic variation of microsatellite loci1) Loci YP7 YP10 YP13 YP19 YP20 YP22 YP25 YP26 YP31 YP33 YP35 YP37 Mean 1)

Allele size (bp) 116–150 125–157 194–212 194–210 133–151 238–268 114–158 216–262 68–106 287–307 226–256 213–231 –

Ne 1.1636 1.0444 1.4032 1.4593 1.0907 1.0673 1.0000 1.1146 1.0000 1.9915 1.0220 1.0444 1.2001

He 0.1406 0.0425 0.2873 0.3147 0.0832 0.0631 0.0000 0.1028 0.0000 0.4979 0.0215 0.0425 0.1330

Ho 0.8579 0.9570 0.7095 0.6818 0.9159 0.9362 1.0000 0.8961 1.0000 0.4967 0.9783 0.9570 0.8655

I 0.2691 0.1047 0.4620 0.4943 0.1788 0.1437 0.0000 0.2111 0.0000 0.6910 0.0600 0.1047 0.2266

Nei’s 0.0761 0.0217 0.1739 0.1957 0.0435 0.0326 0.0000 0.0543 0.0000 0.4674 0.0109 0.0217 0.0915

Ne, effective number of alleles in Commelina communis; He, expected heterozygosity C. communis; Ho, observed heterozygosity in C. communis; I, Shannon’s information index in C. communis; Nei’s, Nei’s gene diversity in C. communis. –, no data.

2297

YANG Juan et al. Journal of Integrative Agriculture 2018, 17(10): 2292–2301

1 2 5 3 4 6 26 29 9 15 16 25 12 14 7 30 11 17 21 27 34 22 33 35 36 8 13 23 10 24 18 28 40 41 19 20 31 32 37 39 44 43 38 45 42 46

10

0.76

0.78

0.81

0.83

0.85

0.88

0.90

0.93

I

II

III

0.95

Coefficient

Fig. 1 Dendrogram of Commelina communis populations based on simple sequence repeat (SSR) data using the unweighted pair-group method with arithmetic averaging (UPGMA) method (codes as in Table 1).

Table 5 Analysis of molecular variance (AMOVA) among Commelina communis populations and clustered groups1) Source of variation2) Among groups Among populations Within populations Total 1)

2)

d.f. 2 43 46 91

MS 39.337 6.660 1.087

SS 78.673 286.392 50.000 415.065

% Total 29 51 20 100

Est. Var. 1.602 2.787 1.087 5.476

Prob <0.001 <0.001 <0.001 <0.001

d.f., degrees of freedom; MS, mean squared deviations; SS, sum of squares; Est. Var., estimate variance; Prob, the significance of the variance components after 999 random permutations. Values among groups, among populations and within populations are considered highly significant (P<0.001).

There were 3 groups based on UPGMA method according to geographical location and latitude from northeast to south in China. These populations which collected from Heilongjiang Province clustered together in group I, and Heilongjiang Province was a higher latitude area of our country. Jilin, Liaoning, Hebei, Jiangsu and Zhejiang provinces were located in the middle of China, and 33 populations from these provinces clustered group II. Additionally, the group III contained 8 populations collected from Hubei and Guizhou provinces, where were lower latitudes in China.

revealed similar relationships with cluster analysis (Fig. 2-B). When component probabilities exceed 0.6, group I on the basis of structural analysis achieved a high similarity to clustered group I. Moreover, the populations were distributed in group II as the population distribution in the clustering group II. Furthermore, group III showed a close relationship with the population distribution in the clustering group III.

3.3. Genetic structure analysis

Principal component analysis was carried out on 46 populations on the basis of the SSR data matrix (Fig. 3). The variances of the first and second principal components were 80.71 and 2.44%, respectively, accounting for 83.15% of the total variation. The 2D scatterplot of 46 populations was produced on the basis of the first two principal components. Furthermore, the positions where the close relationship indicated with each other were intensive. By contrast, the relationship was alienated. Principal component analysis divided the C. communis

We first selected the optimum K-value (Fig. 2-A) to estimate the population genetic structure of C. communis. Fig. 2-A showed that a significant turning point and the maximum likelihood values appeared at K=3 (ΔK=m(|L´´K)|)/s[L(K)]) (Evanno et al. 2005). Therefore, a genetic structure analysis was shown in Fig. 2-B. All populations were divided into three groups (I–III). Generally speaking, genetic structure analysis also

3.4. Principal component analysis

2298

YANG Juan et al. Journal of Integrative Agriculture 2018, 17(10): 2292–2301

Numbers

A

180 160 140 120 100 80 60 40 20 0

Component probability

B 1.0 0.8 0.6 0.4 0.2 0

ΔK

0

1

2

3

4

5 K

6

7

8

9

10

1(1) 3(1) 5(1) 7(1) 9(1) 11(1) 13(1) 15(1) 17(1) 19(1) 21(1) 23(1) 25(1) 27(1) 29(1) 31(1) 33(1) 35(1) 37(1) 39(1) 41(1) 43(1) 45(1) 2(1) 4(1) 6(1) 8(1) 10(1) 12(1) 14(1) 16(1) 18(1) 20(1) 22(1) 24(1) 26(1) 28(1) 30(1) 32(1) 34(1) 36(1) 38(1) 40(1) 42(1) 44(1) 46(1)

Group I

Group II

Group III

Fig. 2 Population structure model of 46 Commelina communis populations. A, group numbers of C. communis populations based on the mapping method. The abscissa is the K value (K=1, 2, 3, …, 10). When K=3, a significant turning point and maximum likelihood in the scatterplot are achieved. B, population structure diagram of 46 C. communis populations. The proportion of each color shows the probability of being divided into the corresponding group for each variety (groups I–III) (codes as in Table 1).

3

0.24

4 1 5 21

0.10

30

33 36 28 26 1512

–0.04 9 29 25

–0.18

2 23 35

27

22

7 17

11

Dim-2

13

I

6

31

10 18 8 34 24 16

20 40

32 41

14

45

19

II

III

44 38

42

37

43

46

–0.32 –0.18

39

–0.04

0.10 Dim-1

0.23

0.37

Fig. 3 2D scatterplot based on the first and second principal components of 46 Commelina communis populations (codes as in Table 1).

populations into three groups (Fig. 3). Group I, which

analysis could directly reveal the genetic relationship among

originates from Heilongjiang Province, was involved with a

different C. communis populations.

closer relations hip than other groups. Group II included the populations from Jilin, Liaoning, Hebei, Jiangsu, and Zhejiang provinces. C. communis populations from Hubei

4. Discussion

Province and Guizhou Province were clustered in group

Capillary electrophoresis is a fast-developing molecular

III. These results indicated that the principal component

marker technology in recent years. This technology holds the

YANG Juan et al. Journal of Integrative Agriculture 2018, 17(10): 2292–2301

advantage of having a fast, efficient, and highly automated process. It has become a hot spot and focus in the field of life science research. Many studies showed that capillary electrophoresis could provide more comprehensive and accurate information on length polymorphism and sequence polymorphism in DNA fragment polymorphism analysis than those provided by existing technologies. Liang et al. (2015) reported that germplasm resources, genetic diversity, and relationships among 133 persimmon populations from China, Japan, and America were analysed using the capillary electrophoresis method, and in the process, 158 alleles and 610 genotypes from 17 SSR loci were obtained. In our study, the genetic diversity of these C. communis populations was analysed by capillary electrophoresis and 12 SSR primers with highly polymorphic, nuclear, singlelocus, and codominant makers were selected from 34 SSRs developed initially for C. communis to thoroughly assess the genetic diversity and population structure of the plants from different locations (Li et al. 2015). The method could successfully distinguish the genotypes of C. communis used in the analysis of genetic diversity. The genetic diversity and the degree of genetic differentiation were researched from the DNA level in our study. A high degree of polymorphism among C. communis populations was found. The similar results were obtained through UPGMA cluster analysis, genetic structure analysis, and principal component analysis, in which 46 C. communis populations were divided into three major groups indicating that the each method could analyse the genetic diversity of C. communis populations. Li et al. (2016) found that geographical location played a more important role than soil composition on structuring genetic variation of pseudometallophyte among C. communis populations. In our research, AMOVA values showed that the “among populations” variation was the most major factor than other variation sources among C. communis populations genetic variation among three groups. As a result, our data further suggested that geographical location might play a more important role than those of other factors in these accessed populations. These results were consistent with previous studies. Li et al. (2016) reported that the metallicolous of C. communis populations with a higher number of private alleles usually showed a higher level of genetic variability. The PBP of C. communis populations was 8.99% by primer YP33 amplified, which was the highest value among all the primers (Table 3). Additionally, the genetic variation of microsatellite loci was analysed. As a result, primer YP33 achieved an increased effective detection. The YP33 locus gave the highest allele size (287–307 bp), the effective number of alleles (1.9915), expected heterozygosity

2299

(0.4979), Shannon’s information index (0.6910), and Nei’s gene diversity (04674). However, the observed heterozygosity was the lowest at 0.4967 (Table 4). These results might be related to C. communis genomic mutations. The similar results were shown in parasitic weeds. The genetic diversity of 50 Orobanche cumana populations using 15 SSR makers was assessed, and the result revealed low inter- and intra-population variability (Pineda-Martos et al. 2013). The dimensional genetic structure of C. communis might be determined by the gene flow. With food transport, the seeds and plant of C. communis were brought from one place to another place. In this study, we detected the shared ancestry in C. communis populations by STRUCTURE software. Several populations might share alleles with other populations. For example, group II contained 33 populations from Jilin, Liaoning, Hebei, Jiangsu, and Zhejiang provinces, and the distribution of these populations is wide, from northeast China to the Yangtze River. Therefore, the results of genetic structure suggested that regional specialization may substantially contribute to the high genetic diversity in the populations of C. communis. Studying the genetic diversity and exploring the genetic relationship and the specialization of C. communis in different regions will provide a theoretical basis for delaying the resistance of C. communis and managing C. communis. Seed banks involved significantly large amounts of genetic diversity (Mandák et al. 2012). Each C. communis plant can produce 500–1 000 seeds in accordance with laboratory statistics. Mature C. communis seeds easily fall on the ground and buried into the soil. These seeds form a massive seed bank, maintaining genotypes over time and containing significantly large genetic diversity information. More than 80% of the seeds can germinate even after a few years in the soil seed bank (Takabayashi and Nakayama 1978). We are currently investigating the relationship of the C. communis seed bank with population genetic diversity and intend to report on this subject in the future.

5. Conclusion The genetic diversity and population structure of 46 C. communis populations were successfully analyzed using 12 SSR fluorescent markers with capillary electropherogram. The mean number of polymorphism bands was 10, and the polymorphic proportion was 18.25%. The average value of PIC was 0.1330. The effective number of alleles was 1.9915 at locus YP33; this value was the highest. The He in different loci varied from 0.0000 to 0.4979, with an average of 0.1330. Meanwhile, the Ho in different loci varied from 0.4967 to 1.0000, with an average of 0.8655. The I and

2300

YANG Juan et al. Journal of Integrative Agriculture 2018, 17(10): 2292–2301

Nei’s indexes, ranging from 0.0000 to 0.6910 and from 0.0000 to 0.4674, respectively, possessed average values of 0.2266 and 0.0915, respectively. On the basis of UPGMA cluster analysis, genetic structure analysis, and principal component analysis, these populations were divided into three major groups.

Acknowledgements This study was funded by the National Key Research and Development Program of China (2016YFD0300701) and the earmarked fund for China Agriculture Research System (CARS-25). The authors gratefully acknowledge Dr. Cui Hailan, Dr. Yu Huilin, Dr. Huang Hongjuan, Dr. Huang Zhaofeng, and Research Assistant Quan Zonghua (Institute of Plant Protection, Chinese Academy of Agricultural Sciences), Prof. Zhou Xiaogang (Institute of Plant Protection, Sichuan Academy of Agricultural Sciences, China), for their help in collecting plant materials and technical assistance. In addition, we also acknowledge the editors honestly.

References Evanno G, Regnaut S, Goudet J. 2005. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Molecular Ecology, 14, 2611–2620. Falush D, Stephens M, Pritchard J K. 2003. Inference of population structure using multilocus genotype data: Linked loci and correlated allele frequencies. Genetics, 164, 1567–1587. Francis C Y, Rong C Y, Boyle T. 1999. POPGENE: Microsoft Window-Based Freeware for Population Genetic Analysis. Version 1.31. Molecular Biology and Biotechnology Centre, University of Alberta, Edmonton, Canada. Gómez-Vargas J M. 2012. Glyphosate-tolerant Asiatic dayflower (Commelina communis L.): Ecological, biological and physiological factors contributing to its adaptation to Iowa agronomic systems. MSc thesis, Iowa State University, Ames, America. Huang C Y, Chen T B, Wang Y, Sun B H. 2000. Weed survey of soybean field in north region of Heilongjiang province. Soybean Science, 19, 341–345. (in Chinese) Kutbay H G, Uckan F. 1998. Phenotypic plasticity in Turkish Commelina communis L. (Comelinacea) populations. Turkish Journal of Botany, 22, 199–204. Lee E J, Nah G, Yook M J, Lim S H, Park T S, Lee D Y, Kim D S. 2016. Phylogenetic relationship of Echinochloa species based on simple sequence repeat and phenotypic marker analyses. Weed Science, 64, 441–454. Li J K, Song Y P, Xu H, Zhu Y J, Tang L L. 2015. Development and characterization of microsatellite

loci for the pseudometallophyte Commelina communis (Commelinaceae). Application in Plant Science, 3, 1400098. Li J K, Xu H, Song Y P, Tang L L, Gong Y B, Yu R L, Shen L, Wu X L, Liu Y D, Zeng W M. 2016. Geography plays a more important role than soil composition on structuring genetic variation of pseudometallophyte Commelina communis. Frontiers in Plant Science, 7, 1085–1097. Li X Y, Tao B, Li Y H, Qiu L J. 2008. Genetic diversity in dayflower at six locations of Heilongjiang Province by using RAPD markers. Crops, 20, 21–25. (in Chinese) Li Y H. 1998. Weed Republicae Popularis Sinicae. China Agricultural Press, Beijing. (in Chinese) Liang Y Q, Han W J, Sun P, Liang J J, Wuyun T N, Li F D, Fu J M. 2015. Genetic diversity among germplasms of Diospyros kaki based on SSR markers. Scientia Horticulturae, 186, 180–189. Liu X B, Feng B, Li J, Yan C, Yang Z L. 2016. Genetic diversity and breeding history of winter mushroom (Flammulina velutipes) in China uncovered by genomic SSR markers. Gene, 591, 227–235. Ma H, Guan C H, Tao B. 2009. The tolerance to imazethapyr in different leaf stages of dayflower, Commelina communis L. Acta Phytophylacica Sinica, 36, 450–454. (in Chinese) Ma H, Guan C H, Tao B. 2010. Tolerance to imazethapyr and physiological difference of dayflower (Commelina communis L.) at different leaf stages. Chinese Journal of Oil Crop Sciences, 32, 136–138. (in Chinese) Mandák B, Zákravský P, Mahelka V and Plačková I. 2012. Can soil seed banks serve as genetic memory? A study of three species with contrasting life history strategies. PLoS ONE, 7, e49471. Mishra J S, Singh V P, Yaduraju N T. 2002. Interference of common dayflower (Commelina communis L.) in soybean (Glycine max L.). Indian Journal of Weed Science, 34, 295–296. Nei M. 1978. Estimation of average heterozygosity and genetic distance from a small number of individuals. Genetics, 89, 583–590. Pineda-Martos R, Velasco L, Fernández-Escobar J, FernándezMartínez J M, Pérez-Vich B. 2013. Genetic diversity of Orobanche cumana populations from Spain assessed using SSR markers. Weed Research, 53, 279–289. Pritchard J K, Stephens M, Donnelly P. 2007. Inference of population structure using multilocus genotype data. Molecular Ecology Resources, 7, 574–578. Prostko E P, Webster T M. 2007. Tropical Spiderwort Identification and Control in Georgia Field Crops. University of Georgia, Athens, America. Pyšek P. 2001. Past and future of predictions in plant invasions: A field test by time. Diversity & Distributions, 7, 145–151. Rolf F J. 1998. Numerical Taxonomy and Multivariate Analysis System. Version 2.0. Exeter Publishing, Setauket, New York, America.

YANG Juan et al. Journal of Integrative Agriculture 2018, 17(10): 2292–2301

Santiago M U, Micheal D K. 2009. Response of Asiatic dayflower (Commelina communis) to glyphosate and alternatives in soybean. Weed Science, 57, 74–80. Selkoe K A, Toonen R J. 2006. Microsatellites for ecologists: A practical guide to using and evaluating microsatellite markers. Ecology Letters, 9, 615–629. Song Y, Fan L, Chen H, Zhang M Y, Ma Q Q, Zhang S L, Wu J. 2014. Identifying genetic diversity and a preliminary core collection of Pyrus pyrifolia cultivars by a genome-wide set of SSR markers. Scientia Horticulturae, 167, 5–16.

2301

Takabayashi M, Nakayama K. 1978. Longevity of buried weed seeds in soil. Journal of Weed Science and Technology, 23, 32–36. Zane L, Bargelloni L, Patarnello T. 2002. Strategies for microsatellite isolation: A review. Molecular Ecology, 11, 1–16. Zhu X C, Wu H W, Raman H, Lemerle D, Stanton R, Burrows G E. 2012. Evaluation of simple sequence repeat (SSR) markers from Solanum crop species for Solanum elaeagnifolium. Weed Research, 52, 217–223.

Section editor WAN Fang-hao Managing editor ZHANG Juan