Genome-wide characterization and comparative analysis of the MLO gene family in cotton

Genome-wide characterization and comparative analysis of the MLO gene family in cotton

Accepted Manuscript Genome-wide characterization and comparative analysis of the MLO gene family in cotton Xiaoyan Wang, Qifeng Ma, Lingling Dou, Zhen...

7MB Sizes 204 Downloads 190 Views

Accepted Manuscript Genome-wide characterization and comparative analysis of the MLO gene family in cotton Xiaoyan Wang, Qifeng Ma, Lingling Dou, Zhen Liu, Renhai Peng, Shuxun Yu PII:

S0981-9428(16)30056-0

DOI:

10.1016/j.plaphy.2016.02.031

Reference:

PLAPHY 4429

To appear in:

Plant Physiology and Biochemistry

Received Date: 12 December 2015 Revised Date:

1 February 2016

Accepted Date: 23 February 2016

Please cite this article as: X. Wang, Q. Ma, L. Dou, Z. Liu, R. Peng, S. Yu, Genome-wide characterization and comparative analysis of the MLO gene family in cotton, Plant Physiology et Biochemistry (2016), doi: 10.1016/j.plaphy.2016.02.031. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

Research Article Genome-wide characterization and comparative analysis of the

RI PT

MLO gene family in cotton

Xiaoyan Wang1+, Qifeng Ma2+, Lingling Dou2, Zhen Liu1, Renhai Peng1* and Shuxun Yu2*

Anyang Institute of Technology, College of Biology and Food Engineering, Anyang, Henan, 455000, China; 2 State Key Laboratory of Cotton Biology, Institute of Cotton Research of CAAS, Anyang, Henan, 455000, China

M AN U

Xiaoyan Wang: [email protected] Qifeng Ma: [email protected] Lingling Dou: [email protected] Zhen Liu: [email protected] Renhai Peng: [email protected] Shuxun Yu: [email protected]

SC

1

These authors contributed equally to this work

*

Corresponding author: Renhai Peng, [email protected] Anyang Institute of Technology, College of Biology and Food Engineering, Anyang, Henan, 455000, P. R. China

1

Tel: +86-372-2909876

EP

Fax: +86-372-2525377

TE D

+

*

Corresponding author: Shuxun Yu, [email protected] State Key Laboratory of Cotton Biology, Institute of Cotton Research of CAAS, Anyang, Henan 455000, P. R. China

AC C

2

Tel: +86-372-2525365

Fax: +86-372-2525363

ACCEPTED MANUSCRIPT 1

Abstract In plants, MLO (Mildew Locus O) gene encodes a plant-specific seven transmembrane (TM) domain protein involved in

3

several cellular processes, including susceptibility to powdery mildew (PM). In this study, a genome-wide characterization of

4

the MLO gene family in G. raimondii L., G. arboreum L. and G. hirsutum L. was performed. In total, 22, 17 and 38

5

homologous sequences were identified for each species, respectively. Gene organization, including chromosomal location,

6

gene clustering and gene duplication, was investigated. Homologues related to PM susceptibility in upland cotton were

7

inferred by phylogenetic relationships with functionally characterized MLO proteins. To conduct a comparative analysis

8

between MLO candidate genes from G. raimondii L., G. arboreum L. and G. hirsutum L., orthologous relationships and

9

conserved synteny blocks were constructed. The transcriptional variation of 38 GhMLO genes in response to exogenous

10

application of salt, mannitol (Man), abscisic acid (ABA), ethylene (ETH), jasmonic acid (JA) and salicylic acid (SA) was

11

monitored. Further studies should be conducted to elucidate the functions of MLO genes in PM susceptibility and

12

phytohormone signalling pathways.

13

Keywords: Gossypium, MLO, Gene family, Synteny blocks, Abiotic Stress, Phytohormone

M AN U

SC

RI PT

2

14

16

1. Introduction

TE D

15

Powdery mildew (PM) is an obligate fungal pathogen that causes PM disease in a broad range of plants, including important crops such as pepper, tomato, apple, strawberry, and cotton [1]. It is difficult to diagnose at the early stages of the

18

disease, and it can easily spread unnoticed. According to previous observations, PM disease primarily affects the leaves of

19

sea-island cotton and upland cotton. In general, PM presents similar symptoms in cotton: white or brown spots on leaf tissues,

20

particularly at the bottom of the plant, whereas upper leaves exert some resistance. Afterwards, tissue death of diseased spots

21

causes infected leaves to crinkle, curl, and prematurely drop. Although blossoms and fruits are not the initial PM fungal

22

targets, they can also become infected.

AC C

23

EP

17

Mildew locus O (MLO) proteins belong to a plant-specific protein family containing seven transmembrane (TM) domains

24

[2, 3]. In addition, a C-terminal calmodulin-binding domain (CaMBD) and an extracellular N-terminus [3, 4] have been

25

identified in this family. PM resistance was first characterized in barley plants in 1942 and the immunity was acquired

26

because of the absence of a susceptibility gene which was named Mildew Locus O (MLO). Recessive MLO gene mutations

27

confer durable broad-spectrum resistance to all discovered isolates of barley powdery mildew fungus Blumeria graminis f. sp

28

hordei (Bgh) [2, 3]. Then, the discovery and identification of PM disease resistance in other plant species, such as 1

ACCEPTED MANUSCRIPT 1

Arabidopsis [5], pea [6] and tomato [7], has confirmed that PM resistance deriving from loss-of-function mutations in MLO

2

functional orthologue is a common phenomenon. Therefore, broad-spectrum PM resistance in plants could be introduced by

3

silencing of MLO gene [8]. Calmodulin-binding of MLO proteins promotes PM susceptibility in barley [9]. Moreover, pharmacological studies have

5

suggested that the influx of Ca2+ ions is important for MLO protein function [4]. Therefore, Ca2+ may be a candidate signal

6

because plant cells generate a transient Ca2+ signal in response to pathogen attack [10]. In addition, a complex mechanism

7

may exist during the interaction between MLO genes and PM. Nevertheless, there is limited information on the precise

8

mechanism of MLO proteins. It has been revealed that PM fungi target MLO proteins as an access to trigger pathogenesis

9

because vesicle-associated and actin-dependent defence pathways are negatively regulated by functional MLO proteins in

10

the circumstance of attempted PM penetration. Studies in tomato [7], barley [11], pepper [12], and grape [13] confirmed that

11

early stages of PM infection are associated with up-regulated expression of MLO susceptibility-related gene, with a peak at

12

six hours after inoculation.

M AN U

SC

RI PT

4

Since the HvMLO gene was first identified in barley [2], MLO genes have been discovered in Arabidopsis thaliana [14],

14

Oryza sativa [15], Vitis vinifera [16], Triticum aestivum [17], Glycine max [18], Cucumis sativus [19], Malus domestica [20]

15

and Solanum lycopersicum [21]. More detailed studies have uncovered that medium-sized gene family of MLO is

16

plant-specific and MLO-based PM resistance is not confined to monocotyledones, but is also discovered in distantly related

17

dicotyledones [22]. For example, mutant alleles of AtMLO2, one of 15 MLO genes present in Arabidopsis thaliana, caused

18

partial resistance to the adapted strains such as Golovinomyces orontii and G. cichoracearum. Complete PM resistance was

19

produced when two other homologous genes AtMLO6 and AtMLO12 were also mutated [5]. Subsequently, two studies

20

showed that loss-of-function of the SlMLO1 gene was the cause of resistance to PM disease in tomato [7, 23]. It was

21

demonstrated that pea PM resistance was associated with loss-of-function mutations in an MLO-homologous locus [6]. The

22

results of virus-induced gene silencing suggested that both CaMLO1 and CaMLO2 are involved in the susceptibility of

23

pepper to the PM fungus Leveillula taurica [23]. Recently, NtMLO1, which is predicted to be an orthologue of tomato

24

SlMLO1 and pepper CaMLO2, was shown to be involved in PM susceptibility [24].

EP

AC C

25

TE D

13

Apart from susceptibility/resistance to PM disease in both monocotyledonous and dicotyledonous plants, increasing reports

26

have suggested that MLO may be involved in a variety of developmental processes. Leaf mesophyll cells in MLO barley

27

mutants have been shown to undergo spontaneous cell death, which is an indication of accelerated leaf senescence [2, 11].

28

MLO family members in Arabidopsis presented tissue-specific expression patterns and silencing of AtMLO7 involved in

29

pollen tube reception by the embryo sac led to decreased fertility [25]. Two additional Arabidopsis genes, AtMLO4 and 2

ACCEPTED MANUSCRIPT 1

AtMLO11, control root architecture, as null mutants generate asymmetrical root growth and exaggerated curvature [26]. More

2

results have revealed that MLO family members are involved in diverse abiotic stresses for Capsicum annuum CaMLO2

3

intensely induced upon exogenous treatment of pepper leaves with the phytohormone abscisic acid (ABA) and drought stress,

4

is shown to act as a suppressor of ABA signalling to prevent water loss from leaves under drought conditions [27]. MLO genes have been intensively studied in many monocots and dicots, but very little research has focused on cotton.

6

Published genomic data on G. raimondii (DD; 2n = 26) [28], and G. arboretum (AA; 2n = 26) [29] as well as G. hirsutum

7

(AADD; 2n = 52) [30] provide an opportunity to conduct a comprehensive overview of the MLO gene family in diploid and

8

tetraploid cotton species. In this study, we characterized the MLO gene family in these three species with respect to their

9

structural, genomic and gene-expression features. Moreover, we assessed the orthologous relationships between the G. raimondii L., G. arboreum L. and G. hirsutum L. genomes.

11

2. Materials and methods

12

2.1. In silico identification and annotation

13

M AN U

10

SC

RI PT

5

Genomic databases of G. raimondii L. (D5, JGI__v2.1), G. arboreum L. (A2, BGI _v1.0) and G. hirsutum L. (AD1, BGI _v1.0), available at the CottonGen website (https://www.cottongen.org/) [31], were downloaded for the identification of

15

MLO homologue nucleotide and protein sequences. Then, several local BLAST searches using the Arabidopsis AtMLO1

16

amino acid sequence as a query were performed. Candidates with an E-value less than 1.0e-20 were estimated to be MLO

17

homologs, and their gene coding regions, genomic DNA and deduced amino acid sequences were acquired. Conserved MLO

18

domains within the acquired MLO sequences were confirmed by searching NCBI's conserved domain database

19

(http://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi) and Pfam's protein domain families (http://pfam.xfam.org/). The

20

presence and number of TM helices in the proteins of interest were predicted using the online software TMHMM

21

(http://www.cbs.dtu.dk/services/TMHMM-2.0/).

22

2.2. Gene organization

EP

AC C

23

TE D

14

The chromosomal localization of each MLO gene in G. raimondii L., G. arboreum L. and G. hirsutum L. was deduced

24

based on the available genomic information at the CottonGen database. Mapchart 2.2 software was used to visualize the

25

distribution of MLO genes on the chromosomes, with the exception of a small portion of genes that have not been localized to

26

a chromosome [32]. Introns and exons of each MLO gene were determined by comparing the cDNAs with their

27

corresponding genomic DNA sequences. Intron/exon composition and position were analysed by the Gene Structure Display

28

Server (GSDS) tool (http://gsds.cbi.pku.edu.cn/). 3

ACCEPTED MANUSCRIPT The MLOs of G. hirsutum were first aligned by Clustal W2 at EMBL-EBI (http://www.ebi.ac.uk/Tools/msa/clustalw2/).

2

Gene duplication events were identified when the following conditions were fulfilled: (1) the alignment covered more than

3

80% of the longer gene, (2) the identity of the aligned regions was greater than 80% of the alienable region, and (3) only one

4

duplication event was taken into account for tightly linked genes [33].

5

2.3. Phylogenetic analysis

6

RI PT

1

A total of 38 GhMLO amino acid sequences together with 36 other MLO homologues from 9 dicot and monocot species were used to construct phylogenetic trees. Amino acid sequences of 36 known MLOs from Arabidopsis thaliana [14],

8

Hordeum vulgare [2], Oryza sativa [15], Zea mays [14], Solanum lycopersicum [21], Pisum sativum [6], Vitis vinifera [16],

9

Malus domestica [20] and Capsicum annuum [23] were obtained based on the published information. A total of 74 MLO

SC

7

protein sequences were included to perform multiple alignments using ClustalW [34] with the default parameters. A

11

neighbour-joining phylogenetic tree was constructed by MEGA 6.0 software [35] with the pairwise deletion option and

12

Poisson correction model. Bootstrapping (1000 replicates) was used to evaluate the degree of support for a particular

13

grouping pattern in the phylogenetic tree.

14

2.4. Detection of synteny blocks

Conserved synteny blocks between MLO candidate genes from G. raimondii L., G. arboreum L. and G. hirsutum L. were

TE D

15

M AN U

10

inferred by running the OrthoClusterDB tool available at GDR (http://genome.sfu.ca/cgi-bin/orthoclusterdb/runortho.cgi)

17

[36]. To run OrthoCluster, the user must provide n (n ≥ 2) genome files and one correspondence file with their corresponding

18

orthologous relationships as input. Thus, orthologous groups of the MLO family among the three Gossypium species were

19

previously generated using the OrthoMCL database (http://orthomcl.org/orthomcl/) [37].

20

2.5. Plant materials, stress and phytohormone treatments

AC C

21

EP

16

Seeds of cotton plants from the CCRI 36 cultivar (G. hirsutum L.) were surface-sterilized in 75% (v/v) absolute alcohol for

22

30 s and in 0.1% HgCl2 (m/v) for 3 min. After washing them in sterilized double-distilled water (ddH2O), seeds were sown

23

onto sterilized Murashige & Skoog (MS) solid medium (pH 5.8) containing 1.5% sucrose and 0.7% agar. The inoculated

24

culture tubes were placed in a growth chamber with 150 µmol m-2 s-1 fluorescent light at 25–26°C. Seven-day-old seedlings

25

with expanding cotyledon were transplanted into new culture tubes supplemented with 200 µM sodium chloride (NaCl), 200

26

mM mannitol (Man), 50 µM ABA, 200 µM ethylene (ETH), 100 µM jasmonic acid (JA) or 1 mM salicylic acid (SA).

27

Treated and control plants were grown under the above conditions, and leaf samples were collected after three weeks. Three

28

biological replicates were included, and each sample contained three young leaves collected from a single plant. Collected 4

ACCEPTED MANUSCRIPT 1

leaves were frozen in liquid nitrogen, and stored at -80°C until RNA extraction.

2

2.6. Real-time quantitative RT-PCR

3

Total RNA was extracted using the EASY spin plus RNA reagent kit RN38 (AIDLAB, Beijing, China) according to the manufacturer's instructions. Poly (dT) cDNA was synthesized using the Superscript III First-Strand Synthesis System

5

(Invitrogen, USA). Primers (Table S1) for transcript analysis were designed with Premier 6.0 of Primer Designing Tool. The

6

Histone 3 (AF024716) gene was used as an internal control. Quantitative Real Time-PCR (qPCR) was performed on an ABI

7

7500 system (Applied Biosystems, USA) using SYBR Green I (with Rox) reagents to detect target products.

8

The running programs were as follows: holding stage at 50°C for 2 min, 94°C for 10 min, followed by 40 cycles at 95°C for

9

15 s, 60°C for 1 min. Then a melting curve was generated from 65 to 95°C to examine the specificity of target sequences.

SC

RI PT

4

To measure differential expression of MLO genes, QRT-PCR data was processed by 2-

11

analysed by Student's t test.

12

Ethics statement

13

△△CT

method [38] and statistically

M AN U

10

We did not make use of human or vertebrate animal subjects and/or tissue in our research.

3. Results and discussion

15

3.1. In silico characterization of Gossypium MLO homologues

TE D

14

Conditional searches for Gossypium MLO homologues produced 38 significant matches in G. hirsutum L., 22 in G.

17

raimondii L. and 17 in G. arboreum L. (Table 1-3). Predicted MLO genes GhMLO1–GhMLO38, GrMLO1–GrMLO22 and

18

GaMLO1–GaMLO17 were numbered depending on their chromosomal location. We concluded that more than 90 percent of

19

the 77 MLOs encoded proteins ranging from 400 to 600 amino acids, whereas six (one from G. raimondii, one from G.

20

arboreum and four from G. hirsutum) had markedly different lengths as compared to MLO homologues reported in the

21

genomes of Arabidopsis [14], Vitis vinifera [13] and Cucumis sativus [19], i.e., they were less than 300 or more than 600

22

amino acids. The TMHMM2 programme predicted different orientations and numbers of transmembrane (TM) helices in the

23

polypeptides. The number of TM domains varied from two in GhMLO8 and GaMLO4 to eight in ten of the 77 MLOs (Table

24

1–3). However, 28 of the 77 MLO proteins had seven TM domains, which are conserved with respect to MLO family

25

members in monocot and dicot plant species [3]. Comparison of the 77 MLO cDNAs to their genomic DNA sequences

26

revealed that the number of exons varied from 8 (GhMLO8) to 18 (GhMLO22). Nearly half of the MLO genes (32 out of 77)

27

contained 15 exons, which seems to be a common feature of the MLO gene family [14]. Details including the length of the

AC C

EP

16

5

ACCEPTED MANUSCRIPT 1

77 sequences, the location of MLO domains and Accession Numbers are provided in Tables 1–3.

2

Table 1 Members of GrMLO gene family as predicted in G. raimondii cv. Shixiya1 genome Name

Accession NO.a

Chr.

CDS

AA

Exons

length

length

TMb

MLO domain

MLO domain

c

location

length

Gorai.001G130800.1

1

1755

584

15

7

6-499

494

GrMLO2

Gorai.001G200000.1

1

1857

618

14

8

29-553

525

GrMLO3

Gorai.002G072500.1

2

1602

533

15

4

66-460

395

GrMLO4

Gorai.002G113800.1

2

1644

547

14

6

GrMLO5

Gorai.004G029300.1

4

1737

578

15

7

GrMLO6

Gorai.004G106900.1

4

1314

437

14

8

GrMLO7

Gorai.005G074900.1

5

1494

497

14

6

GrMLO8

Gorai.005G241700.1

5

1710

569

15

6

GrMLO9

Gorai.006G088500.1

6

1314

437

11

4

GrMLO10

Gorai.007G193200.1

7

1683

560

15

GrMLO11

Gorai.007G250700.1

7

1392

463

13

GrMLO12

Gorai.009G078600.1

9

1764

587

15

GrMLO13

Gorai.009G078700.1

9

1596

531

GrMLO14

Gorai.009G118700.1

9

1455

GrMLO15

Gorai.010G000800.1

10

1530

GrMLO16

Gorai.010G205000.1

10

1707

GrMLO17

Gorai.010G205100.1

10

1239

GrMLO18

Gorai.011G030800.1

11

1545

GrMLO19

Gorai.011G089600.1

11

1437

GrMLO20

Gorai.011G240700.1

11

1731

GrMLO21

Gorai.012G004100.1

12

GrMLO22

Gorai.013G197800.1

13

RI PT

GrMLO1

481

9-497

489

4-420

417

9-470

462

11-507

497

7-404

398

9-456

448

SC

5-485

6

6-419

414

7

11-505

495

15

7

11-485

475

484

14

7

6-450

445

509

15

7

8-454

447

568

15

8

18-475

458

412

12

5

6-337

332

514

14

7

20-460

441

478

14

6

6-425

420

576

15

8

6-495

490

1737

578

15

7

9-494

486

1515

504

13

7

7-455

449

TE D

M AN U

6

a

Available at https://www.cottongen.org/data/download/genome_JGI.

b

Presence and number of transmembrane (TM) helices in the proteins was predicted using the online software of TMHMM

9

Table 2 Members of GaMLO gene family as predicted in G. arboretum genome

EP

3 4 5 6 7 8

(http://www.cbs.dtu.dk/services/TMHMM-2.0/). c

Presence of conserved MLO domains within the acquired MLO sequences was confirmed by searching in NCBI's conserved domain database

AC C

(http://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi).

Name

Accession NO.a

Chr.

CDS length

AA length

Exons

TMb

MLO domain locationc

MLO domain length

GaMLO1

Cotton_A_06751

3

1671

556

15

7

9-475

467

GaMLO2

Cotton_A_19376

4

1596

532

15

6

5-470

466

GaMLO3

Cotton_A_00762

5

1710

569

15

6

11-507

497

GaMLO4

Cotton_A_08798

5

1224

407

11

2

4-389

386

GaMLO5

Cotton_A_36285

6

1302

433

13

5

4-423

420

GaMLO6

Cotton_A_12678

7

1791

596

14

6

5-536

532

GaMLO7

Cotton_A_06313

7

1692

563

15

7

50-490

441

GaMLO8

Cotton_A_26172

8

1488

495

13

7

18-402

385

GaMLO9

Cotton_A_07533

9

1467

488

14 6

6

6-464

459

ACCEPTED MANUSCRIPT 1545

514

14

7

20-460

441

GaMLO11

Cotton_A_23415

9

1704

567

15

7

6-484

479

GaMLO12

Cotton_A_11046

10

1902

633

15

7

1-551

551

GaMLO13

Cotton_A_11047

10

1602

533

15

8

11-486

476

GaMLO14

Cotton_A_15078

10

1395

464

12

8

1-462

462

GaMLO15

Cotton_A_20369

12

1680

559

15

5

7-475

469

GaMLO16

Cotton_A_22367

13

1518

505

13

7

7-455

449

GaMLO17

Cotton_A_39162

Ca1

1674

557

15

8

14-494

481

RI PT

9

a

Available at https://www.cottongen.org/data/download/genome_BGI_A2.

b

Presence and number of transmembrane (TM) helices in the proteins was predicted using the online software of TMHMM

(http://www.cbs.dtu.dk/services/TMHMM-2.0/). c

Presence of conserved MLO domains within the acquired MLO sequences was confirmed by searching in NCBI's conserved domain database

SC

(http://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi).

Table 3 Members of the GhMLO gene family as predicted in G. hirsutum cv. TM-1 genome Accession NO.a

Chr.

GhMLO1

CotAD_67162

1

1332

GhMLO2

CotAD_67021

1

1377

GhMLO3

CotAD_28782

2

1569

GhMLO4

CotAD_16944

2

1545

GhMLO5

CotAD_56441

2

1842

GhMLO6

CotAD_36554

3

1416

GhMLO7

CotAD_62838

4

GhMLO8

CotAD_31993

6

GhMLO9

CotAD_04379

8

GhMLO10

CotAD_12423

9

GhMLO11

CotAD_12424

9

GhMLO12

CotAD_63577

11

GhMLO13

CotAD_64129

GhMLO14

CotAD_30773

GhMLO15

CotAD_75839

GhMLO16

length

AA

Exons

TMb

MLO domain

MLO domain

locationc

length

M AN U

CDS

Name

length

13

5

6-399

394

458

11

5

9-352

344

522

15

7

9-449

441

514

15

6

9-432

424

613

15

7

5-552

548

471

14

5

4-453

450

1302

433

13

5

4-423

420

768

255

8

2

7-253

247

1281

426

13

7

7-481

475

1347

448

11

6

11-401

391

1695

564

15

7

11-482

472

TE D

443

1470

489

14

6

6-465

460

12

1089

362

12

4

60-358

299

13

1491

496

12

6

7-446

440

14

1674

557

15

8

14-494

481

CotAD_36153

14

1767

588

15

7

6-503

498

GhMLO17

CotAD_29126

15

1845

614

15

7

5-552

548

GhMLO18

CotAD_45096

17

1266

421

14

7

4-404

401

GhMLO19

CotAD_31735

18

1407

468

11

6

11-406

396

GhMLO20

CotAD_05932

18

1416

471

14

5

4-444

441

GhMLO21

CotAD_31530

19

1524

507

14

7

14-414

401

GhMLO22

CotAD_20256

19

2079

692

18

8

7-463

457

GhMLO23

CotAD_09253

20

1740

579

16

6

9-492

484

GhMLO24

CotAD_72948

20

1236

411

11

4

44-367

324

GhMLO25

CotAD_08062

22

1662

553

14

7

7-469

463

GhMLO26

CotAD_01552

22

1212

403

12

5

2-369

368

EP

6 7

Cotton_A_03932

AC C

1 2 3 4 5

GaMLO10

7

ACCEPTED MANUSCRIPT CotAD_54310

22

1596

531

15

7

11-485

475

GhMLO28

CotAD_54311

22

1725

574

15

6

11-492

482

GhMLO29

CotAD_15574

24

1470

489

14

6

6-436

431

GhMLO30

CotAD_06176

26

1692

563

15

5

9-482

474

GhMLO31

CotAD_00651

scaffold26.1

1305

434

14

7

20-433

414

GhMLO32

CotAD_07261

scaffold72.1

1437

478

13

4

9-397

389

GhMLO33

CotAD_08394

scaffold190.1

1407

468

11

6

11-406

396

GhMLO34

CotAD_49330

scaffold1917.1

1227

408

9

4

7-338

332

GhMLO35

CotAD_39473

scaffold2046.1

1407

468

13

7

8-438

431

GhMLO36

CotAD_71931

scaffold3483.1

1272

423

11

5

14-330

317

GhMLO37

CotAD_75625

scaffold3566.1

1725

574

15

8

29-509

481

GhMLO38

CotAD_74071

scaffold4982.1

1659

552

15

7

6-469

464

RI PT

GhMLO27

Available at https://www.cottongen.org/data/download/genome_BGI_AD1.

b

Presence and number of transmembrane (TM) helices in the proteins was predicted using the online software of TMHMM

6

3.2. Genomic organization of Gossypium MLO homologues

SC

1 2 3 4 5

a

(http://www.cbs.dtu.dk/services/TMHMM-2.0/).

Presence of conserved MLO domains within the acquired MLO sequences was confirmed by searching in NCBI's conserved domain database

(http://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi).

M AN U

c

According to the report by Kohel [39], chromosomes 1 to 13 of tetraploid cotton were derived from the A subgenome,

8

and chromosomes 14 to 26 originated from the D subgenome. We were able to map 68 out of 77 MLOs onto chromosomes

9

in G. raimondii, G. arboreum, or G. hirsutum (Fig. 1). Generally, one or two members were located on most chromosomes,

10

with the exception of G. raimondii and G. arboretum, in which chromosomes 9 and 10 contained three genes each.

11

Furthermore, chromosome 22, originating from chromosome 9 of the D subgenome in G. hirsutum, contained four MLOs.

12

The majority of the 77 Gossypium MLO family members occurred as singletons, with the exception of five groups,

13

GhMLO10–GhMLO11

14

CotAD_54311),

15

(Gorai.010G205000.1–Gorai.010G205100.1) and GaMLO16–GaMLO17 (Cotton_A_11046–Cotton_A_11047), which were

16

organized as adjacent homologues with the distance ranging from 9.4 kb to 30.8 kb (Table S4). We identified two groups in

17

G. hirsutum and G. raimondii and one in G. arboretum. Interestingly, all five gene clusters were located on chromosome 9

18

or 10 (chr22 originating from subD_chr9), suggesting that there was a slight bias for the MLO homologue location.

NO.

CotAD_12423–CotAD_12424),

EP

(Accession

TE D

7

(Gorai.009G078600.1–Gorai.009G078700.1),

(CotAD_54310–

GrMLO16–GrMLO17

AC C

GrMLO12–GrMLO13

GhMLO27–GhMLO28

19

Previous studies have indicated that genomic changes, including chromosomal rearrangement, gene duplication and gene

20

expression changes, occurred during the formation of polyploid species [40]. To elucidate the expanded mechanism of the

21

MLO gene family in G. hirsutum, we investigated genomic organization of GhMLO homologues. First, we performed

22

multiple and pairwise alignments of 38 GhMLO sequences. After comprehensive analysis of pairwise alignments and the 8

ACCEPTED MANUSCRIPT physical location of each GhMLO gene, we detected 12 pairs of homologous genes and 2 tandem duplications (GhMLO1–

2

GhMLO2/GhMLO24–GhMLO23; GhMLO10–GhMLO11/GhMLO27–GhMLO28). Related information of homologous

3

genes and duplication events were presented in Table 4.

4 5 6 7

Fig. 1. Chromosomal localization of GrMLOs (A), GaMLOs (B), and GhMLOs (C). The relative sizes (unit, Mb) of G. raimondii (chr.D01–D13), G. arboreum (chr.A01–A13) and G. hirsutum (chr.AD01–AD26) chromosomes were consistent with published genomic data.

8

3.3. Phylogenetic analysis

9

AC C

EP

TE D

M AN U

SC

RI PT

1

We performed a phylogenetic study on the newly identified GhMLO proteins. The dataset covered 38 GhMLO proteins,

10

the complete Arabidopsis thaliana MLO protein family AtMLO1–15 [14], and a series of MLO homologues that have been

11

functionally associated with PM susceptibility from grapevine (Vitis vinifera) [16], apple (Malus domestica) [20], barley 9

ACCEPTED MANUSCRIPT (Hordeum vulgare) [2], rice (Oryza sativa) [15], pepper (Capsicum annuum) [23], pea (Pisum sativum) [6], maize (Zea mays)

2

[14] and tomato (Solanum lycopersicum) [21]. Phylogenetic analysis of total 74 MLO proteins resulted in seven distinct

3

clades (Fig. 2). Clades I to VI were assigned according to a previous study of AtMLO homologues and grapevine VvMLOs

4

[14, 16]. Two additional clades (VII and VIII) included Rosaceae (P. persica, F. vesca and M. domestica) MLO homologues

5

only, as reported by Pessina et al. [20]. Four additional G. hirsutum MLO homologues GhMLO8, GhMLO13, GhMLO22

6

and GhMLO31 were grouped in clade VII with apple MdMLO18, suggesting the existence of more than six clades in the

7

plant MLO gene family. Eight G. hirsutum MLO homologues (GhMLO11, GhMLO16, GhMLO21, GhMLO25, GhMLO28,

8

GhMLO34, GhMLO36 and GhMLO38) clustered together in clade V with other MLO proteins, AtMLO2, AtMLO6,

9

AtMLO12, tomato SlMLO1, pea PsMLO1, pepper CaMLO1 and CaMLO2, which have been experimentally shown to be

SC

RI PT

1

10

required for PM susceptibility (e.g., [5, 6, 23, 41]).

11

Table 4 Homologous genes and duplication events detected in GhMLOs Pair 1

GhMLO3

522

2

GhMLO5

613

3

GhMLO6

471

4

GhMLO7

433

5

GhMLO8

255

6

GhMLO9

504

7

GhMLO12

8

GhMLO15

9

GhMLO19

10

GhMLO21

11

GhMLO25

12

GhMLO30

12 13

Identity

GhMLO4

514

95.14

GhMLO17

614

97.72

GhMLO20

471

97.03

GhMLO18

421

93.82

GhMLO22

692

95.29

GhMLO14

496

96.17

489

GhMLO29

489

97.14

557

GhMLO37

574

96.41

468

GhMLO33

468

98.29

507

GhMLO36

423

93.85

553

GhMLO34

408

90.69

563

GhMLO32

478

98.12

GhMLO1/ GhMLO2

443/458

GhMLO24/ GhMLO23

411/579

80.78/96.07

GhMLO10/ GhMLO11

448/564

GhMLO27/ GhMLO28

531/574

95.76/95.21

AC C

2

AA Length

EP

Event 1

SeqB

M AN U

AA Length

TE D

SeqA

Two homologues, GhMLO5 and GhMLO17 were found to grouped in clade IV, which contained all monocot MLO

14

proteins, such as barley HvMLOs, maize ZmMLO1 and rice OsMLOs functionally acting as PM susceptibility factors [14].

15

Consistent with this finding, one MLO protein from the dicot species V. vinifera (VvMLO14) [16], one homologue from F.

16

vesca (FvMLO17) [20] and one from P. persica (PpMLO12) [20] also clustered in clade IV. Such clustering results raise the

17

question of whether exclusively monocot MLO proteins cluster in clade IV. Analysis of phylogenetic relationships revealed

18

that ten G. hirsutum MLO homologues were clustered in clade V and IV, which harboured all dicot and monocot MLO

19

proteins functionally related to PM susceptibility, thus making them susceptibility factor candidates. The phylogenetic 10

ACCEPTED MANUSCRIPT analysis performed here confirmed the presence of clade VII, first reported in Rosaceae by Pessina et al. [20]. Additional

2

studies should focus on the functional characterization of cotton MLO homologues grouped in clades IV, V and VII.

4 5 6 7 8 9 10 11 12 13 14

AC C

3

EP

TE D

M AN U

SC

RI PT

1

Fig. 2. Phylogenetic analysis of MLO proteins. The phylogenetic tree represents a consensus tree with branch lengths proportional to sequence distance. Numbers indicate bootstrap values (from 1000 replicates) that support the respective branch. The dataset includes 38 GhMLOs (GhMLO1–38) and 36 other MLO proteins from Arabidopsis thaliana, grapevine (Vitis vinifera), apple (Malus domestica), barley (Hordeum vulgare), rice (Oryza sativa), pepper (Capsicum annuum), pea (Pisum sativum), maize (Zea mays) and tomato (Solanum lycopersicum). Genbank accession numbers of translated MLO proteins used in phylogenetic analysis: AtMLO1 (Z95352); AtMLO2–15 (AF369563–AF369576); VvMLO3 (CAO18135); VvMLO4 (CAO21819); VvMLO6 (CAO66388); VvMLO9 (CAN84002); VvMLO13(CAO68971); VvMLO14(CAO66265); VvMLO17 (CAO68972); MdMLO5 (MDP0000163089); MdMLO7 (MDP0000123907); MdMLO11 (MDP0000239643); MdMLO18 (MDP0000928368); MdMLO19 (MDP0000168714); HvMLO (CAB06083); HvMLO-h1 (CAB08860); OsMLO1 (CAB08606); OsMLO3 (BAG93853); CaMLO1 (AAX31277); CaMLO2 (AFH68055); PsMLO1 (ACO07297); ZmMLO1 (AAK38337); SiMLO1 (AAX77013). 11

ACCEPTED MANUSCRIPT 1

We conducted further multiple alignments among MLO proteins in clade V to identify conserved domains (Fig. 3). Twelve proteins from 5 species presented a high degree of conservation in their seven predicted TM domains, which define this

3

protein family [14]. We also identified a calmodulin-binding domain consisting of a stretch of approximately 10–15 amino

4

acids proximal to TM domain 7 [4]. Moreover, two other conserved regions within the C-terminus of several MLO proteins

5

have been suggested to modulate PM infection [42]. Peptide domain I is characterized by the presence of serine (S),

6

threonine (T) and proline (P) residues, whereas peptide domain II contains the consensus motif D/E-F-S/T-F (Fig. 3). All of

7

the GhMLO proteins within clade V contain the two conserved domains mentioned above, except GhMLO25 contains a

8

modified motif II of I-F-S-L.

9

3.4. Synteny block detection

SC

RI PT

2

Previous studies have indicated that allotetraploid cotton species were derived from an interspecific hybridization event

11

between A and D-genome diploid species [40]. The recent availability of genome sequences for G. raimondii, G. arboreum

12

and G. hirsutum offers great potential for comparative genomics studies, which aim to provide insights into structures and

13

functions of genomic features. First, we identified a total of 517 orthologous relationships between G. raimondii, G.

14

arboreum and G. hirsutum MLO homologues (Table S2). Because of homologous genes and duplication events in the G.

15

hirsutum genome, numerous many-to-one relationships were identified. Orthologues are genes in different species that evolve

16

from one single gene in their last common ancestor. Such genes often retain identical biological roles in the present-day

17

organisms. A perfect synteny block is a conserved block of genes that share exactly the same order and strandedness and

18

contain no mismatches compared with the chromosomes of related species.

TE D

Then we predicted 25, 28 and 18 conserved non-nested synteny blocks between G. hirsutum and G. raimondii, G. hirsutum

EP

19

M AN U

10

and G. arboretum, and G. raimondii and G. arboretum, respectively (Fig. 4). Notably, 12 conserved synteny blocks were

21

discovered among the three Gossypium genomes (Fig. 5). Thirteen blocks were not included in Fig. 4 and 5 because they

22

involved genes that could not be localized to a specific chromosome. These conserved segments contain different numbers of

23

genes, ranging from 1 to 3. The size distribution of conserved non-nested blocks is shown in Fig. 6, and detailed information

24

about each block is shown in Table S3. Desirable blocks were detected because of a close evolutionary relationship among

25

these three species. In addition, because of the existence of homologous genes, some many-to-one relationships were

26

generated in some blocks.

AC C

20

12

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

1

13

ACCEPTED MANUSCRIPT Fig. 3. Multiple sequence alignment of GhMLO proteins and selected MLO proteins in clade V based on Fig. 2. Arabidopsis thaliana (AtMLO2, AtMLO6 and AtMLO12), Solanum lycopersicum SiMLO1 (AAX77013) and Capsicum annuum CaMLO2 (AFH68055)) have been functionally characterized as susceptibility genes. Vitis vinifera VvMLO3 and VvMLO13 [16] clustered in clade V. The multiple sequence alignment was generated by CLUSTALX2 using default parameters. The positions of seven TM regions (TM1–7) inferred from the experimentally determined topology of HvMLO [2] and the approximate position of the calmodulin-binding domain (CaMDB) were previously defined [4]. Two additional conserved domains I and II were previously identified [42], and the above-mentioned domains were indicated by lines above the sequences.

10

A total of 83 conserved non-nested synteny blocks were predicted after pairwise comparative analysis of MLO

11

homologues. In particular, genes situated on G. raimondii chromosomes 2, 5, 7, 9 and 11 are predicted to have

12

corresponding orthologues on G. hirsutum chromosomes 2, 20, 18, 22 and 22, respectively, whereas genes on G. arboreum

13

chromosomes 5, 7, 9 and 10 are suggested to originate from conserved blocks on G. hirsutum chromosomes 20, 2, 19 and 22,

14

respectively (Fig. 4). The corresponding chromosomes that contain the largest number and highest density of perfectly

15

conserved synteny blocks in G. hirsutum, G. raimondii and G. arboreum are chr2- chr2D- chr7A, chr20- chr5D- chr5A and

16

chr22- chr9D- chr10A. These data suggest that genes within these conserved blocks may be co-regulated by specific locus

17

control regions (LCRs), which can control the expression of a group of genes. This finding indicates conservation and

18

rearrangements of certain chromosome segments, which play an important role in the evolution or adaptive changes of these

19

three close species.

SC

M AN U

TE D EP AC C

20

RI PT

1 2 3 4 5 6 7 8 9

14

1 2 3 4 5 6

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

Fig. 4. Circos diagram of synteny blocks identified between G. raimondii, G. arboreum, and G. hirsutum MLOs. The chromosomes of G. raimondii (D01–D13), G. arboreum (A01–A13), A subgenome (AD01–AD13) of G. hirsutum, and D subgenome (AD14–AD26) of G. hirsutum were filled with light red, light green, dark green and dark red, respectively. A total of 61 coloured lines connecting two chromosomal regions denote syntenic regions between G. raimondii, G. arboreum, and G. hirsutum. Ten blocks were not included because involved genes were not localized to definitive chromosomes.

7 8

15

1 2 3 4 5 6 7

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

Fig.5. Circos diagrams of synteny blocks detected among GhMLOs, GrMLOs and GaMLOs. The chromosomes of G. raimondii (D01–D13), G. arboreum (A01–A13), A subgenome (AD01–AD13) of G. hirsutum, and D subgenome (AD14– AD26) of G. hirsutum were filled with light red, light green, dark green and dark red, respectively. One coloured circle with three lines indicates a synteny block among the three genomes. Three blocks were not depicted because they did not demonstrate positioning of related genes.

16

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

Fig. 6. Size distribution of conserved non-nested synteny blocks, obtained by OrthoCluster, preserving gene order. We did not permit mismatches and did not consider strandedness. In total, 25, 28 and 18 conserved non-nested synteny blocks were predicted between G. hirsutum and G. raimondii (AADD- DD), G. hirsutum and G. arboretum (AADD- AA), G. raimondii and G. arboretum (DD- AA), respectively. Moreover, 12 conserved blocks of MLO candidate genes were found among the three genomes (AADD- DD- AA). These conserved segments contain different numbers of genes, ranging from 1 to 3 genes.

8

3.5. Gene expression analysis during leaf development

TE D

1 2 3 4 5 6 7

To estimate the temporal expression patterns of GhMLO genes during leaf development of upland cotton, we analysed

10

publicly available RNA-Seq data containing 3,624 differentially expressed genes during leaf development of 15-, 25-, 35-,

11

45-, 55-, and 65-day-old plants [43]. Four genes, GhMLO15, GhMLO21, GhMLO25 and GhMLO38, were differentially

12

expressed during the six leaf developmental stages from young and mature to senescent phases (Fig. 7). The relative

13

expression of GhMLO15 and GhMLO21 in leaves was higher than that of GhMLO25 and GhMLO38. GhMLO15 was

14

significantly up-regulated at almost all phases, although it sharply decreased at the 35-day-old stage. During leaf growth

15

stages, GhMLO21 transcripts continuously increased, especially after 35 days. GhMLO25 and GhMLO38 expression slightly

16

increased overall during leaf development. In summary, four differentially expressed genes were up-regulated during leaf

17

development, indicating that they could be involved in the regulation of leaf senescence.

18

3.6. Responses of GhMLOs to stress and phytohormonal stimuli

19 20

AC C

EP

9

To determine whether GhMLOs are involved in environmental stress or phytohormone signalling pathways, we examined gene expression of GhMLOs in response to exogenous application of salt, Mannitol, ABA, ETH, JA or SA. Experimental 17

ACCEPTED MANUSCRIPT data demonstrated that only five genes were not differentially expressed and twenty genes were sharply up-regulated or

2

down-regulated under treatments (Fig. 8). All data of gene expression were presented in Table S5. Transcripts of eleven

3

genes (Fig. 8A, B and C) were increased by more than ten-fold compared to non-treated plants, whereas six genes (Fig. 8D

4

and E) were found to be significantly suppressed in leaves under some of these conditions.

AC C

5

EP

TE D

M AN U

SC

RI PT

1

6 7 8 9

Fig. 7. Transcriptional variation of four G. hirsutum MLO genes during leaf development of 15-, 25-, 35-, 45-, 55-, and 65-day-old plants. Published RNA-Seq data [43] containing 3,624 differentially expressed genes during leaf development were analysed. Four genes, GhMLO15, GhMLO21, GhMLO25 and GhMLO38 were differentially expressed during six leaf developmental stages from young and mature to senescent phases.

10

Previous analysis of phylogenetic relationships revealed that ten G. hirsutum MLO homologues were clustered in clade V

11

and IV, which harboured all dicot and monocot MLO proteins functionally related to PM susceptibility [5, 6, 14, 23, 41].

12

Experimental results showed that all of them were differentially expressed under stress or phytohormone application. Three

13

(GhMLO6, GhMLO18 and GhMLO25) out of 38 genes were intensely responsive to ABA treatment. ABA is an important

14

phytohormone that can convert the initial stress signal, such as drought or high salinity, into a cellular response [44, 45]. 18

ACCEPTED MANUSCRIPT Intense induction of these three genes by ABA suggests that they are specifically involved in the response of the ABA

2

signalling pathway. GhMLO6, GhMLO11, GhMLO17 and GhMLO28 were dramatically up-regulated in upland cotton

3

leaves cultured in the presence of 200 µM ETH. ETH is a gas phytohormone with significant functions throughout the

4

whole dicotyledon and monocotyledon life cycle, ranging from growth and development to a variety of stress responses [46,

5

47]. In addition, three genes (GhMLO9, GhMLO17 and GhMLO23) showed opposite expression pattern in response to

6

exogenous JA and SA.

RI PT

1

The functions of MLO family members in modulating plant defence responses against PM and regulating cell death has

8

been confirmed [2, 5-7, 11, 24, 41]. However, accumulating evidence has suggested that MLO may be involved in a variety

9

of abiotic stresses [11, 27, 48-49]. Four GmMLOs from soybean were responsive to various abiotic stresses and

SC

7

phytohormone treatments [48]. The results of virus-induced silencing of CaMLO2 in chili pepper and over-expression in

11

Arabidopsis support that CaMLO2 participate in drought stress regulation, acting as a suppressor of ABA signalling [27].

12

The expression of HbMLO1 from the rubber tree was intensely induced by diverse phytohormones (including ethephon, JA,

13

SA, ABA, indole-3-acetic acid, and gibberellic acid), H2O2, and wounding treatments, but no intense response to PM

14

infection was found [45]. In the current study, various abiotic stresses and phytohormone treatments induced or suppressed

15

the expression of 33 GhMLOs. The clear response of GhMLOs to stress conditions or phytohormone supplement suggests

16

that they may participate in the salt, Man, ABA, ETH, JA and SA responsive signalling pathways. We propose that future

17

studies should focus on elucidating the roles of the MLO gene family in response to environmental stimuli.

18

4. Conclusion

TE D

Our work led to the identification of 22 MLO homologues in G. raimondii L., 17 in G. arboreum L. and 38 in G. hirsutum

EP

19

M AN U

10

L. The majority of the 77 Gossypium MLO members were organized as singletons, with the exception of five gene clusters.

21

After comprehensive analysis of pairwise alignments and the physical location of each GhMLO gene, we detected 12 pairs

22

of homologous genes and 2 tandem duplications. Clearly, the phylogenetic analysis performed in this study confirmed the

23

presence of clade VII, which was previously reported in the Rosaceae MLO family. A total of 83 conserved non-nested

24

synteny blocks were predicted after pairwise comparative analysis of MLO homologues among the three Gossypium species.

25

Four genes (GhMLO15, GhMLO21, GhMLO25 and GhMLO38) were differentially expressed during six leaf developmental

26

stages from young and mature to senescent phases. The general and intense response of GhMLOs to stress conditions or

27

phytohormone supplement suggests that MLO gene family may participate in the salt, Man, ABA, ETH, JA and SA

28

responsive signalling pathways in upland cotton.

AC C

20

19

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

1

2 20

ACCEPTED MANUSCRIPT 1 2 3 4 5 6 7

Fig. 8. Relative expression analysis of twenty G. hirsutum MLO genes in response to abiotic treatments and phytohormone application. Seven-day-old seedlings were cultured within MS solid medium as a control (CK) or supplemented with 50 µM ABA, 200 µM sodium chloride (NaCl), 200 µM ethylene (ETH), 100 µM JA, 1 mM SA or 200 mM mannitol (Man). Treated and control plants were grown under the same conditions, and leaf samples were collected after three weeks of treatments. Three biological repeats were performed, and each sample contained three young leaves collected from one single plant. Data in the graph were mean values with standard deviation (error bar) from three replicates. Statistical analysis was conducted by Student’s t-test (** P<0.01, * P<0.05).

9

RI PT

8

Additional files: Table S1 Quantitative PCR primers of target genes used in the study.

11

Table S2 Ortholgous relationships generated between G. raimondii, G. arboreum and G. hirsutum MLO homologs.

12

Table S3 Information of synteny blocks predicted between G. hirsutum, G. raimondii and G. arboretum MLO homologs.

13

Table S4 Gene clusters found in three cotton species of 77 MLO homologs.

14

Table S5 Relative expression analysis of 38 G. hirsutum MLO genes in response to abiotic treatments or phytohormone

15

application.

16

Contributions:

M AN U

SC

10

Conceived and designed the experiments: Renhai Peng, Xiaoyan Wang and Shuxun Yu. Performed the experiments:

18

Xiaoyan Wang, Qifeng Ma and Lingling Dou. Analyzed the data: Xiaoyan Wang and Qifeng Ma. Contributed

19

reagents/materials/analysis tools: Renhai Peng and Shuxun Yu. Wrote the paper: Xiaoyan Wang. Edited the manuscript:

20

Xiaoyan Wang and Zhen Liu.

21

Acknowledgements

EP

We would like to thank doctoral candidate Xihua Li for her assistance of Circos software. The work described in this paper

AC C

22

TE D

17

23

was supported by the National High-tech Research and Development Projects of China (2013AA102601) and funded by the

24

Major Projects of Anyang City Science and Technology Plan (ANKE20140208).

25

Reference

26 27 28 29 30 31

[1] D.A. Glawe, The powdery mildews: a review of the world's most familiar (yet poorly known) plant pathogens, Annu Rev Phytopathol, 46 (2008) 27-51. [2] R. Buschges, K. Hollricher, R. Panstruga, G. Simons, M. Wolter, A. Frijters, R. van Daelen, T. van der Lee, P. Diergaarde, J. Groenendijk, S. Topsch, P. Vos, F. Salamini, P. Schulze-Lefert, The barley Mlo gene: a novel control element of plant pathogen resistance, Cell, 88 (1997) 695-705. [3] A. Devoto, P. Piffanelli, I. Nilsson, E. Wallin, R. Panstruga, G. von Heijne, P. Schulze-Lefert, Topology, subcellular 21

ACCEPTED MANUSCRIPT localization, and sequence diversity of the Mlo family in plants, J Biol Chem, 274 (1999) 34993-35004. [4] M.C. Kim, S.H. Lee, J.K. Kim, H.J. Chun, M.S. Choi, W.S. Chung, B.C. Moon, C.H. Kang, C.Y. Park, J.H. Yoo, Y.H. Kang, S.C. Koo, Y.D. Koo, J.C. Jung, S.T. Kim, P. Schulze-Lefert, S.Y. Lee, M.J. Cho, Mlo, a modulator of plant defense and cell death, is a novel calmodulin-binding protein. Isolation and characterization of a rice Mlo homologue, J Biol Chem, 277 (2002) 19304-19314. [5] C. Consonni, M.E. Humphry, H.A. Hartmann, M. Livaja, J. Durner, L. Westphal, J. Vogel, V. Lipka, B. Kemmerling, P. Schulze-Lefert, S.C. Somerville, R. Panstruga, Conserved requirement for a plant host cell protein in powdery mildew

RI PT

pathogenesis, Nat Genet, 38 (2006) 716-720.

[6] S. Pavan, A. Schiavulli, M. Appiano, A.R. Marcotrigiano, F. Cillo, R.G. Visser, Y. Bai, C. Lotti, L. Ricciardi, Pea powdery mildew er1 resistance is associated to loss-of-function mutations at a MLO homologous locus, Theor Appl Genet, 123 (2011) 1425-1431.

[7] Y. Bai, S. Pavan, Z. Zheng, N.F. Zappel, A. Reinstadler, C. Lotti, C. De Giovanni, L. Ricciardi, P. Lindhout, R. Visser, K.

SC

Theres, R. Panstruga, Naturally occurring broad-spectrum powdery mildew resistance in a Central American tomato accession is caused by loss of mlo function, Mol Plant Microbe Interact, 21 (2008) 30-39.

[8] S. Pavan, E. Jacobsen, R.G. Visser, Y. Bai, Loss of susceptibility as a novel breeding strategy for durable and broad-spectrum resistance, Mol Breed, 25 (2010) 1-12.

M AN U

[9] M.C. Kim, R. Panstruga, C. Elliott, J. Muller, A. Devoto, H.W. Yoon, H.C. Park, M.J. Cho, P. Schulze-Lefert, Calmodulin interacts with MLO protein to regulate defence against mildew in barley, Nature, 416 (2002) 447-451. [10] H. Xu, M.C. Heath, Role of calcium in signal transduction during the hypersensitive response caused by basidiospore-derived infection of the cowpea rust fungus, Plant Cell, 10 (1998) 585-598.

[11] P. Piffanelli, F. Zhou, C. Casais, J. Orme, B. Jarosch, U. Schaffrath, N.C. Collins, R. Panstruga, P. Schulze-Lefert, The barley MLO modulator of defense and cell death is responsive to biotic and abiotic stress stimuli, Plant Physiol, 129 (2002) 1076-1085.

TE D

[12] Z. Zheng, T. Nonomura, K. Boka, Y. Matsuda, R.G. Visser, H. Toyoda, L. Kiss, Y. Bai, Detection and quantification of Leveillula taurica growth in pepper leaves, Phytopathology, 103 (2013) 623-632. [13] A. Feechan, A.M. Jermakow, I.B. Dry, Grapevine MLO candidates required for powdery mildew pathogenicity?, Plant Signal Behav, 4 (2009) 522-523.

[14] A. Devoto, H.A. Hartmann, P. Piffanelli, C. Elliott, C. Simmons, G. Taramino, C.S. Goh, F.E. Cohen, B.C. Emerson, P.

EP

Schulze-Lefert, R. Panstruga, Molecular phylogeny and evolution of the plant-specific seven-transmembrane MLO family, J Mol Evol, 56 (2003) 77-88.

[15] Q. Liu, H. Zhu, Molecular evolution of the MLO gene family in Oryza sativa and their functional divergence, Gene, 409 (2008) 1-10.

AC C

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44

[16] A.M.J. Angela Feechan , Laurent Torregrosa, Ralph Panstruga, Ian B. Dry Identification of grapevine MLO gene candidates involved in susceptibility to powdery mildew, Functional Plant Biology 35 (2008) 1255-1266. [17] S. Konishi, T. Sasakuma, T. Sasanuma, Identification of novel Mlo family members in wheat and their genetic characterization, Genes Genet Syst, 85 (2010) 167-175. [18] R. Deshmukh, V.K. Singh, B.D. Singh, Comparative phylogenetic analysis of genome-wide Mlo gene family members from Glycine max and Arabidopsis thaliana, Mol Genet Genomics, 289 (2014) 345-359. [19] S.J. Zhou, Z. Jing, J.L. Shi, Genome-wide identification, characterization, and expression analysis of the MLO gene family in Cucumis sativus, Genet Mol Res, 12 (2013) 6565-6578. [20] S. Pessina, S. Pavan, D. Catalano, A. Gallotta, R.G. Visser, Y. Bai, M. Malnoy, H.J. Schouten, Characterization of the MLO gene family in Rosaceae and gene expression analysis in Malus domestica, BMC Genomics, 15 (2014) 618. [21] W.Y. Chen YB, Zhang H, Genome-wide analysis of the mildew resistance locus (MLO) gene family in tomato (Solanum lycopersicum L.), Plant Omics Journal, 7 (2014) 87-93. 22

ACCEPTED MANUSCRIPT [22] J. Acevedo-Garcia, S. Kusch, R. Panstruga, Magical mystery tour: MLO proteins in plant immunity and beyond, New Phytol, 204 (2014) 273-281. [23] Z. Zheng, T. Nonomura, M. Appiano, S. Pavan, Y. Matsuda, H. Toyoda, A.M. Wolters, R.G. Visser, Y. Bai, Loss of function in Mlo orthologs reduces susceptibility of pepper and tomato to powdery mildew disease caused by Leveillula taurica, PLoS One, 8 (2013) e70723. [24] M. Appiano, S. Pavan, D. Catalano, Z. Zheng, V. Bracuto, C. Lotti, R.G. Visser, L. Ricciardi, Y. Bai, Identification of NtMLO1, Transgenic Res, 24 (2015) 847-858.

RI PT

candidate MLO powdery mildew susceptibility genes in cultivated Solanaceae and functional characterization of tobacco [25] H.A.H. Zhongying Chen, Ming-Jing Wu, Erin J. Friedman, Jin-Gui Chen, Matthew Pulley, Paul Schulze-Lefert, Ralph Panstruga, Alan M. Jones, Expression analysis of the AtMLO gene family encoding plant-specific seven-transmembrane domain proteins, Plant Molecular Biology, 60 (2006) 583-597.

[26] Z. Chen, S. Noir, M. Kwaaitaal, H.A. Hartmann, M.J. Wu, Y. Mudgil, P. Sukumar, G. Muday, R. Panstruga, A.M. Jones, Two

SC

seven-transmembrane domain MILDEW RESISTANCE LOCUS O proteins cofunction in Arabidopsis root thigmomorphogenesis, Plant Cell, 21 (2009) 1972-1991.

[27] C.W. Lim, S.C. Lee, Functional roles of the pepper MLO protein gene, CaMLO2, in abscisic acid signaling and drought sensitivity, Plant Mol Biol, 85 (2014) 1-10.

M AN U

[28] A.H. Paterson, J.F. Wendel, H. Gundlach, H. Guo, J. Jenkins, D. Jin, D. Llewellyn, K.C. Showmaker, S. Shu, J. Udall, M.J. Yoo, R. Byers, W. Chen, A. Doron-Faigenboim, M.V. Duke, L. Gong, J. Grimwood, C. Grover, K. Grupp, G. Hu, T.H. Lee, J. Li, L. Lin, T. Liu, B.S. Marler, J.T. Page, A.W. Roberts, E. Romanel, W.S. Sanders, E. Szadkowski, X. Tan, H. Tang, C. Xu, J. Wang, Z. Wang, D. Zhang, L. Zhang, H. Ashrafi, F. Bedon, J.E. Bowers, C.L. Brubaker, P.W. Chee, S. Das, A.R. Gingle, C.H. Haigler, D. Harker, L.V. Hoffmann, R. Hovav, D.C. Jones, C. Lemke, S. Mansoor, M. ur Rahman, L.N. Rainville, A. Rambani, U.K. Reddy, J.K. Rong, Y. Saranga, B.E. Scheffler, J.A. Scheffler, D.M. Stelly, B.A. Triplett, A. Van Deynze, M.F. Vaslin, V.N. Waghmare, S.A. Walford, R.J. Wright, E.A. Zaki, T. Zhang, E.S. Dennis, K.F. Mayer, D.G. Peterson, D.S. fibres, Nature, 492 (2012) 423-427.

TE D

Rokhsar, X. Wang, J. Schmutz, Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton [29] F. Li, G. Fan, K. Wang, F. Sun, Y. Yuan, G. Song, Q. Li, Z. Ma, C. Lu, C. Zou, W. Chen, X. Liang, H. Shang, W. Liu, C. Shi, G. Xiao, C. Gou, W. Ye, X. Xu, X. Zhang, H. Wei, Z. Li, G. Zhang, J. Wang, K. Liu, R.J. Kohel, R.G. Percy, J.Z. Yu, Y.X. Zhu, S. Yu, Genome sequence of the cultivated cotton Gossypium arboreum, Nat Genet, 46 (2014) 567-572.

EP

[30] F. Li, G. Fan, C. Lu, G. Xiao, C. Zou, R.J. Kohel, Z. Ma, H. Shang, X. Ma, J. Wu, X. Liang, G. Huang, R.G. Percy, K. Liu, W. Yang, W. Chen, X. Du, C. Shi, Y. Yuan, W. Ye, X. Liu, X. Zhang, W. Liu, H. Wei, S. Wei, S. Zhu, H. Zhang, F. Sun, X. Wang, J. Liang, J. Wang, Q. He, L. Huang, J. Cui, G. Song, K. Wang, X. Xu, J.Z. Yu, Y. Zhu, S. Yu, Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution, Nat Biotechnol, 33 (2015) 524-530.

AC C

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44

[31] J. Yu, S. Jung, C.H. Cheng, S.P. Ficklin, T. Lee, P. Zheng, D. Jones, R.G. Percy, D. Main, CottonGen: a genomics, genetics and breeding database for cotton research, Nucleic Acids Res, 42 (2014) D1229- D1236. [32] R.E. Voorrips, MapChart: software for the graphical presentation of linkage maps and QTLs, J Hered, 93 (2002) 77-78. [33] Z. Gu, A. Cavalcanti, F.C. Chen, P. Bouman, W.H. Li, Extent of gene duplication in the genomes of Drosophila, nematode, and yeast, Mol Biol Evol, 19 (2002) 256-262. [34] J.D. Thompson, D.G. Higgins, T.J. Gibson, CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice, Nucleic Acids Res, 22 (1994) 4673-4680. [35] K. Tamura, G. Stecher, D. Peterson, A. Filipski, S. Kumar, MEGA6: Molecular Evolutionary Genetics Analysis version 6.0, Mol Biol Evol, 30 (2013) 2725-2729. [36] M.P. Ng, I.A. Vergara, C. Frech, Q. Chen, X. Zeng, J. Pei, N. Chen, OrthoClusterDB: an online platform for synteny blocks, BMC Bioinformatics, 10 (2009) 192. 23

ACCEPTED MANUSCRIPT [37] S. Fischer, B.P. Brunk, F. Chen, X. Gao, O.S. Harb, J.B. Iodice, D. Shanmugam, D.S. Roos, C.J. Stoeckert, Jr., Using OrthoMCL to assign proteins to OrthoMCL-DB groups or to cluster proteomes into new ortholog groups, Curr Protoc Bioinformatics, Chapter 6 (2011) Unit 6 12 11-19. [38] K.J. Livak, T.D. Schmittgen, Analysis of relative gene expression data using real-time quantitative PCR and the 2-

△△CT

method, Methods, 25 (2001) 402-408. [39] R. Kohel, Genetic nomenclature in cotton, J Hered, 64 (1973) 291-295. [40] R.C. Cronn, R.L. Small, J.F. Wendel, Duplicated genes evolve independently after polyploid formation in cotton, Proc Natl

RI PT

Acad Sci U S A, 96 (1999) 14406-14411.

[41] D.S. Kim, B.K. Hwang, The pepper MLO gene, CaMLO2, is involved in the susceptibility cell-death response and bacterial and oomycete proliferation, Plant J, 72 (2012) 843-855.

[42] R. Panstruga, Discovery of novel conserved peptide domains by ortholog comparison within plant multi-protein families, Plant Mol Biol, 59 (2005) 485-500.

SC

[43] M. Lin, C. Pang, S. Fan, M. Song, H. Wei, S. Yu, Global analysis of the Gossypium hirsutum L. Transcriptome during leaf senescence by RNA-Seq, BMC Plant Biol, 15 (2015) 43.

[44] M. Gonzalez-Guzman, G.A. Pizzio, R. Antoni, F. Vera-Sirera, E. Merilo, G.W. Bassel, M.A. Fernandez, M.J. Holdsworth, M.A. Perez-Amador, H. Kollist, P.L. Rodriguez, Arabidopsis PYR/PYL/RCAR receptors play a major role in quantitative

M AN U

regulation of stomatal aperture and transcriptional response to abscisic acid, Plant Cell, 24 (2012) 2483-2496. [45] H. Li, H. Jiang, Q. Bu, Q. Zhao, J. Sun, Q. Xie, C. Li, The Arabidopsis RING finger E3 ligase RHA2b acts additively with RHA2a in regulating abscisic acid signaling and drought response, Plant Physiol, 156 (2011) 550-563. [46] D. Van Der Straeten, M. Van Montagu, The molecular basis of ethylene biosynthesis, mode of action, and effects in higher plants, Subcell Biochem, 17 (1991) 279-326.

[47] Y.M. Qin, C.Y. Hu, Y. Pang, A.J. Kastaniotis, J.K. Hiltunen, Y.X. Zhu, Saturated very-long-chain fatty acids promote cotton fiber and Arabidopsis cell elongation by activating ethylene biosynthesis, Plant Cell, 19 (2007) 3692-3704.

TE D

[48] Q. Shen, J. Zhao, C. Du, Y. Xiang, J. Cao, X. Qin, Genome-scale identification of MLO domain-containing genes in soybean (Glycine max L. Merr.), Genes Genet Syst, 87 (2012) 89-98.

[49] B. Qin, F. Zheng, Y. Zhang, Molecular cloning and characterization of a Mlo gene in rubber tree (Hevea brasiliensis), J Plant

EP

Physiol, 175 (2015) 78-85.

AC C

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27

24

ACCEPTED MANUSCRIPT Highlights 1. Our work led to the identification of 77 MLO homologues in three Gossypium species.

RI PT

2. A total of 83 non-nested synteny blocks were predicted among the three species.

3. Four genes were differentially expressed during six leaf developmental stages.

4. 33 genes were induced or suppressed in response to stress or phytohormone

AC C

EP

TE D

M AN U

SC

treatments.

ACCEPTED MANUSCRIPT

Contributions:

AC C

EP

TE D

M AN U

SC

RI PT

Conceived and designed the experiments: Renhai Peng, Xiaoyan Wang and Shuxun Yu. Performed the experiments: Xiaoyan Wang, Qifeng Ma and Lingling Dou. Analyzed the data: Xiaoyan Wang and Qifeng Ma. Contributed reagents/materials/analysis tools: Renhai Peng and Shuxun Yu. Wrote the paper: Xiaoyan Wang. Edited the manuscript: Xiaoyan Wang and Zhen Liu.