Variability in amylose content of Bangladeshi rice cultivars due to unique SNPs in Waxy allele

Variability in amylose content of Bangladeshi rice cultivars due to unique SNPs in Waxy allele

Accepted Manuscript Variability in amylose content of Bangladeshi rice cultivars due to unique SNPs in Waxy allele Saima Shahid, Rokeya Begum, Samsad ...

3MB Sizes 0 Downloads 25 Views

Accepted Manuscript Variability in amylose content of Bangladeshi rice cultivars due to unique SNPs in Waxy allele Saima Shahid, Rokeya Begum, Samsad Razzaque, Jesmin, Zeba I. Seraj PII:

S0733-5210(16)30128-X

DOI:

10.1016/j.jcs.2016.07.006

Reference:

YJCRS 2178

To appear in:

Journal of Cereal Science

Received Date: 19 August 2015 Revised Date:

2 July 2016

Accepted Date: 7 July 2016

Please cite this article as: Shahid, S., Begum, R., Razzaque, S., Jesmin, , Seraj, Z.I., Variability in amylose content of Bangladeshi rice cultivars due to unique SNPs in Waxy allele, Journal of Cereal Science (2016), doi: 10.1016/j.jcs.2016.07.006. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT 1

Variability in amylose content of Bangladeshi rice cultivars due to unique SNPs in Waxy allele

2

Saima Shahida,c,1, Rokeya Beguma,c, Samsad Razzaquea, Jesminb and Zeba I. Seraja*

3

a

4

Dhaka, Dhaka-1000, Bangladesh

5

b

6

Bangladesh

Plant Biotechnology Laboratory, Department of Biochemistry and Molecular Biology, University of

RI PT

Department of Genetic Engineering and Biotechnology, University of Dhaka, Dhaka-1000,

7 8

c

Contributed equally

SC

9

*Corresponding Author: Zeba I. Seraj, Plant Biotechnology Laboratory, Department of Biochemistry

11

and Molecular Biology, University of Dhaka, Dhaka-1000, Bangladesh. Tel: +880-2-8614708; fax:

12

+880-2-8615583/9127051. Email: [email protected].

M AN U

10

13 14

1

15

Life Sciences, Penn State University, University Park, PA 16802, USA

TE D EP AC C

16

Present address: Plant biology PhD program, Department of Biology, and the Huck Institutes of the

1

ACCEPTED MANUSCRIPT List of Abbreviations

18

AC = Amylose content

19

bp =Base pairs

20

cDNA = Complementary DNA

21

CTAB = Cetyltrimethylammonium bromide

22

DNA =Deoxyribonucleic acid

23

dNTP = Deoxynucleoside triphosphate

24

ESE = Exonic splicing enhancer

25

GBSS-I = Granule-bound starch synthase I

26

G3PDH = Glyceraldehyde-3-phosphate dehydrogenase

27

indel = Insertion/deletion

28

kb = Kilobase pairs

29

mM =Millimolar

30

mRNA = Messenger RNA

31

ng = Nanogram

32

pmol = Picomole

33

QTL = Quantitative trait loci

34

RNA = Ribonucleic acid

35

RT-PCR = Reverse transcription polymerase chain reaction

36

SF2/ASF = Splicing factor-2/ alternative splicing factor

37

SNP = Single nucleotide polymorphism

38

UTR =Untranslated region

39

Wx = Waxy

40

µl = Microlitre

41

µM = Micromolar

AC C

EP

TE D

M AN U

SC

RI PT

17

42 43 44 2

ACCEPTED MANUSCRIPT 45 46

Abstract

47 Waxy gene (Granule Bound Starch Synthase I) is responsible for amylose synthesis in the rice

49

endosperm. Several mutations in this gene have been shown to be responsible for variable amylose

50

content (AC) phenotypes. The G/T mutation in 5′ splice site of Waxy intron 1 has been traced as the

51

origin of the glutinous rice phenotype and differentiates low AC from non-glutinous intermediate/high

52

AC rice. Sequencing of Waxy promoter and 5′ noncoding regions from 22 rice cultivars showed that

53

the evolutionary pattern of all Bangladeshi non-glutinous and most glutinous rice accessions are in

54

line with the general patterns of South and Southeast Asia. However, three cultivars Khara Beruin,

55

Modhu Beruin White and Kathali Beruin Red with low to very low amylose lacked the G/T splice site

56

mutation. These were more closely related to non-glutinous cultivars based on their SNP patterns in

57

promoter and noncoding regions. Further sequencing revealed a unique C deletion at a pyrimidine

58

tract of intron 5 of these three cultivars that may cause slippage of intron splicing. Additional SNPs at

59

intron 9 and 10 were also identified among these cultivars. These Bangladeshi-genotype-specific

60

mutations could be the cause of waxy or low amylose phenotypes in these glutinous accessions.

SC

M AN U

TE D

Key words: Amylose content; Waxy gene; Glutinous and non-glutinous rice; SNPs

EP

62

AC C

61

RI PT

48

3

ACCEPTED MANUSCRIPT 1. Introduction

64

Starch in rice endosperm contains amylose and amylopectin polysaccharides. Amylose is principally a

65

linear molecule containing α(1→4) linked glucose units and makes up approximately 0-30% of total

66

starch. In contrast amylopectin is a branched molecule which contributes to approximately 70-100%

67

of total starch in rice endosperm (Martin and Smith, 1995). Higher amylose levels (20–30%) are

68

observed in many indica rice varieties in South Asia (Morishima et al., 1992). Lower amylose levels

69

(10–20%) are more common in japonica varieties that predominate in East Asia (Yamanaka et al.,

70

2004). Amylose content (AC) can be thus used to classify milled rice samples into different categories

71

such as glutinous/waxy (0–5% amylose), low AC (6–18%), intermediate AC (19–23%), or high AC

72

(>23%) types (Bergman et al., 2004). Low AC is usually associated with tender, cohesive, and glossy

73

cooked rice while high AC is associated with firm, fluffy, and separated grains in cooked rice (Juliano

74

et al., 1981). Therefore, AC is considered as one of the major characteristics for assessing rice

75

cooking and eating qualities. In addition, low AC rice varieties have long been used as a tool of

76

improving grain quality through conventional breeding as these represent an intermediate type

77

between glutinous and non-glutinous rice varieties (Dong, 2000; Sato et al., 2002).

SC

M AN U

TE D

78

RI PT

63

Genetic and molecular marker-based QTL analyses have revealed that the wide range of AC variation

80

in rice endosperm is mainly controlled by a major locus (Wx or Waxy gene) and multiple minor loci

81

(Fan et al., 2005; Inukai et al., 2000). The Wx gene in rice encodes the granule-bound starch synthase

82

I (GBSS-I) enzyme responsible for amylose biosynthesis (Okagaki and Wessler, 1988). The origin of

83

the glutinous phenotype such as those found in japonica rice has been traced to the Wxb allele, which

84

contains a G/T mutation in intron 1 splice donor site of the Wx gene (Olsen and Purugganan, 2002).

85

Presence of this splice donor site mutation leads to inefficient splicing of Wx pre-mRNA followed by

86

reduced expression of functional GBSS-I (Cai et al., 1998). In contrast, non-glutinous phenotype

87

observed in many indica rice varieties has been traditionally associated with Wxa allele, which lacks

88

the intron 1 G/T mutation. However many Northeast Asian rice varieties carrying this intron 1 G/T

89

mutation still show non-glutinous phenotype. This suggests that partial suppression of the intron 1

90

G/T mutation may have an important role in the development of non-glutinous rice of this region

AC C

EP

79

4

ACCEPTED MANUSCRIPT (Olsen and Purugganan, 2002). Amylose content of rice endosperm is therefore a major domestication

92

trait that has evolutionary importance. Among the minor alleles affecting amylose content, a

93

polymorphic CTn microsatellite in the 5′-untranslated region (UTR) of Wx exon 1 has been shown to

94

be correlated with various AC classes (Ayres et al., 1997). Other SNPs in coding regions of different

95

Wx alleles lead to decreased AC by dropping the binding efficiency of GBSS-I to starch granules,

96

changing amino acid at respective sites (Liu et al., 2009; Sato et al., 2002) and altering expression

97

pattern of GBSS-I in endosperm and anthers (Mikami et al., 1999). In addition to Wx, mutations in

98

genes such as Dull (Zeng et al., 2007) and shrunken (Asaoka et al., 1993) also indirectly affect AC in

99

rice.

SC

RI PT

91

100

Bangladesh is the world’s fourth largest rice producing country. It is considered as an enriched rice

102

germplasm reservoir with over 6500 wild, landraces and modern high yielding varieties (Elias et al.,

103

2011). Majority of the rice species in Bangladesh are indica varieties with high amylose content and

104

non-glutinous phenotype (Olsen and Purugganan, 2002). Several glutinous rice varieties, which are

105

locally known as ‘Beruin’ or sticky rice, are also cultivated in the Northeastern region of Bangladesh

106

(Dipti et al., 2003). These Beruin cultivars generally have low amylose content and are popular for

107

making traditional dishes on special occasions. However the molecular basis of low amylose content

108

in these local Beruin varieties is unknown. In this study we examined the nucleotide diversity at the

109

Wx locus of selected Bangladeshi glutinous Beruin and non-glutinous rice cultivars to identify the

110

underlying cause of AC variation in these genotypes. We also compared these Bangladeshi varieties

111

with known Wx haplotypes in an attempt to understand their evolutionary pattern across Asia.

113

TE D

EP

AC C

112

M AN U

101

2. Materials and Methods

114 115

2.1. Plant materials

116

Seeds of eleven Beruin cultivars, which are representative of the diverse traditional landraces unique

117

to the Northeastern region of Bangladesh (Sylhet), were collected from the local farmers. An equal

118

number of non-glutinous cultivars was then selected to allow identification of SNPs underlying the

5

ACCEPTED MANUSCRIPT glutinous phenotype in Bangladeshi cultivars (Fig. 1A and 1B). Nine of the selected non-glutinous

120

cultivars were from the same region as the Beruin cultivars. Two other popular aromatic cultivars,

121

Kalijira and Kataribhog, were collected from the Northwestern region of Bangladesh (Dinajpur) to

122

expand the genetic diversity of sampled non-glutinous cultivars. All of the selected cultivars are

123

traditional transplanted Aman or monsoon season varieties. The collected seeds were multiplied

124

during the planting season in the net house. Three plants of each cultivar were grown in a single pot to

125

collect immature seeds.

RI PT

119

126 2.2. Estimation of amylose content

128

The amylose content (AC) of all cultivars was determined according to previously published protocol

129

(Juliano et al., 1981). Nazirshail rice flour (AC 25%, Bangladesh Rice Research Institute) was used as

130

a control and potato starch flour (E. Merck, Germany) was used for generating the standard curve.

M AN U

131

SC

127

2.3. Genomic DNA Extraction

133

Genomic DNA was extracted from leaves of 3-week-old pooled plants (10-12 plants per cultivar)

134

using CTAB extraction method and in some cases by using the Qiagen DNeasy® plant mini kit. DNA

135

of all cultivars was quantified by spectrophotometry. Quality of extracted DNA was checked by 1%

136

agarose gel electrophoresis and compared with known concentration of Lambda DNA (Invitrogen,

137

USA).

EP AC C

138

TE D

132

139

2.4. PCR amplification and DNA sequencing

140

A 2.7 kb upstream regulatory region of the Wx gene including promoter, exon 1, intron 1 and part of

141

exon 2 was PCR amplified and sequenced in all 22 Bangladeshi rice cultivars. PCR amplification was

142

performed in two steps to amplify overlapping ~1.5 kb segments of the above upstream regulatory

143

region using primers described by Olsen and Purugganan (2002) (Supplementary Table 1). For each

144

primer set, 15 µl PCR reaction mixture containing 60 ng of DNA template, 1 × PCR buffer, 1.67 mM

145

MgCl2, 100 µM of each dNTP, 0.33 pmol Forward /Reverse primer, 1 unit Taq DNA polymerase and

146

0.1 unit Pfx® DNA polymerase (Invitrogen, USA) was used. Amplification conditions were 94 °C for

6

ACCEPTED MANUSCRIPT 147

5 min, followed by 35 cycles of 95 °C for 1 min, 65.2 °C for 1 min, 72 °C for 1 min and a final

148

extension of 72 °C for 8.30 min.

149 For further characterization of the Wx coding region in Shamudrophena, Kathali Beruin Red, Modhu

151

Beruin White, Khara Beruin and Mou Beruin, two different segments were amplified and sequenced

152

using four sets of primers (Supplementary Table 1). The first segment (~0.2 kb) included part of exon

153

2, which is known to contain a 23 bp duplication in some glutinous varieties. This segment was

154

amplified using primer pair sequences Glu-23F and Glu-23R as described by Wanchana et al. (2003).

155

The other ~2 kb segment including exons 3-10 was amplified using three overlapping primer pairs

156

(Supplementary

157

(http://frodo.wi.mit.edu/primer3/) based on sequence of Nipponbare Wx locus (OS06G0133000) in

158

Gramene database (www.gramene.org). PCR amplification conditions were the same as for the Wx

159

upstream regulatory region except the annealing temperature, which was optimized for each primer

160

pair (Supplementary Table 1).

1).

These

were

designed

using

primer3

program

TE D

161

primers

M AN U

Table

SC

RI PT

150

All PCR amplicons were purified using QiAquick gel extraction kit (Qiagen, Germany) and directly

163

sequenced in ABI 3730XL genetic analyzer using both forward-reverse and internal primers

164

(Supplementary Table 1). Each fragment was sequenced twice for SNP confirmation. All of the

165

sequences from this study have been deposited in GenBank (accession no. JF834042.1-JF834063.1

166

and KP675770-KP675774).

AC C

167

EP

162

168

2.5. Sequence analysis

169

Wx upstream regulatory region sequences from 22 Bangladeshi rice landraces were compared to 18

170

previously reported haplotypes across Asia (Genbank accessions AY136760–AY136784). All

171

multiple sequence alignments were generated using ClustalW (Thompson et al., 2002). SNP/indels

172

were identified using TASSEL (Bradbury et al., 2007). Neighbour-joining tree of aligned Wx

173

upstream regulatory region (2.7 kb) sequences was constructed using MEGA 6.06 software package

174

(Tamura et al., 2013) with Kimura two-parameter model and complete deletion of alignment gaps.

7

ACCEPTED MANUSCRIPT Bootstrap test with 1000 replicates was performed to ensure confidence in phylogeny. A median-

176

joining haplotype network was also generated based on the aligned sequences using SplitsTree 4.13.1

177

(Huson and Bryant, 2006). Motif change patterns due to SNPs in Wx promoter region and intron 1 of

178

Bangladeshi cultivars were predicted using the PLACE database (www.dna.affrc.go.jp/PLACE).

179

Amino acid changes due to SNP in sequenced Wx coding exons from selected Bangladeshi landraces

180

were predicted using the ExPASy translation tool (http://web.expasy.org/translate/). SplicePredictor

181

(http://bioservices.usd.edu/splicepredictor/) was used to predict splice-site modifying SNPs in intronic

182

regions of sequenced varieties. RegRNA (http://regrna.mbc.nctu.edu.tw/) was used to predict cis-

183

regulatory elements involved in mRNA splicing or transcriptional regulation.

184

SC

RI PT

175

2.6. Semi-quantitative RT-PCR and gene expression

186

Total RNAs from rice endosperm at 16-18 days after flowering were extracted using TRIZOL. First

187

strand cDNA were synthesized from total RNA following manufacturer’s protocol (Invitrogen, USA).

188

G3PDH (glyceraldehyde-3-phosphate dehydrogenase) gene-specific primers (Supplementary Table 1)

189

were used to optimize the cDNA concentration for estimating the Wx mRNA level. Nanodrop

190

spectrophotometer was used to measure and optimize the working concentration of the cDNA. Wx

191

mRNA level was assessed using cycle-dependent semi-quantitative RT-PCR with the WxRT1 primer

192

pair (Supplementary Table 1) for 30, 32 and 35 cycles. Expression levels of samples were estimated

193

by visual inspection after electrophoresis in a 0.9% agarose gel based on the band intensity of the

194

loading control G3PDH. Two sets of primer pairs - Wx-RT2 and Wx-RT3 (Supplementary Table 1)

195

were used to confirm the G/T SNP at the 5´-splice donor site of Wx mRNA of Bangladeshi cultivars.

196

Wx-RT2 primer pair was used to determine the splicing efficiency of the first intron in the selected

197

landraces. Nipponbare was used as a control as it contains G/T SNP at intron 1 splice donor site of Wx

198

gene (Liu et al., 2009).

AC C

EP

TE D

M AN U

185

199 200

3. Results

201 202

3.1. Amylose content of Bangladeshi cultivars

8

ACCEPTED MANUSCRIPT In order to identify the molecular basis of glutinous rice in Bangladesh, we first evaluated the amylose

204

content (AC) of 22 Beruin and non-Beruin traditional landraces that are locally considered as

205

glutinous and non-glutinous cultivars respectively. These selected cultivars were sampled from

206

intensive rice cultivation zones in Northeastern and Northwestern regions of Bangladesh that are

207

known to harbour many diverse landraces (Elias et al., 2011). Based on their AC, the selected

208

cultivars were assigned into three major categories - intermediate, low and glutinous/waxy. All non-

209

Beruin Bangladeshi cultivars in this study had intermediate AC ranging from 19.1% to 23.3%

210

(Fig.1A), confirming that these are indeed non-glutinous. In contrast, only seven out of eleven Beruin

211

varieties, traditionally associated as glutinous or sticky rice, had amylose content within the waxy

212

range (0-5.4%), as shown in Fig. 1B. Among the remaining four Beruin varieties, Kathali Beruin Red

213

and Kalo Beruin belonged to the low AC group (AC 13.7% and 8.2% respectively). The two other

214

cultivars Modhu Beruin Red and Push Beruin (AC 23.4% and 22.6% respectively) had intermediate

215

AC, similar to the Bangladeshi non-glutinous cultivars.

M AN U

SC

RI PT

203

216

3.2. Nucleotide variation at Wx upstream regulatory region of Bangladeshi cultivars

218

We compared the ~2.7 kb Wx upstream noncoding region sequences from selected Bangladeshi

219

cultivars to Olsen and Purugganan (2002) reported progenitor haplotypes F (non-glutinous) and G

220

(glutinous). These progenitor haplotypes differ only in the intron 1 G/T splice donor site mutation

221

(Olsen and Purugganan, 2002). Overall, Bangladeshi cultivars had 29 single nucleotide

222

polymorphisms (SNP) and 11 insertions/deletions (indel) in the sequenced upstream regulatory region

223

(Fig. 1C). Out of these, 15 SNP and 3 indels were found in the promoter regions and 13 SNP and 7

224

indels in intron 1. These SNP and indels in promoter and intron 1 were more frequent in non-

225

glutinous cultivars than the Beruin cultivars. In total, seven different microsatellite alleles: CT8, CT10,

226

CT11, CT12, CT16, CT17 and CT18, were identified in the Wx exon 1 (encoding 5′ untranslated region) of

227

Bangladeshi cultivars (Figure 1C). An additional SNP within Wx exon 2 that also codes for 5′ UTR

228

was observed only in one of the non-glutinous Bangladeshi cultivars.

229

All of the Bangladeshi non-glutinous cultivars in this study had G-SNP at the 5′ splice donor site of

AC C

EP

TE D

217

9

ACCEPTED MANUSCRIPT Wx intron 1 (Fig. 1C) similar to the known non-glutinous haplotype F. The CT10 or CT11 alleles within

231

Wx exon 1 were observed in six of these non-glutinous Bangladeshi cultivars. The CT17 allele was

232

present in cultivars with variable AC, ranging from intermediate AC in Bangladeshi non-glutinous

233

cultivars to low and waxy AC content in the Beruin cultivars (Fig. 1). The CT12/ CT18 and CT8/ CT16

234

alleles were specific to non-glutinous and glutinous Bangladeshi cultivars, respectively.

RI PT

230

235

G/T SNP at the 5' splice site of Wx intron 1 could explain the waxy or low AC phenotype (AC 4.4-

237

8.2%) of six out of eleven Bangladeshi Beruin cultivars. These cultivars also carried CT17 allele in

238

exon 1 similar to the known glutinous progenitor haplotype G (Fig. 1C). Among these six Beruin

239

cultivars, Pak Beruin, Kathali Beruin white and Kalo Beruin had an additional novel G/A SNP at

240

position 718 in the Wx promoter region (Fig. 1C). The remaining five Beruin cultivars lacked the G/T

241

SNP in intron 1 splice donor site. Among these, Push Beruin and Modhu Beruin Red also showed

242

intermediate amylose phenotype (AC 22.6% and AC 23.4% respectively) like non-glutinous

243

Bangladeshi cultivars in this study (Fig. 1A). These varieties also had other nucleotide variations in

244

Wx regulatory region that were similar to the Bangladeshi non-glutinous cultivars (Fig. 1C). This

245

confirmed that Push Beruin and Modhu Beruin Red might be locally misnamed as glutinous varieties

246

even though they have non-glutinous phenotype.

M AN U

TE D

EP

247

SC

236

The other three Beruin cultivars that lacked the G/T SNP in splice donor site (Khara Beruin, Modhu

249

Beruin White and Kathali Beruin Red) had waxy or low AC phenotype (Fig. 1B). Among these only

250

Khara Beruin showed several unique mutations in the Wx upstream noncoding region (T/A SNP at

251

216 position in promoter, G/T SNP at position 1934 in intron 1). This indicated that polymorphisms

252

other than G/T SNP in intron 1 splice donor site and CTn allele in exon 1 may play a major role in

253

regulating Wx gene expression in these three genotypes.

AC C

248

254 255

3.3. Evolutionary relatedness of Bangladeshi cultivars

256

An unrooted phylogenetic tree based on the aligned ~2.7 kb Wx upstream regulatory region sequences

257

of selected Bangladeshi cultivars and 18 other reported haplotypes (Olsen and Purugganan, 2002) was

10

ACCEPTED MANUSCRIPT constructed to explore the evolutionary relationships among these accessions. These 18 reported

259

haplotypes represent the landrace diversity across Asia. In the phylogenetic tree, six Bangladeshi

260

glutinous cultivars (Akia Beruin White, Akia Beruin Red, Mou Beruin, Kalo Beruin, Kathali Beruin

261

White and Pak Beruin) clustered with reported glutinous haplotypes G, I, K, L and Q at high bootstrap

262

(98%) value (Fig. 2). Pak Beruin, Kalo Beruin, Kathali Beruin White along with a non-glutinous

263

cultivar Shamudrophena also formed a sub clade depending on the G/A SNP in position 718 of the Wx

264

promoter region (Fig. 2). These cultivars may carry the same Wx allele or with an additional mutation

265

(718 G/A).

RI PT

258

SC

266

On the other hand, five glutinous Beruin cultivars that lacked the intron 1 G/T SNP were grouped

268

with reported non-glutinous haplotypes and Bangladeshi non-glutinous cultivars. Intermediate AC

269

cultivars Push Beruin and Modhu Beruin Red clustered with reported non-glutinous haplotypes B, A

270

and Bangladeshi non-glutinous cultivars (Fig. 2). Waxy cultivar Khara Beruin clustered with reported

271

non-glutinous haplotypes E and D at 64% bootstrap and with non-glutinous haplotype C at 93%

272

bootstrap (Fig. 2). Based on the aligned Wx upstream regulatory region sequences, waxy AC Modhu

273

Beruin White and low AC Kathali Beruin Red appeared to be very close to intragenic recombinant

274

haplotype R and non-glutinous haplotype S, respectively, even though neither shared similar amylose

275

phenotype with the corresponding haplotypes (Fig. 2). Bangladeshi non-glutinous cultivars were

276

clustered with reported non-glutinous haplotypes but were more close to the haplotype F, A and B

277

(Fig. 2). In addition, Chinigura, Kalijira and Kataribhog distinctly clustered with non-glutinous

278

progenitor haplotype F at 48% bootstrap (Fig. 2). It might be possible that the Bangladeshi non-

279

glutinous cultivars originally carried the non-glutinous progenitor Waxy allele and then evolved into

280

haplotypes A and B through subsequent mutations at promoter region and intron 1.

AC C

EP

TE D

M AN U

267

281 282

3.4. Nucleotide variations at coding region of Wx gene in selected Beruin cultivars

283

Since G/T SNP at intron 1 splice donor site was not enough to explain the waxy AC of Modhu Beruin

284

White and Khara Beruin and low AC of Kathali Beruin Red, we further sequenced two segments in

285

the coding region of Wx gene. Mou Beruin (AC 4.4%) and Shamudrophena (AC 23.4%) were

11

ACCEPTED MANUSCRIPT considered as references for Bangladeshi glutinous and non-glutinous cultivars respectively for this

287

analysis. The first sequenced segment in Wx gene included part of coding region of exon 2 (~196 bp)

288

which has been reported to contain a 23 bp frame-shift duplication in some glutinous varieties that

289

lead to non-functional Wx proteins (Mikami et al., 2008; Wanchana et al., 2003). Among the selected

290

cultivars, this 23 bp duplication was only identified in Mou Beruin, consistent with its glutinous

291

phenotype (Table 1). The second segment analyzed in Wx gene spanned ~1.9 kb genomic region

292

including 3' end of exons 2-10 and introns 2-10. Only the non-glutinous reference Shamudrophena

293

among the five cultivars carried A/C SNP in exon 6 (position 671 from start codon), which resulted in

294

nonsynonymous amino acid change from tyrosine to serine (Table 1). This SNP has been shown to be

295

associated with intermediate AC (21-24%) previously (Dobo et al., 2010). Two other coding region

296

SNPs (T/C in exon 9, C/T in exon 10) were identified in the three selected Beruin cultivars but not in

297

the non-glutinous or glutinous reference (Table 1). Exon 9 SNP (T/C, position 1109 from start codon)

298

was found in all three selected cultivars (Table 1). The exon 9 SNP results in a synonymous change in

299

amino acid and has been previously reported as a silent mutation (Larkin and Park, 2003). Exon 10

300

SNP (C/T) was found only in low AC Kathali Beruin Red (Table 1). Exon 10 SNP (C/T) caused a

301

nonsynonymous change from a nonpolar amino acid proline (CCT) to polar amino acid Serine (TCT),

302

and it has been reported that this SNP is linked to high amylose content (Larkin and Park, 2003).

303

Therefore these exon-specific mutations were not directly responsible for the variant AC phenotype

304

traits observed for the selected Beruin cultivars.

SC

M AN U

TE D

EP

AC C

305

RI PT

286

306

Most interestingly, Kathali Beruin Red, Modhu Beruin White and Mou Beruin had overall four SNPs

307

and two indels in Wx introns which were absent in the Bangladeshi non-glutinous and glutinous

308

references (Table 1). Among these variations, C deletion in intron 5, G/A (at positions 81 and 95 from

309

5' splice site of intron 10) and ATA deletion at position 104-106 in intron 10 were identified in all

310

these three cultivars. On the other hand, G/A in intron 10 (position 118) was found only in low AC

311

Kathali Beruin Red and A/G at position 29 from 5' splice site of intron 9 was only observed in the

312

waxy AC Khara Beruin and Modhu Beruin White. A search for putative cis-elements that might be

313

affected by these SNPs/indel revealed that the C indel was located within a poly-C tract in intron 5,

12

ACCEPTED MANUSCRIPT 314

and this indel resulted in absence of a putative INTRONLOWER motif in these three cultivars

315

(Supplementary Table 2). Also, the G/A indel in intron 10 (at position 95 from 5' splice site) resulted

316

in a missing putative ESE-SF2/ASF motif (Exonic Splicing Enhancer) in these three cultivars

317

(Supplementary Table 2).

RI PT

318 Further homology search using NCBI BLASTn revealed that the sequenced coding regions of Khara

320

Beruin and Modhu Beruin White shared ~99% identity with previously reported Wxop (opaque) allele

321

containing cultivars ARC10818, ACC35618 and ARC6622 (Genbank accessions AB281448.1,

322

AB281447.1, and AB281453.1) from Nepal and India (Mikami et al., 2008, 1999). These Wxop

323

accessions did not contain the G/T SNP at the 5´ splice donor site of intron 1 and also had

324

nonsynonymous A/G SNP in exon 4 at position 521 from start codon (Mikami et al., 2008, 1999).

325

However the latter exon 4 SNP was not present in the Khara Beruin and Modhu Berun White (Table

326

1). Comparison of the rice grains revealed that only Khara Beruin had a completely opaque or chalky

327

endosperm, while Modhu Beruin White had partially chalky endosperm (Fig. 3), which supports the

328

low to very low amylose content of these varieties.

M AN U

TE D

329

SC

319

3.5. Wx gene expression in selected Beruin cultivars without G/T SNP in splice donor site

331

Semi-quantitative RT-PCR was used to estimate total expression levels of Wx gene in Modhu Beruin

332

White (waxy AC), Kathali Beruin Red (low AC), Push Beruin and Modhu Beruin Red (intermediate

333

AC). Nipponbare was used as a control since it is known to show decreased splicing efficiency of Wx

334

transcript due to G/T SNP at intron 1 splice donor site (Liu et al., 2009). The results showed that Wx

335

transcript levels in Push Beruin, Modhu Beruin Red and Kathali Beruin Red were relatively much

336

higher compared to the control Nipponbare, (Fig. 4A). This suggests that the splicing efficiency of Wx

337

transcript is not affected in Push Beruin, Modhu Beruin Red as well as Kathali Beruin Red. This

338

observation is consistent with the absence of G/T SNP at intron 1 and the intermediate-to-low AC in

339

these cultivars. The expression level of Wx transcript in Modhu Beruin White (intron 1 splice site

340

mutation absent) was also higher than that of Nipponbare but lower than the aforementioned three

341

Beruin cultivars (Fig. 4A). The transcript level however correlates with the waxy AC of Modhu

AC C

EP

330

13

ACCEPTED MANUSCRIPT Beruin White.

343

RT-PCR with primers specific for unspliced variant of Wx transcript (Liu et al., 2009) confirmed the

344

absence of intron 1 containing 1.1 kb fragment in all of the selected Beruin cultivars (Fig. 4B). A

345

second set of RT-PCR primers previously described by Prathepha (2007) was used to re-confirm the

346

absence of G/T SNP at intron 1 splice donor site of Wx gene of these selected cultivars (Fig. 4C).

347

Nipponbare showed a predominant band at 120 bp and another band at 210 bp, while the selected

348

Beruin cultivars only showed the 210 bp band (Fig. 4C). This is consistent with previous observations

349

that cultivars carrying intron 1 G/T SNP showed a predominant band at 120 bp with an additional

350

band at 210 bp, indicating at least two alternative variants of Wx transcript, but cultivars without the

351

mutation showed only a predominant band at 210 bp (Prathepha, 2007).

SC

RI PT

342

353

M AN U

352 4. Discussion

354

4.1. Distribution of Bangladeshi rice cultivars in context with Asian rice cultivars

356

In this study, 22 Bangladeshi glutinous and non-glutinous cultivars were placed in specific positions

357

along with 18 previously reported Wx haplotypes (Olsen and Purugganan, 2002) based on

358

phylogenetic- and haplotype network analyses of Wx upstream regulatory region sequences (Fig. 2

359

and 5). The previously reported Wx haplotypes were identified using 105 accessions from different

360

countries of Asia (Olsen and Purugganan, 2002). Bangladeshi non-glutinous cultivars Kalijira,

361

Kataribhog and Chinigura have the same Wx allele as haplotype F with no additional mutation. As

362

such, these cultivars were placed with non-glutinous progenitor haplotype F. Previously, Olsen and

363

Purugganan (2002) also showed that 39% of accessions in Southeast Asia and 15% of the South Asian

364

accessions contain this F haplotype. Four non-glutinous cultivars (Lathial White, Lathial Red,

365

Girishail, and Raujan 2) and one intermediate amylose containing cultivar Modhu Beruin Red bear

366

closer identity to haplotype A and were placed with it (Fig. 5).

AC C

EP

TE D

355

367 368

Three other Bangladeshi non-glutinous cultivars - Balam White, Balam Red and Raujan, and one

369

intermediate amylose containing cultivar Push Beruin were placed in the same node as haplotype B

14

ACCEPTED MANUSCRIPT (Fig. 5). Non-glutinous haplotypes A and B were derived from haplotype F by subsequent mutations

371

at both promoter and intron1 region. Haplotype A is more frequent in Southeast Asia and haplotype B

372

is more frequent in South Asia (Olsen and Purugganan, 2002). Therefore, Bangladeshi non-glutinous

373

and Beruin rice cultivars with intermediate AC carry the general pattern prevalent in South and

374

Southeast Asia.

RI PT

370

375

Among the Bangladeshi glutinous cultivars, Mou Beruin, Akia Beruin White and Akia Beruin Red

377

have the same sequence identities as the glutinous progenitor haplotype G in Wx upstream region

378

(Fig. 5). Most of the glutinous rice accessions in Southeast, North and South Asia have been reported

379

to carry haplotype G. Bangladeshi glutinous cultivars Kalo Beruin, Pak Beruin and Kathali Beruin

380

White contained an additional G/A SNP at position 718 of Wx promoter along with the G/T SNP at

381

the 5′ splice donor site of intron 1 (Fig. 1). These cultivars clustered with the glutinous haplotype

382

group (L, K, O and I), which also have additional mutations located at promoter and intron 1 but at

383

different positions (Fig. 5). The 718 (G/A) SNP has not yet been reported and is therefore unique to

384

Bangladeshi glutinous cultivars as well as the non-glutinous Shamudrophena.

TE D

M AN U

SC

376

385

From the median-joining network analysis (Fig. 5), low AC Bangladeshi Kathali Beruin Red was

387

found to be close to the non-glutinous haplotype S. Two other waxy AC cultivars Khara Beruin and

388

Modhu Beruin White were placed with non-glutinous haplotype C and recombinant haplotype R in

389

the Wx haplotype network (Fig. 5). This suggests that some Bangladeshi glutinous Beruin cultivars

390

may have a different allele compared to glutinous cultivars of Asia responsible for waxy or low AC

391

phenotype.

AC C

392

EP

386

393

4.2. Bangladesh genotype-specific changes

394

Several studies have reported specific SNP or indels in Wx exons that decrease AC in rice seeds by

395

reducing the binding of GBSS-I to starch granules or creating stop codon that terminates the

396

translation of Wx transcript (Liu et al., 2009; Wanchana et al., 2003). However, waxy Khara Beruin,

397

Modhu Beruin White and Kathali Beruin did not have any of those SNPs. Pairwise sequence

15

ACCEPTED MANUSCRIPT comparisons of these cultivars with Olsen and Purugganan (2002) reported haplotypes showed that

399

some mutations at both promoter and intron 1 region were shared between Khara Beruin and

400

haplotype C (99.93% sequence identity), Modhu Beruin White and haplotype R (99.19% identity) as

401

well as Kathali Beruin Red and haplotype S (99.26% identity). Some of these mutations were found to

402

be located within putative cis-regulatory motifs in the Wx promoter site in Beruin cultivars

403

(Supplementary Table 2). Most interestingly, these three cultivars have a C deletion at a

404

polypyrimidine tract of intron 5 as well as two G/A SNPs and an ATA deletion at intron 10. This C

405

deletion in intron 5 consequentially causes loss of the putative INTRONLOWER motif in that region.

406

This may cause slippage of intron splicing and subsequent translation (Brown, 1986; Sridharan and

407

Singh, 2007). Also, the G/A indel in intron 10 (at position 95 from 5' splice site) which affected a

408

putative Exonic Splicing Enhancer motif might cause exon skipping in these three cultivars

409

(Supplementary Table 2). Therefore, these mutations might be another cause of the low AC in Khara

410

Beruin, Modhu Beruin White and Kathali Beruin Red despite lack of the G/T SNP at the intron 1

411

splice donor site, which is known as the main cause of waxy or low amylose phenotype.

M AN U

SC

RI PT

398

TE D

412

Waxy amylose containing Khara Beruin and Modhu Beruin White also contained one SNP at intron 9

414

(G/A) and showed 99% similarity with opaque or chalky endosperm containing rice cultivars found in

415

Nepal, India, Myanmar and Indonesia. On the other hand, low amylose containing Kathali Beruin Red

416

contained one G/A SNP in position 118 in intron 10 that might create an alternative splice acceptor

417

site in intron 10 (Supplementary Table 2). This SNP in intron 10 has not been reported previously.

418

These SNPs at introns 9 and 10 also might have an effect on splicing as well as gene expression and

419

varying amylose between waxy Khara Beruin, Modhu Beruin and Low AC Kathali Beruin Red.

420

However, the transcript levels of the Wx gene in these three Beruin cultivars were not low compared

421

to Nipponbare, which carries the G/T splice site SNP in intron 1. Whether the above SNPs in the

422

promoter and introns alone are responsible for regulating Wx transcript levels in these cultivars or

423

other players are involved requires further investigation.

AC C

EP

413

424 425

5. Conclusion

16

ACCEPTED MANUSCRIPT Bangladeshi non-glutinous and glutinous cultivars with intermediate AC carried the general pattern of

427

nucleotide variation in Wx upstream region as observed in South and Southeast Asian cultivars. Some

428

exceptional Wx alleles were also identified in Bangladeshi glutinous cultivars Khara Beruin, Modhu

429

Beruin White and Kathali Beruin Red. These cultivars lack the G/T splice site mutation in intron 1

430

region as non-glutinous cultivars, but yet have waxy and/or low amylose phenotype. However these

431

cultivars have some SNPs in promoter region that may alter putative cis-regulatory motifs involved in

432

Wx gene regulation. Additionally, a putative INTRONLOWER motif (consensus sequence for plant

433

introns) and an Exonic Splicing Enhancer motif were lost in these cultivars due to indels in Wx introns

434

5 and 10 respectively. Khara Beruin and Modhu Beruin White (waxy AC) also contained a SNP at

435

intron 9 and Kathali Beruin Red (low AC) contained two SNPs at introns 6 and 10. It can be

436

hypothesized that the intron- and promoter-specific mutations found in these three cultivars could be

437

the cause of the waxy or low amylose phenotype. This hypothesis however needs to be investigated

438

further through functional characterization of these motifs and confirmation of alternative Wx variants

439

due to misregulation of splicing.

TE D

440

M AN U

SC

RI PT

426

Acknowledgement

442

Funding for this research, including fellowships for RB and SS and consumables and equipment, was

443

received from the Bangladesh Chapter of USDA under the 416-(B) grant aid. BAS-USDA-PALS

444

project also provided a 6-month extension fellowship to RB. We would like to thank Dr. Abdul

445

Chaudhury and Munir Hasan for providing information on the Beruin cultivars of Bangladesh and

446

encouraging us to undertake the work, Md. Sazzadur Rahman (Senior Scientific Officer, Bangladesh

447

Rice Research Institute) for helping with seed collection and Md. Shamim Hossain for taking care of

448

the plants.

AC C

EP

441

449 450

References

451

Asaoka, M., Okuno, K., Yano, M., Fuwa, H., 1993. Effects of Shrunken and Other Mutations on the

452 453

Properties of Rice Endosperm Starch. Starch/Stärke 45, 383–387. Ayres, N.M., McClung, A.M., Larkin, P.D., Bligh, H.F.J., Jones, C.A., Park, W.D., 1997.

17

ACCEPTED MANUSCRIPT 454

Microsatellites and a single-nucleotide polymorphism differentiate apparentamylose classes

455

in an extended pedigree of US rice germ plasm. Theoretical and Applied Genetics 94, 773–

456

781.

458

Bergman, C., Bhattacharya, K., Ohtsubo, K., 2004. Rice end-use quality analysis. In: Champagne, E.T. (Ed.), Rice Chemistry and Technology. AACC, MN, USA, pp. 415–472

RI PT

457

459

Bradbury, P.J., Zhang, Z., Kroon, D.E., Casstevens, T.M., Ramdoss, Y., Buckler, E.S., 2007.

460

TASSEL: software for association mapping of complex traits in diverse samples.

461

Bioinformatics 23, 2633–2635.

463

Brown, J.W.S., 1986. A catalogue of splice junction and putative branch point sequences from plant introns. Nucleic Acids Research 14, 9549–9559.

SC

462

Cai, X.L., Wang, Z.Y., Xing, Y.Y., Zhang, J.L., Hong, M.M., 1998. Aberrant splicing of intron 1

465

leads to the heterogeneous 5′ UTR and decreased expression of waxy gene in rice cultivars of

466

intermediate amylose content. The Plant Journal 14, 459–465.

468

Dipti, S., Bari, M., Kabir, K., 2003. Grain quality characteristics of some Beruin rice varieties of Bangladesh. Pakistan Journal of Nutrition 2, 242-245.

TE D

467

M AN U

464

Dobo, M., Ayres, N., Walker, G., Park, W.D., 2010. Polymorphism in the GBSS gene affects amylose

470

content in US and European rice germplasm. Journal of Cereal Science 52, 450–456.

471

Dong, B., 2000. The potential commercial value of Yunnan “Zhefang rice.” Yunnan Agric. Sci.

473 474 475

Technol. 5, 13–15.

Elias, S.M., Hasan, A.K.M.M., Seraj, Z.I., 2011. Microsatellite marker diversity and sequence

AC C

472

EP

469

polymorphism in the red gene locus of indigenous rice populations of Bangladesh. Plant Systematics and Evolution 296, 157–165.

476

Fan, C.C., Yu, X.Q., Xing, Y.Z., Xu, C.G., Luo, L.J., Zhang, Q., 2005. The main effects, epistatic

477

effects and environmental interactions of QTLs on the cooking and eating quality of rice in a

478

doubled-haploid line population. Theoretical and Applied Genetics 110, 1445–1452.

479 480 481

Huson, D.H., Bryant, D., 2006. Application of Phylogenetic Networks in Evolutionary Studies. Molecular Biology and Evolution. 23, 254–267. Inukai, T., Sako, A., Hirano, H.Y., Sano, Y., 2000. Analysis of intragenic recombination at wx in rice:

18

ACCEPTED MANUSCRIPT 482

Correlation between the molecular and genetic maps within the locus. Genome 43, 589–596.

483

Juliano, B.O., Perez, C.M., Blakeney, A.B., Castillo, T., Kongseree, N., Laignelet, B., Lapis, E.T.,

484

Murty, V.V.S., Paule, C.M., Webb, B.D., 1981. International Cooperative Testing on the

485

Amylose Content of Milled Rice. Starch/Stärke 33, 157–162.

487

Larkin, P.D., Park, W.D., 2003. Association of waxy gene single nucleotide polymorphisms with

RI PT

486

starch characteristics in rice (Oryza sativa L.). Molecular Breeding 12, 335–339.

Liu, L., Ma, X., Liu, S., Zhu, C., Jiang, L., Wang, Y., Shen, Y., Ren, Y., Dong, H., Chen, L., others,

489

2009. Identification and characterization of a novel Waxy allele from a Yunnan rice landrace.

490

Plant Molecular Biology 71, 609–626.

SC

488

Martin, C., Smith, A.M., 1995. Starch biosynthesis. Plant Cell 7, 971–985.

492

Mikami, I., Aikawa, M., Hirano, H.Y., Sano, Y., 1999. Altered tissue-specific expression at the Wx

493

M AN U

491

gene of the opaque mutants in rice. Euphytica 105, 91–97.

Mikami, I., Uwatoko, N., Ikeda, Y., Yamaguchi, J., Hirano, H., Suzuki, Y., Sano, Y., 2008. Allelic

495

diversification at the wx locus in landraces of Asian rice. Theoretical and Applied Genetics

496

116, 979–989.

499 500 501 502 503 504

relatives. Oxford Surveys in Evolutionary Biology 8, 135–135. Okagaki, R.J., Wessler, S.R., 1988. Comparison of non-mutant and mutant waxy genes in rice and

EP

498

Morishima, H., Sano, Y., Oka, H.I., 1992. Evolutionary studies in cultivated rice and its wild

maize. Genetics 120, 1137–1143. Olsen, K.M., Purugganan, M.D., 2002. Molecular evidence on the origin and evolution of glutinous

AC C

497

TE D

494

rice. Genetics 162, 941–950.

Prathepha, P., 2007. Identification of variant transcripts of waxy gene in non-glutinous rice (O. sativa L.) with different amylose content. Pakistan Journal of Biological Sciences 10, 2500–2504.

505

Sato, H., Suzuki, Y., Sakai, M., Imbe, T., 2002. Molecular Characterization of Wx-mq, a Novel

506

Mutant Gene for Low-amylose Content in Endosperm of Rice (Oryza sativa L.). Breeding

507

Science 52, 131–135.

508 509

Sridharan, V., Singh, R., 2007. A conditional role of U2AF in splicing of introns with unconventional polypyrimidine tracts. Molecular and Cellular Biology 27, 7334–7344.

19

ACCEPTED MANUSCRIPT 510 511

Tamura, K., Stecher, G., Peterson, D., Filipski, A., Kumar, S., 2013. MEGA6: molecular evolutionary genetics analysis version 6.0. Molecular Biology and Evolution 30, 2725–2729. Thompson, J.D., Gibson, T.J., Higgins, D.G., 2002. Multiple Sequence Alignment Using ClustalW

513

and ClustalX. In: Baxevanis, A.D., Stein, L.D., Stormo, G.D., Yates, J.R. (Eds.), Current

514

Protocols in Bioinformatics. John Wiley & Sons, New York, pp. 2.3.1–2.3.22.

515 516

RI PT

512

Wanchana, S., Toojinda, T., Tragoonrung, S., Vanavichit, A., 2003. Duplicated coding sequence in the waxy allele of tropical glutinous rice (Oryza sativa L.). Plant Science 165, 1193–1199. Yamanaka, S., Nakamura, I., Watanabe, K.N., Sato, Y.I., 2004. Identification of SNPs in the waxy

518

gene among glutinous rice cultivars and their evolutionary significance during the

519

domestication process of rice. Theoretical and Applied Genetics 108, 1200–1204.

SC

517

Zeng, D., Yan, M., Wang, Y., Liu, X., Qian, Q., Li, J., 2007. Du1, encoding a novel Prp1 protein,

521

regulates starch biosynthesis through affecting the splicing of Wxb pre-mRNAs in rice (Oryza

522

sativa L.). Plant Molecular Biology 65, 501–509.

M AN U

520

AC C

EP

TE D

523

20

ACCEPTED MANUSCRIPT Figure Legends

2

Fig. 1: Amylose content and nucleotide variation in Wx upstream regulatory region of twenty-two

3

Bangladeshi rice cultivars. Comparison of amylose contents in (A) non-glutinous and (B) Beruin

4

(locally known as glutinous) cultivars revealed three distinct categories. Nucleotide polymorphisms in

5

~2.7 kb Wx upstream noncoding region of these varieties with respect to known non-glutinous and

6

glutinous progenitor haplotypes (Olsen and Purugganan, 2002) are shown in (C).

RI PT

1

7

Fig. 2: Phylogenetic tree based on Wx upstream regulatory region showing the relationship among

9

Bangladeshi cultivars and other reported haplotypes. Neighbour-joining method along with Kimura 2-

10

parameter distance were utilized for phylogenetic analysis. Bootstrap values (>40%) for 1000

11

replicates are shown at the nodes.

M AN U

12

SC

8

13

Fig. 3: Chalky endosperm of Khara Beruin, partially chalky in Modhu Beruin White and nearly

14

transparent endosperm of non-glutinous cultivar Kataribhog.

TE D

15

Fig. 4: Expression of Wx gene in Beruin cultivars lacking G/T SNP in intron 1 splice donor site. (A)

17

Semi-quantitative RT-PCR analysis of Wx gene expression in developing seeds of selected Beruin

18

cultivars. Nipponbare, which carries G/T SNP in intron 1, was used as a control. RT- PCR with two

19

different primer sets (B and C) re-confirmed that intron 1 is efficiently spliced in the selected Beruin

20

cultivars without the G/T splice site SNP.

AC C

21

EP

16

22

Fig. 5: Median-joining network for Bangladeshi cultivars and reported haplotypes based on Wx

23

upstream regulatory region. Here magenta and blue colors indicate Bangladeshi glutinous Beruin and

24

non-glutinous cultivars, respectively. Purple color indicates cultivars that were newly assigned as non-

25

glutinous (this study) despite being locally known as Beruin or sticky rice. Squares and diamonds

26

represent previously reported non-glutinous and glutinous haplotypes across Asia (Olsen and

27

Purugganan, 2002), respectively. Circles represent nodes in the network. Node size is proportional to

28

number of cultivars/haplotypes present in each node.

1

ACCEPTED MANUSCRIPT

Table 1. Nucleotide polymorphisms in Wx coding region of selected Bangladeshi cultivars.

RI PT

Region in Position Position Polymorph Reported Shamudro Kathali Modhu Khara Mou Wx locus from from 5' -ism type Wx allele -phena Beruin Beruin Beruin Beruin start splice Name (nonRed White (glutinous codon site of glutinous reference) intron reference) Exon 2 100 23 bp wx,a No No No No Yes duplication Exon 4 476 SNP (ns) Wxhp(A/G)b A A A A A Exon 4 497 SNP (ns) Wxmq(G/A) G G G G G

SC

c

SNP (ns) Wx (A/G)a A A SNP(ns) Wxmq(T/C)c T T SNP (ns) Wxin(A/C) d C A SNP (s) T C SNP (ns) C T 94 Indel C 29 SNP A A 81 SNP G A 95 SNP G A 104- 106 indel ATA 118 SNP G A Here ‘ns’ and ‘s’ refers to nonsynonymous and synonymous amino acid 521 574 671 1109 1266

M AN U

Exon 4 Exon 5 Exon 6 Exon 9 Exon 10 Intron 5 Intron 9 Intron 10

op

A A A T T T A A A C C T C C C C G G A A A G A A G ATA G G G changes, respectively.

TE D

Sites without polymorphism in any of the cultivars are shaded in grey. a(Mikami et al., 2008,

AC C

EP

1999) ,b(Liu et al., 2009), c(Sato et al., 2002), d(Dobo et al., 2010)

ACCEPTED MANUSCRIPT

Non-glutinous cultivars 23.1

23.3 23.1 22.5 23.1 23.3

21.48

25

21.48

23.4 22.6

20

15

15 10

5

5

0

4.5 4.5 5.4

5.2 4.9 4.4

0

Low AC (6-18%)

C 1

2

Promoter

5 6 7

8

9

23 56 57 67 216 255 256 307 333 350 387 398 475 534 677 718 1250 1283

Exon 1 5ʹ UTR

10

11 12 13

14

Intron 1

Exon 2 5ʹ UTR

A A A A A G G G G G G G

C C C C C T T T T T T T

T T T T T T T T T T T T

G G G G G G G

C C C C C C C

G G G G G G G G G G G G

T T T T T C C C C C C C

T T T T T G G G G G G G

T T T T T C C C C C C C

A A A A A G G G G G G G

C C C C C T T T T T T T

G G G G G A A A A A A A

T -

G G G G A G G G G G G G

A A A A A G G G G G G G

C 16-20 G G T A 5 A C A G C A C 18 G G T A 5 A C A G C A C 18 G G T A 5 A C A G C A C 17 G G T A 6 A C A G C A C 17 G G T A 5 A C A G C A T 12 G A T A 6 G T G - T C T 11 G A T A 6 G T G - T C T 11 G A T A 6 G T G - T C T 11 G A T A 6 G T G - T C T 10 G A T A 5 G C G G T C T 10 G A - - 5 A C G G T C T 10 G A - - 5 A C G G T C

T T T T T G G G G G G G

G G G G G G G G G G G G

C C C C C T T T T T T T

T T T T T C C C C C C C

C -

A A A A A A A A A A -

A A A A A A A A A G A A

T T T T T C C C C C C C

T T

G G G G G G G G G G A G

C C C C C C C C C A A A

T T T T T T T C T C C C

A A A A A A A G A G G G

C C C C C C C T C T T T

T T T T T T T A T T T T

G G

C C

G G G G G G G G G G G C

T T T T T T T T T T C C

T T T T T T T G T T G G

T T T T T T T C T T C C

A A A A A A A A A A G G

C C C C C C C T C C T T

G G G G G G G A G G A A

T -

G G G G A A G A A G G A

A A A A A A G G A G G G

C 16-18 T G T A 5 A C A G C A C 17 T G T A 5 A C A G C A C 17 T G T A 5 A C A G C A C 17 T G T A 5 A C A G C A C 17 T G T A 5 A C A G C A C 17 T G T A 5 A C A G C A T 16 G A T A 5 G C G - T C T 8 G A T A 6 G T G - T C C 17 T G T A 5 A C A G C A T 11 G A T A 6 G T G - T C T 11 G A T A 6 G T G - T C T 10 G A - - 5 A C A G T C

T T T T T T G G T G G G

G G G G G G G T G G G G

C C C C C C T T C T T T

T T T T T T C C T C C C

-

A A A A A A A A A A -

A A A A A A A A A A A A

T T T T T T C C T C C C

T

G G G G G G G G G G G G

TE D

T T T T T C C C C C C C

EP

Haplotype G Akia Beruin White Akia Beruin Red Mou Beruin Pak Beruin Kathali Beruin White Modhu Beruin White Khara Beruin Kalo Beruin Kathali Beruin Red Modhu Beruin Red Push Beruin

4

C C C C C A A A A C A A

AC C

Non-glutinous

Haplotype F Kataribhog Chinigura Kalijira Shamudrophena Lathial White Lathial Red Girishail Raujan 2 Balam Red Balam White Raujan

Beruin (local glutinous)

Type Cultivar name

3

4.2

Waxy AC (0-5%)

M AN U

~2.7kb

1505,(CT)n

Wx Exons

SC

Ak ia

Ka

Intermediate AC (19-23%)

8.2

RI PT

10

13.7

2733

19.1

20.5

1600, Splice site 1685 1686 1687 1747, (AATT)n 1819 1846 1863 1902 1914 1917 1927 1934 1951 1953 2016 2171 2285 2304 2584

20

20.7

Amylose Content (%) B Ak eru ia in Be W ru hite M in R ou e Ka d th P Be al ak r u M i B B in od e r er u uin uin Be W ru h Kh in W ite ar a hit Ka K Be e a r th a lo ui M li B Be n od e r u hu ru in Be in R ru e Pu in d sh Re Be d ru in

25

Beruin (local glutinous) cultivars

B

ta ri C bho hi ni g gu Sh am K ra a ud liji La rop ra th he ia na La l W th hit ia e lR G ed iri R sha au il Ba jan la 2 Ba m la Re m d W hi t R e au ja n

Amylose Content (%)

A

ACCEPTED MANUSCRIPT

64

Bangladeshi non-glutinous Beruin cultivars Bangladeshi non-glutinous cultivars Known non-glutinous haplotypes Non-glutinous progenitor haplotype

82

Modhu Beruin White Kathali Beruin Red

52 64

AC C

EP

0.001

66

TE D

50

Haplotype S Khara Beruin Haplotype C Haplotype E Haplotype D Push Beruin Balam Red Balam White Haplotype B Raujan Haplotype A Modhu Beruin Red Raujan 2 Girishail 74 Lathial Red Lathial White

Clade 1 CT16-20

Sub clade 1.2 G at Intron 1 splice site (except haplotype R)

M AN U

93

59

Sub clade 1.1 T at Intron 1 splice site (except Shamudrophena)

RI PT

Glutinous progenitor haplotype Bangladeshi glutinous Beruin cultivars

SC

Haplotype K Haplotype L Akia Beruin Red Mou Beruin Haplotype M 44 Haplotype N 64 Haplotype Q Akia Beruin White Haplotype I Haplotype G Shamudrophena Sub clade 1.1.1 Kalo Beruin 718 G/A at 98 54 Kathali Beruin White Wx promoter 46 Pak Beruin Kalijira Kataribhog Chinigura Haplotype F 48 Haplotype P Haplotype J Haplotype H 64 Haplotype R

Known Glutinous haplotypes

Sub clade 2.1 CT8 (except haplotype E) Sub clade 2.2 CT10

Sub clade 2.3 CT 11-12

Clade 2 CT8-12 G at Intron 1 splice site

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

AC C EP

200 bp 120 bp 100 bp

C

210 bp

1.6 kbp 1.1 kbp 1 kbp

SC

Push Beruin

Kathali Beruin Red

Modhu Beruin Red

Modhu Beruin White

Nipponbare

Marker

RI PT

Kathali Beruin Red

Push Beruin

Modhu Beruin Red

M AN U

Kathali Beruin Red

TE D

Push Beruin

Modhu Beruin Red

B Modhu Beruin White

Nipponbare

Marker

A

Modhu Beruin White

Nipponbare

Marker

ACCEPTED MANUSCRIPT

ACCEPTED MANUSCRIPT

SC

Raujan 2

Push Beruin C

Khara Beruin

M AN U

Modhu Beruin White

TE D

B , Raujan

K

R

Kathali Beruin Red

EP

Balam Red

Q

AC C

E

RI PT

A , Girishail, Lathial Red, Balam White Lathial White, Modhu Beruin Red

D

S

Pak Beruin, Kalo Beruin, Kathali Beruin White

Mou Beruin, Akia Beruin Red, Akia Beruin White

Chinigura, Kataribhog

Shamudrophena P

G F

M

Kalijira

J

I N

L

H

ACCEPTED MANUSCRIPT



Sequence polymorphism in Wx locus of 22 Bangladeshi rice landraces was analyzed



Most of the Bangladeshi landraces showed SNP patterns reported in South & Southeast Asian varieties Three Beruin cultivars lacked the known glutinous alleles but had waxy or low amylose



Novel Wx promoter/intronic mutations in these 3 Beruin cultivars may explain their

RI PT



AC C

EP

TE D

M AN U

SC

glutinous phenotype