Cas9 System

Cas9 System

Accepted Manuscript Efficient Targeted Genome Modification in Maize Using CRISPR/Cas9 System Chao Feng, Jing Yuan, Rui Wang, Yang Liu, James A. Birchl...

4MB Sizes 91 Downloads 90 Views

Accepted Manuscript Efficient Targeted Genome Modification in Maize Using CRISPR/Cas9 System Chao Feng, Jing Yuan, Rui Wang, Yang Liu, James A. Birchler, Fangpu Han PII:

S1673-8527(15)00179-4

DOI:

10.1016/j.jgg.2015.10.002

Reference:

JGG 408

To appear in:

Journal of Genetics and Genomics

Received Date: 16 August 2015 Revised Date:

14 October 2015

Accepted Date: 20 October 2015

Please cite this article as: Feng, C., Yuan, J., Wang, R., Liu, Y., Birchler, J.A., Han, F., Efficient Targeted Genome Modification in Maize Using CRISPR/Cas9 System, Journal of Genetics and Genomics (2015), doi: 10.1016/j.jgg.2015.10.002. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT 1

Efficient Targeted Genome Modification in Maize Using

2

CRISPR/Cas9 System

3

Chao Fenga,b,1, Jing Yuana,1, Rui Wanga , Yang Liua,b, James A. Birchlerc and Fangpu

5

Hana,*

RI PT

4

6 7

a

8

Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China

9

b

University of Chinese Academy of Sciences, Beijing 100049, China

10

c

Division of Biological Sciences, University of Missouri, Columbia, MO 65211-7400, USA

12

1

These authors contributed equally to this work.

13

*

14

Tel: +86 10 6480 7926, fax: +86 10 6485 4467

15

E-mail address: [email protected]

11

19 20 21 22 23 24 25 26 27 28 29

TE D

18

EP

17

AC C

16

Correspondending author:

M AN U

SC

State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and

ACCEPTED MANUSCRIPT 30

ABSTRACT

31 CRISPR (clustered regularly interspaced short palindromic repeats)/Cas9 system, which is a

33

newly developed technology for targeted genome modification, has been successfully used in a

34

number of species. In this study, we applied this technology to carry out targeted genome

35

modification in maize. A marker gene Zmzb7 was chosen for targeting. The sgRNA-Cas9

36

construct was transformed into maize protoplasts, and indel mutations could be detected. A mutant

37

seedling with an expected albino phenotype was obtained from screening 120 seedlings generated

38

from 10 callus events. Mutation efficiency in maize heterochromatic regions was also investigated.

39

Twelve sites with different expression levels in maize centromeres or pericentromere regions were

40

selected. The sgRNA-Cas9 constructs were transformed into protoplasts followed by sequencing

41

the transformed protoplast genomic DNA. The results show that the genes in heterochromatic

42

regions could be targeted by the CRISPR/Cas9 system efficiently, no matter whether they are

43

expressed or not. Meanwhile, off-target mutations were not found in the similar sites having no

44

PAM motif or having more than two mismatches. Together, our results show that the

45

CRISPR/Cas9 system is a robust and efficient tool for genome modification in both euchromatic

46

and heterochromatic regions in maize.

49 50 51 52 53 54 55 56 57 58 59

SC

M AN U

TE D

Keywords: CRISPR/Cas9; targeted genome modification; heterochromatic region; maize

EP

48

AC C

47

RI PT

32

ACCEPTED MANUSCRIPT 60 61 62

INTRODUCTION

63 As one of the most important crops in the world, maize (Zea mays) is also a model plant for

65

genetic research. Traditional strategies for genetic study of maize mostly depend on chemical

66

(EMS) treatment or transposon tagging to generate mutant alleles (Candela and Hake, 2008).

67

These methods have been proved to be effective, but because the mutations occur randomly, it

68

takes time and energy to perform a large screen.

SC

RI PT

64

“Targeted genome modification” is a new concept. The basic principle is to design a nuclease

70

that can specifically target a genomic locus, generate a DSB (double strand break) at the site

71

(Puchta and Fauser, 2013), and then use the endogenous cellular DSB repair system to modify the

72

targeted genomic locus. Two artificial nucleases, ZFN (zinc finger nuclease) and TALEN

73

(transcription activator like effector nuclease) were successfully used for targeted genome

74

modifications in the past, especially TALENs (Carroll, 2011; Joung and Sander, 2012;

75

Perez-Pinera et al., 2012). More recently, a new “targeted genome modification” system, CRISPR

76

(clustered regularly interspaced short palindromic repeats)/Cas9 system, was developed and

77

quickly spread in the scientific community due to its simple cloning procedure and relatively

78

higher efficiency (Jinek et al., 2012; Cong et al., 2013; Mali et al., 2013). For the CRISPR/Cas9

79

system, the endonuclease Cas9 is guided by a single guide RNA (sgRNA), which includes two

80

parts. One of them (usually 20 bp) recognizes the target sites through Watson-Crick base pairing

81

between DNA and RNA, and the other forms a structure that recruits the Cas9 endonuclease. Only

82

a PAM (NGG) sequence is necessary for target site design, which means almost all genes in a

83

genome could be targeted by this system (Hsu et al., 2014). The utilization of CRISPR/Cas9

84

system for genome editing has been reported in various species, from bacteria to mammals, which

85

supports the point that it hardly has species limits (Bassett an Liu, 2014; Cho et al., 2013; Hwang

86

et al., 2013; Jiang et al., 2013; Li et al., 2013a; Li et al., 2013b; Nekrasov et al., 2013; Shan et al.,

87

2013). In maize, genome modification by the CRISPR/Cas9 system has been reported previously

88

(Liang et al., 2014; Xing Hui-Li, 2014). Mutations mediated by the CRISPR/Cas9 system have

89

been detected in maize protoplasts and transgenic seedlings.

AC C

EP

TE D

M AN U

69

ACCEPTED MANUSCRIPT Although the CRISPR/Cas9 system shows high efficiency for targeted genome modification, it

91

did not work well constitutively. In some cases, the target sites meeting the requirements for

92

targeting cannot be targeted (Hwang et al., 2013). The reason is unknown, and the molecular

93

mechanisms for Cas9 nuclease to cut the target DNA sequence are to be explored. As the process

94

of gene targeting by CRIPSR/Cas9 system depends on the sgRNA to search and target to the

95

specific site, it is an interesting question of whether the chromatin environment would influence

96

the sgRNA binding ability and change the gene targeting efficiency in plants.

RI PT

90

Here, we report our results of using CRIPSR/Cas9 system to do targeted genome modification

98

in maize. First, a marker gene Zmzb7 was successfully mutated, and a mutant plant exhibiting the

99

expected phenotype was obtained. Second, we provide evidence that the CRISPR/Cas9 system

100

mediated gene targeting in heterochromatic regions has no apparent difference compared to that in

101

euchromatic regions.

M AN U

SC

97

102 103

RESULTS

104

106 107

Zmzb7 as a marker gene for gene targeting

TE D

105

To simplify the later mutant screening step, a marker gene Zmzb7 was chosen as our target. Zmzb7 encodes the IspH protein, which is essential for the methyl-D-erythritol-4-phosphate (MEP)

109

pathway. Loss of function of Zmzb7 will generate a completely albino plant (Lu et al., 2012). To

110

target Zmzb7, we designed a sgRNA for a site located in the 8th exon of the gene. The sgRNA was

111

driven by maize U3 promoter, and the Cas9 gene was controlled by a 2 x 35S promoter (Fig. 1A)

112

(Feng et al., 2013). The sgRNA-Cas9 expressing plasmid was transformed into maize protoplasts

113

to check the mutation efficiency. By RE-PCR-RE (restriction enzyme-PCR-restriction enzyme)

114

analysis of the protoplast genomic DNA, a mutated sequence band could be observed easily in the

115

agarose gel. The PCR products were then cloned and sent for sequencing. Indel mutations in the

116

target site were revealed by the sequencing results (Fig. 1B). Next we transformed the construct

117

into maize immature embryos mediated by Agrobacterium. Following the standard maize

118

transformation protocol (Frame et al., 2002), we obtained 10 putative transgenic callus events. By

119

PCR-RE analysis of the genomic DNA of these events, we determined that the mutation efficiency

AC C

EP

108

ACCEPTED MANUSCRIPT ranged from 19-31%. PCR products were also sequenced, and Indel mutations were revealed (Fig.

121

1C). The putative calli were then used for regeneration. In total about 150 harvested seedlings, one

122

plant showed an albino phenotype (Fig. 2B). The albino plant was analyzed by PCR-RE followed

123

by PCR product cloning and sequencing. Sequencing results indicated that it was probably a

124

mosaic mutant with two mutated alleles, and the ratio of mutational alleles in the whole plant is

125

86%. Of the two mutation alleles, one is a 1 bp insertion and the other is a 1 bp deletion, both of

126

which will cause frame shift of the protein sequence. Because of the extremely high mutation ratio,

127

the plant exhibits an albino phenotype and arrested growth. In addition, we obtained another 2

128

putative heterozygous mutant plants or mosaic plants with a mutant allele ratio of 49% and 69%,

129

respectively (Fig. 2A-C). These two plants show normal phenotypes (data not shown).

SC

RI PT

120

131

M AN U

130

Sites located in heterochromatic regions could be targeted by the CRISPR/Cas9 system

132

The CRISPR/Cas9 system has proved to be robust and efficient for a number of species. In

134

Drosophila, it was reported that heterochromatic regions have no effect on CRISPR/Cas9

135

mediated genome modification (Yu et al., 2013). In plants, especially maize, which have large

136

heterochromatic regions, there is no report of the efficiency of the CRISPR/Cas9 system mediated

137

gene targeting in heterochromatic regions. We were interested to address this question. As

138

centromere and pericentromere regions usually are thought to be heterochromatic, we decided to

139

select genes located in these regions (Dillon and Festenstein, 2002). According to the maize

140

genome and transcriptome sequencing data and our anti-CENH3 antibody ChIP-sequencing data

141

(Wolfgruber et al., 2009; Fu et al., 2013; He et al., 2013), 12 genes located in the maize

142

centromere and pericentromere regions of chromosomes 2 and 5 were selected for targeting

143

(Tables 1 and S1). Among these genes, half are actively expressed. On each gene we choose one

144

targeting site, and sgRNAs were then designed to target these sites. The plasmids for expressing

145

sgRNAs and Cas9 nuclease were mixed and used to transform maize protoplasts and followed by

146

PCR-RE analysis. Indel mutations were detected in 5 of these target sites, with an efficiency

147

ranging from 2.8% to 27%. Among these 5 target sites, 3 of them were located in active genes and

148

2 of them were located in silent genes (Fig. 3). For the other 7 target sites, no mutation was

149

detected in 5 sites, and 2 sites were undetermined. Based on these results, we suggest that in maize,

AC C

EP

TE D

133

ACCEPTED MANUSCRIPT 150

genes located in heterochromatic regions could be targeted efficiently and independently of

151

whether they are expressed.

152 153

Evaluation of off-target events of CRISPR/Cas9 in whole genome

RI PT

154 The application of CRISPR/Cas system in human cells showed a high-frequency of off-target

156

mutations (Fu et al., 2013; Pattanayak et al., 2013). To assess the off-target possibility of

157

CRISPR/Cas9 in maize, the sites similar to target sites were amplified by PCR using transformed

158

protoplast DNA as template to perform DNA sequencing. As shown in Table 2, there was no

159

mutation in a similar site if there was no NGG (PAM site) (in No. 2) or there were more than 3

160

different base pairs (in No. 1). However, if the site was almost identical to the target site

161

(including the PAM site) except a single nucleotide (in No. 3), the mutation of 1-bp substitution

162

happened at the similar site. In our study, the PAM site was necessary for sgRNA targeting and the

163

off-target effects were active at the sites with high similarity to target sites.

M AN U

SC

155

164

DISCUSSION

TE D

165 166

In this study, our results show that the CRISPR/Cas9 system is a robust and efficient tool for

168

targeted genome modification in maize. For targeting the Zmzb7 site, transformation experiments

169

generated 10 bar-resistant callus events. The mutation ratio in calli is variable, ranging from 19%

170

to 31%. We speculate that this may be due to the different T-DNA genomic insertion sites in

171

genome region. In some cases, the expression cassettes included in the T-DNA would be silenced.

172

From our screening, one seedling showed an expected albino phenotype. The plant also grew very

173

slowly. Sequencing results revealed that the plant had mutations with a ratio of 86%, and is most

174

probably mosaic mutant with two types of mutant alleles, both of which caused frame shift during

175

translation. As callus can be subcultured for a long time, the mutations would be continuously

176

occurring. It is possible that if more seedlings are regenerated, the probability to gain a

177

homozygous mutant would increase. We also obtained three putative heterozygous mutant plants,

178

which should produce homozygous mutants in the next generation. We also designed several

179

sgRNAs targeting other genes. By testing the mutation efficiency in protoplasts, we found that

AC C

EP

167

ACCEPTED MANUSCRIPT only about 50% of sgRNAs could generate mutations successfully (data not shown).This indicates

181

that there might be some factors regulating the mutation process. Moreover, the mutation

182

efficiency mediated by the CRIPSR/Cas9 system is variable in different plant species (Fauser et al.,

183

2014; Zhang et al., 2014). The efficiency is very high in rice, and relatively lower in Arabidopsis.

184

Some factors probably affect the efficiency of the CRISPR/Cas9 system, such as gene specificity

185

of the target sites, sgRNA sequence, the promoter for Cas9 and sgRNA expression, and T-DNA

186

insertion sites (Johnson et al., 2015). Because genome editing process is complicated and there is

187

no standard rule for choosing proper target sites, a mutation test in protoplasts or other ways prior

188

the transformation is strongly recommended for maize.

SC

RI PT

180

189

The frequency of obtaining mutant plants and mutation types possibly is related to the target

191

gene or target site. Genes not affecting vegetative growth, such as MADS-box genes AP1 (Feng et

192

al., 2014), and the gene encoding a noncoding RNA, such as OsPMS3 (Zhang et al., 2014), could

193

have less detrimental effects, which would facilitate the recovery of homozygous mutants. The

194

mutation types in these genes were mainly small insertions and deletions. Genes encoding key

195

enzymes, such as OsEPSPS, had rarely detected mutations of three or more base pairs and

196

homozygous mutants, because it would lead to plant death (Zhang et al., 2014). Selecting genes

197

with spatial- or temporal-specific expression should be a good strategy. Meiosis-specific genes are

198

good candidates. Plants are supposed to be healthy until they proceed to reproductive development

199

if the meiosis-specific genes were targeted.

TE D

EP

AC C

200

M AN U

190

201

Our work also indicates that genomic loci in heterochromatic regions in maize could be targeted

202

by the CRISPR/Cas9 system. Among the 12 selected sites located in centromere or pericentromere

203

regions, 5 sites could be targeted. The others could not or were undetected. The ratio of the sites

204

that could be targeted is about 50%. The efficiency shows no difference compared to the sites we

205

chose from euchromatic regions. Therefore, we suggest that gene targeting by the CRISPR/Cas9

206

system may be independent of the chromatin state of the genes. We also analyzed the expression

207

levels of the 12 genes utilizing the published maize transcriptome data, and found that half of

208

them are expressed, while the others are not. We checked whether the CRISPR/Cas9 mediated

209

gene targeting efficiency is related to the genes’ expression level, and found no correlation.

ACCEPTED MANUSCRIPT 210 In human cells, the CRISPR/Cas9 system can tolerate several nucleotide mismatches between

212

the target and similar sites; therefore, the off-target effects are common (Fu et al., 2013;

213

Pattanayak et al., 2013). However, off-target effects were reported to appear rarely in rice (Shan et

214

al., 2013; Xu et al., 2014; Zhang et al., 2014) and sweet orange (Jia and Wang, 2014). In our study,

215

we did not detect off-target effects if the similar site has no PAM motif or has more than two

216

mismatches. According to the results of ours and others, off-target mutagenesis is of low concern

217

for genetic studies in plants. The best strategy to avoid off-target effects is to identify

218

gene-specific sgRNA target sites.

SC

RI PT

211

219

In summary, our results indicate that the CRISPR/Cas9 system is an efficient tool for targeted

221

genome modification in maize in both enchromatic and heterochromatic regions. The fantastic

222

system would benefit maize functional genomics study, and ultimately maize breeding.

223 224

227 228

TE D

226

MATERIALS AND METHODS

Construction of the sgRNA-Cas9 expression vector

EP

225

M AN U

220

To target the marker gene Zmzb7 and other sites in maize, we generate plasmids as follows.

230

The Cas9 expression cassette in 35S-Cas9-SK was released by digestion with Xma I and Hind

231

III (New England Biolabs, UK) and then subcloned into pCambia3301 to produce3301-Cas9

232

(Feng et al., 2013). Maize U3 promoter was amplified from B73 genomic DNA using two

233

primers (ZmU3-F and ZmU3-R) (Leader, 1994). The sgRNA scaffold was amplified with

234

another two primers (sgRNA-F and sgRNA-R) (Feng et al., 2013). Two PCR fragments were

235

then used as template to do overlapping PCR to generate ZmU3-sgRNA with primers

236

(ZmU3-F and sgRNA-R). PCR products were then cloned into pEasy-Blunt simple vector

237

(TransGen Biotech, Beijing, China) to make pU3-sgRNA. Sequence such as 5′-G-N(20)-GG-3′

238

or 5′-B-N(21)-GG-3′ in the maize genome region could be chosen as target sites to design

239

sgRNA. To make the later mutation analysis simpler, sequences with a restriction enzyme site

AC C

229

ACCEPTED MANUSCRIPT 240

over the Cas9 cutting site would be preferred.

241 242

Maize protoplast transformation

243

After finishing the constructs, we first identified the mutation efficiencies in protoplast.

245

Maize protoplast transformation is carried out according to previously reported methods with

246

some modifications (Zhang et al., 2011). Maize HiII seeds were germinated in the dark at

247

30°C for 3 days and then moved to dark conditions at 25°C for another 7 days. Leaves of the

248

seedlings would be used to harvest mesophyll protoplasts.

SC

RI PT

244

Leaves were cut into 1mm pieces by new sharp razor blades, and then put into a 100 mL

250

triangular flask containing 20 mL enzyme solution (1.5% Cellulase, 0.1% Macerzyme, 0.4

251

mol/L Mannitol, 20 mmol/L KCl, 20 mmol/L MES at pH 5.7, 10 mmol/L CaCl2, 0.1% BSA

252

and 5 mmol/L β-mercaptoethanol) and vacuum infiltration applied for 30 min (15 Hg),

253

followed by 3-4 h shaking (40 r/min) at 25°C in the dark. After digestion, the solution was

254

passively flowed through a 40 mm nylon mesh (Millipore, Germany). The protoplasts were

255

subjected to centrifugation at 100 g for 3 min. The supernatant was discarded, and then the

256

protoplasts were washed with W5 buffer (154 mmol/L NaCl, 5 mmol/L KCl, 125 mmol/L

257

CaCl2, 2 mmol/L MES at pH 5.7) two times. The protoplasts were resuspend in Mmg buffer

258

(4 mmol/L MES, 0.4 mol/L Mannitol, 15 mmol/L MgCl2 at pH 5.7) at a concentration of

259

about 106 cells/mL.

EP

TE D

M AN U

249

For PEG mediated transformation, 190 µL protoplast suspension combined with 10 µg

261

plasmid (1µg/µL) was added into 2 mL centrifuge tubes. Then 200 µL 40% PEG solution (40%

262

PEG4000, 100 mmol/L CaCl2, 0.6 mol/L Mannitol) was added and mixed gently by pipetting.

263

The mixture was incubated at 25°C for 18 min. Then, the protoplasts were washed by W5

264

solution (1.6 mL) two times, resuspended in W5 solution and incubated in the dark at 25°C

265

for 48 h.

AC C

260

266 267

Agrobacterium mediated maize transformation

268 269

Maize HiII seeds were planted in the experimental field in Beijing during May to

ACCEPTED MANUSCRIPT September. The F2 immature zygotic embryos were harvested about 9 days after pollination

271

and used for Agrobacterium mediated transformation following previous protocols (Frame et

272

al., 2002) with a slight difference. In summary, after embryo dissection, the immature

273

embryos were infected by Agrobacterium for 5 min, and then placed on the co-cultivation

274

medium at 20°C in the dark for 3 days. After that, embryos were transferred to resting

275

medium at 28°C in the dark for 7 days and transferred to selection medium I for 2 weeks with

276

the same conditions as on the resting medium. Additionally, they were subcultured on the

277

selection medium II for 4-6 weeks. The putative transformed calli were harvested for

278

regeneration. The regeneration process includes three two-week sub-culture steps in

279

pre-regeneration medium (dark), regeneration medium I (dark) and regeneration medium II

280

(light) at 25°C, respectively.

M AN U

281 282

SC

RI PT

270

Mutation analysis of the transformed protoplasts, callus and seedlings

283

To detect the mutation efficiency in protoplasts, genomic DNA of the transformed

285

protoplasts was extracted using DNA extract kit (Tiangen, Beijing, China). The genomic DNA

286

was used as template to perform a PCR reaction. The PCR products were then digested by

287

restriction enzymes, and subjected to gel electrophoresis. The gel was imaged by UV and

288

photograghed by professional software. The target sites in heterochromatic regions were

289

identified by this way. For some target sites, such as Zmzb7 site, mutations cannot be detected

290

readily. The genomic DNA was first digested prior to PCR, followed by PCR amplification

291

and restriction enzyme digestion. Because the digestion efficiency could not reach 100%, the

292

digested wild type genomic DNA would also be amplified by adding 5-10 amplification cycles. To

293

detect the mutation efficiency in callus and regenerated seedlings, the protocols are similar to

294

those for protoplasts and the genomic DNA was used directly as template for PCR. The PCR

295

products were cloned into pMD19-T (Takara, Japan), and usually 30-50 single clones for one

296

target site were sent for Sanger sequencing (Ruibio Biotech, Beijing, China).

AC C

EP

TE D

284

297 298 299

ACKNOWLEDGEMENTS

ACCEPTED MANUSCRIPT We thank Prof. Jiankang Zhu for kindly providing the Cas9 expression constructs (Shanghai

301

Center for Plant Stress Biology of Chinese Academy of Sciences, China). Prof. Tianyu Wang

302

(Institute of Crop Science of Chinese Academy of Agricultural Sciences China) helped us with

303

maize transformation. This work was supported by the National Natural Science Foundation of

304

China (No. 31320103912).

305 306

SUPPLEMENTARY DATA

307 Table S1. Sequences and restriction enzymes for the target sites.

309

Table S2. Primers used in this study.

SC

308

REFERENCES

312

Bassett, A.R., and Liu, J.L., 2014. CRISPR/Cas9 and Genome Editing in Drosophila. J. Genet. Genomics 41: 7-19.

Candela, H., and Hake, S., 2008. The art and design of genetic screens: maize. Nat. Rev. Genet. 9:192-203.

TE D

Carroll, D., 2011. Genome engineering with zinc-finger nucleases. Genetics 188:773-782. Cho, S.W., Kim, S., Kim, J.M., and Kim, J.S., 2013. Targeted genome engineering in human cells with the Cas9 RNA-guided endonuclease. Nat. Biotechnol. 31:230-232. Cong, L., Ran, F.A., Cox, D., Lin, S., Barretto, R., Habib, N., Hsu, P.D., Wu, X., Jiang, W., Marraffini, L.A., and Zhang, F., 2013. Multiplex genome engineering using CRISPR/Cas systems.

EP

Science 339:819-823.

Dillon, N., and Festenstein, R., 2002. Unravelling heterochromatin: competition between positive and negative factors regulates accessibility. Trends Genet. 18:252-258. Feng, Z.Y., Mao, Y., Xu, N., Zhang, B., Wei, P., Yang, D., Wang, Z., Zhang, Z., Zheng, R., Yang, L.,

AC C

313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337

M AN U

310 311

RI PT

300

Zeng, L, Liu, X. and Zhu, J. (2014). Multigeneration analysis reveals the inheritance, specificity, and patterns of CRISPR/Cas-induced gene modifications in Arabidopsis. Proc. Natl. Acad. Sci. USA111: 4632-4637.

Feng, Z.Y., Zhang, B.T., Ding, W.N., Liu, X.D., Yang, D.L., Wei, P.L., Cao, F.Q., Zhu, S.H., Zhang, F., Mao, Y.F., and Zhu, J.K., 2013. Efficient genome editing in plants using a CRISPR/Cas system. Cell Res. 23:1229-1232. Fauser, F., Schiml, S., and Puchta, H., 2014. Both CRISPR/Cas-based nucleases and nickases can be used efficiently for genome engineering in Arabidopsis thaliana. Plant J. 79:348-359. Frame, B.R., Shou, H., Chikwamba, R.K., Zhang, Z., Xiang, C., Fonger, T.M., Pegg, S.E.K., Li, B., Nettleton, D.S., and Pei, D., 2002. Agrobacterium tumefaciens-mediated transformation of maize embryos using a standard binary vector system. Plant Physiol. 129:13-22. Fu, S., Lv, Z., Gao, Z., Wu, H., Pang, J., Zhang, B., Dong, Q., Guo, X., Wang, X.-J., and Birchler, J.A.,

ACCEPTED MANUSCRIPT 2013.De novo centromere formation on a chromosome fragment in maize. Proc. Natl. Acad. Sci. USA110:6033-6036. Fu, Y., Foden, J.A., Khayter, C., Maeder, M.L., Reyon, D., Joung, J.K., Sander, J.D. 2013. High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells. Nat. Biotechnol. 31:822-826. He, G., Chen, B., Wang, X., Li, X., Li, J., He, H., Yang, M., Lu, L., Qi, Y., Wang, X., et al., 2013. Conservation and divergence of transcriptomic and epigenomic variation in maize hybrids.

RI PT

Genome Biol. 14:R57.

Hsu, P.D., Lander, E.S., and Zhang, F., 2014. Development and Applications of CRISPR-Cas9 for Genome Engineering. Cell 157:1262-1278.

Hwang, W.Y., Fu, Y.F., Reyon, D., Maeder, M.L., Tsai, S.Q., Sander, J.D., Peterson, R.T., Yeh, J.R.J., and Joung, J.K., 2013. Efficient genome editing in zebrafish using a CRISPR-Cas system. Nat.

SC

Biotechnol. 31:227-229.

Jia, H., and Wang, N. 2014. Targeted genome editing of sweet orange using Cas9/sgRNA. PLoS One 9:e93806.

Jiang, W.Y., Bikard, D., Cox, D., Zhang, F., and Marraffini, L.A., 2013. RNA-guided editing of

M AN U

bacterial genomes using CRISPR-Cas systems. Nat. Biotechnol. 31:233-239. Jinek, M., Chylinski, K., Fonfara, I., Hauer, M., Doudna, J.A., and Charpentier, E., 2012. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337:816-821.

Johnson, R.A., Gurevich, V., Filler, S., Samach, A., and Levy, A.A., 2015. Comparative assessments of CRISPR-Cas nucleases' cleavage efficiency in planta. Plant Mol. Biol. 87:143-156. Joung, J.K., and Sander, J.D., 2012. TALENs: a widely applicable technology for targeted genome

TE D

editing. Nat. Rev. Mol. Cell Biol. 14:49-55.

Leader DJ, C.S., Filipowicz W, Brown JW., 1994. Characterisation and expression of a maize U3 snRNA gene. Biochim. Biophys. Acta 1219:145-147. Li, D.L., Qiu, Z.W., Shao, Y.J., Chen, Y.T., Guan, Y.T., Liu, M.Z., Li, Y.M., Gao, N., Wang, L.R., Lu, X.L., Zhao, Y.X., and Liu, M.Y., 2013a. Heritable gene targeting in the mouse and rat using a

EP

CRISPR-Cas system. Nat. Biotechnol. 31:681-683. Li, J.F., Norville, J.E., Aach, J., McCormack, M., Zhang, D.D., Bush, J., Church, G.M., and Sheen, J., 2013b. Multiplex and homologous recombination-mediated genome editing in Arabidopsis and Nicotiana benthamiana using guide RNA and Cas9. Nat. Biotechnol. 31:688-691.

AC C

338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381

Li, X., Dong, L., wang, Z., Zhang, H., Han, C., Liu, B., Wang, X., and Chen, Q., 2014. A CRISPR/Cas9 toolkit for multiplex genome editing in plants. BMC Plant Biol. 14:327.

Liang, Z., Zhang, K., Chen, K., and Gao, C., 2014. Targeted mutagenesis in Zea mays using TALENs and the CRISPR/Cas system. J. Genet. genomics 41:63-68.

Lu, X.M., Hu, X.J., Zhao, Y.Z., Song, W.B., Zhang, M., Chen, Z.L., Chen, W., Dong, Y.B., Wang, Z.H., and Lai, J.S., 2012. Map-based cloning of zb7 encoding an IPP and DMAPP synthase in the MEP pathway of maize. Mol. Plant 5:1100-1112. Mali, P., Yang, L., Esvelt, K.M., Aach, J., Guell, M., DiCarlo, J.E., Norville, J.E., and Church, G.M., 2013. RNA-guided human genome engineering via Cas9. Science 339:823-826. Nekrasov, V., Staskawicz, B., Weigel, D., Jones, J.D., and Kamoun, S., 2013. Targeted mutagenesis in the model plant Nicotiana benthamiana using Cas9 RNA-guided endonuclease. Nat. Biotechnol. 31:691-693.

ACCEPTED MANUSCRIPT profiling

of off-target DNA cleavage reveals RNA-programmed Cas9 nuclease specificity.

Nat. Biotechnol. 31:839-843. Perez-Pinera, P., Ousterout, D.G., and Gersbach, C.A., 2012. Advances in targeted genome editing. Curr. Opin. Chem. Biol. 16:268-277. Puchta, H., and Fauser, F., 2013. Gene targeting in plants: 25 years later. Int. J. Dev. Biol. 57:629-637. Shan, Q.W., Wang, Y.P., Li, J., Zhang, Y., Chen, K.L., Liang, Z., Zhang, K., Liu, J.X., Xi, J.J., Qiu,

RI PT

J.L., and Gao, C.X., 2013. Targeted genome modification of crop plants using a CRISPR-Cas system. Nat. Biotechnol. 31:686-688.

Svitashev, S., Young, J., Schwartz, C., Gao, H., Falco, S.C., and Cigan, A.M., 2015. Targeted mutagenesis, precise gene editing and site-specific gene insertion in maize using Cas9 and guide RNA. Plant Physiol. pp. 00793.02015.

SC

Wolfgruber, T.K., Sharma, A., Schneider, K.L., Albert, P.S., Koo, D.H., Shi, J., Gao, Z., Han, F., Lee, H., and Xu, R., 2009. Maize centromere structure and evolution: sequence analysis of centromeres 2 and 5 reveals dynamic loci shaped primarily by retrotransposons. PLoS Genet. 5:e1000743.

M AN U

Xu, R., Li, H., Qin, R., Wang, L., Li, L., Wei, P.C., and Yang, J.B. 2014. Gene targeting using the Agrobacterium tumefaciens-mediated CRISPR-Cas system in rice. Rice 7: 5. Yu, Z., Ren, M., Wang, Z., Zhang, B., Rong, Y.S., Jiao, R., and Gao, G., 2013. Highly efficient genome modifications mediated by CRISPR/Cas9 in Drosophila. Genetics 195:289-291. Zhang, H., Zhang, J., Wei, P., Zhang, B., Gou F., Feng, Z., Mao, Y., Yang, L, Zhang, H., Xu, N., and Zhu, J. 2014. The CRISPR/Cas9 system produces specific and homozygous targete gene editing in rice in one gerneration. Plant Biotech. J. 12:797-807.

TE D

Zhang, Y., Su, J., Duan, S., Ao, Y., Dai, J., Liu, J., Wang, P., Li, Y., Liu, B., Feng, D., Wang, J., and Wang, H.B., 2011. A highly efficient rice green tissue protoplast system for transient gene expression and studying light/chloroplast-related processes. Plant Methods 7:30.

EP

409

Pattanayak, V., Lin, S., Guilinger, J.P., Ma, E., Doudna, J.A., Liu, D.R. 2013. High-throughput

AC C

382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408

ACCEPTED MANUSCRIPT 410

Figure legend

411 Fig. 1 Detection of CRISPR/Cas9 system mediated mutations on Zmzb7 target site in protoplast

413

and callus.

414

A: Schematic illustration of the target site and construct for gRNA-Cas9 expression and

415

transformation. Black box indicates exons of the Zmzb7 gene. The red arrow shows the target site

416

and the red DNA sequence is the target sequence. Underline indicates the Pvu II site. “H” and “X”

417

indicates restriction enzyme Hind III and Xma I, respectively. B: RE-PCR-RE results of

418

transformed protoplast genomic DNA. Lanes 1 and 2 show PCR products amplified from Pvu II

419

digested transformed protoplast genomic DNA and wild type protoplast genomic DNA followed

420

by Pvu II digestion, respectively. Lane “WT” shows PCR product amplified from Pvu II digested

421

wild type protoplast genomic DNA without the second Pvu II digestion. The right side shows the

422

sequencing results of the undigested fragment of lane 1. C: PCR-RE analysis of transgenic callus

423

genomic DNA. Lanes 1-10 show PCR amplified from 10 different transgenic callus genomic DNA

424

followed by Pvu II digestion. Lane “WT” shows PCR product amplified from wild type genomic

425

DNA followed by Pvu II digestion. Mutation efficiencies are shown below the gel picture. The

426

below shows the sequencing results which indicating the mutation types in callus. The sequence

427

with a red underline is PAM sequence.

SC

M AN U

TE D

EP

428

RI PT

412

Fig. 2 Detection of CRISPR/Cas9 mediated mutations in regenerated seedlings.

430

A: PCR-RE results of transgenic seedlings’ genomic DNA. Lane 1-15 show PCR product

431

amplified from genomic DNA of 15 selected transgenic seedlings followed by Pvu II digestion.

432

Lanes “WT” shows PCR product amplified from WT genomic DNA followed by Pvu II digestion.

433

Mutation efficiencies of partial lanes are shown just below the gel picture. B: Phenotype of

434

seedling corresponding to lane 6, which is the expected albino plant. C: Sequencing results of lane

435

6, lane 9 and lane 13. The sequence with a red underline is PAM sequence.

AC C

429

436 437

Fig. 3 Detection of CRISPR/Cas9 mediated mutations for target sites in heterochromatic regions.

438

Hsg3, Hsg4, Hsg6, Hsg7, Hsg12 were selected target sites with mutations mediated by

439

CRISPR/Cas9 system. For all five gel pictures, lanes 1 and 2 repeatedly show PCR products

ACCEPTED MANUSCRIPT amplified from transformed protoplast genomic DNA followed by restriction enzyme digestion.

441

Lanes 3 and 4 show PCR products amplified from wild type protoplast genomic DNA followed

442

with or without restriction enzyme digestion. The right side shows the sequencing results of the

443

undigested fragment of lanes 1 and 2. The sequence with a red underline is PAM sequence.

444

Mutation efficiencies are shown below the gel pictures.

RI PT

440

AC C

EP

TE D

M AN U

SC

445 446

ACCEPTED MANUSCRIPT Table 1. Summary of the target sites in heterochromatic regions selected for targeting by CRISPR/Cas9system Mutation detected by PCR-RE

Gene ID

Hsg1

GRMZM2G091313

Chr.2

5.64

No

Hsg2

GRMZM2G083935

Chr.2

15.99

No

Hsg3

GRMZM2G332562

Chr.5

9.26

Hsg4

GRMZM2G080129

Chr.5

26.24

Hsg5

GRMZM2G170577

Chr.5

4.23

Hsg12

GRMZM2G438243

Chr.2

59.93

Hsg6

GRMZM2G170586

Chr.2

0

Hsg7

GRMZM2G099580

Chr.2

0

Yes

Hsg8

GRMZM2G000411

Chr.5

0

No

Hsg9

GRMZM2G429781

Chr.5

0

No

Hsg10

GRMZM2G135228

Chr.5

0

Undetermined

Hsg11

GRMZM2G342426

Chr.5

0

No

450

AC C

EP

TE D

451

RI PT

Chromosome

Relative expression level

Yes Yes No

Yes

SC

Site Name

M AN U

447 448 449

Yes

ACCEPTED MANUSCRIPT 452

Table2. The off-target effects of CRISPR-Cas9 in maize protoplasts

453 No.

Sequence of target site

Sequence of similar site

Mutation

in

similar site ccgcgcgctgcaggcggccatgg

No

2

acaagatgctatgtatcagctgg

tgcagatgctatctatcagcttc

No

3

agaacctgcgaagtgaagatagg

agaacctgcgaagtgaggatagg

RI PT

acgagagctgcaggcggccatgg

Yes

EP

TE D

M AN U

SC

The underlined sequence is PAM. The letters in red are to emphasize the difference.

AC C

454

1

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT