Molecular characterization of a human matrix attachment region that improves transgene expression in CHO cells Qiu-li Sun, Chun-peng Zhao, Shao-nan Chen, Li Wang, Tian-yun Wang PII: DOI: Reference:
S0378-1119(16)30049-X doi: 10.1016/j.gene.2016.02.009 GENE 41166
To appear in:
Gene
Received date: Revised date: Accepted date:
23 December 2015 3 February 2016 4 February 2016
Please cite this article as: Sun, Qiu-li, Zhao, Chun-peng, Chen, Shao-nan, Wang, Li, Wang, Tian-yun, Molecular characterization of a human matrix attachment region that improves transgene expression in CHO cells, Gene (2016), doi: 10.1016/j.gene.2016.02.009
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT Molecular characterization of a human matrix attachment region that improves transgene expression in CHO cells
SC R
IP
T
Running title: Matrix attachment region and CHO cells
Qiu-li Sun1, Chun-peng Zhao1, Shao-nan Chen1, Li Wang1 and Tian-yun Wang1,2*
Department of Biochemistry and Molecular Biology, Xinxiang Medical University,
NU
1
2
MA
Henan 453003 China;
Henan Collaborative Innovation Center of Molecular Diagnosis and Laboratory
*
CE P
Correspondence to:
TE
D
Medicine, Xinxiang, 453003, Henan, China
Tianyun Wang,
AC
Department of Biochemistry and Molecular Biology, Xinxiang Medical University, Jinsui Road,
Xinxiang 453003, Henan P.R. China Tel: 86-373-3831899 E-mail:
[email protected]
ACCEPTED MANUSCRIPT ABSTRACT Chinese hamster ovary (CHO) cells offer many advantages for recombinant gene
IP
T
expression, including proper folding and post-translational modification of the
SC R
recombinant protein. However, due to positional effects resulting from the neighboring chromatin, transgenes are often expressed at low levels in these cells. While previous studies demonstrated that matrix attachment regions (MARs) can be
NU
utilized to increase transgene expression by buffering transgene silencing, the
MA
mechanism by which this occurs is poorly understood. We therefore performed a deletion analysis of the human β-globin MAR sequence to characterize the regions
D
that are necessary to enhance transgene expression in CHO cells. Our results indicate
TE
that of the six β-globin MAR fragments tested (MAR-1-6; nucleotides 1-540,
CE P
420-1020, 900-1500, 1380-1980, 1860-2460, and 2340-2999, respectively), MAR-2, followed by MAR-3, was the most effective region for promoting stable and elevated
AC
transgene expression. Meanwhile, bioinformatic analyses demonstrated that these fragments encode a MAR-like motif and several transcription factor binding sites, including special AT-rich binding protein 1 (SATB1), CCAAT-enhancer-binding proteins (C/EBP), CCCTC-binding factor (CTCF), and Glutathione (GSH) binding motifs, indicating that these elements may contribute to the MAR-mediated enhancement of transgene expression. In addition, we found that truncated MAR derivatives yield more stable transgene expression levels than transgenes lacking the MAR. We concluded that the MAR-mediated transcriptional activation of transgenes requires a specific AT-rich sequence, as well as specific transcription factor-binding
2
ACCEPTED MANUSCRIPT motifs.
IP
T
Keywords:
SC R
Chinese hamster ovary; Gene expression; Matrix attachment region; Molecular characterization; Transgene silencing
NU
1. Introduction
MA
Since the emergence of the use of recombinant proteins, the production of proteins with therapeutic applications has become an important part of the biopharmaceutical
D
industry. Indeed, the value of therapeutic recombinant proteins, including
TE
recombinant antibodies, has surpassed $1,00 billion per year worldwide. Notably,
CE P
nearly 70% of these proteins are produced using Chinese hamster ovary (CHO) cells [1-3], making CHO cells the preferred mammalian cell line for the production of
AC
recombinant therapeutic proteins. Xu et al.investigated the genomic sequence of the CHO-K1 cell , and found that the assembly comprises 2.45Gb genomic sequence with 24,383 predicted genes. In additon homologs for most human glycosylation-associated genes and many important viral entry genes are present, but some not expressed in the genome of CHO-K1 cells [4]. Upon introduction and integration into the host cell genome, efficient transgene expression is highly dependent on the site of integration [5-7]. Specifically, integration into heterochromatin, a transcriptionally inactive region, often results in silencing of the transgenic sequence. As such, developing methods to buffer this silencing effect and
3
ACCEPTED MANUSCRIPT to enhance transgene expression is of great interest from a biotechnology and gene therapy perspective.
IP
T
Matrix attachment regions (MARs), also referred to as scaffold attachment
SC R
regions (SARs), are genomic DNA sequences that serve as attachment points to facilitate the anchoring of the chromatin structure to the nuclear matrix during interphase [8]. The most consistent features of MARs include the prevalence of
NU
AT-rich sequences, origins of replication, binding sites for DNA topoisomerase II,
MA
special AT-rich binding protein 1 (SATB1) motifs, kinked DNA, and curved DNA [9]. Recently, the use of MARs for the production of recombinant proteins has raised
D
considerable interest. Several studies demonstrated that these sequences enable
TE
increased production of recombinant proteins of biotechnological interest in
CE P
mammalian cell lines [8, 10-15]. Furthermore, multiple studies demonstrated that a human MAR, termed β-globin MAR, increases transgene expression and decreases
AC
the probability of transgene silencing in CHO cells [12, 14, 16]. Although the function of MARs appears to be evolutionarily conserved, defining the discrete elements or consensus sequences that may mediate MAR activity has been unsuccessful. In this study, we therefore attempted to identify functional elements of MARs, and to assess their effects on chromatin structure. Our results imply that binding sites for transcriptional activating proteins contribute to the MAR-mediated increase in transgene expression. These results provide a mechanistic basis for the epigenetic insulator and transcriptional augmentation activities of MARs.
4
ACCEPTED MANUSCRIPT 2. Methods 2.1. Plasmids and constructs
IP
T
As shown in Figure 1A, the human β-globin MAR (No: L22754) sequence were
SC R
cloned upstream of the Simian virus 40 (SV40) early promoter, which drives CTA reporter gene expression, within the pCATG vector [9, 11], thereby producing pCAMM(Figure 1B). Six sub-fragments [from 5´-3´; nucleotides 1-540, 420-1020,
NU
900-1500, 1380-1980, 1860-2460, and 2340-2999] were amplified from the flanking
MA
regions of the β-globin MAR sequence using the primers listed in Table 1. The fragments were each designed to overlap with adjacent sub-fragments by 120
D
nucleotides. Bidirectional cloning was achieved by digesting the resulting PCR
TE
fragments with KpnI and BglII restriction enzymes, and subsequently inserting these
CE P
sequences into linearized pCATG plasmid treated with the same restriction enzymes
AC
to generate the following six plasmids: pCAM-1, 2, 3, 4, 5, and 6, respectively.
2.2. CHO cell culture and transfection CHO cells (Institute of Laboratory Animal Sciences, Beijing, China) were cultivated in Dulbecco’s modified Eagle’s (DMEM) medium (Gibco, Grand Island, NY, USA) supplemented with 10% heat-inactivated fetal bovine serum (FBS; Gibco) at 37°C in a humidified incubator with 5% CO2. The cells were then seeded into six-well plates at approximately 3 × 106 cells/well. Prior to transfection, the plasmid DNA was linearized by treating with ScaI restriction enzyme, which specifically targets a unique site within the ampicillin resistance gene encoded by the plasmid.
5
ACCEPTED MANUSCRIPT Cells were divided into the following eight treatment groups: pCAM-1–6, pCAMM, and pCATG, which were used as a negative control. Transfections were performed
IP
T
using Lipofectamine 2000 reagent (Invitrogen, Waltham, MA, USA), according to the
SC R
manufacturer’s instructions. Forty-eight hours post-transfection, 800 μg/ml G418 (Calbiochem, Billerica, MA, USA) was added to the culture medium, and cells were incubated for 14 days until single colonies appeared. Subsequently, cell populations
NU
exhibiting stable transgene integration were cultured in the presence of 500 μg/ml
MA
G418 for 10 to 14 days, and then subjected to enzyme-linked immunosorbent assay (ELISA) analysis for detection of chloramphenicol acetyltransferase (CAT)
CE P
2.3. CAT assays
TE
D
production.
CAT assay analysis was performed as in previous studies [12, 14, 15]. Briefly,
AC
G418-resistant transfected cells were collected, adjusted to a density of 1 × 106 cells/ml, and rinsed three times with precooled phosphate buffered saline (PBS). Cells were then treated with 1 ml of lysis buffer, mixed gently by shaking, incubated at room temperature for 30 min, and centrifuged at 12,000 rpm for 15 min at 4°C. The levels of CAT in the resulting supernatants were measured using a CAT ELISA Kit (Roche, Basel, Switzerland), according to the manufacturer’s instructions. Experiments were performed in triplicate.
2.4. Reverse transcription quantitative PCR (RT-qPCR)
6
ACCEPTED MANUSCRIPT Total RNA was harvested from approximately 5 × 106 transfected CHO cells using an RNeasy Kit (Qiagen, Venlo, Netherlands), according to the manufacturer’s
IP
T
instructions. One milligram of total RNA from each sample was then reverse
SC R
transcribed with oligo (dT) primers using the First Strand cDNA Synthesis Kit (GE Healthcare, Little Chalfont, United Kingdom), according to the manufacturer’s instructions, and RT-qPCR was performed using a Bio-Rad CFX Connect™ system
NU
(Bio-Rad, Hercules, CA, USA), a Power SYBR® Green PCR Master Mix (Cat:
MA
4367659; Applied Biosystems, Waltham, MA, USA), and the following primers: CAT, 5´-GGTGAGCTGGTGATATGGGA-3´ and 5´-AGGGATTGGCTGAGACGAAA-3´;
D
β-actin (internal control), 5´-CCTCTATGCCAACACAGTGC-3´ and 5´-CCT
TE
GCTTGCTGATCCACATC-3´. The reaction conditions were as follows: 95°C for 3
CE P
min followed by 40 cycles of 95°C for 10 s, 55°C for 20 s, and 72°C for 20 s, and a final extension at 60°C for 10 min. β-actin expression was used as the reference.
AC
Mean Ct values were calculated using the relative quantification method (ΔΔCt).
2.5. Bioinformatics analyses Allele-specific transcription factor binding sites were identified using MatInspector software (http://www.genom atix.de/products/index.html) [17]. CpG islands were analyzed with CPGPLOT [18]. Other motifs previously associated with MARs included unwinding motifs (AATATATT and AATATT), polyAs and polyTs, poly Gs and polyCs, topoisomerase II (GTNWAYATTNATTNATNNR), and a motif thought to relieve the superhelical stress of DNA [19-21]. Lastly, structural motifs,
7
ACCEPTED MANUSCRIPT including nucleosome-binding and nucleosome-disfavoring sites (green and red boxes,
IP
T
respectively), were identified using GeneExpress software [22].
SC R
3. Results 3.1. Characterization of MAR elements
We cloned the human β-globin MAR (No: L22754) and sequenced, the results
NU
showed that ten mutations occurred in the cloned sequence, including point mutation,
MA
deletion and addition (Supplementary Table S1). As described above, MAR elements are typically AT-rich and contain a MAR consensus sequence. They also often bear
D
topoisomerase-II-binding sites [23]. As shown in Table 2, the MAR-2, MAR-3, and
TE
MAR-4 sub-fragments each contained a MAR-like motif (AATATATTT; nucleotides
CE P
840-848, 1149-1157, and 1636-1644, respectively). Furthermore, both the MAR-3 and MAR-4 fragments encoded a topoisomerase-II-binding site (nucleotides
AC
1252-1266 and 1456-1470, respectively). Meanwhile, the 3´ end of the MAR-6 fragment contained both a beta-globin cluster and an Alu element (nucleotides 2687-2998 and 2804-2988, respectively). We next utilized bioinformatic approaches to identify putative transcription factor-binding motifs within the β-globin MAR coding sequence (Fig. 1C~D, Supplementary Table S2, S3). As shown in Table 3, we detected two SATB1 motifs and one CCCTC-binding factor (CTCF) motif within the (+) strand of the MAR-2 sub-fragment, and one SATB1 motif within the (+) strand of the MAR-3 sub-fragment. Meanwhile, the (−) strand of the MAR-2 and MAR-3 sub-fragments
8
ACCEPTED MANUSCRIPT contained two CCAAT-enhancer-binding protein (C/EBP) and two GSH motifs, and
IP
T
one C/EBP and two GSH motifs, respectively (Table 3).
SC R
3.2. Chloramphenicol acetyltransferase assays
After G418 selection, stably-transfected CHO cells were screened and RT-qPCR was utilized to assess CAT mRNA expression levels. The highest level of
NU
MAR-mediated CAT expression (5.20-fold greater than that of the control vector;
MA
P<0.05) was observed in the cells transfected with the construct encoding the full-length human β-globin MAR sequence (Fig. 2A). Of the deletion constructs,
D
MAR-2 and MAR-3 (nucleotides 420-1020 and 900-1500, respectively) yielded the
TE
highest levels of transgene expression, increasing CAT mRNA levels to 3.15- and
CE P
1.90-fold higher than that of the control construct, respectively. MAR-6 did not show the enhancing function on transgene expression. In contrast, the MAR-1, MAR-4, and
AC
MAR-5 (nucleotides 1-540, 1380-1980, and 1860-2460, respectively) constructs failed to enhance transgene expression. Similar results were obtained by ELISA. The highest levels of CAT protein production (2.37-fold increase compared to that of the control; P<0.05) were observed in the CHO cells containing the construct encoding the full-length β-globin sequence. Furthermore, while the MAR-2 and MAR-3 deletion constructs exhibited enhanced transgene expression compared to that of the control, the MAR-4, MAR-5 and MAR-6 sequences were not sufficient to promote transgene expression.
9
ACCEPTED MANUSCRIPT 3.3. Identification of MAR elements necessary for transgene expression To determine which MAR motifs are responsible for enhancing transgene
IP
T
expression, we first analyzed the relationship between specific MAR sequences and
SC R
the level of CAT expression. The MAR-2, -3, and -4 fragments each encoded a MAR-like motif. Meanwhile, MAR-2 and MAR-3 yielded the highest levels of transgene expression, indicating that the MAR-like motif may contribute to the
NU
observed MAR-mediated increase in CAT expression. Conversely, the topoisomerase
MA
II site, the β-globin cluster, and the Alu elements appeared to be dispensable for this effect, as the fragments that encoded these elements failed to promote CAT
D
expression. Furthermore, while transcription factor-binding motif analyses indicated
TE
that the SATB1, C/EBP, CTCF, and GSH sites may contribute to the MAR-mediated
CE P
effect on gene expression, they also indicated that sites such as the FAST-1 and GSH
AC
sites are not essential for this activity.
3.4. MAR fragments enable stable, elevated expression of CAT in CHO cells The stably-transfected CHO cells were cultured as cell suspensions in a fully synthetic medium in the absence of antibiotic selection, and CAT expression was estimated at regular intervals. We observed more stable CAT expression in the cells encoding the MAR constructs than in the control population during continuous culturing (>150 days; Fig. 3). Moreover, the MAR-2 and -3 fragments yielded CAT expression levels that were similar to that mediated the full-length MAR element, and which were significantly higher than those observed in the cell populations encoding
10
ACCEPTED MANUSCRIPT the other MAR fragments. These findings demonstrate that the MAR-2- and MAR-3-mediated increases in transgene expression can be stably maintained during
SC R
IP
T
prolonged culturing.
4. Discussion
The inconsistent transgene expression levels observed within CHO cells is a
NU
major drawback to using this cell line for the production of recombinant proteins. As a
MA
result, labor-intensive selection and screening approaches are necessary to generate cell lines that stably express the exogenous gene at high levels. Recently, the use of
D
MARs for enhancing transgene expression, particularly for the production of proteins
TE
of therapeutic value and for gene therapy, has been extensively investigated. Indeed
CE P
multiple MAR sequences, such as the human β-globin MAR, have been shown to exert anti-silencing and transcriptional augmentation effects [12, 14, 16, 24]; however,
AC
not all MARs exhibit these characteristics. While bioinformatic analyses have identified putative genetic elements within the human MAR coding sequence, including an AT-rich core sequence and a flanking region enriched in transcription factor binding motifs [10, 23], attempts to identify consensus sequences within MAR sequences or to determine which genetic elements mediate the anti-silencing activities have failed [25]. MARs consist of AT-rich regions that are several hundred bases in length, and often contain short AT-rich boxes that exhibit strong potential for causing base-unpairing strain. As such, previous studies have proposed that the secondary
11
ACCEPTED MANUSCRIPT structure of MARs is largely responsible for their functional activities [26, 27]. However, studies of the chicken lysozyme MAR found that binding of the MAR to
IP
T
the nuclear matrix was not sufficient to enhance transgene expression [10, 28]. In this
SC R
study, we performed a deletion analysis of the β-globin MAR to identify the core elements that are necessary for enhancing gene expression in vitro. Our results indicate that this activity is dependent, at least in part, on a combination of two types
NU
of DNA elements: AT-rich sequences and transcription factor-binding sites such as
MA
SATB1, C/EBP, and CTCF sites.
AT-rich sequences, which are essentially comprised of uninterrupted rows of
D
alternating adenine (A) and thymine (T) residues, were proposed to be essential for
TE
the MAR-mediated increase in transgene expression [29]. Notably, certain AT cores
CE P
function as binding sites for transcription factors that specifically target AT-rich sequences [29, 30]. Arope et al. found that the core region of human X-68 MAR
AC
mediates the full anti-silencing activity of the MAR, whereas it is unable to provide a high transcriptional augmentation effect. However, the mere presence of the AT dinucleotide repeats from MARs was insufficient for mediating the anti-silencing effect [31]. In this study, among the six sub-fragments tested, we found that the fragment encoding nucleotides 420-1020 (MAR-2), followed by that encoding nucleotides 900-1500 (MAR-3), of the β-globin MAR sequence yielded the highest levels of CAT mRNA and protein expression. Transcription factor binding motifs, such as nuclear matrix protein 4 (NMP4), SATB1, FAST-1, and C/EBP binding sites, are thought to play an important role in
12
ACCEPTED MANUSCRIPT promoting transgene expression [31]. Consistent with these findings, in this study, bioinformatic analyses detected a MAR-like motif, as well as SATB1, C/EBP, CTCF,
IP
T
and GSH transcription factor binding sites within the MAR-2 and -3 regions,
SC R
indicating that these elements may contribute to the observed enhancement of CAT expression mediated by the MAR sequence. Specifically, these results suggest that the mechanism by which MARs enhance transgene expression may be, at least partially,
NU
dependent on the positioning of certain transcription factor binding sites within the
MA
MAR. In addition, we found that these truncated MAR derivatives provided more stable transgene expression than the construct lacking an MAR sequence. MAR-2 and
D
-3 regions have some advantages and significance in theoretical research and practical
TE
value, the truncated DNA fragments are easily performed and highly transfection
CE P
efficiency,and the mechanism of MAR’s regulation function may be elaborated through the subfragments of MAR.
AC
During the process of heterochromatic spreading, certain silencing proteins, such as histone deacetylases (HDAC) or DNA methyltransferases, are recruited. As such, the integration of transgenes into heterochromatic regions results in gene silencing or low levels of expression. Meanwhile, MAR sequences enable the recruitment of transcription factors, such as histone acetyltransferases (HATs) and other remodeling proteins, which restrain heterochromatic spreading and modify the state of the chromatin, thereby promoting transgene expression. Notably, the permissive chromatic structure of MARs might also contribute to homologous recombination [32], which could explain the increased transgenic copy numbers observed in
13
ACCEPTED MANUSCRIPT recombinant host cells. Epigenetic regulation of gene expression influences all aspects of genetic transfer in
IP
T
eukaryotes [33]. MAR-binding proteins such as SAT B1 and CTCF are associated
SC R
with the formation of DNA loops, which can result in the migration of epigenetic regulatory sequences to within close proximity of the enhancer and promoter sequences that drive the expression of their target genes,[34,35].Otherwise,the MAR
NU
elements may bring the nuclear matrix,consisting mainly of transcription factors and
MA
RNA processing enzymes, such as C/EBP, GSH in the proximity of transcriptional initiation sites at promoters[36]. Indeed, a previous study demonstrated that the MAR
D
sequence can facilitate the migration of the nuclear matrix, which contains a variety of
TE
transcription factors and RNA processing enzymes, to within proximity of certain
CE P
promoter sequences, thereby enabling the binding of RNA polymerases and subsequent transcriptional expression of the downstream gene(s) [31]. The use of
AC
miRNAs to engineer Chinese hamster ovary (CHO) cells is an emerging strategy to improve recombinant protein production. Jadhav et al. provided valuable evidence for miR-17 as a cell engineering target to enhance CHO cell productivity[37]. In conclusion, our results indicate that MAR-mediated transcriptional activation requires specific AT-rich sequences and transcription factor binding motifs. However, a more extensive and systematic study of these elements is needed to fully understand the nature and organization of the motifs that are essential for this activity.
Conflict of interest
14
ACCEPTED MANUSCRIPT The authors confirm that this article content has no conflict of interest.
IP
T
Acknowledgement
SC R
This work was partly supported by the grants from the National Natural Science Foundation of China (No. 31371332) and and the Grant from Henan Collaborative Innovation Center of Molecular Diagnosis and Laboratory Medicine, China
MA
NU
(No.XTCX-2015-ZD1).
References
D
1. Top 30 biologics 2011. Available online: http://www.pipelinereview.com.
TE
2. Jayapal KP, Wlaschin KF, Hu WS, Yap M. Recombinant protein therapeutics
CE P
from CHO cells-20 years and counting. Chem Eng Prog 2007; 103(10):40-7. 3. Datta P, Linhardt RJ, Sharfstein ST. An omics approach towards CHO cell
AC
engineering. Biotechnol Bioeng 2013; 110(5):1255-71. 4. Xu X, Nagarajan H, Lewis NE, Pan S, Cai Z, Liu X, Chen W, Xie M, Wang W, Hammond S. et al. The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell line. Nat Biotechnol 2011 ;29(8):735-41. 5. Grosveld F, van Assendelft GB, Greaves DR, Kollias G. Position-independent, high-level expression of the human beta-globin gene in transgenic mice. Cell 1987; 51(6):975-85. 6. Phi-Van L, von Kries JP, Ostertag W, Strätling WH. The chicken lysozyme 5' matrix attachment region increases transcription from a heterologous promoter in
15
ACCEPTED MANUSCRIPT heterologous cells and dampens position effects on the expression of transfected genes. Mol Cell Biol 1990; 10(5):2302-7.
IP
T
7. Chung JH, Whiteley M, Felsenfeld G. A 5' element of the chicken beta-globin
SC R
domain serves as an insulator in human erythroid cells and protects against position effects in Drosophila. Cell 1993; 74(3):505-14.
8. Mirkovitch J, Mirault ME, Laemmli UK. Organization of the higher-order
NU
chromatin loop: Specific DNA attachment sites on nuclear scaffold. Cell 1984;
MA
39(1):223-32.
9. Platts AE, Quayle AK, Krawetz SA. In-silico prediction and observations of
D
nuclear matrix attachment. Cell Mol Biol Lett 2006; 11(2):191-213.
TE
10. Girod PA, Nguyen DQ, Calabrese D, Puttini S, Grandjean M, Martinet D,
CE P
Regamey A, Saugy D, Beckmann JS, Bucher P, et al. Genome-wide prediction of matrix attachment regions that increase gene expression in mammalian cells. Nat
AC
Methods 2007; 4(9):747-53. 11. Harraghy N, Buceta M, Regamey A, Girod PA, Mermod N. Using Matrix Attachment Regions to Improve Recombinant Protein Production. Methods Mol Biol 2012; 801:93-110. 12. Wang TY, Yang R, Qin C, Wang L, Yang XJ. Enhanced expression of transgene in CHO cells using matrix attachment region. Cell Biol Int 2008; 32(10):1279-83. 13. Chang M, Liu R, Jin Q, Liu Y, Wang X. Scaffold/matrix attachment regions from CHO cell chromosome enhanced the stable transfection efficiency and the
16
ACCEPTED MANUSCRIPT expression of transgene in CHO cells. Biotechnol Appl Biochem 2014; 61(5):510-6.
IP
T
14. Wang TY, Zhang JH, Jing CQ, Yang XJ, Lin JT. Positional effects of the matrix
SC R
attachment region on transgene expression in stably transfected CHO cells. Cell Biol Int 2010; 34(2):141-5.
15. Wang F, Wang TY, Tang YY, Zhang JH, Yang XJ. Different matrix attachment
NU
regions flanking a transgene effectively enhance gene expression in stably
MA
transfected Chinese hamster ovary cells. Gene 2012; 500(1):59-62. 16. Kim JM, Kim JS, Park DH, Kang HS, Yoon J, Baek K, Yoon Y.
D
Improved recombinant gene expression in CHO cells using matrix attachment reg
TE
ions. J Biotechnol 2004; 107(2):95-105.
CE P
17. Quandt K, Frech K, Karas H, Wingender E, Werner T. MatInd and MatInspector: new fast and versatile tools for detection of consensus matches in nucleotide
AC
sequence data. Nucleic Acids Res 1995; 23(23):4878-84. 18. Larsen F, Gundersen G, Lopez R, Prydz H. CpG islands as gene markers in the human genome. Genomics 1992; 13(4):1095-107. 19. Boulikas T. Nature of DNA sequences at the attachment regions of genes to the nuclear matrix. J Cell Biochem 1993; 52(1):14-22. 20. Boulikas T. Chromatin domains and prediction of MAR sequences. Int Rev Cytol 1995; 162A:279-388. 21. van Drunen CM, Sewalt RG, Oosterling RW, Weisbeek PJ, Smeekens SC, van Driel R. A bipartite sequence element associated with matrix/scaffold attachment
17
ACCEPTED MANUSCRIPT regions. Nucleic Acids Res 1999; 27(14):2924-30. 22. Levitsky VG, Ponomarenko MP, Ponomarenko JV, Frolov AS, Kolchanov NA.
IP
T
Nucleosomal DNA property database. Bioinformatics 1999; 15(7-8):582-92.
SC R
23. van Drunen CM, Oosterling RW, Keultjes GM, Weisbeek PJ, van Driel R, Smeekens SC. Analysis of the chromatin domain organisation around the plastocyanin gene reveals an MAR-specific sequence element in Arabidopsis
NU
thaliana. Nucleic Acids Res 1997; 25(19):3904-11.
MA
24. Gorman C, Arope S, Grandjean M, Girod P, Mermod N. Use of MAR elements to increase the production of recombinant proteins. Cell Engineering 2009; 6:1-32.
D
25. Wang DM, Taylor S, Levy-Wilson B. Evaluation of the function of the human
TE
apolipoprotein B gene nuclear matrix association regions in transgenic mice. J
CE P
Lipid Res 1996; 37(10):2117-24. 26. Benham C, Kohwi-Shigematsu T, Bode J. Stress-induced duplex DNA
AC
destabilization in scaffold/matrix attachment regions. J Mol Biol 1997; 274(2):181-96. 27. von Kries JP, Phi-Van L, Diekmann S, Strätling WH. A non-curved chicken lysozyme 5’ matrix attachment site is 3’ followed by a strongly curved DNA sequence. Nucleic Acids Res 1990; 18(13):3881-5. 28. Zahn-Zabal M, Kobr M, Girod PA, Imhof M, Chatellard P, de Jesus M, Wurm F, Mermod N. Development of stable cell lines for production or regulated expression using matrix attachment regions. J Biotechnol 2001; 87(1):29-42. 29. Bode J, Kohwi Y, Dickinson L, Joh T, Klehr D, Mielke C, Kohwi-Shigematsu T.
18
ACCEPTED MANUSCRIPT Biological significance of unwinding capability of nuclear matrix-associating DNAs. Science 1992; 255(5041):195-7.
IP
T
30. Romig H, Fackelmayer FO, Renz A, Ramsperger U, Richter A. Characterization
SC R
of SAF-A, a novel nuclear DNA binding protein from HeLa cells with high affinity for nuclear matrix/scaffold attachment DNA elements. EMBO J 1992; 11(9):3431-40.
NU
31. Arope S, Harraghy N, Pjanic M, Mermod N. Molecular characterization of PLoS
One
2013;
MA
a human matrix attachment region epigenetic regulator. 8(11):e79262.
D
32. Raynard SJ, Baker MD. Cis-acting regulatory sequences promote highfrequency
TE
gene conversion between repeated sequences in mammalian cells. Nucleic Acids
CE P
Res 2004; 32(19):5916-27.
33. Kwaks TH, Otte AP. Employing epigenetics to augment the expression of
AC
therapeutic proteins in mammalian cells. Trends Biotechnol 2006; 24(3):137-42. 34. Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, Hu M, Liu JS, Ren B. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 2012; 485(7398):376-80. 35. Heng HH, Krawetz SA, Lu W, Bremer S, Liu G, Ye CJ. Re-defining the chromatin loop domain. Cytogenet Cell Genet 2001; 93(3-4):155-61. 36. Bode J, Benham C, Knopp A, Mielke C.Transcriptional augmentation:modulation of gene expression by scaffold/matrix-attached regions (S/MAR elements). Crit Rev Eukaryot Gene Expr 2000; 10: 73–90.
19
ACCEPTED MANUSCRIPT 37. Jadhav V, Hackl M, Klanert G, Hernandez Bort JA, Kunert R, Grillari J, Borth N.Stable overexpression of miR-17 enhances recombinant protein production of
AC
CE P
TE
D
MA
NU
SC R
IP
T
CHO cells. J Biotechnol 2014;175:38-44.
20
ACCEPTED MANUSCRIPT Figure legends Fig. 1 Schematic representation of β-globin MAR subfragment and illustration of
IP
T
its transcription factor binding motifs. A) Structures of deletion mutant constructs
SC R
of the human β-globin MAR element cloned in the forward direction. B) Schematic diagram representing the full-length human β-globin MAR cloned upstream of a minimal SV40 promoter and CAT reporter gene. C) Schematic illustration of various
NU
transcription factors binding motifs within the (+) strand of the full-length human
MA
β-globin MAR. D) Schematic illustration of various transcription factor binding
D
motifs within the (-) strand of the full-length human β-globin MAR.
TE
Fig. 2 Effects of the full length and the deletion constructs of the human human
CE P
β-globin MAR element on CAT expression in G418-resistant transfectants. A) The CAT expression level was detected by fluorescence quantitative RT-qPCR in
AC
G418-resistant transfectants with either the pCATG control vector or a MAR-containing vector were measured; B) The CAT expression level was detected by ELISA in G418-resistant transfectants with either the pCATG control vector or a MAR-containing vector. Lane 1 indicates the control, 2~8 represent the the CAT expression vector containing full length MAR, and deleted human β-globin MAR (1-6) of the human β-globin MAR element.
Fig. 3 Analysis of the stably of high-producing CHO cell lines. MAR-2, MAR-3, full length MAR and the control were transfected into CHO cells and the CAT
21
ACCEPTED MANUSCRIPT amount was determined by ELISA at regular time as described in methods. Lane 1~3 indicates the the CAT expression vector containing full length MAR, and deleted
IP
T
human β-globin MAR (1-2) of the human β-globin MAR element, respectively and
AC
CE P
TE
D
MA
NU
SC R
lane 4 represents the control.
22
NU
SC R
IP
T
ACCEPTED MANUSCRIPT
AC
CE P
TE
D
MA
Figure 1
23
SC R
IP
T
ACCEPTED MANUSCRIPT
AC
CE P
TE
D
MA
NU
Figure 2a
24
SC R
IP
T
ACCEPTED MANUSCRIPT
AC
CE P
TE
D
MA
NU
Figure 2b
25
SC R
IP
T
ACCEPTED MANUSCRIPT
AC
CE P
TE
D
MA
NU
Figure 3
26
ACCEPTED MANUSCRIPT Table 1
The PCR primers used in this study
Position
Primer sequence(5′-3′)
MAR-1
1-540
P1:ATCGGTACCAAGCTTCTGACAAATTATTC TTCCT
T
Name
420-1020
IP
P2:TGA AGATCTCACTCTTTCATCAGTAATCACTCAG P1: ATC GGTACC GGTTCACTCT GCTATAGCAA TTTCA
MAR-2 900-1500
SC R
P2: TGA AGATCT ACATTGAATC AAAGGTAATG TTGCC P1: ATC GGTACC TAGTTAGTAA GACATCACCT TGCAT
MAR-3
P2: TGA AGATCT GTAAGAATCC TTTCAATGTG TGTGT P1: ATC GGTACC ACTCAAATAA ATACCTGCTT CATAG
NU
1380-1980 MAR-4
P2: TGA AGATCT TCATCCACTT ATTTATACAT TTAAA P1: ATC GGTACC AGTAGAATTA TACTCCCACT TTAGT
MA
1860-2460 MAR-5
P2: TGA AGATCT GTCCTTGTCT ATTTTCCCTG AAGTT
2340-2999
P1: ATC GGTACC CAAAGGAGAA AAGTTTGTTG GCCTC
D
MAR-6
AC
CE P
TE
P2: ATC GGTACC CAAAGGAGAA AAGTTTGTTG GCCTC-
27
ACCEPTED MANUSCRIPT Table 2 Molecular sequence characteristic of six sub-fragments of β-globin MAR Motif
MAR-1
MAR-2
MAR-3
MAR-4
MAR-5
MAR-6
Position
0
0
1
1
1
0
Topo II
0
0
1
1
0
beta-globin cluster
0
0
0
0
0
1
840~848, 1149~1157, 1636~1644 1252~1266 1456~1470 2687~2998
Alu element
0
0
0
0
0
1
2804~2998
AT content (%)
61.85
60.33
67.50
66.83
62.83
56.15
IP
SC R
NU MA D TE CE P AC
28
T
MAR- like motif
0
ACCEPTED MANUSCRIPT
Table 3. Locations of various transcription factor binding motifs within the six sub-fragments of β-globin MAR MAR-1
MAR-2
MAR-3
MAR-4
MAR-5
CEBP
1 0 0 0 2 3 0
1 2 2 1 0 3 1
1 1 1 2 1 0 0
1 0 0 0 0 0 1
1 0 0 0 1 0 0
0
0
0
1 1 0 0
1 2 0 0
1 2 0 0
0 0 0 1 1 0 0
+ + + +
IP
SC R
NU 29
Strand
0
0
0
-
1 0 0 0
0 0 0 0
0 1 0 1
+ + -
MA
FAST
D
GSH
TE
CTCF
CE P
NFAT
AC
SATB1
MAR-6
T
Matrix Family
ACCEPTED MANUSCRIPT
Abbreviations Chinese hamster ovary(CHO); matrix attachment regions(MARs);AT-rich binding proteins(C/EBP);CCCTC-binding
T
1(SATB1);CCAAT-enhancer-binding
IP
protein
factor(CTCF);Glutathione(GSH);scaffold attachment regions(SARs);special AT-rich protein
1(SATB1);
Fluorescence
AC
CE P
TE
D
MA
NU
(RT-qPCR); CCCTC-binding factor (CTCF).
quantitative
SC R
binding
30
reverse
transcription
ACCEPTED MANUSCRIPT Highlights
IP
T
Characterize the regions of MAR sequence that are necessary to enhance transgene
SC R
expression.
MAR-like motif contribute to the MAR-mediated enhancement of transgene
MA
NU
expression.
Truncated MAR derivatives yield more stable transgene expression levels than
TE
D
lacking the MAR.
AC
sequence.
CE P
MAR-mediated transcriptional activation of transgenes requires a specific AT-rich
31