The interaction between MYB proteins and their target DNA binding sites

The interaction between MYB proteins and their target DNA binding sites

Biochimica et Biophysica Acta 1819 (2012) 67–77 Contents lists available at SciVerse ScienceDirect Biochimica et Biophysica Acta journal homepage: w...

779KB Sizes 3 Downloads 74 Views

Biochimica et Biophysica Acta 1819 (2012) 67–77

Contents lists available at SciVerse ScienceDirect

Biochimica et Biophysica Acta journal homepage: www.elsevier.com/locate/bbagrm

Review

The interaction between MYB proteins and their target DNA binding sites Michael B. Prouse a, Malcolm M. Campbell a, b,⁎ a b

Centre for the Analysis of Genome Evolution & Function, Department of Cell & Systems Biology, University of Toronto, 25 Willcocks St., Toronto, ON, Canada M5S 3B2 Department of Biological Sciences, University of Toronto Scarborough, 1265 Military Trail, Toronto ON, Canada M1C 1A4

a r t i c l e

i n f o

Article history: Received 20 June 2011 Received in revised form 17 October 2011 Accepted 18 October 2011 Available online 28 October 2011 Keywords: MYB Transcription factor DNA binding sites Transcriptional regulation Molecular modelling

a b s t r a c t Members of the MYB family of transcription factors are found in all eukaryotic lineages, where they function to regulate either fundamental cellular processes, or specific facets of metabolism or cellular differentiation. MYB transcription factors regulate these processes through modulation of transcription at target genes, to which they bind in a sequence-specific manner. Over the past decades, insights have been gained into the molecular interactions between MYB proteins and their cognate DNA targets. This review focuses on those insights, the emergence of common themes in DNA binding by diverse MYB family members. The review also considers gaps in the current knowledge of MYB–DNA interactions, particularly for plant MYB proteins, and how emerging techniques that examine protein–DNA interactions can fill these gaps. © 2011 Elsevier B.V. All rights reserved.

1. Introduction Growth and development are orchestrated by the activity of sequence-specific transcription factors, proteins that function to reconfigure gene expression in response to external and internal cues. Sequence-specific transcription factors can act as transcriptional activators, repressors, or both [1]. Sequence-specific transcription factors frequently have a modular structure — comprising a DNAbinding domain together with a transcriptional regulatory domain [2]. Based on the similarities of the DNA-binding domain, transcription factors have been categorised into families or superfamilies, with several such groups composed of one hundred or more

Abbreviations: R, MYB repeat; AMV, avian myeloblastosis virus; C1, COLORED1 locus; MBS, MYB binding site; EMSA, electrophoretic mobility-shift assay; AGRIS, Arabidopsis gene regulatory information server; PLACE, Plant cis-acting regulatory elements database; TRANSFAC, Transcription factor database; UTR, untranslated regions; ChIP-chip, chromatin immunoprecipitation on chip; ChIP-seq, chromatin immunoprecipitation followed by high throughput sequencing; PDB, Protein data bank; TRFL, TRF1/2Like genes; IBP, indicator binding protein group; SMH, single MYB histone group; WER, WEREWOLF; Kd, dissociation constant; MSA, M phase-specific activator element; TF, transcription factor; SNP, sodium nitroprusside; GSNO, S-nitrosoglutathione; CAST, cyclic amplification and selection of targets; SELEX, systematic evolution of ligands by exponential enrichment; FLP, FOUR LIPS; GL3, GLABRA3; GL1, GLABRA1; bHTH, basic helix–turn–helix; GR, glucocorticoid receptor; CPC, CAPRICE; WBS, WER-binding site; IFN-g, human interferon-g; LCR, locus control region ⁎ Corresponding author at: Department of Cell & Systems Biology, University of Toronto, 25 Willcocks St., Toronto, ON, Canada M5S 3B2. Tel.: + 1 416 946 0817; fax: + 1 416 978 5878. E-mail addresses: [email protected] (M.B. Prouse), [email protected] (M.M. Campbell). 1874-9399/$ – see front matter © 2011 Elsevier B.V. All rights reserved. doi:10.1016/j.bbagrm.2011.10.010

members [3,4]. The MYB superfamily is one of the largest and most diverse families of sequence-specific transcription factors [5,6]. In animals, the MYB superfamily is relatively small, generally comprising four or five proteins [6-9]. Animal MYB superfamily members regulate gene expression related to cell division or a discrete subset of cellular differentiation events [10-12]. By contrast, the MYB superfamily in plants has expanded dramatically, with 100–200 MYB family members commonly found in individual plant species [13]. In plants, MYB proteins regulate a vast array of metabolic, cellular and developmental processes [13-15]. Much is known about the specifics of the interaction between animal MYB proteins and their cognate DNA binding sites. By contrast, the knowledge of the details of MYB–DNA interactions in plants is rather incomplete. In this review we consider the current state of knowledge with respect to MYB–DNA interactions in animals, and contrast this with what is known in plants, suggesting means by which the gap in knowledge in plants can be addressed. 2. The nature of MYB proteins 2.1. The MYB transcription factor superfamily The MYB superfamily is found in all major eukaryotic lineages, and is thought to be more than 1 billion years old [6,8,16,17]. MYB proteins acquired their name from v-MYB, the oncogenic component of avian myeloblastosis virus (AMV), where the sequence-specific MYB domain was initially discovered [18]. The cellular counterpart of v-MYB is cMYB, a three repeat (R1R2R3) MYB protein that plays a critical role in controlling the proliferation and differentiation of hematopoietic cells [19]. c-MYB mutations that alter target gene expression drastically

68

M.B. Prouse, M.M. Campbell / Biochimica et Biophysica Acta 1819 (2012) 67–77

reduce the proliferation of hematopoietic cells [20]. In keeping with this, homozygous c-MYB knock-out lines of mice die before reaching day 15 of the foetal lifecycle due to the inability to sustain hepatic erythropoiesis [19]. MYB superfamily members are characterised by a highly conserved DNA-binding domain, referred to as the MYB domain, which generally consists of up to three imperfect amino acid repeats (R1, R2, and R3) of 50–53 amino acids [6]; however, there are certain MYB proteins that do not follow this trend, possessing at least four MYB repeats [9]. Each of the MYB repeats, within the MYB domain, gives rise to a helix–turn–helix secondary structure. The MYB domain is predominantly found within the N-terminus of MYB-proteins [21]; however, MYB domains recently have also been discovered within the C-termini of MYB-proteins [22]. Each MYB repeat consists of several highly conserved tryptophan residues that are regularly spaced forming a hydrophobic core [23]. In contrast to MYB domain, the C-terminal region of MYB proteins is characteristically highly variable from one MYB protein to another, and usually functions as either an activation or repression domain [14,16,21,24]. This gives rise to a wide range of variability both structurally and functionally within the MYB superfamily. 2.2. Animal MYB proteins As is the case with c-MYB, animal MYB superfamily members possess three MYB-repeat proteins [23,25,26]; although, there are some notable exceptions that deviate from this, including human SNAPc 190 and TRF1 [7,9]. In all annotated vertebrate genomes, there are only three MYB proteins with three MYB-repeats: A-MYB, B-MYB, and c-MYB [6,8]. A-MYB and B-MYB proteins are R1R2R3–MYB nuclear transcription factors expressed in hematopoietic cells, epithelial cells, and fibroblasts [27]. A-MYB negatively regulates cellular proliferation [11], while B-MYB positively regulates cell growth control, differentiation, and cancer [28]. 2.3. Plant MYB proteins In comparison to animals, the MYB superfamily is greatly expanded in plants [17,21,24]. For example, of the over 1600 sequence-specific transcription factors identified in the genome of the model dicotyledenous plant, Arabidopsis thaliana, almost 10% are members of the MYB transcription factor family [5,13]. In contrast to animals, A. thaliana has 5 three-repeat MYB proteins, and 126 two-repeat (R2R3) MYB proteins, [4,5,13,15,21,29], while the monocotyledon plant rice (Oryza sativa) has over 110 predicted R2R3–MYB proteins [4] (http:// grassius.org/tf_family.html?KEYWORDS=MYB&SPECIES=2). In addition, single-repeat MYB proteins have been identified in plants and animals in increasing numbers [30-57]. Although, single-repeat MYB proteins have been identified in both animals and plants, the majority of single-repeat MYB proteins have been characterised in plants. As their name implies, R2R3–MYB proteins have two MYB repeats [21]. R2R3–MYB proteins comprise the largest group of MYB transcription factors in the MYB superfamily and appear to be specific to plants [13]. Plant R2R3–MYB proteins regulate a myriad of processes, including primary and secondary metabolism; regulation of cell fate and identity; regulation of plant development; and responses to biotic and abiotic stresses [13-15,24,58-64]. While analogous processes, such as regulation of cell fate and identity, can be found in animals, the precise functions associated with R2R3–MYB proteins appear to be plant specific [13-15]. 2.4. Single MYB-repeat proteins Single MYB-repeat proteins can be classified into the following two groups: 1) Indicator Binding Protein (IBP) group, and 2) Single MYB Histone (SMH) group. The IBP group of proteins includes

RTBP1 from rice, AtTRP1 and AtTBP1 from A. thaliana [7,33,38], as well as the highly characterised telomeric DNA-binding proteins TRF1, TRF2, RAP1 and Taz1. SMH proteins are a novel group of single MYBrepeat proteins that have only been identified in plants. SMH group of proteins include PcMYB1 from Petroselinum crispum, AtTRB1, AtTRB2, AtTRB3 from Arabidopsis, and Smh1 from Maize. AtTRB1, AtTRB2, AtTRB3 have been studied in detail, all sharing a single MYB-repeat more similar to R2 than R1 and R3 [48]. In A. thaliana, single-repeat MYB proteins CAPRICE (CPC), TRYPTICHON (TRY), ETC1 (ENHANCER OF TRY and CPC) and ETC2 have been identified [65,66]. 2.5. Expansion and diversification of the MYB family Two theories of how the MYB superfamily evolved have been constructed based on parsimony [8]. The first is formulated on the premise that three-repeat MYB proteins are closely related to vertebrate c-MYB and other similar three-repeat MYB proteins in other eukaryotic groups, such as ciliates and slime molds [67,68]. These primitive proteins are predicted to have existed before the divergence between animals and plants [68]. This theory proposes that R2R3–MYB proteins originated recently from three-repeat MYB proteins due to loss of R1-MYB repeat [67,69]. The second theory postulates that within an ancient R2R3 predecessor that there was a domain duplication and subsequent gain of R1, suggesting that R2R3 is a precursor of MYB3R [70]. Common to both theories, there was a vast expansion of R2R3–MYB proteins in plants via duplications of entire genes [8]; however, the expansion was restricted for the three-repeat MYB proteins in both animals and plants. Comparisons of DNA-binding specificities and functional roles between MYB proteins with different repeats could help elucidate the nature of the evolutionary pathway for MYB proteins. 3. DNA targets of MYB family members 3.1. Animal MYB DNA-binding sites The DNA target of animal three-repeat MYB transcription factors was first determined by isolation of chicken genomic DNA fragments bound by v-MYB on filters [10] and by comparison of putative MYB binding sites within the SV40 enhancer region [71]. Binding-site selection methods with c-MYB protein resulted in added minor extensions to the c-MYB consensus sequence. The c-MYB consensus sequence was found to be ((T/C)AAC(G/T)G(A/C/T)(A/C/T)) and was termed MYB binding site I (MBSI) [25,72]. Mutational assays validated by NMR structural data revealed that the MBSI sequence was bipartite. The first halfsite ((T/C)AAC)) has the majority of specific contacts with R3, and the second half-site ((G/T)G(A/C/T)(A/C/T)) had specific contacts with R2 [23,73,74]. Following identification of the c-MYB DNA-binding site, mammalian A-MYB and B-MYB, were subsequently shown to bind MBSI [14,75-77]. 3.2. Plant MYB DNA-binding sites Although R1R2R3–MYB proteins in plants share the same functionality as animal R1R2R3–MYB family members, their DNAbinding specificities are different [12,72,78]. All three characterised animal three-repeat MYB proteins bind to the same sequence MBSI ((T/C)AAC(G/T)G(A/C/T)(A/C/T)) and have similar functions in cellcycle control [10-12]. In comparison, plant three-repeat MYB proteins, such as tobacco MYBA1, MYBA2, and MYBB have an important role at the G2/M phase of the cell-cycle, by regulating transcription of cyclin B and other cell-cycle genes that are expressed at a similar time in the cell-cycle [79]. Through a yeast one-hybrid screen, NtMYBA1, NtMYBA2, and NtMYBB were found to bind to AACGG. This consensus sequence is known as the M phase-specific activator (MSA) element, and was identified previously in tobacco. The other two three-repeat MYB proteins found in tobacco, NtMYBC1 and

M.B. Prouse, M.M. Campbell / Biochimica et Biophysica Acta 1819 (2012) 67–77

NtMYBC2, are also predicted to bind to this motif; however, this assumption has not been validated experimentally. Relatively few of the possible plant R2R3–MYB DNA targets have been characterised; but some common elements of plant MYB–DNA interactions have emerged (Fig. 1, Supplemental Table S1). Recognition of plant MYB DNA targets was first determined with studies conducted on the Maize P protein, a R2R3–MYB protein involved in flavonoid biosynthesis [80]. Through binding-site selection assays and EMSAs, P was shown to bind to ACC(A/T)ACC(A/C/T). This contrasted with the animal MYB DNA consensus sequence of ((T/C)AAC (G/T)G(A/C/T)(A/C/T)), but was a harbinger for the majority of plant MYB proteins, which recognise MBSI ((T/C)AAC(G/T)G(A/C/T) (A/C/T)), MBSII (AGTTAGTTA), and MBSIIG ((C/T)ACC(A/T)A(A/C)C). Nevertheless, it is important to note that not all plant MYB proteins, especially within the R2R3–MYB family, recognise these motifs [81]. Many R2R3–MYB transcription factors recognise AC elements, DNA motifs that are enriched in adenosine and cytosine residues [37,62,63,80,82-90]. Some R2R3–MYB proteins function as transcriptional activators at these sites [62,63], while others function as transcriptional repressors [82]. Compendia of plant MYB DNA-binding sites can be found in databases such as The Arabidopsis Gene Regulatory Information Server (AGRIS) (http://arabidopsis.med.ohio-state.edu/) and the transcription factor database (TRANSFAC) (http://www.gene-regulation. com/pub/databases.html). These databases contain many of the MYB DNA-binding sites reported in the literature, most of which have been experimentally validated, and all of which are reported here (Fig. 1, Supplemental Table S1). Plant MYB DNA-binding sites were determined on a protein-by-protein basis [26,31,33,40,45,62,63,80,81,83,85,86,88-107], and generally reside approximately 500 bp upstream of the transcriptional start site (Fig. 1, Supplemental Table S1).

3.3. The DNA targets of single MYB-repeat proteins In contrast to two and three-repeat MYB proteins, single-repeat MYB proteins bind predominantly to the telomeric sequence TTAGGG and display similar sequence identity; however, not all single-repeat MYB proteins bind this sequence and moreover, they do not share the same functional roles (Supplemental Table S1). Single-repeat MYB proteins are involved with telomere binding and circadian clock regulation [15]. These functionalities have been conserved during the evolution of yeast, animals and plants [8,108]. Both C-terminal and N-terminal single-MYB repeat proteins bind to double-stranded DNA of telomeric repeats TTTAGGG. AtTRB1, AtTRB2, AtTRB3 all bind the telomeric DNA sequence containing a minimum of two repeats (TTTAGGG)2. In this regard, these single-MYB repeat proteins are divergent from R2R3–MYB and R1R2R3–MYB proteins both in terms of the primary sequence of the MYB domains, and also, consistent with their divergent DNA-binding domain, in terms of their cognate DNA target binding sites. By contrast, some single-repeat MYB proteins seem to bind to DNA targets that are coincident with R2R3– MYB proteins [36,45,47]. In this regard, they function as competitors for the same DNA targets. The rice single-MYB repeat proteins OsMYBS1, OsMYBS2, and OsMYBS3 can form dimers to bind to the sequence TATCCA with different binding affinities, as determined by EMSAs with competition [47]. Mutational assays showed that nucleotides CCA are more important for OsMYBS1 and OsMYBS3 binding than the TAT nucleotides. In contrast, sequence TAT seems to be more important for OsMYBS2 binding. Moreover, all three of these MYB proteins alter alpha-amylase gene expression. OsMYBS1 had a higher transactivation ability than OsMYBS2 and OsMYBS3. OsMYBS3 acted as a transcriptional repressor in both yeast and barley aleurone cells. These results demonstrate differential binding affinities and transactivation ability of three MYB proteins for the same target DNA sequence, showing the mechanism behind the phenotype. Competing with each other, single-repeat MYB proteins

69

and R2R3–MYB proteins provide a means by which to fine-tune gene expression of genes that contain the gene regulatory regions that are the sites of competition [42,45,47]. 4. The nature of DNA-binding by MYB proteins 4.1. Relationship between the MYB DNA-binding domain and DNA-binding specificity The MYB superfamily has been categorised based both on the number of MYB repeats and on the amino acid sequence of the MYB domain [21,24]. In other families of transcription factors, overall sequence conservation is low and variability in DNA-binding specificity is high [109,110]. Contrary to this, members of the plant R2R3–MYB family share higher amino acid sequence similarity, especially in their recognition helices, and display similar DNA-recognition patterns [81]. These similarities in recognition specificity are heightened between members of the same phylogenetic group. R2R3–MYB family members from different species have been previously classified into different phylogenetic clades (groups A, B, and C) based on sequence similarities [81]. These clades were then analysed for DNA-binding specificities [81]. The R2R3–MYB DNA-binding data was mainly a result of in vitro assays; however, in vivo assays were used as well to determine R2R3–MYB DNA-binding specificities. It was shown that members from group A, bind MBS type I sequence (C(A/C/G/T)GTT(A/G)), members of group B bind equally to both MBS type I and type II (G(G/T)T(A/T)GTT(A/G)), and most members of group C bind MBS type IIG ((C/T)ACC(A/T)A(A/C)C). For example, AtMYB6 and AtMYB7 are both members of group C and share 90% amino acid sequence identity [81]. AtMYB6 and AtMYB7 both bind to the MBS type IIG sequence [97] (Fig. 1, Supplemental Table S1). Well-characterised DNA-binding sites can be extracted from the literature for 87 proteins from the MYB superfamily (Fig. 1, Supplemental Table S1). Characterisation of DNA targets was derived from both in vivo and in vitro protein–DNA-binding assays (see captions for Fig. 1 and Supplemental Table S1). DNA binding sites for these proteins can be categorised into seven groups, and each of the 87 MYB proteins can be grouped based on DNA-binding specificities. Examinaton of the protein-sequence-similarity-derived phylogenetic relationships between these 87 MYB proteins reveals that, in general, MYB proteins that share protein sequences bind to similar DNA sequences (Fig. 1, Supplemental Table S1). That is, similar protein structure implies similar DNAbinding sequences recognized by MYB proteins; however, there are instances in the phylogenetic tree and in other studies where this is not the case [97] (Fig. 1, Supplemental Table S1). Members of the MYB superfamily do not always share similar DNAbinding sites based on similar structure. Although Romero et al. had shown a correlation between MYB protein structure and DNA-binding specificity, there were MYB family members that did not fit into his predictions [81]. For example, Group C MYB family members prefer in general type IIG sequence; however, the two Group C MYB proteins, AtMYB2 and AtMYBGL1, bound DNA with different patterns. AtMYB2 bound to type I sequences [105], and AtMYBGL1 bound only to type II sequence [81]. These examples show that MYB proteins, although similar in structure and function, can bind to different DNA cognate target sites. Moreover, it highlights the importance of conducting DNAbinding site experiments for individual MYB proteins because it is extremely difficult to predict a MYB DNA-binding site based solely on homology. 4.2. Involvement of MYB repeats in DNA binding Both R2 and R3-MYB repeats are necessary for DNA binding, either by R1R2R3–MYB or R2R3–MYB proteins [23]. Neither R2 nor R3 can alone bind DNA specifically [23]. This implies that both the R2 and R3 repeats bind cooperatively to its cognate DNA target sequence.

70

M.B. Prouse, M.M. Campbell / Biochimica et Biophysica Acta 1819 (2012) 67–77

Fig. 1. Phylogenetic relationships and subgroup designations for 87 MYB superfamily members using the neighbour-joining method. The unrooted phylogenetic tree was generated using the amino acid sequences of the MYB proteins in Supplemental Table S1. Whole MYB protein sequences were downloaded from The Arabidopsis Information Resource (TAIR; http://Arabidopsis.org) and from the National Center for Biotechnology Information protein database (NCBI Entrez; http://www.ncvi.nlm.nih.gov/sites/entrez). The phylogenetic analysis included 9 three-repeat MYB proteins (R1R2R3–MYB proteins), 50 two-repeat MYB proteins (R2R3–MYB proteins) and 28 one-repeat MYB proteins (R1-MYB proteins). The full-length amino acid sequences were aligned using Multiple Alignment using Fast Fourier Transform (MAFFT) using the G-INS-I algorithm [179]. A neighbour-joining tree was constructed using Molecular Evolutionary Genetics Analysis 4 (MEGA 4) [180] with the parameters for the Jones-Taylor-Thornton substitution model and a Gamma parameter of 1.0 to account for the uneven rates of substitution across the length of the MYB proteins. Pairwise gap deletion was used, along with a bootstrap value of 1000. DNA-binding sites for MYB proteins were obtained from the literature. MYB proteins are annotated by colour based on DNA sequence recognition. Red, blue, green, orange, purple and grey represent MYB proteins that bind CNGTT(A/G), ACC(A/T)A(A/C), TTAGGG, AAAATATCT, GATA and TATCCA respectively. Black represents MYB proteins that do not bind to an assigned group. N indicates adenine, guanine, cytosine or thymine. * indicates that the MYB protein DNA-binding specificity differs slightly from the consensus sequence of its group. Refer to Supplemental Table S1 for specific details on DNA sequences bound by the MYB proteins.

The resolved structure of the R2R3–DNA complex has displayed that both R2 and R3 recognition helices contact directly with each other prior to sequence-specific binding [23,111-113]. Furthermore, the phosphate backbone interacts simultaneously with the amino acids in both the R2 and R3 repeats to aid in DNA-binding. As R1 is not necessary for the specific recognition of DNA target sequences, both R1R2R3 and R2R3–MYB proteins bind DNA in a similar manner. By contrast, single-repeat MYB proteins, which only possess one MYB DNA-binding repeat, bind DNA in a different manner than R1R2R3 and R2R3–MYB proteins [38]. This first became clear when S. cerevisiae Rap1 was found to contain two MYB repeats in its MYB–DNAbinding domain and its orthologous MYB counterpart, Homo sapien RAP1, only possessed one MYB repeat [42]. It was subsequently found that S. cerevisiae Rap1 binds DNA as a monomer because it contains two MYB repeats. In contrast, H. sapien RAP1 contains only one MYB repeat and does not bind DNA directly; however, it tethers itself to shelterin via TRF2 to bind DNA in a sequence specific manner.

4.3. The nature of DNA binding by animal MYB proteins The nature of DNA binding by any MYB protein has been most extensively examined using c-MYB and its cognate target, MBSI. Mutational studies on c-MYB have shown that R1 can be deleted without significant loss of DNA-binding ability, and that both R2 and R3 are essential for MYB–DNA recognition and binding [114-116]. Although R1 is not involved in the direct recognition of DNA sequence motifs, it does enhance the stability of DNA binding by the R2R3 repeats without significantly altering the DNA–R2R3 conformation. The R1 repeat is hypothesised to achieve this by fluctuating between bound and unbound states at a rate faster than the chemical shift time scale [23,74]. c-MYB–DNA interactions were validated structurally with the resolution of the NMR solution structures and X-ray crystal structures of c-MYB DNA-binding domain in the free and DNA-bound states [23,111-113]. Each third helix (C-terminal helix) of R2 and R3 were subsequently found to act as the recognition helix [23]. In keeping

M.B. Prouse, M.M. Campbell / Biochimica et Biophysica Acta 1819 (2012) 67–77

with this, the recognition helix of R3 interacts with the core of the DNA consensus sequence ((T/C)AAC)); while the recognition helix of R2 interacts less specifically with nucleotides surrounding the core recognition motif ((G/T)G(A/C/T)(A/C/T)) [111]. The binding of R2 and R3 to its consensus sequence ((T/C)AAC(G/T)G(A/C/T)(A/C/T)) widens the major groove and causes a bend of local helical axis [23]. Several interhelical interactions occur between the helices of R2 and R3, stabilising the MYB–DNA interaction. Moreover, R2 and R3 bound the major groove continuously, similar to transcription factor IIIA (TFIIIA)-type Zn fingers [117-119]. Contrary to TFIIIA-type Zn fingers, the recognition helices of c-MYB R2 and R3 are more closely packed together in the major groove. This type of direct interaction between the recognition helices from different DNA-binding units is unique among other DNA-binding domain complexes [23]. Not all amino acid residues within the DNA-binding site of transcription factors partake in DNA recognition and binding. Within the MYB protein family, certain key residues are critical for these tasks [23,103]. For example, for c-MYB, the three key base contacts are governed by residues Lys128 (R2), Lys182 (R3), and Asn183 (R3), which are found to be fully conserved in all know animal and plant MYB proteins [23,111]. Each MYB DNA-binding domain contains several conserved regularly spaced tryptophan residues that participate in a hydrophobic cluster [114,120]. Through mutational and structural studies on cMYB, this hydrophobic cluster was determined to be essential for both the stability of MYB-protein interaction and for sequencespecific binding to its consensus sequence ((T/C)AAC(G/T)G(A/C/T) (A/C/T)). Mutational and structural studies on animal c-MYB have aided in providing critical knowledge on the molecular mechanisms behind MYB–DNA interactions. Moreover, these studies allow one to generate testable hypotheses on MYB–DNA interactions in other organisms where orthologous MYB proteins reside. A cysteine residue located in the DNA recognition helix of R2 has remained completely conserved in animals, fungi, and plants during the evolution of MYB domains [121]. R1R2R3–MYB domains have a cysteine residue (Cys130) that is included in the hydrophobic core. Cys130 of c-MYB needs to be reduced to allow for sequence-specific DNA-binding. When reduced, Cys130 accomplishes this by structurally stabilising the three helices of the R2-MYB repeat during sequence-specific DNA-binding [122-124]. In contrast, most R2R3– MYB domains possess two cysteine residues (Cys49 and Cys53) with the equivalent position as Cys130 in R1R2R3 MYB [121]. c-MYB has been extensively studied with regards to dynamics of DNA binding [23,74,111,125]. The c-MYB R2R3-domain was shown to bind tightly to the MYB binding site ((T/C)AAC(G/T)G(A/C/T)(A/C/T)) with a binding constant of 1.5E− 09 M ± 28% [74,125]. Mutational analyses have shown that specific residues within the R2R3–MYB repeats of c-MYB bound to specific nucleotides with different affinities [74]. Specific interactions within the R2R3–MYB repeats of c-MYB are disproportionately localised in the AACTGAC region of the MYBbinding site. The first adenine, the third cytosine, and the fifth guanine are involved in extremely specific interactions with c-MYB, in which any base substitutions reduce the binding affinity by more than 500fold. In contrast to this, the interaction with the second adenine is less specific, with an affinity reduction in the range of 6 to 15-fold when subjected to base change. The seventh cytosine shows an interesting interaction in that only guanine substitution abolishes the specific binding. All together, these affinity data show that the second and third MYBrepeats cover the AACTGAC region from the major groove of DNA in an orientation that allows the third MYB-repeat to cover the core AAC sequence. Moreover, the results show that the third MYB-repeat recognises the core AAC sequence very specifically; however, the second repeat recognises the GAC sequence in a redundant manner. MYB–DNA kinetic studies also found that mutating the R1 repeat does not affect the DNA recognition of c-MYB but does effect the stability of the MYB–DNA complex. Furthermore, the N-terminal

71

acidic activation region upstream of the first MYB repeat was found to reduce the binding affinity by interfering with R1 binding to DNA. NMR, X-ray crystallography, and surface plasmon resonance studies have validated these c-MYB–DNA binding kinetic results [23,111,126]. Further studies on c-MYB DNA affinity indicated that when c-MYB binds DNA, the orientation of R2 and R3 are immobilised by sequence-specific binding and their conformations are slightly changed. No significant conformational changes occur in R1 during MYB DNA-binding, further emphasising that R1 is not involved in DNA-binding site recognition [111]. In a comparison between the binding kinetics of the three vertebrate MYB proteins (A-, B- and c-MYB), both A- and c-MYB bound the MYB recognition site with similar binding constants and specificity; however, B-MYB formed DNA-protein complexes of lower stability, rapidly dissociating under competitive conditions and showed less tolerance to DNA-binding site variations [127]. These studies on animal MYB proteins have granted insight into the molecular mechanisms behind MYB–DNA interactions in general because R2R3–MYB proteins bind DNA in a similar fashion [23,103]. Kinetics on single-repeat animal MYB proteins binding to their DNA cognate sequences have also been examined. For example, the human single-repeat MYB protein TRF1 shows that TRF1 can bind to the telomeric sequence TTAGGG with high affinity (Kd = 3.2 ± 0.5 × 10 − 9 M) and specificity as a monomer [7]. The recorded DNA binding affinity lies in the range of various homeodomains that also bind specifically to DNA as monomers [128-131]. Although the interaction of TRF1 and the telomeric sequence is specific, the specificity and affinity is significantly increased as a homodimer [132]. 4.4. The nature of DNA binding by plant MYB proteins To date, some of the specifics of plant MYB interaction with target DNA have relied on model-building based on the c-MYB binding to DNA, as no crystal structure has been generated yet for any plant multi-repeat MYB protein. For example, PAP1/AtMYB75, the R2R3domain was modelled according to the known structural data of c-MYB [133]. A conserved amino acid signature ([DE]Lx2[RK] x3Lx6Lx3R) found within several MYB proteins was hypothesised to predict new MYB/BHLH interactions for A. thaliana proteins. Consistent with this hypothesis, analysis of the predicted 3D model of PAP1/AtMYB75 showed that the amino acids of the conserved motif are surface-exposed on helices 1 and 2 of the R3 repeat, forming hydrophobic and charged residue patterns [133]. These surface-exposed amino acids are thought to stabilise the protein–protein interactions [133]. This model was validated by mutational assays [133]. The Petunia MYB Ph3 structure was also modelled after c-MYB. MYB. Ph3, a plant R2R3–MYB transcription factor involved in the regulation of flavonoid biosynthetic pathway in petunia flowers [85,134,135], shows divergence in binding specificity compared to c-MYB [103]. MYB.Ph3 can bind two types of sites: MBSI ((T/C)AAC(G/T)G(A/C/T)(A/C/T)) and MBSII (AGTTAGTTA) [81,103]. Modelling predicted that a single residue substitution in the R2 repeat of MYB.Ph3 (Leu71►Glu) would change its DNA recognition to that of c-MYB, and the reciprocal substitution in c-MYB, Glu132►Leu would change c-MYB specificity to that of MYB.Ph3 [103]. This model was experimentally validated via mutational assays. Even though it was previously found that these residues do not directly bind DNA [23], the MYB.Ph3 Leu71 and c-MYB Glu132 residues interact with residues that do interact with DNA, enabling them to impact DNA-specificity indirectly [103]. By contrast, Williams and Grotewold [136] found that P and v-MYB DNAbinding domains, which are conserved among animal and plant MYB domains, are necessary for the high affinity DNA-binding activity of these proteins to their respective DNA target sites but are not sufficient for their unique DNA-binding site recognition of P and vMYB. Furthermore, Williams and Grotewold [136] found that chimeric MYB domains have novel DNA-binding specificities. Resolution of

72

M.B. Prouse, M.M. Campbell / Biochimica et Biophysica Acta 1819 (2012) 67–77

these differences will require crystal or solution structures for plant MYB proteins. As is the case with c-MYB, both Cys49 and Cys53 are thought to be essential for the DNA-binding or transcriptional activity of plant MYB proteins, forming an intramolecular disulfide bond with each other under non-reducing conditions. This disulfide bond has been hypothesised to impair DNA binding under non-reducing conditions, causing R2R3–MYB proteins to be functionally active only under reducing conditions. Toward this end, the same two cysteines are conserved in the R2-MYB repeat of the R2R3–MYB protein WEREWOLF (WER) [95]. WER cannot bind to its DNA-binding sites within its downstream target promoter regions without the addition of dithiothreitol (a reducing agent). The dithiothreitol is thought to abolish the disulfide bond, leading to the sequence specific binding of WER to its downstream targets. Nitric oxide (NO) was shown to modifiy the DNA-binding activity of AtMYB2 by a posttranslational modification of its conserved Cys53 [137]. AtMYB2 bound to the core binding site AAACCA in an EMSA assay; however, the addition of NO donors, such as SNP (sodium nitroprusside) and GSNO (S-nitrosoglutathione), inhibited sequence specific binding of AtMYB2. The NO-mediated inhibitory effect was reversed by DTT, demonstrating that sequence specific DNA-binding of AtMYB2 is inhibited by S-nitrosylation of Cys53 as a result of NO action. The role of cysteine residues in MYB proteins displays the divergence of DNA binding mechanisms between both animal and plant MYB proteins. Despite some similarities in DNA binding, given the divergence of target DNA-binding sites of R2R3–MYB proteins relative to R1R2R3–MYB proteins, it follows that the residues critical for DNA recognition and binding within the binding site of many plant MYB proteins differ from those of animal MYB proteins. These examples display why there is inherent flexibility of DNA recognition by the MYB superfamily of transcription factors because merely one change in residue can alter the DNA recognition by a particular MYB protein. Despite the vast knowledge of plant MYB transcription factor function at the gross morphological level, little is known about the dynamics of MYB protein–DNA interactions. Nevertheless, some general themes regarding plant MYB–DNA binding kinetics are emerging [45,47,103]. Most plant MYB proteins display considerable inherent flexibility in their ability to recognise target sites (Fig. 1, Supplemental Table S1). For example, Petunia protein MYB.Ph3 bound to both MBSI and MBSII sites with the same affinity, inducing similar DNA-bending/ distortions in both cases [103]. Affinities for these two plant MYB binding sites vary among other plant MYB proteins; however, certain MYB proteins have been shown to only bind one of these sequences [80,83,85,92,97,105]. The maize R2R3–MYB C1 protein bound to its target sequences in the a1 (dihydroflavonol reductase) promoter [86]. Determined by EMSA assays, the affinity of binding was reduced by mutations in the C1 DNA-binding domain or in the a1 sequences recognised and bound by C1. Maize transient assays determined that C1 directly activated the a1 gene. Towards this end, the two C1 binding sites were also bound by the maize P protein. One of the sites (ACC(A/T)ACC) were bound with higher affinity by P (Kd = 52 ± 4 × 10− 9 M) relative to C1 (Kd = 330 ± 50 × 10− 9 M). In contrast, the other site (AACTACCGG) is bound with similar low affinities by P (Kd = 860 ±150 × 10− 9 M) and C1 (Kd = 780 ± 70 × 10− 9 M). These results allow a greater understanding of the mechanism behind the anthocyanin biosynthetic pathway in maize. In another example, all three Soy-MYB proteins, GmMYB76, GmMYB92, and GmMYB177, bound to the MBSI sequence [45]. GmMYB92 could also bind sequences MRE4 (TCTCACCTACC) and mMRE1 (CCGGAAAAAAGGAT). Unlike GmMYB92, GmMYB76 and GmMYB177 bound to the mMRE1 sequence with weak affinity. It is important to note that, while the aforementioned studies have provided profoundly useful insights into plant R2R3–MYB interactions with DNA sequences, they also provide a rather incomplete picture of the specific interactions that are possible. Given the sheer number of plant MYB proteins, the correspondingly large number of downstream

DNA targets for these proteins, and the breadth of processes controlled by the MYB family members in plants, the complexity of plant MYB–DNA interactions characterised to date is the tip of the proverbial iceberg. Clearly, there is a need for more extensive analysis of these important interactions. One might expect considerable inroads to be made in future, with the emergence of new technologies to probe DNA-protein interactions. 5. Future of plant MYB–DNA interaction studies 5.1. Determining the breadth of MYB DNA targets in vitro The identification of in vitro MYB DNA-binding sequences in a rapid and high-throughput manner is required in the future to identify all variants of their DNA targets. Transcription factors, including MYB proteins, are promiscuous in terms that they can interact and initiate transcription from multiple target sequences [63,103,138]. Well-established protocols based on recombinant MYB transcription factor (TF) DNA-binding domains have been used to enrich for target sequences from libraries of random DNA sequences [12,72,80,139]. These experiments include cyclic amplification and selection of targets (CASTing) [140] and systematic evolution of ligands by exponential enrichment (SELEX) [141]. Both of these procedures have determined numerous MYB in vitro DNA binding motifs for several MYB transcription factors, and their underlying principles can now be scaled to accommodate high-throughput approaches. Microarray based technologies, such as protein-binding microarrays, have been developed to identify transcription factor sequence specificities [142-145]. Binding sites identified by this technology have correlated with in vivo transcription factorbound DNA sequences identified by ChIP experiments [144,146,147]. Two types of protein-binding microarrays have emerged: doublestranded DNA microarrays and transcription factor microarrays. Double-stranded DNA microarrays possess all possible doublestranded 11 bp sequences (approximately 4.2 million sequences) in roughly 240,000 oligonucleotides [148]. Recombinant protein from a transcription factor of interest is flowed over the double-stranded DNA microarray and washed with increasing concentrations of salts. This technology allows the accurate quantification of binding affinities to all possible DNA-binding sites recognised by the transcription factor of interest in just one hybridization step. Transcription factor DNA-binding enrichment, based on a protein array, allows for the capture multiple transcription factors and then discovery of their binding sites [149]. A library of random oligonucleotides is flowed over captured proteins to identify the transcription factors’ DNA-binding sites. The array is then washed with increasing salt concentrations to allow for the identification of relative binding affinities. This specific protein-binding microarray has a slight advantage over the double-stranded DNA microarray because multiple transcription factors can hybridise onto a chip, allowing for the identification of binding preferences for transcription factor families [150]. Both these techniques are powerful means to identify in vitro DNA-binding sites of proteins of interest in a time efficient manner [149,150]; however, there are limitations to these experiments. One limitation is that protein–DNA complexes that have weak affinity for each other will be washed away with low concentrations of salts, biassing the results. Another limitation is that the whole structure of the protein is not available to bind to its preferred DNA targets because a portion of the protein is hybridised to the array. Not knowing the DNA binding domain of the protein of interest could lead to misleading results. 5.2. Emerging approaches for plant MYB target discovery and analysis in vivo Crucially, MYB DNA-binding sites in vitro might differ from those preferred in vivo [151,152]. These differences are a result of in vivo protein–protein interactions and post-translational modifications

M.B. Prouse, M.M. Campbell / Biochimica et Biophysica Acta 1819 (2012) 67–77

altering DNA binding specificity, as well as conformational differences between in vitro recombinant DNA-binding domain and in vivo native conformations of these domains. The in vivo availability of transcription factor DNA-binding sites is also controlled by the packaging of genomic DNA in chromatin. Therefore, alternative in vivo approaches are necessary to map MYB–DNA binding sites in the genome accurately. Transient expression assays and yeast one-hybrid assays are now a staple in identifying that a particular MYB binds to a specific DNA target in vivo [62,63,153]. These procedures involve the expression of a transcription factor of interest within organisms, such as plants or yeasts, to see if it is sufficient to enable the transactivation of an artificial gene comprising a tandem repeat of its putative DNAbinding site fused to a minimal promoter, upstream of a reporter gene. These experiments, with the right controls, ensure that a specific transcription factor of interest interacts and activates transcription from its putative DNA binding site in vivo. For example, the R2R3–MYB transcription factors AtMYB11, AtMYB12 and AtMYB111 were shown, through transient expression assays in Arabidopsis protoplasts, that they were functionally similar to its structurally similar maize P protein [154,155]. AtMYB11, AtMYB12, AtMYB111, and P protein had similar target gene specificity, regulating a myriad of flavonoid biosynthetic genes. Furthermore, all activated target gene promoters in vivo in the presence of a MYB recognition element. Transient expression assays and yeast one-hybrid assays are well established experiments to validate if a particular protein activates transcription from a particular motif; however, chromatin immunoprecipitation (ChIP) followed by either whole-genome tiled microarray analysis (ChIP-chip) or high-throughput signature sequencing (ChIP-seq) can identify novel in vivo DNA targets of proteins of interest, resulting in more biologically significant results. ChIP-chip or ChIP-seq has proven to be powerful tools by which to identify in vivo binding sites of sequence-specific transcription factors in the context of chromatin [156], which avoids many caveats of the aforementioned techniques [157]. Recently, ChIP identified in vivo DNA-binding target sites for a select group of MYB proteins [153,158-163]. In plants, ChIP-chip identified in vivo binding sites for two Arabidopsis two-MYB-repeat proteins, FOUR LIPS (FLP; AtMYB124) and AtMYB88 [153]. FLP and MYB88 were shown to directly bind promoters of cell cycle genes, including CDKA:1 (At3g48750), CELL DIVISION CYCLE6a and 6b (CDC6a At2g29680 and 6b At1g07270), Cyclind4:1 (At5g65420), a cyclin-like gene, CYCLINT:1 (CYCT:1, At1g35440), CDKD1:3 (At1g8040) and CYCB1:3 (At3g11520). These results were consistent with FLP/ MYB88 in suppressing DNA replication and cell cycle progression within the stomata. Systematic evolution of ligands by exponential enrichment (SELEX) and EMSA, along with ChIP-chip, helped identified that this group bound to the core consensus sequence of (A/T/G)(A/T/ G)C(C/G)(C/G). Similarily, Zea mays MYB31 was shown by SELEX and ChIP to bind to the sequence ACC(T/A)ACC within the two lignin promoters XmCOMT and ZmF5H, resulting in the repression of lignin biosynthetic gene expression [159]. Furthermore, ChIP-chip was performed on the trichome developmental selectors GLABRA3 (GL3) and GLABRA1 (GL1), encoding basic helix-loop-helix (bHLH) and MYB transcription factors respectively. ChIP-chip identified 20 novel in vivo GL3/GL1 direct targets such as SCL8 and MYC1 (involved in the control of gene expression), SIM (a cyclin-dependent kinase inhibitor), and RBR1 (a negative regulator of the cell cycle transcription factor E2F) [162]. ChIP-chip and ChIP-seq are difficult procedures to conduct because these procedures require an antibody that specifically recognises a transcription factor of interest. Although these procedures are the most effective way in determining true in vivo DNA-binding targets and sites, they have not been used to study the majority of the MYB superfamily. Epitope tagging is the process of making the product of a gene of interest immunoreactive to an already synthesised antibody [156].

73

This can be done by inserting a polynucleotide encoding an epitope into a gene of interest and expressing the gene in an appropriate host. This protein from the gene of interest can now be located via an antibody that has already been generated. This method could be used as an alternative to generating novel antibodies before conducting a ChIP-chip. When an antibody cannot be generated to a protein of interest, this method is best used to determine in vivo DNAprotein binding data on the protein of interest. Other methods can also identify in vivo MYB DNA-binding sites and downstream targets. A glucocorticoid receptor (GR)-mediated inducible system has successfully been used to define direct target genes of several putative transcription factors [164-166]. In a GR-mediated inducible system a fusion protein between a protein of interest and the rat glucocorticoid receptor hormone binding domain is engineered. This fusion protein is retained in the cytoplasm in absence of the synthetically made steroid hormone dexamethasone. Upon addition of dexamethasone the protein of interest-GR fusion protein enters the nucleus and binds to the protein of interest's downstream target genes. The addition of translational inhibitors such as cycloheximide will inhibit further downstream effects of your protein of interest. By assaying genome-wide expression changes on microarrays, one can determine direct target genes of a protein of interest. The GR inducible system was used to show the single-repeat MYB protein CAPRICE (CPC) transcription is regulated directly by WER (a R2R3– MYB transcription factor). Using EMSAs, two WER-binding sites (WBSs; WBSI and WBSII) were verified in the CPC promoter. WERWBSI binding was further validated in vivo using yeast one-hybrid assays. In another example, AtMYB80 involvement in tapetal and pollen development was examined [167]. Using the GR system, it was determined that 79 genes were changed when the R2R3–MYB transcription factor AtMYB80 function was restored in the myb80 mutant following dexamethasone induction [167]. Thirty-two of these genes were analysed using ChIP, and three were identified as direct targets of AtMYB80. These genes were shown to encode a glyoxal oxidase (GLOX1), a pectin methylesterase (VANGUARD1), and an A1 aspartic protease (UNDEAD) and corresponded with in vitro binding data. This procedure is a powerful way of identifying direct targets of transcription factors. When the GR-inducible system is coupled with in silico processes, such as promoter analyses, and DNA-binding experiments, such as EMSAs, yeast-one hybrid assays and protoplasts assays, it has proven quite useful in identifying in vivo DNA binding sites. The use of both expression data and DNA-binding site experiments are required to reduce the amount of MYB false positive targets. The forkheadboxA homolog, PHA-4 regulates organogenesis of the pharynx in Caenorhabditis elegans [168]. Expression of PHA-4 targets correlated with its binding sites in promoter regions, and that the timing of target expression correlated with binding affinity between PHA-4 and its target sequence [169]. The data suggested that PHA-4 regulates pharyngeal organ development by combining PHA-4 binding affinity and cooperating factors to regulate gene expression temporally. ChIP-seq data for PHA-4 validated this assessment — 87% of the associated genes were expressed when PHA-4 binding was present, and this expression was reduced to 60% when PHA-4 binding was not present [170]. Towards this end, using both expression data and DNA-binding site experiments is a powerful means of validating that the binding of a factor activates the expression of its putative target genes. The use of both expression data and MYB–DNA binding site assays in tandem will aid in generating a more biologically significant MYB network. Another problem with the identification of MYB binding sites is that some MYB binding sites are not proximal to the predicted target genes. In an early ChIP-seq study, there were a large number of bound sites observed for the human interferon-g (IFN-g) responsive transcription factor STAT1 [171]. Before stimulating the cells with IFN-g, 10,000 binding sites were identified. Binding sites for STAT1

74

M.B. Prouse, M.M. Campbell / Biochimica et Biophysica Acta 1819 (2012) 67–77

increased fourfold after stimulating the cells with IFN-g. In both conditions, approximately 50% of the total sites were intragenic, 25% of the total sites were intergenic. Most binding sites were not located near STAT1-regulated genes, which suggested that bound sites were not directly regulating nearby genes. This can be explained by chromatin looping - a mechanism for transcriptional control that involves bringing regulatory elements into proximity of target genes [172]. The use of chromosome conformation capture studies in the future will determine if the distant locus control region (LCR) with MYB target genes are required for high-level transcription. 6. Conclusion The past decades have seen remarkable inroads made into our understanding of the molecular interactions between sequence-specific transcription factors and their DNA targets in general, and MYB proteins and their binding sites more specifically. This is particularly true for animal R1R2R3–MYB transcription factors. Insights into the specificities of R1R2R3–MYB interactions with DNA target sequences is, in turn, providing greater understanding of the molecular mechanisms that control cellular processes [12,62,72,80,103] (Fig. 1, Supplemental Table S1), and is aiding in the development of diagnostics and therapeutics for when those mechanisms go awry [173,174]. By contrast, comparable understanding of the molecular mechanisms that proceed from the interaction of plant R2R3–MYB proteins with their cognate DNA targets is much less complete, with only a handful of MYB–DNA interactions characterised at the molecular level for any given plant species. Consequently, the precise means by which plant R2R3–MYB proteins coordinate gene expression to give rise to plant phenotype is still at a relatively nascent stage. This said, with the emergence of new techniques that enable the dissection of protein– DNA interactions more rapidly [175,176], and/or with higher resolution [156,177], and/or for a larger number of proteins [145,148,178], the characterisation of new MYB–DNA interactions, particularly those that occur in plant species, should proceed apace. Given this, the coming decade promises to provide great insights into the means by which members of this remarkable family of proteins convert molecular information into whole organism responses. Supplementary materials related to this article can be found online at doi:10.1016/j.bbagrm.2011.10.010. Acknowledgements We are very grateful to Dr. Katharina Bräutigam, Joseph Skaf, and Heather Wheeler for fruitful discussions and extensive assistance with earlier drafts of this manuscript. This work was generously supported by a Natural Science and Engineering Research Council of Canada (NSERC) Canadian Graduate Scholarship (CGSD) awarded to MP, and by funding from the University of Toronto and NSERC to MMC. References [1] T. Maniatis, S. Goodbourn, J.A. Fischer, Regulation of inducible and tissuespecific gene expression, Science 236 (1987) 1237–1245. [2] J. Colladovides, B. Magasanik, J.D. Gralla, Control site location and transcriptional regulation in Escherichia coli, Microbiol. Rev. 55 (1991) 371–394. [3] C.O. Pabo, R.T. Sauer, Transcription factors: structural families and principles of DNA recognition, Annu. Rev. Biochem. 61 (1992) 1053–1095. [4] C. Yanhui, Y. Xiaoyuan, H. Kun, L. Meihua, L. Jigang, G. Zhaofeng, L. Zhiqiang, Z. Yunfei, W. Xiaoxiao, Q. Xiaoming, S. Yunping, Z. Li, D. Xiaohui, L. Jingchu, D. Xing-Wang, C. Zhangliang, G. Hongya, Q. Li-Jia, The MYB transcription factor superfamily of Arabidopsis: expression analysis and phylogenetic comparison with the rice MYB family, Plant Mol. Biol. 60 (2006) 107–124. [5] J.L. Riechmann, J. Heard, G. Martin, L. Reuber, C.Z. Jiang, J. Keddie, L. Adam, O. Pineda, O.J. Ratcliffe, R.R. Samaha, R. Creelman, M. Pilgrim, P. Broun, J.Z. Zhang, D. Ghandehari, B.K. Sherman, C.L. Yu, Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes, Science 290 (2000) 2105–2110.

[6] J.A. Rosinski, W.R. Atchley, Molecular evolution of the Myb family of transcription factors: evidence for polyphyletic origin, J. Mol. Evol. 46 (1998) 74–83. [7] P. Konig, L. Fairall, D. Rhodes, Sequence-specific DNA recognition by the Myb-like domain of the human telomere binding protein TRF1: a model for the protein– DNA complex, Nucleic Acids Res. 26 (1998) 1731–1740. [8] J.S. Lipsick, One billion years of Myb, Oncogene 13 (1996) 223–235. [9] M.W. Wong, R.W. Henry, B.C. Ma, R. Kobayashi, N. Klages, P. Matthias, M. Strubin, N. Hernandez, The large subunit of basal transcription factor SNAP(C) is a Myb domain protein that interacts with Oct-1, Mol. Cell. Biol. 18 (1998) 368–377. [10] H. Biedenkapp, U. Borgmeyer, A.E. Sippel, K.H. Klempnauer, Viral myb oncogene encodes a sequence-specific DNA-binding activity, Nature 335 (1988) 835–837. [11] J. Golay, A. Capucci, M. Arsura, M. Castellano, V. Rizzo, M. Introna, Expression of c-myb and B-myb, but not A-myb, correlates with proliferation in human hematopoietic cells, Blood 77 (1991) 149–158. [12] K.M. Howe, R.J. Watson, Nucleotide preferences in sequence-specific recognition of DNA by c-myb protein, Nucleic Acids Res. 19 (1991) 3913–3919. [13] C. Dubos, R. Stracke, E. Grotewold, B. Weisshaar, C. Martin, L. Lepiniec, MYB transcription factors in Arabidopsis, Trends Plant Sci. 15 (2010) 573–581. [14] H.L. Jin, C. Martin, Multifunctionality and diversity within the plant MYB-gene family, Plant Mol. Biol. 41 (1999) 577–585. [15] C. Martin, J. PazAres, MYB transcription factors in plants, Trends Genet. 13 (1997) 67–73. [16] H. Kranz, K. Scholz, B. Weisshaar, c-MYB oncogene-like genes encoding three MYB repeats occur in all major plant lineages, Plant J. 21 (2000) 231–235. [17] O. Wilkins, H. Nahal, J. Foong, N.J. Provart, M.M. Campbell, Expansion and diversification of the populus R2R3–MYB family of transcription factors, Plant Physiol. 149 (2009) 981–993. [18] C.W.B. Peters, A.E. Sippel, M. Vingron, K.H. Klempnauer, Drosophila and vertebrate myb proteins share two conserved regions, one of which functions as a DNAbinding domain, EMBO J. 6 (1987) 3085–3090. [19] M.L. Mucenski, K. McLain, A.B. Kier, S.H. Swerdlow, C.M. Schreiner, T.A. Miller, D.W. Pietryga, W.J. Scott, S.S. Potter, A functional c-myb gene is required for normal murine fetal hepatic hematopoiesis, Cell 65 (1991) 677–689. [20] A.M. Gewirtz, B. Calabretta, A c-myb antisense oligodeoxynucleotide inhibits normal human hematopoiesis in vitro, Science 242 (1988) 1303–1306. [21] R. Stracke, M. Werber, B. Weisshaar, The R2R3–MYB gene family in Arabidopsis thaliana, Curr. Opin. Plant Biol. 4 (2001) 447–456. [22] B.R. Linger, C.M. Price, Conservation of telomere protein complexes: shuffling through evolution, Crit. Rev. Biochem. Mol. Biol. 44 (2009) 434–446. [23] K. Ogata, S. Morikawa, H. Nakamura, A. Sekikawa, T. Inoue, H. Kanai, A. Sarai, S. Ishii, Y. Nishimura, Solution structure of a specific DNA complex of the Myb DNA-binding domain with cooperative recognition helices, Cell 79 (1994) 639–648. [24] L. Jia, M.T. Clegg, T. Jiang, Evolutionary dynamics of the DNA-binding domains in putative R2R3–MYB genes identified from rice subspecies indica and japonica genomes, Plant Physiol. 134 (2004) 575–585. [25] K.M. Howe, C.F.L. Reakes, R.J. Watson, Characterization of the sequence-specific interaction of mouse c-myb protein with DNA, EMBO J. 9 (1990) 161–169. [26] B. Luscher, R.N. Eisenman, New light on Myc and Myb. Part I. Myc, Genes Dev. 4 (1990) 2025–2035. [27] N. Nomura, M. Takahashi, M. Matsui, S. Ishii, T. Date, S. Sasamoto, R. Ishizaki, Isolation of human cDNA clones of myb-related genes, A-myb and B-myb, Nucleic Acids Res. 16 (1988) 11075–11089. [28] A. Sala, R. Watson, B-Myb protein in cellular proliferation, transcription control, and cancer: latest developments, J. Cell. Physiol. 179 (1999) 245–250. [29] I. Arabidopsis Genome, Analysis of the genome sequence of the flowering plant Arabidopsis thaliana, Nature (London) 408 (2000) 796–815. [30] D. Alabadi, T. Oyama, M.J. Yanovsky, F.G. Harmon, P. Mas, S.A. Kay, Reciprocal regulation between TOC1 and LHY/CCA1 within the Arabidopsis circadian clock, Science 293 (2001) 880–883. [31] N. Baranowskij, C. Frohberg, S. Prat, L. Willmitzer, A novel DNA binding protein with homology to Myb oncoproteins containing only one repeat can function as a transcriptional activator, EMBO J. 13 (1994) 5383–5392. [32] I.A. Carre, S.A. Kay, Multiple DNA-protein complexes at a circadian-regulated promoter element, Plant Cell 7 (1995) 2039–2051. [33] C.M. Chen, C.T. Wang, C.H. Ho, A plant gene encoding a Myb-like protein that binds telomeric GGTTTAG repeats in vitro, J. Biol. Chem. 276 (2001) 16511–16519. [34] R. Court, L. Chapman, L. Fairall, D. Rhodes, How the human telomeric proteins TRF1 and TRF2 recognize telomeric DNA: a view from high-resolution crystal structures, EMBO Rep. 6 (2005) 39–45. [35] G.M. Ehrenkaufer, J.A. Hackney, U. Singh, A developmentally regulated Myb domain protein regulates expression of a subset of stage-specific genes in Entamoeba histolytica, Cell. Microbiol. 11 (2009) 898–910. [36] M. Feldbrugge, M. Sprenger, K. Hahlbrock, B. Weisshaar, PcMYB1, a novel plant protein containing a DNA-binding domain with one MYB repeat, interacts in vivo with a light-regulatory promoter unit, Plant J. 11 (1997) 1079–1093. [37] M. Fukuzawa, N.V. Zhukovskaya, Y. Yamada, T. Araki, J.G. Williams, Regulation of Dictyostelium prestalk-specific gene expression by a SHAQKY family MYB transcription factor, Development 133 (2006) 1715–1724. [38] M.G. Hwang, I.K. Chung, B.G. Kang, M.H. Cho, Sequence-specific binding property of Arabidopsis thaliana telomeric DNA binding protein 1 (AtTBP1), FEBS Lett. 503 (2001) 35–40. [39] S. Ko, S.H. Jun, H. Bae, J.S. Byun, W. Han, H. Park, S.W. Yang, S.Y. Park, Y.H. Jeon, C. Cheong, W.T. Kim, W. Lee, H.S. Cho, Structure of the DNA-binding domain of NgTRF1 reveals unique features of plant telomere-binding proteins, Nucleic Acids Res. 36 (2008) 2739–2755.

M.B. Prouse, M.M. Campbell / Biochimica et Biophysica Acta 1819 (2012) 67–77 [40] S. Ko, E.Y. Yu, J. Shin, H.H. Yoo, T. Tanaka, W.T. Kim, H.S. Cho, W. Lee, I.K. Chung, Solution structure of the DNA binding domain of rice telomere binding protein RTBP1, Biochemistry 48 (2009) 827–838. [41] C.E. Koering, G. Fourel, E. Binet-Brasselet, T. Laroche, F. Klein, E. Gilson, Identification of high affinity Tbf1p-binding sites within the budding yeast genome, Nucleic Acids Res. 28 (2000) 2519–2526. [42] P. Konig, D. Rhodes, Recognition of telomeric DNA, Trends Biochem. Sci. 22 (1997) 43–47. [43] M. Lang, E. Juan, Binding site number variation and high-affinity binding consensus of Myb-SANT-like transcription factor Adf-1 in Drosophilidae, Nucleic Acids Res. 38 (2010) 6404–6417. [44] B.B. Li, T. de Lange, Rap1 affects the length and heterogeneity of human telomeres, Mol. Biol. Cell 14 (2003) 5060–5068. [45] Y. Liao, H.F. Zou, H.W. Wang, W.K. Zhang, B. Ma, J.S. Zhang, S.Y. Chen, Soybean GmMYB76, GmMYB92, and GmMYB177 genes confer stress tolerance in transgenic Arabidopsis plants, Cell Res. 18 (2008) 1047–1060. [46] C.B.B. Lira, J.L.D. Neto, L. Khater, T.C. Cagliari, L.A. Peroni, J.R.R. dos Reis, C.H.I. Ramos, M.I.N. Cano, LaTBP1: A Leishmania amazonensis DNA-binding protein that associates in vivo with telomeres and GT-rich DNA using a Myb-like domain, Arch. Biochem. Biophys. 465 (2007) 399–409. [47] C.A. Lu, T.H.D. Ho, S.L. Ho, S.M. Yu, Three novel MYB proteins with one DNA binding repeat mediate sugar and hormone regulation of alpha-amylase gene expression, Plant Cell 14 (2002) 1963–1980. [48] C.O. Marian, S.J. Bordoli, M. Goltz, R.A. Santarella, L.P. Jackson, O. Danilevskaya, M. Beckstette, R. Meeley, H.W. Bass, The maize Single myb histone 1 gene, Smh1, belongs to a novel gene family and encodes a protein that binds telomere DNA repeats in vitro, Plant Physiol. 133 (2003) 1336–1350. [49] B.B. Maxwell, C.R. Andersson, D.S. Poole, S.A. Kay, J. Chory, HY5, Circadian ClockAssociated 1, and a cis-element, DET1 dark response element, mediate DET1 regulation of chlorophyll a/b-binding protein 2 expression, Plant Physiol. 133 (2003) 1565–1577. [50] L. Mohrmann, A.J. Kal, C.P. Verrijzer, Characterization of the extended Myb-like DNA-binding domain of trithorax group protein zeste, J. Biol. Chem. 277 (2002) 47385–47392. [51] T. Nishikawa, H. Okamura, A. Nagadoi, P. Konig, D. Rhodes, Y. Nishimura, Solution structure of a telomeric DNA complex of human TRF1, Structure 9 (2001) 1237–1251. [52] C.W. Pitt, L.P. Valente, D. Rhodes, T. Simonsson, Identification and characterization of an essential telomeric repeat binding factor in fission yeast, J. Biol. Chem. 283 (2008) 2693–2701. [53] R. Rawat, J. Schwartz, M.A. Jones, I. Sairanen, Y.F. Cheng, C.R. Andersson, Y.D. Zhao, K. Ljung, S.L. Harmer, REVEILLE1, a Myb-like transcription factor, integrates the circadian clock and auxin pathways, Proc. Natl. Acad. Sci. U. S. A. 106 (2009) 16883–16888. [54] R. Schaffer, N. Ramsay, A. Samach, S. Corden, J. Putterill, I.A. Carre, G. Coupland, The late elongated hypocotyl mutation of Arabidopsis disrupts circadian rhythms and the photoperiodic control of flowering, Cell 93 (1998) 1219–1229. [55] G.P. Xue, A CELD-fusion method for rapid determination of the DNA-binding sequence specificity of novel plant DNA-binding proteins, Plant J. 41 (2005) 638–649. [56] J.X. Yi, M.R. Derynck, X.Y. Li, P. Telmer, F. Marsolais, S. Dhaubhadel, A singlerepeat MYB transcription factor, GmMYB176, regulates CHS8 gene expression and affects isoflavonoid biosynthesis in soybean, Plant J. 62 (2010) 1019–1034. [57] E.Y. Yu, W.F. Yen, O. Steinberg-Neifach, N.F. Lue, Rap1 in Candida albicans: an unusual structural organization and a critical function in suppressing telomere recombination, Mol. Cell. Biol. 30 (2010) 1254–1268. [58] B.J. Glover, M. Perez-Rodriguez, C. Martin, Development of several epidermal cell types can be specified by the same MYB-related plant transcription factor, Development 125 (1998) 3497–3508. [59] J. Gomez-Maldonado, C. Avila, F. de la Torre, R. Canas, F.M. Canovas, M.M. Campbell, Functional interactions between a glutamine synthetase promoter and MYB proteins, Plant J. 39 (2004) 513–526. [60] Y.K. Liang, C. Dubos, I.C. Dodd, G.H. Holroyd, A.M. Hetherington, M.M. Campbell, AtMYB61, an R2R3–MYB transcription factor controlling stomatal aperture in Arabidopsis thaliana, Curr. Biol. 15 (2005) 1201–1206. [61] C. Martin, K. Bhatt, K. Baumann, H. Jin, S. Zachgo, K. Roberts, Z. Schwarz-Sommer, B. Glover, M. Perez-Rodrigues, The mechanics of cell fate determination in petals, Philos. Trans. R. Soc. London, Ser. B-Biol. Sci. 357 (2002) 809–813. [62] A. Patzlaff, S. McInnis, A. Courtenay, C. Surman, L.J. Newman, C. Smith, M.W. Bevan, S. Mansfield, R.W. Whetten, R.R. Sederoff, M.M. Campbell, Characterisation of a pine MYB that regulates lignification, Plant J. 36 (2003) 743–754. [63] A. Patzlaff, L.J. Newman, C. Dubos, R. Whetten, C. Smith, S. McInnis, M.W. Bevan, R.R. Sederoff, M.M. Campbell, Characterisation of PtMYB1, an R2R3–MYB from pine xylem, Plant Mol. Biol. 53 (2003) 597–608. [64] J. Pazares, D. Ghosal, U. Wienand, P.A. Peterson, H. Saedler, The regulatory c1 locus of Zea mays encodes a protein with homology to myb proto-oncogene products and with structural similarities to transcriptional activators, EMBO J. 6 (1987) 3553–3558. [65] V. Kirik, M. Simon, M. Huelskamp, J. Schiefelbein, The ENHANCER OF TRY AND CPCl gene acts redundantly with TRIPTYCHON and CAPRICE in trichome and root hair cell patterning in Arabidopsis, Dev. Biol. 268 (2004) 506–513. [66] S. Schellmann, A. Schnittger, V. Kirik, T. Wada, K. Okada, A. Beermann, J. Thumfahrt, G. Jurgens, M. Hulskamp, TRIPTYCHON and CAPRICE mediate lateral inhibition during trichome and root hair patterning in Arabidopsis, EMBO J. 21 (2002) 5036–5046.

75

[67] E.L. Braun, E. Grotewold, Newly discovered plant c-myb-like genes rewrite the evolution of the plant myb gene family, Plant Physiol. 121 (1999) 21–24. [68] T. Yang, R. Perasso, A. Baroin-Tourancheau, Myb genes in ciliates: a common origin with the myb protooncogene? Protist 154 (2003) 229–238. [69] A.P. Dias, E.L. Braun, M.D. McMullen, E. Grotewold, Recently duplicated maize R2R3 Myb genes provide evidence for distinct mechanisms of evolutionary divergence after duplication, Plant Physiol. 131 (2003) 610–620. [70] C.H. Jiang, J.Y. Gu, S. Chopra, X. Gu, T. Peterson, Ordered origin of the typical twoand three-repeat Myb genes, Gene 326 (2004) 13–22. [71] H. Nakagoshi, T. Nagase, C. Kaneiishii, Y. Ueno, S. Ishii, Binding of the c-myb protooncogene product to the simian virus 40 enhancer stimulates transcription, J. Biol. Chem. 265 (1990) 3479–3483. [72] K. Weston, Extension of the DNA binding consensus of the chicken c-Myb and v-Myb proteins, Nucleic Acids Res. 20 (1992) 3042–3049. [73] E. Ording, W. Kvavik, A. Bostad, O.S. Gabrielsen, Two functionally distinct half sites in the DNA-recognition sequence of the Myb oncoprotein, Eur. J. Biochem. 222 (1994) 113–120. [74] J. Tanikawa, T. Yasukawa, M. Enari, K. Ogata, Y. Nishimura, S. Ishii, A. Sarai, Recognition of specific DNA sequences by the c-myb protooncogene product: role of three repeat units in the DNA-binding domain, Proc. Natl. Acad. Sci. U. S. A. 90 (1993) 9320–9324. [75] X.P. Ma, B. Calabretta, DNA binding and transactivation activity of A-myb, a c-mybrelated gene, Cancer Res. 54 (1994) 6512–6516. [76] G. Mizuguchi, H. Nakagoshi, T. Nagase, N. Nomura, T. Date, Y. Ueno, S. Ishii, DNA binding activity and transcriptional activator function of the human B-myb protein compared with c-MYB, J. Biol. Chem. 265 (1990) 9280–9284. [77] R.J. Watson, C. Robinson, E.W.F. Lam, Transcription regulation by murine B-myb is distinct from that by c-myb, Nucleic Acids Res. 21 (1993) 267–272. [78] M. Ito, Conservation and diversification of three-repeat Myb transcription factors in plants, J. Plant Res. 118 (2005) 61–69. [79] M. Ito, M. Iwase, H. Kodama, P. Lavisse, A. Komamine, R. Nishihama, Y. Machida, A. Watanabe, A novel cis-acting element in promoters of plant B-type cyclin genes activates M phase-specific transcription, Plant Cell 10 (1998) 331–341. [80] E. Grotewold, B.J. Drummond, B. Bowen, T. Peterson, The myb-homologous P gene controls phlobaphene pigmentation in maize floral organs by directly activating a flavonoid biosynthetic gene subset, Cell 76 (1994) 543–553. [81] I. Romero, A. Fuertes, M.J. Benito, J.M. Malpica, A. Leyva, J. Paz-Ares, More than 80R2R3–MYB regulatory genes in the genome of Arabidopsis thaliana, Plant J. 14 (1998) 273–284. [82] H.L. Jin, E. Cominelli, P. Bailey, A. Parr, F. Mehrtens, J. Jones, C. Tonelli, B. Weisshaar, C. Martin, Transcriptional repression by AtMYB4 controls production of UVprotecting sunscreens in Arabidopsis, EMBO J. 19 (2000) 6150–6161. [83] E. Moyano, J.F. MartinezGarcia, C. Martin, Apparent redundancy in Myb gene function provides gearing for the control of flavonoid biosynthesis in Antirrhinum flowers, Plant Cell 8 (1996) 1519–1532. [84] R.W.M. Sablowski, D.C. Baulcombe, M. Bevan, Expression of a flower-specific Myb protein in leaf cells using a viral vector causes ectopic activation of a target promoter, Proc. Natl. Acad. Sci. U. S. A. 92 (1995) 6901–6905. [85] R.W.M. Sablowski, E. Moyano, F.A. Culianezmacia, W. Schuch, C. Martin, M. Bevan, A flower-specific Myb protein activates transcription of phenylpropanoid biosynthetic genes, EMBO J. 13 (1994) 128–137. [86] M.B. Sainz, E. Grotewold, V.L. Chandler, Evidence for direct activation of an anthocyanin promoter by the maize C1 protein and comparison of DNA binding by related Myb domain proteins, Plant Cell 9 (1997) 611–625. [87] K. Sugimoto, S. Takeda, H. Hirochika, MYB-related transcription factor NtMYB2 induced by wounding and elicitors is a regulator of the tobacco retrotransposon Tto1 and defense-related genes, Plant Cell 12 (2000) 2511–2527. [88] L. Tamagnone, A. Merida, A. Parr, S. Mackay, F.A. Culianez-Macia, K. Roberts, C. Martin, The AmMYB308 and AmMYB330 transcription factors from antirrhinum regulate phenylpropanoid and lignin biosynthesis in transgenic tobacco, Plant Cell 10 (1998) 135–154. [89] A. Uimari, J. Strommer, Myb26: a MYB-like protein of pea flowers with affinity for promoters of phenylpropanoid genes, Plant J. 12 (1997) 1273–1284. [90] S.C. Yang, J.P. Sweetman, S. Amirsadeghi, M. Barghchi, A.K. Huttly, W.I. Chung, D. Twell, Novel anther-specific myb genes from tobacco as putative regulators of phenylalanine ammonia-lyase expression, Plant Physiol. 126 (2001) 1738–1753. [91] K. Aya, M. Ueguchi-Tanaka, M. Kondo, K. Hamada, K. Yano, M. Nishimura, M. Matsuoka, Gibberellin modulates anther development in rice via the transcriptional regulation of GAMYB, Plant Cell 21 (2009) 1453–1472. [92] F. Gubler, R. Kalla, J.K. Roberts, J.V. Jacobsen, Gibberellin-regulated expression of a myb gene in barley aleurone cells: evidence for Myb transactivation of a high-pI alpha-amylase gene promoter, Plant Cell 7 (1995) 1879–1891. [93] G.F. Heine, V. Malik, A.P. Dias, E. Grotewold, Expression and molecular characterization of ZmMYB-IF35 and related R2R3–MYB transcription factors, Mol. Biotechnol. 37 (2007) 155–164. [94] M. Ito, S. Araki, S. Matsunaga, T. Itoh, R. Nishihama, Y. Machida, J.H. Doonan, A. Watanabe, G2/M-phase-specific transcription during the plant cell cycle is mediated by c-Myb-like transcription factors, Plant Cell 13 (2001) 1891–1905. [95] Y. Koshino-Kimura, T. Wada, T. Tachibana, R. Tsugeki, S. Ishiguro, K. Okada, Regulation of CAPRICE transcription by MYB proteins for root epidermis differentiation in Arabidopsis, Plant Cell Physiol. 46 (2005) 817–826. [96] S. Legay, E. Lacombe, M. Goicoechea, C. Briere, A. Seguin, J. Mackay, J. Grima-Pettenati, Molecular characterization of EgMYB1, a putative transcriptional repressor of the lignin biosynthetic pathway, Plant Sci. 173 (2007) 542–549.

76

M.B. Prouse, M.M. Campbell / Biochimica et Biophysica Acta 1819 (2012) 67–77

[97] S.F. Li, R.W. Parish, Isolation of two novel myb-like genes from Arabidopsis and studies on the DNA-binding properties of their products, Plant J. 8 (1995) 963–972. [98] G.Y. Liu, G. Ren, A. Guirgis, R.W. Thornburg, The MYB305 transcription factor regulates expression of nectarin genes in the ornamental tobacco floral nectary, Plant Cell 21 (2009) 2672–2687. [99] R.D. Mellway, L.T. Tran, M.B. Prouse, M.M. Campbell, C.P. Constabel, The wound-, pathogen-, and ultraviolet B-responsive MYB134 gene encodes an R2R3 MYB transcription factor that regulates proanthocyanidin synthesis in poplar, Plant Physiol. 150 (2009) 924–941. [100] J.A. Punwani, D.S. Rabiger, G.N. Drews, MYB98 positively regulates a battery of synergid-expressed genes encoding filiform apparatus-localized proteins, Plant Cell 19 (2007) 2557–2568. [101] R.G. Ramsay, S. Ishii, T.J. Gonda, Increase in specific DNA binding by carboxyl truncation suggests a mechanism for activation of Myb, Oncogene 6 (1991) 1875–1879. [102] A. Rose, I. Meier, U. Wienand, The tomato I-box binding factor LeMYBI is a member of a novel class of Myb-like proteins, Plant J. 20 (1999) 641–652. [103] R. Solano, A. Fuertes, L. SanchezPulido, A. Valencia, J. PazAres, A single residue substitution causes a switch from the dual DNA binding specificity of plant transcription factor MYB.Ph3 to the animal c-MYB specificity, J. Biol. Chem. 272 (1997) 2889–2895. [104] A. Suzuki, C.Y. Wu, H. Washida, F. Takaiwa, Rice MYB protein OSMYB5 specifically binds to the AACA motif conserved among promoters of genes for storage protein glutelin, Plant Cell Physiol. 39 (1998) 555–559. [105] T. Urao, K. Yamaguchishinozaki, S. Urao, K. Shinozaki, An Arabidopsis myb homolog is induced by dehydration stress and its gene product binds to the conserved MYB recognition sequence, Plant Cell 5 (1993) 1529–1539. [106] Z.Y. Wang, E.M. Tobin, Constitutive expression of the CIRCADIAN CLOCK ASSOCIATED 1 (CCA1) gene disrupts circadian rhythms and suppresses its own expression, Cell 93 (1998) 1207–1217. [107] Y.O. Yang, D.F. Klessig, Isolation and characterization of a tobacco mosaic virusinducible myb oncogene homolog from tobacco, Proc. Natl. Acad. Sci. U. S. A. 93 (1996) 14972–14977. [108] T. Bilaud, C.E. Koering, E. BinetBrasselet, K. Ancelin, A. Pollice, S.M. Gasser, E. Gilson, The telobox, a Myb-related telomeric DNA binding motif found in proteins from yeast, plants and human, Nucleic Acids Res. 24 (1996) 1294–1303. [109] A. Klug, J.W.R. Schwabe, Protein motifs 5. Zinc fingers, FASEB J. 9 (1995) 597–604. [110] R. Treisman, R. Marais, J. Wynne, Spatial flexibility in ternary complexes between SRF and its accessory proteins, EMBO J. 11 (1992) 4631–4640. [111] K. Ogata, S. Morikawa, H. Nakamura, H. Hojo, S. Yoshimura, R.H. Zhang, S. Aimoto, Y. Ametani, Z. Hirata, A. Sarai, S. Ishii, Y. Nishimura, Comparison of the free and DNA-complexed forms of the DNA-binding domain from c-Myb, Nat. Struct. Biol. 2 (1995) 309–320. [112] T.H. Tahirov, M. Sasaki, T. Inoue-Bungo, A. Fujikawa, K. Sato, T. Kumasaka, M. Yamamoto, K. Ogata, Crystals of ternary protein–DNA complexes composed of DNA-binding domains of c-Myb or v-Myb, C/EBP alpha or C/EBP beta and tom1A promoter fragment, Acta Crystallogr., Sect. D: Biol. Crystallogr. 57 (2001) 1655–1658. [113] T.H. Tahirov, K. Sato, E. Ichikawa-Iwata, M. Sasaki, T. Inoue-Bungo, M. Shiina, K. Kimura, S. Takata, A. Fujikawa, H. Morii, T. Kumasaka, M. Yamamoto, S. Ishii, K. Ogata, Mechanism of c-Myb-C/EBP beta cooperation from separated sites on a promoter, Cell 108 (2002) 57–70. [114] I.A. Anton, J. Frampton, Tryptophans in myb proteins, Nature 336 (1988) 719-719. [115] J. Frampton, T.J. Gibson, S.A. Ness, G. Doderlein, T. Graf, Proposed structure for the DNA-binding domain of the Myb oncoprotein based on model building and mutational analysis, Protein Eng. 4 (1991) 891–901. [116] H. Sakura, K.I. Chie, T. Nagase, H. Nakagoshi, T.J. Gonda, S. Ishii, Delineation of three functional domains of the transcriptional activator encoded by the c-myb protooncogene, Proc. Natl. Acad. Sci. U. S. A. 86 (1989) 5758–5762. [117] L. Fairall, J.W.R. Schwabe, L. Chapman, J.T. Finch, D. Rhodes, The crystal structure of a two zinc-finger peptide reveals an extension to the rules for zinc-finger/DNA recognition, Nature 366 (1993) 483–487. [118] N.P. Pavletich, C.O. Pabo, Zinc finger-DNA recognition: crystal structure of a Zif268-DNA complex at 2.1 A, Science 252 (1991) 809–817. [119] N.P. Pavletich, C.O. Pabo, Crystal structure of a five-finger GLI-DNA complex: new perspectives on zinc fingers, Science 261 (1993) 1701–1707. [120] P. Saikumar, R. Murali, E.P. Reddy, Role of tryptophan repeats and flanking amino acids in MYB–DNA interactions, Proc. Natl. Acad. Sci. U. S. A. 87 (1990) 8452–8456. [121] G.F. Heine, J.M. Hernandez, E. Grotewold, Two cysteines in plant R2R3 MYB domains participate in REDOX-dependent DNA binding, J. Biol. Chem. 279 (2004) 37878–37885. [122] F.A. Graesser, K. Lamontagne, L. Whittaker, S. Stohr, J.S. Lipsick, A highly conserved cysteine in the v-Myb DNA-binding domain is essential for transformation and transcriptional trans-activation, Oncogene 7 (1992) 1005–1009. [123] S. Guehmann, G. Vorbrueggen, F. Kalkbrenner, K. Moelling, Reduction of a conserved Cys is essential for Myb DNA-binding, Nucleic Acids Res. 20 (1992) 2279–2286. [124] K. Melcher, A modular set of prokaryotic and eukaryotic expression vectors, Anal. Biochem. 277 (2000) 109–120. [125] A. Ebneth, O. Schweers, H. Thole, U. Fagin, C. Urbanke, G. Maass, H. Wolfes, Biophysical characterization of the c-Myb DNA-binding domain, Biochemistry 33 (1994) 14586–14593. [126] M. Oda, K. Furukawa, A. Sarai, H. Nakamura, Kinetic analysis of DNA binding by the c-Myb DNA-binding domain using surface plasmon resonance, FEBS Lett. 454 (1999) 288–292.

[127] S. Bergholtz, T.O. Andersen, K.B. Andersson, J. Borrebaek, B. Luscher, O.S. Gabrielsen, The highly conserved DNA-binding domains of A-, B- and c-Myb differ with respect to DNA-binding, phosphorylation and redox properties, Nucleic Acids Res. 29 (2001) 3546–3556. [128] S.E. Ades, R.T. Sauer, Differential DNA-binding specificity of the engrailed homeodomain: the role of residue 50, Biochemistry 33 (1994) 9187–9194. [129] M. Affolter, A. Percivalsmith, M. Muller, W. Leupin, W.J. Gehring, DNA binding properties of the purified Antennapedia homeodomain, Proc. Natl. Acad. Sci. U. S. A. 87 (1990) 4093–4097. [130] J.H. Carra, P.L. Privalov, Energetics of folding and DNA binding of the MAT alpha 2 homeodomain, Biochemistry 36 (1997) 526–535. [131] B. Florence, R. Handrow, A. Laughon, DNA-binding specificity of the fushi tarazu homeodomain, Mol. Cell. Biol. 11 (1991) 3613–3623. [132] A. Bianchi, S. Smith, L. Chong, P. Elias, T. deLange, TRF1 is a dimer and bends telomeric DNA, EMBO J. 16 (1997) 1785–1794. [133] I.M. Zimmermann, M.A. Heim, B. Weisshaar, J.F. Uhrig, Comprehensive identification of Arabidopsis thaliana MYB transcription factors interacting with R/B-like BHLH proteins, Plant J. 40 (2004) 22–34. [134] J. Avila, C. Nieto, L. Canas, M.J. Benito, J. Pazares, Petunia hybrida genes related to the maize regulatory C1 gene and to animal myb proto-oncogenes, Plant J. 3 (1993) 553–562. [135] R. Solano, C. Nieto, J. Pazares, MYB. Ph3 transcription factor from Petunia hybrida induces similar DNA-bending/distortions on its two types of binding site, Plant J. 8 (1995) 673–682. [136] C.E. Williams, E. Grotewold, Differences between plant and animal myb domains are fundamental for DNA binding activity, and chimeric Myb domains have novel DNA binding specificities, J. Biol. Chem. 272 (1997) 563–571. [137] V. Serpa, J. Vernal, L. Lamattina, E. Grotewold, R. Cassia, H. Terenzi, Inhibition of AtMYB2 DNA-binding by nitric oxide involves cysteine S-nitrosylation, Biochem. Biophys. Res. Commun. 361 (2007) 1048–1053. [138] S.H. Meijsing, M.A. Pufall, A.Y. So, D.L. Bates, L. Chen, K.R. Yamamoto, DNA binding site sequence directs glucocorticoid receptor structure and activity, Science 324 (2009) 407–410. [139] J. Jackson, G. Ramsay, N.V. Sharkov, E. Lium, A.L. Katzen, The role of transcriptional activation in the function of the Drosophila myb gene, Blood Cells Mol. Dis. 27 (2001) 446–455. [140] W.E. Wright, M. Binder, W. Funk, Cyclic amplification and selection of targets (CASTing) for the myogenin consensus binding site, Mol. Cell. Biol. 11 (1991) 4104–4110. [141] P.J. Roche, S.A. Hoare, M.G. Parker, A consensus DNA-binding site for the androgen receptor, Mol. Endocrinol. 6 (1992) 2229–2235. [142] M.F. Berger, A.A. Philippakis, A.M. Qureshi, F.X.S. He, P.W. Estep, M.L. Bulyk, Compact, universal DNA microarrays to comprehensively determine transcriptionfactor binding site specificities, Nat. Biotechnol. 24 (2006) 1429–1435. [143] M.J. Kim, T.H. Lee, Y.M. Pahk, Y.H. Kim, H.M. Park, Y.D. Choi, B.H. Nahm, Y.K. Kim, Quadruple 9-mer-based protein binding microarray with DsRed fusion protein, BMC Mol. Biol. 10 (2009) 11. [144] S. Mukherjee, M.F. Berger, G. Jona, X.S. Wang, D. Muzzey, M. Snyder, R.A. Young, M.L. Bulyk, Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays, Nat. Genet. 36 (2004) 1331–1339. [145] S.Y. Seong, C.Y. Choi, Current status of protein chip development in terms of fabrication and application, Proteomics 3 (2003) 2176–2189. [146] G. Badis, M.F. Berger, A.A. Philippakis, S. Talukder, A.R. Gehrke, S.A. Jaeger, E.T. Chan, G. Metzler, A. Vedenko, X.Y. Chen, H. Kuznetsov, C.F. Wang, D. Coburn, D.E. Newburger, Q. Morris, T.R. Hughes, M.L. Bulyk, Diversity and complexity in DNA recognition by transcription factors, Science 324 (2009) 1720–1723. [147] C.A. Grove, F. De Masi, M.I. Barrasa, D.E. Newburger, M.J. Alkema, M.L. Bulyk, A.J.M. Walhout, A multiparameter network reveals extensive divergence between C. elegans bHLH transcription factors, Cell 138 (2009) 314–327. [148] M. Godoy, J.M. Franco-Zorrilla, J. Pérez-Pérez, J.C. Oliveros, Ó. Lorenzo, R. Solano, Improved protein-binding microarrays for the identification of DNA-binding specificities of transcription factors, Plant J. 66 (2011) 700–711. [149] J. Linnell, R. Mott, S. Field, D.P. Kwiatkowski, J. Ragoussis, I.A. Udalova, Quantitative high-throughput analysis of transcription factor binding specificities, Nucleic Acids Res. 32 (2004) 7. [150] W. Gong, K. He, M. Covington, S.P. Dinesh-Kumar, M. Snyder, S.L. Harmer, Y.X. Zhu, X.W. Deng, The development of protein microarrays and their applications in DNA-protein and protein–protein interaction analyses of Arabidopsis transcription factors, Mol. Plant. 1 (2008) 27–41. [151] K. Barbulescu, C. Geserick, I. Schuttke, W.D. Schleuning, B. Haendler, New androgen response elements in the murine Pem promoter mediate selective transactivation, Mol. Endocrinol. 15 (2001) 1803–1816. [152] G. Verrijdt, A. Haelens, F. Claessens, Selective DNA recognition by the androgen receptor as a mechanism for hormone-specific regulation of gene expression, Mol. Genet. Metab. 78 (2003) 175–185. [153] Z.D. Xie, E. Lee, J.R. Lucas, K. Morohashi, D.M. Li, J.A.H. Murray, F.D. Sack, E. Grotewold, Regulation of cell proliferation in the stomatal lineage by the Arabidopsis MYB FOUR LIPS via direct targeting of core cell cycle genes, Plant Cell 22 (2010) 2306–2321. [154] F. Mehrtens, H. Kranz, P. Bednarek, B. Weisshaar, The Arabidopsis transcription factor MYB12 is a flavonol-specific regulator of phenylpropanoid biosynthesis, Plant Physiol. 138 (2005) 1083–1096. [155] R. Stracke, H. Ishihara, G.H.A. Barsch, F. Mehrtens, K. Niehaus, B. Weisshaar, Differential regulation of closely related R2R3–MYB transcription factors controls flavonol accumulation in different parts of the Arabidopsis thaliana seedling, Plant J. 50 (2007) 660–677.

M.B. Prouse, M.M. Campbell / Biochimica et Biophysica Acta 1819 (2012) 67–77 [156] C.E. Massie, I.G. Mills, ChIPping away at gene regulation, EMBO Rep. 9 (2008) 337–343. [157] M.J. Solomon, P.L. Larsen, A. Varshavsky, Mapping protein–DNA interactions in vivo with formaldehyde: evidence that histone H4 is retained on a highly transcribed gene, Cell 53 (1988) 937–947. [158] T. Berge, V. Matre, E.M. Brendeford, T. Saether, B. Luscher, O.S. Gabrielsen, Revisiting a selection of target genes for the hematopoietic transcription factor c-Myb using chromatin immunoprecipitation and c-Myb knockdown, Blood Cells Mol. Dis. 39 (2007) 278–286. [159] S. Fornale, X.H. Shi, C.L. Chai, A. Encina, S. Irar, M. Capellades, E. Fuguet, J.L. Torres, P. Rovira, P. Puigdomenech, J. Rigau, E. Grotewold, J. Gray, D. CaparrosRuiz, ZmMYB31 directly represses maize lignin genes and redirects the phenylpropanoid metabolic flux, Plant J. 64 (2010) 633–644. [160] D. Georlette, S. Ahn, D.M. MacAlpine, E. Cheung, P.W. Lewis, E.L. Beall, S.P. Bell, T. Speed, J.R. Manak, M.R. Botchan, Genomic profiling and expression studies reveal both positive and negative activities for the Drosophila Myb-MuvB/dREAM complex in proliferating cells, Genes Dev. 21 (2007) 2880–2896. [161] Y. Hara, Y. Onishi, K. Oishi, K. Miyazaki, A. Fukamizu, N. Ishida, Molecular characterization of Mybbp1a as a co-repressor on the Period2 promoter, Nucleic Acids Res. 37 (2009) 1115–1126. [162] K. Morohashi, E. Grotewold, A systems approach reveals regulatory circuitry for Arabidopsis trichome initiation by the GL3 and GL1 selectors, PLoS Genet. 5 (2009) 17. [163] Q.F. Wang, J. Lauring, M.S. Schlissel, c-Myb binds to a sequence in the proximal region of the RAG-2 promoter and is essential for promoter activity in T-lineage cells, Mol. Cell. Biol. 20 (2000) 9203–9211. [164] R.W.M. Sablowski, E.M. Meyerowitz, A homolog of NO APICAL MERISTEM is an immediate target of the floral homeotic genes APETALA3/PISTILLATA, Cell 92 (1998) 93–103. [165] A. Samach, H. Onouchi, S.E. Gold, G.S. Ditta, Z. Schwarz-Sommer, M.F. Yanofsky, G. Coupland, Distinct roles of CONSTANS target genes in reproductive development of Arabidopsis, Science 288 (2000) 1613–1616. [166] D. Wagner, R.W.M. Sablowski, E.M. Meyerowitz, Transcriptional activation of APETALA1 by LEAFY, Science 285 (1999) 582–584. [167] H.A. Phan, S. Iacuone, S.F. Li, R.W. Parish, The MYB80 transcription factor is required for pollen development and the regulation of tapetal programmed cell death in Arabidopsis thaliana, Plant Cell 23 (2011) 2209–2224. [168] J. Gaudet, S.E. Mango, Regulation of organogenesis by the Caenorhabditis elegans, FoxA protein PHA-41, Science 295 (2002) 821–825. [169] J. Gaudet, S. Muttumu, M. Horner, S.E. Mango, Whole-genome analysis of temporal gene expression during foregut development, PLoS Biol. 2 (2004) 1828–1842. [170] M. Zhong, W. Niu, Z.J. Lu, M. Sarov, J.I. Murray, J. Janette, D. Raha, K.L. Sheaffer, H.Y.K. Lam, E. Preston, C. Slightham, L.W. Hillier, T. Brock, A. Agarwal, R. Auerbach, A.A. Hyman, M. Gerstein, S.E. Mango, S.K. Kim, R.H. Waterston, V. Reinke, M. Snyder, Genome-wide identification of binding sites defines distinct functions for Caenorhabditis elegans PHA-4/FOXA in development and environmental Response, PLoS Genet. 6 (2010) 13. [171] G. Robertson, M. Hirst, M. Bainbridge, M. Bilenky, Y.J. Zhao, T. Zeng, G. Euskirchen, B. Bernier, R. Varhol, A. Delaney, N. Thiessen, O.L. Griffith, A. He, M. Marra, M. Snyder, S. Jones, Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing, Nat. Methods 4 (2007) 651–657. [172] C.R. Vakoc, D.L. Letting, N. Gheldof, T. Sawado, M.A. Bender, M. Groudine, M.J. Weiss, J. Dekker, G.A. Blobel, Proximity amona distant reaulatory elements at the beta-globin locus requires GATA-1 and FOG-1, Mol. Cell 17 (2005) 453–462. [173] G. Stenman, M.K. Andersson, Y. Andren, New tricks from an old oncogene Gene fusion and copy number alterations of MYB in human cancer, Cell Cycle 9 (2010) 2986–2995. [174] C. Vicente, A. Conchillo, D. Pauwels, I. Vazquez, L. Garcia-Orti, M.J. Calasanz, I. Lahortiga, J. Cools, M.D. Odero, MYB overexpression is directly involved in acute myeloid leukemia pathogenesis and could constitute a new therapeutic target for patients with aberrant expression of this gene, Blood 114 (2009) 948-948. [175] J. Gertz, L. Riles, P. Turnbaugh, S.W. Ho, B.A. Cohen, Discovery, validation, and genetic dissection of transcription factor binding sites by comparative and functional genomics, Genome Res. 15 (2005) 1145–1152. [176] T. Vavouri, G. Elgar, Prediction of cis-regulatory elements using binding site matrices — the successes, the failures and the reasons for both, Curr. Opin. Genet. Dev. 15 (2005) 395–402. [177] E.R. Mardis, ChIP-seq: welcome to the new frontier, Nat. Methods 4 (2007) 613–614. [178] R.P. Huang, Protein arrays, an excellent tool in biomedical research, Front. Biosci. 8 (2003) D559–D576. [179] K. Katoh, K. Kuma, H. Toh, T. Miyata, MAFFT version 5: improvement in accuracy of multiple sequence alignment, Nucleic Acids Res. 33 (2005) 511–518. [180] K. Tamura, J. Dudley, M. Nei, S. Kumar, MEGA4: molecular evolutionary genetics analysis (MEGA) software version 4.0, Mol. Biol. Evol. 24 (2007) 1596–1599. [181] E.L. Beall, J.R. Manak, S. Zhou, M. Bell, J.S. Lipsick, M.R. Botchan, Role for a Drosophila Myb-containing protein complex in site-specific DNA replication, Nature 420 (2002) 833–837.

77

[182] M. Borg, L. Brownfield, H. Khatab, A. Sidorova, M. Lingaya, D. Twell, The R2R3 MYB transcription factor DUO1 activates a male germline-specific regulon essential for sperm cell differentiation in Arabidopsis, Plant Cell 23 (2011) 534–549. [183] P.W. Chen, C.M. Chiang, T.H. Tseng, S.M. Yu, Interaction between rice MYBGA and the gibberellin response element controls tissue-specific sugar sensitivity of alpha-amylase genes, Plant Cell 18 (2006) 2326–2340. [184] I. Galis, P. Simek, T. Narisawa, M. Sasaki, T. Horiguchi, H. Fukuda, K. Matsuoka, A novel R2R3 MYB transcription factor NtMYBJS1 is a methyl jasmonate-dependent regulator of phenylpropanoid-conjugate biosynthesis in tobacco, Plant J. 46 (2006) 573–592. [185] M. Goicoechea, E. Lacombe, S. Legay, S. Mihaljevic, P. Rech, A. Jauneau, C. Lapierre, B. Pollet, D. Verhaegen, N. Chaubet-Gigot, J. Grima-Pettenati, EgMYB2, a new transcriptional activator from Eucalyptus xylem, regulates secondary cell wall formation and lignin biosynthesis, Plant J. 43 (2005) 553–567. [186] T. Hirayama, K. Shinozaki, A cdc5(+) homolog of a higher plant, Arabidopsis thaliana, Proc. Natl. Acad. Sci. U. S. A. 93 (1996) 13371–13376. [187] F.U. Hoeren, R. Dolferus, Y.R. Wu, W.J. Peacock, E.S. Dennis, Evidence for a role for AtMYB2 in the induction of the Arabidopsis alcohol dehydrogenase gene (ADH1) by low oxygen, Genetics 149 (1998) 479–490. [188] Y.C. Huang, L.H. Su, G.A. Lee, P.W. Chiu, C.C. Cho, J.Y. Wu, C.H. Sun, Regulation of cyst wall protein promoters by Myb2 in Giardia lamblia, J. Biol. Chem. 283 (2008) 31021–31029. [189] R.F. Lascaris, W.H. Mager, R.J. Planta, DNA-binding requirements of the yeast protein Rap1p as selected in silico from ribosomal protein gene promoter sequences, Bioinformatics 15 (1999) 267–277. [190] J.D. Lieb, X.L. Liu, D. Botstein, P.O. Brown, Promoter-specific binding of Rap1 revealed by genome-wide maps of protein–DNA association, Nat. Genet. 28 (2001) 327–334. [191] K. Maeda, S. Kimura, T. Demura, J. Takeda, Y. Ozeki, DcMYB1 acts as a transcriptional activator of the carrot phenylalanine ammonia-lyase gene (DcPAL1) in response to elicitor treatment, UV-B irradiation and the dilution effect, Plant Mol. Biol. 59 (2005) 739–752. [192] E. Meneses, H. Cardenas, S. Zarate, L.G. Brieba, E. Orozco, C. Lopez-Camarillo, E. Azuara-Liceaga, The R2R3 Myb protein family in Entamoeba histolytica, Gene 455 (2010) 32–42. [193] S.J. Ong, H.M. Hsu, H.W. Liu, C.H. Chu, J.H. Tai, Multifarious transcriptional regulation of adhesion protein gene ap65-1 by a novel Myb1 protein in the protozoan parasite Trichomonas vaginalis, Eukaryot. Cell 5 (2006) 391–399. [194] S.J. Ong, H.M. Hsu, H.W. Liu, C.H. Chu, J.H. Tai, Activation of multifarious transcription of an adhesion protein ap65-1 gene by a novel Myb2 protein in the protozoan parasite Trichomonas vaginalis, J. Biol. Chem. 282 (2007) 6716–6725. [195] D.G. Oppenheimer, P.L. Herman, S. Sivakumaran, J. Esch, M.D. Marks, A myb gene required for leaf trichome differentiation in Arabidopsis is expressed in stipules, Cell 67 (1991) 483–493. [196] J.A. Punwani, D.S. Rabiger, A. Lloyd, G.N. Drews, The MYB98 subcircuit of the synergid gene regulatory network includes genes directly and indirectly regulated by MYB98, Plant J. 55 (2008) 406–414. [197] V. Ramirez, A. Agorio, A. Coego, J. Garcia-Andrade, M.J. Hernandez, B. Balaguer, P.B.F. Ouwerkerk, I. Zarra, P. Vera, MYB46 modulates disease susceptibility to Botrytis cinerea in Arabidopsis, Plant Physiol. 155 (2011) 1920–1935. [198] K.H. Ryu, Y.H. Kang, Y.H. Park, D. Hwang, J. Schiefelbein, M.M. Lee, The WEREWOLF MYB protein directly regulates CAPRICE transcription during cell fate specification in the Arabidopsis root epidermis, Development 132 (2005) 4765–4775. [199] U. Stobergrasser, B. Brydolf, X. Bin, F. Grasser, R.A. Firtel, J.S. Lipsick, The Myb DNA-binding domain is highly conserved in Dictyostelium discoideum, Oncogene 7 (1992) 589–596. [200] C.H. Sun, D. Palm, A.G. McArthur, S.G. Svard, F.D. Gillin, A novel Myb-related protein involved in transcriptional activation of encystation genes in Giardia lamblia, Mol. Microbiol. 46 (2002) 971–984. [201] A. Telfer, K.M. Bollman, R.S. Poethig, Phase change and the regulation of trichome distribution in Arabidopsis thaliana, Development 124 (1997) 645–654. [202] Q.J. Xiang, H.S. Judelson, Myb transcription factors in the oomycete Phytophthora with novel diversified DNA-binding domains and developmental stage-specific expression, Gene 453 (2010) 1–8. [203] H. Yang, H.J. Chung, T. Yong, B.H. Lee, S. Park, Identification of an encystationspecific transcription factor, Myb protein in Giardia lamblia, Mol. Biochem. Parasitol. 128 (2003) 167–174. [204] R.Q. Zhong, C.H. Lee, J.L. Zhou, R.L. McCarthy, Z.H. Ye, A battery of transcription factors involved in the regulation of secondary cell wall biosynthesis in Arabidopsis, Plant Cell 20 (2008) 2763–2782. [205] J.L. Zhou, C.H. Lee, R.Q. Zhong, Z.H. Ye, MYB58 and MYB63 are transcriptional activators of the lignin biosynthetic pathway during secondary cell wall formation in Arabidopsis, Plant Cell 21 (2009) 248–266.