STATs Shape the Active Enhancer Landscape of T Cell Populations

STATs Shape the Active Enhancer Landscape of T Cell Populations

STATs Shape the Active Enhancer Landscape of T Cell Populations Golnaz Vahedi,1 Hayato Takahashi,1 Shingo Nakayamada,1 Hong-wei Sun,2 Vittorio Sartore...

2MB Sizes 8 Downloads 61 Views

STATs Shape the Active Enhancer Landscape of T Cell Populations Golnaz Vahedi,1 Hayato Takahashi,1 Shingo Nakayamada,1 Hong-wei Sun,2 Vittorio Sartorelli,3 Yuka Kanno,1,4,* and John J. O’Shea1,4 1Lymphocyte

Cell Biology Section, Molecular Immunology and Inflammation Branch Mining and Discovery Section 3Laboratory of Muscle Stem Cells and Gene Regulation National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD 20892-1930, USA 4These authors contributed equally to this work *Correspondence: [email protected] http://dx.doi.org/10.1016/j.cell.2012.09.044 2Biodata

SUMMARY

Signaling pathways are intimately involved in cellular differentiation, allowing cells to respond to their environment by regulating gene expression. Although enhancers are recognized as key elements that regulate selective gene expression, the interplay between signaling pathways and actively used enhancer elements is not clear. Here, we use CD4+ T cells as a model of differentiation, mapping the activity of cell-type-specific enhancer elements in T helper 1 (Th1) and Th2 cells. Our data establish that STAT proteins have a major impact on the activation of lineage-specific enhancers and the suppression of enhancers associated with alternative cell fates. Transcriptome analysis further supports a functional role for enhancers regulated by STATs. Importantly, expression of lineage-defining master regulators in STAT-deficient cells fails to fully recover the chromatin signature of STAT-dependent enhancers. Thus, these findings point to a critical role of STATs as environmental sensors in dynamically molding the specialized enhancer architecture of differentiating cells.

INTRODUCTION How the extracellular environment coordinates gene transcription remains a central and largely unanswered question in biology. In bacteria, coordination of gene expression is resolved by the linear organization of the operon, a genetic entity in which adjacent units are transcribed by a single regulatory region (Jacob and Monod, 1961). In metazoans, genes are regulated by the juxtaposition of promoters with enhancer regulatory regions. The latter can be located at remote distances from the transcribed units with the interactions being achieved through dynamic, long-range physical interactions. Such enhancer elements are likely to be a primary determinant of cell type specificity (Bulger and Groudine, 2011). In spite of their functional

relevance, it has proven difficult to unambiguously locate enhancers. Only recently, chromatin signatures have been identified that allow genome-wide enumeration of cis-regulatory regions with enhancer properties. Specifically, the monomethylation of histone H3 lysine 4 (H3K4me1) signature is considered as the permissive enhancer signature (Heintzman et al., 2009). Other marks in combination with H4K4me1 signature have been subsequently used to differentiate active enhancer elements. These include the binding of acetyltransferase p300 (Visel et al., 2009) or deposition of H3K27ac (Creyghton et al., 2010; Rada-Iglesias et al., 2011). The predictive ability of p300-based active enhancer signature has been tested by using a large series of reporter transgenic mice. In almost all cases, reproducible enhancer activity correlated with the tissue-specific p300 binding (Visel et al., 2009). In a more recent study, human cardiac enhancers have been identified by using a similar approach (May et al., 2011). Mapping of p300 binding allows a refinement of the enhancer landscapes as p300 peaks offer a more discrete definition than other histone modifications, leading to more precise localization of enhancers (Smale, 2010). Although p300 binding constitutes a substantial portion of histone acetyltransferase activity found in the cells, other factors may also contribute to active enhancer landscapes (Krebs et al., 2011). The ability to profile active enhancers on a genome-wide scale raises a number of questions. Studies on differentiation of biological structures such as mammalian nervous system or immune processes explored how signaling pathways allow cells to respond to environment by regulating global gene expression patterns (Miller and Gauthier, 2007; O’Shea and Paul, 2010). However, the interplay between environment and active enhancer landscapes remains poorly understood. In particular, the contribution of exogenous inductive signals that sense the environment and endogenous tissue-specific transcription factors to the establishment of active enhancer repertoire is not clear. Here, we chose CD4+ T cells as a model of differentiation and investigated the formation and maintenance of genome-wide enhancer signatures by using H3K4me1 and p300 in two distinct T helper cell populations, Th1 and Th2 cells. T cell differentiation is a multistep process, which, through a series of progressive Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc. 981

63kbp

A

CNS Il4

Il13

Rad50

110

p300 Th1

2

H3K4me1 Th1

110

p300 Th2

2

H3K4me1 Th2 Previously known enhancers

B

C

p300 Th1

promoter intergenic intragenic

p300 Th1

p300 Th2

7,089 (35%)

9,180 (45%)

p300 Th2

11,258

11,988 (46%)

3,001 ((11%))

10,590 (46%)

10,565 (42%)

2,675 ((12%))

1,498 89

9,269 (41%)

8,157 (67%)

8,903 (74%)

p300 ES

E

F

300 200 100

100 50

Th1SpecGenes

HouseKeepingGenes

Th2SpecGenes

HouseKeepingGenes

Th1SpecGenes

**

p<1e-9

0

100

200

300

400

Tag-per-million 20 40 60

NS

HouseKeepingGenes

p300 in Th2 500

H3K4me3 in Th2

0

4

6

RPKM 8 10

12

Expression in Th2

** p<1e-16

0

0

HouseKeepingGenes

**

p<1e-14

400

250 200 150

8 6 4

Th1SpecGenes

p300 in Th1 500

NS

Tag-per-million

300

H3K4me3 in Th1 Tag-per-million

** p<1e-16

RPKM

10

Expression in Th1

Tag-per-million

D

p300 MØ

Th2SpecGenes

HouseKeepingGenes

Th2SpecGenes

HouseKeepingGenes

Figure 1. Active Enhancer Landscapes in Th1 and Th2 Cells Are Distinct (A) Chromatin signatures as defined by p300 binding and H3K4me1 map recognized and other putative enhancers in the Il4-Il13 locus. The Il4-Il13 gene track represents 13 p300 binding sites within H3K4me1 domains in Th2 cells, including eight known elements (orange triangles) (Table S1B). ‘‘CNS’’ lane shows conserved noncoding sequences. (B) Genomic distribution of p300-bound elements in Th1 (total 25,554) and Th2 (total 22,534) cells at promoter ( 4 kbp to +500 bp of transcriptional start site [TSS]), intergenic (>4 kbp TSS), and intragenic regions (+500 bp of TSS to transcription end site [TES]). (C) T helper subsets have thousands of unique p300 binding sites, but almost none are shared among T cells, macrophages, and ES cells. Venn diagram depicts the number and percentages of shared and unique p300 binding sites in each cell type. p300 binding in ES cells and macrophages is from Creyghton et al. (2010) and Ghisletti et al. (2010).

982 Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc.

and alternative choices, generates different cell populations dedicated to specific aspects of host defense. This process represents the integration of extrinsic cues sensed by signal transducer and activator of transcription (STATs) proteins and the induction of intrinsic master regulator transcription factors. Our genome-wide profiling of enhancers reveals that STAT proteins are pervasive effectors of active enhancer landscapes. Importantly, expression of endogenous master regulators in STAT-deficient cells fails to fully re-establish landscapes of active enhancers. In this manner, the ability of STATs to sense environmental signals is indispensable in the control of active enhancer elements and, consequently, alternative gene expression programs essential for specialized cells. RESULTS Chromatin Signature Maps Previously Recognized and Putative Enhancers in T Cells To begin to understand the breadth of the active enhancer repertoires in differentiating CD4+ T cells, we mapped global binding of p300, H3K4me1, and H3K4me3 histone modifications in Th1 and Th2 cells. On a genome-wide scale, we identified 25,554 statistically significant p300 peaks in Th1 and 22,534 peaks in Th2 cells. Biological repeat experiments showed the reproducibility of p300 peaks in Th1 and Th2 cells (r = 0.91 and 0.82, respectively) (Figure S1A available online). To verify the utility of this approach, we first asked whether this method identified known regulatory elements in CD4+ T cells. We found that the well-characterized Cd4 enhancer was marked by p300 in both Th1 and Th2 cells (Chong et al., 2010). In contrast, the CD8a and Foxp3 genes, whose products are not expressed in these cells, were devoid of p300 binding (Figure S1B). Furthermore, our genome-wide p300 profiling identified known enhancers of Ifng and Il4-Il13 genes, the signature cytokines of Th1 and Th2 cells, respectively (Figures S1B and S1A and Table S1). In addition, numerous other putative elements were identified (Figures S1B–S1D). Active Enhancer Landscapes in Th1 and Th2 Cells Are Distinct Although specialized cells are functionally distinct, they also share many key cellular processes. This raises the fundamental question of how discrete the active enhancer landscapes are in distinct cell populations. We therefore sought to evaluate the differences in the genome-wide profile of active enhancer signatures in Th1 and Th2 cells, cells that are closely related yet functionally distinct. The p300 binding sites in promoter regions, enriched for H3K4me3, constituted 12% of the total binding sites and were excluded from further analysis (Figure 1B). Overall, there were 22,553 and 19,859 putative distal enhancers in Th1 and

Th2 cells, respectively. Of these, 12,845 putative enhancers, 55% and 64% of Th1 and Th2 identified elements, were shared by these two T helper subsets (Figure 1C). Conversely, 9,180 (45%) and 7,089 (35%) distal p300 peaks were specific to Th1 and Th2 cells, respectively. Given the many similarities in function in Th1 and Th2 cells, we were surprised by how different these two subsets were in their active enhancer landscapes. We therefore compared Th1 and Th2 cells to more distantly related cells, namely macrophages and embryonic stem (ES) cells (Figures 1C and S1E) (Creyghton et al., 2010; Ghisletti et al., 2010). Macrophages and T cells are both of hematopoietic origin; however, only 1,498 regions were shared among T helper cells and macrophages (10% of the total elements). Intriguingly, the conservation of enhancer repertoire, as defined by p300 binding, did not extend to include ES cells. Only 89 DNA elements bound by p300 were shared among all four data sets (<1% of enhancer elements of each cell type) (Figure 1C). Together, each T helper subset had many unique enhancer elements, and T helper cells, macrophages, and ES cells had essentially no active enhancers in common. Housekeeping Genes Have Little or No p300 Binding Given that many functions are shared among cells, we were struck by the uniqueness of the global enhancer signatures. Therefore, we wondered whether genes with a high degree of tissue-specific expression would be relatively enriched for p300 binding compared to genes that were widely expressed (e.g., ‘‘housekeeping’’ genes). Therefore, we next performed genome-wide transcriptional profiling in Th1 and Th2 cells by using RNA sequencing (RNA-seq) (Mortazavi et al., 2008) and identified the top 100 differentially expressed genes in each subset (Figures S1F and S1G). In addition, we chose 100 housekeeping genes based on an earlier study (Eisenberg and Levanon, 2003) (Figure S1H). The data showed that the housekeeping genes exhibit high levels of expression and enrichment of H3K4me3 at their promoters (Figures 1D and 1E). In fact, the levels of expression of housekeeping genes were significantly higher compared to T-helper-specific genes. However, the pattern of p300 enrichment for T-helper-specific and housekeeping genes was very different; genes selectively expressed in Th1 or Th2 cells showed significantly higher p300 binding in their extended loci (±20 kbp) compared to housekeeping genes (Figure 1F). In contrast to p300, however, the distribution of H3K4me1 modifications did not distinguish between preferentially expressed and housekeeping genes to the same degree (Figure S1I). Collectively, our data demonstrate that the majority of housekeeping genes have little or no p300 binding, suggesting their distinct modes of regulation compared to tissue-specific genes.

(D–F) In contrast to differentially expressed genes in Th1 and Th2 cells, housekeeping genes have little proximal p300 binding. Box plots show median and quartiles of (D) normalized mRNA expression levels (RPKM) measured by RNA-seq, (E) normalized H3K4me3 (tag per million), and (F) normalized p300 binding (tag per million) for top 100 Th-specific genes versus 100 housekeeping genes selected from Eisenberg and Levanon (2003). The intensity of p300 binding was computed 20 kbp to 20 kbp from the TSS to capture potential enhancers. The intensity of H3K4me3 was computed 4 kbp to 1 kbp from the TSS to capture active promoters (p values for Wilcoxon rank-sum test). See also Figure S1.

Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc. 983



Th1

ES

Th2



ES

Th1

Th2

Th1-specific p300 (9,180)

25

80000

0

Distance from Th1-specific p300

40000 30000 20000

0

0

0

0

0

40kbp 50kbp

E

500

10kbp

20kbp

500 100 0

p300 Th1

30kbp 40kbp 50kbp

*

p<1e-4

Tag-per-million 200 300

300 100

0

Th2SpecGenes

p300 Th2

p300 Th1

p300 Th2

distance from TSS of Th2SpecGenes

Th2-specific p300 (7,089)

30kbp

Th1-Th2 common p300 (12,845)

20kbp

Th1-specific p300 (9,180)

10kbp

distance from TSS of Th1SpecGenes

*

p<1e-8

0

0.000

0

Th1SpecGenes

200

0.004

0.006

Tag-per-million

0.008

400

0.010

0.012

Th2-specific p300-bound sites randomly selected sites

0.002

0.015 0.010 0.005 0.000

Normalized number of p300 sites

D Normalized number of p300 sites

C Th1-specific p300-bound sites randomly selected sites

0

Distance fromTh2-specific p300

400

0

Distance from p300-bound sites

+5kbp

-5kbp

+5kbp

-5kbp

0

+5kbp

0

-5kbp

0

+2.5kbp

0

-2.5kbp

0

Th2-specific p300 (7,089)

p-value = 1e-11

H3K4me1 Th1 H3K4me1 Th2

10000

Tag−per-million

50000

Th1-Th2 common p300 (12,845)

+5kbp

-5kbp

0

0

p-value < 2.2e-16

60000

Th2

50

H3K4me1 Th1 H3K4me1 Th2

40000

Th1

B

H3K4me3

Tag−per-million

H3K4me1

p300

20000

A

AP-1 RUNX

0.0

0.5

NFAT Tbox p65 (RelA) GAS-3 (STAT1,4) GATA GAS-4 (STAT6)

Figure 2. Properties of T-Helper-Specific p300-Bound Elements (A) T-helper-specific p300 elements are marked by high H3K4me1 and low H3K4me3 in both Th cells but lack p300 binding and H3K4me1 in macrophages and ES cells. Each column depicts p300 binding, H3K4me1, or H3K4me3 within a window centered on the p300-bound sites (indicated as position ‘‘0’’ by red triangle). Three patterns of p300 binding are shown: Th1-specific (9,180), Th1-Th2-common (12,845), and Th2-specific (7,089). Color map corresponds to binding intensities where ‘‘black’’ represents no binding. (B) H3K4me1 at Th-specific p300 sites shows enrichment in the respective lineage and relative reduction in the opposite lineage. Plots show the normalized distribution of H3K4me1 at Th1 (Th2)-specific p300 elements in Th1 and Th2 cells (±5 kbp) (Kolmogorov-Smirnov test). (C) Th-specific p300 binding sites are enriched in proximity to genes selectively expressed in T helper cells. Plots depict number of Th-specific p300 binding sites within a given distance to promoters of Th-specific genes (Th1 blue, Th2 black) versus randomly generated sites (red) (Wilcoxon rank-sum test p value < 2.2 3 10 16).

984 Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc.

Cell-Type-Specific p300 Peaks Colocalize with H3K4me1 and Correlate with Cell-Type-Specific Gene Expression Because the chromatin signature of active enhancers is reported as p300 positive, H3K4me1 high, H3K4me3 low, and H3K27me3 low, we characterized patterns of histone modifications around common and unique p300-bound regions in Th1 and Th2 cells. On a genome-wide scale, elements uniquely marked by p300 in Th1 or Th2 cells resided within domains of high H3K4me1, low H3K4me3, and low H3K27me3 in the corresponding cell type (Figures 2A and S2A). Of note, these T-helper-specific p300 elements were highly conserved among mammals and were enriched for CpG islands (Figures S2B and S2C). H3K4 monomethylation in regions that were differentially bound by p300 in one lineage showed relative reduction in cells of the opposite lineage (Figure 2B). Macrophages and ES cells lacked H3K4me1 across T helper p300 elements (Figure 2A). Collectively, although Th-specific p300 elements are devoid of p300 and H3K4me1 in macrophage and ES cells, these elements are marked by H3K4me1 in both T helper subsets. We next evaluated whether differential gene expression correlated with the presence of Th-specific p300 elements in the appropriate subset. In fact, we found that genes that were differentially expressed exhibited significant enrichment of p300 binding in the corresponding cell type, with significantly less p300 recruitment in the opposite lineage (Figures 2C and 2D). Overall, Th1- and Th2-specific p300-bound regions had the chromatin characteristics of active enhancers and strongly correlated with cell-type-specific expression of proximate genes. Enrichment of Lineage-Specific Transcription Factor Binding Sites at Lineage-Specific Enhancers The finding that the lineage-specific p300 elements correlate with differential gene expression led us to next assess whether these elements exhibited enrichment of binding sites for transcription factors that promote a lineage-specific gene expression program. We first characterized the enrichment of consensus motifs identified from publically available ChIP-seq data sets. AP-1 and NFAT, factors activated by T cell receptor engagement, and Runx are prevalent in Th cell enhancer elements but do not discriminate between Th1- and Th2-selective elements (Figure 2E). In contrast, motifs for factors that promote Th1 differentiation, such as RelA (p65) (Balasubramani et al., 2010) and T-bet (Szabo et al., 2000), were enriched in Th1-specific, but not Th2-specific, enhancers. Th1-specific enhancers were enriched for gamma-interferon activation site with three base pair spacer (GAS-3), which is the consensus motifs for STAT1 and STAT4, but were relatively devoid of the

STAT6 binding motif (GAS-4) (Wei et al., 2010). In contrast, Th2 elements exhibited enrichment of STAT6 and GATA3 motifs. To more rigorously assess binding of relevant transcription factors to helper cell enhancers, we utilized available ChIP-seq data in CD4+ T cells (Table S2) (Nakayamada et al., 2011; Wei et al., 2010; Wei et al., 2011). Our analysis revealed that 48% of Th1-specific p300 elements were bound by STAT4 or STAT1, and 36% were bound by T-bet. Similarly, 31% and 11% of Th2-specific enhancers were bound by STAT6 and GATA3, respectively. Taken together, our findings reveal that T-helper-specific p300 elements are enriched for lineagespecific transcription factors and correlate with lineage-specific gene expression programs these transcription factors promote. STAT6 Has a Major Role in Generating Active Enhancers of Th2 Cells Although the functional importance of enhancer elements in gene regulation is well recognized, the factors that shape nascent enhancer landscapes of highly specialized cells are mostly unknown. Because cell-type-specific p300 elements were enriched for STAT binding sites, we asked whether chromatin signatures of active enhancers were also STAT dependent. The ability of T cells to remain viable and retain their developmental potential in the absence of STAT proteins allowed us to assess the consequence of genetic deletion of these proteins on the enhancer repertoire of T cells. Activated by IL-4, STAT6 is a key player in Th2 cell specification (Goenka and Kaplan, 2011; Zhu and Paul, 2008). To evaluate the contribution of STAT6 in shaping the active enhancer structure, we generated p300 and H3K4me1 profiles in wild-type and STAT6-deficient cells (Figures 3A and S3A). Focusing first on the Il4 extended locus (Ansel et al., 2006), we observed that STAT6 bound to more than half of the regulatory regions marked by p300, and p300 binding was abrogated in STAT6-deficient T cells (Figure S3B). Globally, the impact of STAT6 deficiency on the chromatin signatures of Th2-specific active enhancers was striking; 77% of the Th2-specific p300 sites (5,451) were STAT6 dependent. Further analysis revealed that the magnitude of H3K4me1 marks was also significantly dependent on STAT6 at STAT6-dependent p300 elements (Figure 3B). To link the effects of STAT6 on transcriptome and the active enhancer landscape, we measured global gene expression in wild-type and STAT6-deficient cells by using RNA-seq. Indeed, p300 binding at the extended loci of STAT6-regulated genes showed STAT6 dependency (Figures 3C and S3C). Collectively, our data revealed a major role for STAT6 in p300 binding and H3K4me1 marks at active enhancers of Th2 cells.

(D) Th-specific genes exhibit enrichment of p300 binding in the corresponding lineage and relative p300 depletion in the opposite lineage. Box plots show median and quartiles of p300 binding in Th1 and Th2 cells around Th1- or Th2-specific genes (±20 kbp from the TSS) (Wilcoxon rank-sum test). (E) Th-specific p300 elements are enriched for consensus motifs of lineage-appropriate transcription factors. Consensus motifs for T-cell-related transcription factors were computed based on the de novo motif analysis by using ChIP-seq data for each factor. A Gibbs sampling method was used to search for a motif by using the genome as the background (likelihood ratio r > 1,000). Consensus motifs GATA and GAS-4 (STAT6) were preferentially enriched in Th2, whereas T-box, GAS-3 (STAT1,4), and p65 were enriched in Th1-specific p300 elements. See also Figure S2.

Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc. 985

p300 Th2

A

H3K4me1 Th2 -/-

WT

Stat6

WT

Stat6

-/-

1,638 (23%)

+5kbp

-5kbp

0

+2.5kbp

0

0

-2.5kbp

5,451(77%)

0

Distance from Th2-specific p300-bound sites

** p<1e-8

300

4

300

400

500

600

Tag−per−million 150 200 250

800

** p<1e-16 8

p-value < 2.2e-16

p300 at (+/-20kbp) of STAT6-Dependent Genes

(A) STAT6 is critical for the global chromatin signature of Th2-specific enhancers. Globally, p300 binding and H3K4me1 at 77% of Th2specific p300 sites (5,451) were STAT6 dependent. The plot in each column represents the pattern of p300 binding and H3K4me1 in wild-type or Stat6 / cells centered on the Th2-specific p300-bound sites (as indicated by position ‘‘0’’). Color map corresponds to binding intensities where ‘‘black’’ represents no binding. (B) H3K4me1 at Th2-specific p300 sites is STAT6 dependent. Plot shows the normalized distribution of H3K4me1 at 5,451 STAT6-dependent p300 elements (Kolmogorov-Smirnov test). (C) STAT6 positively regulated genes are enriched with STAT6-dependent p300 binding sites. By using RNA-seq data from wild-type Th2 and STAT6-deficient cells, we identified positively regulated genes by STAT6 (>2-fold change). Accumulation of p300 binding at these genes in wild-type and STAT6-deficient cells was computed (±20 kbp from the TSS). Box plots show median and quartiles of gene expression levels in RPKM (left) and p300 binding in tag per million (right) at STAT6-dependent genes in wild-type and STAT6-deficient cells (Wilcoxon rank-sum test). See also Figure S3.

Distance from STAT6-dependent p300 sites

50

IL-12 and IFNs acting via STAT4 and STAT1 induce the expression of Tbx21, which encodes the master regulator tranStat6-/WT scription factor T-bet (Szabo et al., 2000). Given its role in Th1 differentiation, we next asked whether T-bet was also an important driver of the genomic enhancer signature. To our surprise, T-bet had a modest effect on the genomic enhancer repertoire (p value = 0.06). Although elements in proximity to some genes like the Ifng locus were regulated by both STATs and T-bet (Figures S4E–S4G), 83% of Th1-specific p300-bound elements were independent of T-bet (Figure 4A). Transcriptional profiling in T-bet-deficient cells revealed that p300 binding sites in proximity to genes positively regulated by T-bet were not dependent on this transcription factor (Figure 4C). Taken together, our findings indicate that STAT1 and STAT4 play major roles in generating the active enhancer landscape of Th1 cells, whereas T-bet has a modest impact. 0

150

0

100

+5kbp

50

0

-5kbp

100

2

100

200

Tag−per−million

700

WT Stat6-/-

STAT6-Dependent Genes (mRNA) 350

C

H3K4me1 Th2

RPKM6

B

Figure 3. STAT6 Has a Major Role in Generating Active Enhancers of Th2 Cells

WT

Stat6-/-

STAT4 and STAT1, but Not T-bet, Are Global Regulators of Active Enhancers in Th1 Cells STAT4 and STAT1 work in concert to drive Th1 differentiation, being activated by IL-12 and IFNg, respectively (Lighvani et al., 2001; Thieu et al., 2008). To assess the contribution of these transcription factors in shaping global enhancer structures, we generated p300 and H3K4me1 profiles by using wild-type cells and cells lacking STAT4 or STAT1 (Figure 4A). Duplicate experiments showed the reproducibility of p300 peaks in these genotypes (Figure S4A). Because these factors are important for the regulation of Ifng expression, we first assessed their roles in the generation of enhancers of this gene (Figure S4B). More than half of the regulatory elements in Ifng gene extended locus were dependent on STAT4 or STAT1 (Figures S4C–S4G). Globally, deficiency of STAT4 and STAT1 resulted in a significant reduction in p300 marks; 60% of Th1-specific elements were dependent on either STAT1 or STAT4 (Figure 4A). More specifically, 15% and 13% were uniquely dependent on STAT1 and STAT4, respectively, whereas 30% of p300 binding disappeared in the absence of either STAT1 or STAT4. Interestingly, STAT-dependent p300 binding sites were enriched in proximity to genes whose expression levels were positively regulated by STATs (Figure 4B). In contrast, H3K4me1 at Th1-specific p300 elements were largely independent of STAT1 or STAT4 (Figure S4H). 986 Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc.

STATs Exert Positive and Negative Effects on p300 Recruitment Thus far, our findings indicate that STAT proteins bind to many T-helper-specific p300 elements and are responsible for p300 deposition in T helper cells. We next assessed the extent to which STAT binding and recruitment of p300 were related. The integration of STAT and p300 ChIP-seq data revealed that around one-third of STAT-dependent p300 elements were also bound by the cognate STAT (Figure S5A), arguing that STAT proteins likely shape the enhancer landscape of T helper cells both directly and through deployment of other factors.

Figure 4. STAT4 and STAT1, but Not T-bet, Are Critical for Active Enhancers of Th1 Cells

p300 Th1 A

Stat4-/-

WT

Stat1-/-

T-bet-/-

3,681(40%)

1,433 (15%)

1,249 (13%)

1,214 (13%)

0

0

+2.5kbp

0

-2.5kbp

1,603 (17%)

0

Distance from Th1-specific p300-bound sites

B

p300 at (+/- 20kbp) of STAT4-Dependent Genes

500

** p<1e-16

C

p300 at (+/- 20kbp) of T-bet-Dependent Genes

Tbet−Dependent Genes (mRNA)

** p<1e-16

* p=0.03

p=0.5

Stat4-/-

WT

Stat4-/-

300 200

6

RPKM

4

100

this cell type (Figures 5A and 5B). Indeed, the expression of Ifng gene increased in the absence of STAT6 (Figure 5B). In general, transcriptome analysis in WT T-bet-/STAT6-deficient cells revealed that the effect of STAT6 on p300 recruitment correlated well with its role on gene expression patterns (Figure S5B). Overall, given the multiplicity of factors that influence T cell activation and differentiation, the extent of STAT6 binding sites with an effect on p300 deposition was notable. Similar to STAT6, the binding of STAT4 was associated with gain (13%, 2,646) or loss (9%, 1,723) of p300 binding (Figure 5C). STAT1 binding also demonstrated a direct effect on p300 deposition (Figure S5C). In contrast, T-bet binding positively correlated with very few p300 binding sites (6%, 1,138 of 19,152) (Figure 5D). Intriguingly, at 2,352 sites (13%), T-bet binding was associated with the inhibition of p300 recruitment, and these elements were proximal to genes negatively regulated by T-bet (Figure S5B). Examples of genes with proximal T-bet binding for which p300 was negatively regulated by T-bet included Eomes and Il4-Il13 (Intlekofer et al., 2005; Thieu et al., 2008). Consistent with our earlier findings, T-bet exerts a modest role on the acquisition of Th1-specific enhancers 0

2 0

100 0

2 0

WT

Tag−per−million

400 300

Tag−per−million

200

6 4

RPKM

8

8

10

STAT4−Dependent Genes (mRNA)

(A) STAT1 and STAT4, but not T-bet, play major roles in generating the active enhancer landscape of Th1 cells. Globally, 60% of Th1-specific enhancers were STAT dependent, whereas 17% were T-bet dependent. Each column represents the pattern of p300 binding in wild-type, Stat4 / , Stat1 / , or T-bet / cells centered on the Th1specific p300-bound sites. Color map corresponds to binding intensities where ‘‘black’’ represents no binding. (B) Genes positively regulated by STAT4 are enriched with STAT4-dependent p300 binding sites. Using RNA-seq data in wild-type Th1 and STAT4deficient cells, we identified genes that were positively regulated by STAT4 (>2-fold change). Accumulation of p300 binding at these genes in wild-type and STAT4-deficient cells was computed (±20 kbp). Box plots show normalized gene expression levels in RPKM (left) and p300 binding in tag per million (right) at STAT4-dependent genes in wild-type and STAT4-deficient cells (Wilcoxon rank-sum test). (C) p300 binding at the extended loci of genes positively regulated by T-bet is not T-bet dependent. By using RNA-seq data in wild-type Th1 and T-bet-deficient cells, we selected positively regulated genes by T-bet (>2-fold change). Box plots show normalized gene expression levels in RPKM (left) and p300 binding in tag per million (right) at T-bet-dependent genes in wild-type and T-betdeficient cells. See also Figure S4.

WT

T-bet-/-

To further characterize the direct effect of STATs, we asked whether we could quantitate the role of STAT binding on cognate p300 recruitment. In particular, we explored the extent to which the accumulation of STAT binding associated with the acquisition of lineage-appropriate enhancer elements and the suppression of lineage-inappropriate marks. Our analysis revealed that the enrichment of STAT6 binding positively correlated with p300 recruitment at 22% of binding sites of this protein (3,523 of 16,079) (Figure 5A). Examples included multiple genes that contribute to Th2 differentiation such as Gata3, Nfil3, and Il24 (Figure 5A) (Kashiwada et al., 2011; Wei et al., 2010). Alternatively, the intensity of p300 binding increased more than 4-fold in STAT6-deficient cells at 10% of STAT6-bound sites (1,606 of 16,079), arguing for a substantial role of STAT6 in limiting p300 recruitment. One intriguing example was the Ifng locus where intergenic STAT6 binding sites in Th2 cells correlated with the loss of p300 binding and H3K4me1 modification in

Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc. 987

A

Nfil3

p300 binding (WT/Stat6-/-) (log2)

Il24

Gata3

Figure 5. Quantification of Direct Contribution of STATs to p300 Binding

B Ifng

22%

14 (RPKM)

+5

mRNA Th2

135 (RPKM)

mRNA Stat6

CNS 300

p300

1.5

0

H3K4me1

50

10%

100

p300 WT

100

p300 Stat6-/-

Ifng

2

H3K4me1 WT

1.5

H3K4me1 Stat6-/-

1.5

H3K27me3 WT

Xcl1

Il18r1

4

6

8

H3K27me3 Stat6-/-

1.5

10

STAT6 binding (log2)

C Nfatc2

Ifng

13%

+5

0

-5 9% 0

2

Il2 Il4ra

4

6

D p300 binding (WT/T-bet-/-) (log2)

Il18r1

8

10

STAT4 binding (log2)

Th1

STAT6

1.5

-5 0

p300 binding (WT/Stat4-/-) (log2)

-/-

Ifng

Xcl1

6%

+5

0

-5 13% 0

2

Eomes

4

Il4-Il13

6

8

T-bet binding (log2)

10

Th2

(A) Global binding of STAT6 leads to both gain and loss of cognate p300 binding. Two-dimensional histogram depicts STAT6 binding, resulting in a change in p300 recruitment in wild-type versus Stat6 / cells. Percentages of STAT6-bound sites with positive or negative effect on p300 are represented in the marked area (>4-fold change). The x axis corresponds to intensity of STAT6 binding (log2). The y axis measures the fold change of p300 binding in wild-type versus Stat6 / cells (log2). Color map corresponds to the number of binding events. Examples of genes with proximal STAT6 binding include Nfil3, Il24, and Gata3 (for positive effect) and Ifng, Xcl1, and Il18r1 (for negative effect). (B) STAT6 has direct negative effects on Ifng enhancers in Th2 cells. Gene track shows that STAT6 binding (dotted box) in Th2 cells leads to loss of p300 binding and H3K4me1 at Ifng enhancers. RNA-seq lanes depict the expression of Ifng gene increased in the absence of STAT6 (14–135 RPKM). (C) STAT4 binding correlates with gain and loss of p300 binding. Examples of genes with proximal STAT4 binding include Ifng, Nfatc2, and Il18r1 (for positive effect) and Il2 and Il4ra (for negative effect). (D) Contrasting effect of T-bet on p300 binding. T-bet has a dominant role as a repressor rather than an activator based on p300 binding. Examples of genes with proximal T-bet binding include Ifng and Xcl1 (for positive effect) and Eomes and Il4-Il13 (for negative effect). See also Figure S5.

and exerts a more dominant role in limiting lineage-inappropriate p300 deposition. In contrast to the effect on p300 binding, the direct effects of STAT4, STAT1, or T-bet on H3K4me1 modifications were not significant (Figure S5D). However, at 7% of its binding sites (1,152 of 16,079), STAT6 binding was associated with an increase in H3K4me1 marks (Figure S5D). The major impact of STAT6 on p300 and H3K4me1 deposition suggests its crucial role in recruiting ‘‘writers’’’ complex of ‘‘histone code’’ to enhancer cohorts of specialized cells.

At the Il4-Il13 locus, expression of GATA3 restored many p300 binding sites; however, the most conserved noncoding region in the locus lacked p300 binding in the absence of STAT6 (Figure S6B) (Table S3). On the genome-wide scale, 51% of Th2specific STAT6-dependent sites (2,639 of 5,041) were recovered after GATA3 overexpression (Figures 6A and S6C). However, 2,402 sites (49% of STAT6-dependent enhancers) could not be reconstituted. Of note, 26% (690) of the recovered elements were bound by GATA3. In contrast, only 7% (157) of nonrecovered elements exhibited GATA3 binding (Wei et al., 2011).

GATA3 Expression Fails to Re-establish the STAT6Dependent Active Enhancer Landscape Although deletion of STATs significantly altered the active enhancer landscape of T helper cells, the argument could be made that, without these factors, differentiated T helper cells were not efficiently generated due to the suboptimal expression of endogenous master regulators. To overcome this problem, we next asked whether overexpression of master regulators could reconstitute the chromatin signature of T-helper-specific enhancers in STAT-deficient cells. Starting with Th2 cells, we found that the overexpression of GATA3 in STAT6-deficient cells was sufficient to restore IL-4 production (Figure S6A), as previously reported (Lee et al., 2001). We next assessed the genome-wide binding of p300 in cells that lacked STAT6 but expressed GATA3 and IL-4, comparing binding patterns in wildtype and STAT6-deficient cells.

GATA3-Mediated Transcriptome Changes Correlate with Changes in p300 Binding To evaluate the degree to which gene expression correlates with establishment of the p300 binding repertoire, we measured global transcription in wild-type and STAT6-deficient cells in the absence or presence of GATA3 overexpression. Of 249 genes whose expression levels were reduced in STAT6-deficient cells, 42% (99) were induced by GATA3 overexpression (Figures 6B, 6C, and S6D). To assess how transcription might correlate with the appearance of enhancer landscape, we quantitated the p300 binding in the vicinity of genes (±20 kbp) whose expression was recovered by GATA3 (Figure 6B). For these genes, p300 binding was significantly increased upon GATA3 expression. In contrast, genes that were not induced by GATA3 exhibited no change in proximal p300 binding (Figure 6C). Overall, the integration of transcriptome changes and p300 binding revealed that

988 Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc.

changes in gene expression patterns mediated by GATA3 correlate with changes in p300 binding. T-bet Expression in STAT4-Deficient Cells Fails to Recover Active Enhancers in Th1 Cells Next, we asked whether T-bet would function similarly to GATA3. Consistent with previous work, we found that the overexpression of T-bet in STAT4-deficient cells was sufficient to restore IFNg production (Figure S6E) (Afkarian et al., 2002; Mullen et al., 2001). In addition, we found that p300 binding to many sites within the Ifng locus was restored after T-bet overexpression. However, the majority of STAT4-dependent p300 binding sites were not recovered (Figure S6F and Table S3). On the genome-wide scale, only 23% of Th1-specific STAT4dependent p300 sites (1,614 of 6,820) were recovered after T-bet overexpression (Figure 6D). To assess whether changes in transcription are related to recovery in p300 binding, we measured global gene expression levels in wild-type and STAT4-deficient cells in the absence or presence of T-bet. Consistent with our earlier observations, T-bet expression had limited effects on the p300 binding of 37 genes where their expression levels were reconstituted by this transcription factor (Figures 6E, 6F, and S6G). Overall, our results highlight the importance of STAT signaling in molding the active enhancer landscape and generating alternative gene expression programs of T helper cells, independent of endogenous factors such as T-bet and GATA3. DISCUSSION The ability of a cell to sense and interpret environmental stimuli and appropriately modify gene expression is a fundamental tenet of evolutionary adaptation. Irrespective of the nature of the instigating stimulus and elicited signaling pathways, the final decoding of the message occurs at the genome level with a binary decision, resulting in either activation or repression of transcription. Taking part in these binary decisions are chromatin structures, including regulatory elements such as enhancers, which are now recognized as having major contributions to cell-typespecific gene expression programs. Enhancer elements either adopt a poised structure or, upon receiving appropriate signals, transition to transcriptional competency. Here, we set out to interrogate the roles of environment-sensing transcription factors and phenotype-defining master transcription factors in shaping the active enhancer landscape during cell fate specification. Our study argues for the pervasive involvement of cytokine-regulated STATs in conferring enhancer specificity and an indispensable role of environmental signals in creating the active enhancer repertoire. These are functions that are not replaced by the enforced action of phenotype-defining master regulators. Distinct Chromatin Signatures Identify Active Enhancers in T Cell Populations We profiled the repertoires of H3K4me1-high, p300-high regions (operationally defined as active enhancers) in Th1 and Th2 cells. Our data establish that closely related T helper cells have distinct active enhancer landscapes. The ability of p300 mapping to discriminate cell type specificity becomes more evident when

our analysis included macrophages and ES cells. In this respect, it is worth pointing out that a precise understanding of what constitutes an active enhancer has not been firmly established (Natoli, 2010). Although p300 binding successfully identified known enhancers of key genes in T cell populations, it is likely that p300 binding reports only a cross-section of active enhancer repertoire, and the entire view of active enhancer landscape also includes mapping of other HAT complexes such as CBP (May et al., 2011) or SAGA (Krebs et al., 2011). Nevertheless, we have established that unbiased mapping of p300 binding is a powerful way to broadly interrogate enhancer activity with fine resolution and sufficient coverage in closely related cell populations. The resulting annotations of active enhancers have implications for the interpretation of genome-wide association studies. Top-scoring disease-linked single-nucleotide polymorphisms are frequently positioned within enhancer elements specifically active in relevant cell types (Ernst et al., 2011). Global profiling of enhancers in various cell types thus provides a knowledge base for the systematic investigation of such elements in health and disease. Environment-Directed STATs Are Major Drivers of Active Enhancer Landscape of T Cells Soluble secreted factors in the environment play key roles in cellular specification. For T cells, cytokines are the major factors that determine fate commitment, mainly through the activation and recruitment of STATs to chromatin. By comparing wildtype and STAT-deficient T cells, we observed an unexpectedly large contribution of these factors to the active enhancer landscape. Clearly, transcriptomic changes mediated by STATs correlated well with STAT-dependent changes in p300 binding. Our data suggest the direct role of STATs on p300 recruitment. Although STATs may directly associate with p300 (Paulson et al., 1999), the extent to which these proteins interact on a large scale will require further validation. In contrast to the major effect of STATs on p300-bound enhancers, the impact of STATs on H3K4me1-positive enhancers was variable. The absence of STAT6 reduced but did not abrogate H3K4me1 marks, whereas the lack of STAT1 or STAT4 minimally influenced H3K4me1 distribution. The modest effect of STAT1 or STAT4 on H3K4me1 is not unexpected as both can contribute to Th1 specification, and mice lacking both these factors have not been generated (Lighvani et al., 2001; Thieu et al., 2008). Considering that H3K4me1 broadly maps regions that may include both inactive and poised elements as well as active regulatory sites, ‘‘pioneering factors’’ other than STATs are likely responsible for deposition of this mark. Possibly, the appearance of such marks occurs at an earlier stage of T helper differentiation. In contrast, STATs are the major drivers of p300-bound, active enhancer landscape. This suggests a stepwise process of the enhancer firing in which the establishment of H3K4me1 modifications may precede STAT binding and HAT recruitment. In a sense, the process of enhancer formation can be seen as a volleyball game; some factors ‘‘set’’ the play by forming a permissive enhancer landscape on which environment-sensing factors ‘‘spike’’ to create the productive enhancer elements. Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc. 989

p300 Th2 A

Stat6 -/Control

WT GATA3

p300 Th1 Stat6 -/GATA3

D

Stat4 -/T-bet

0

p300

mRNA p=0.5

Tag-per-million

1500

10 8

2000

1000

6

Genes not Recovered by Gata3 in Th2 p300 mRNA

* Tag-per-million

8 6

RPKM -/Th2 C S6KO Gata3 S6KO Gata3 WT Stat6 Stat6C-/- Th2WT Cont Cont GATA3 GATA3

p=3e-7

*

500 0

0

0

2

4

2000 1500 500

1000

Tag-per-million

8 6 4

RPKM

2 0

p=1e-9

*

T-bet

p300

mRNA

p=1e-6

-/S4KO Stat4Tbet

Genes not Recovered by T-bet in Th1

F

p=2e-8

-/Th2 C S6KO Gata3 S6KO Gata3 WT Stat6 Stat6C-/- Th2WT GATA3 GATA3 Cont Cont

Th1 C S4KO-/C Th1 Stat4 WT WTTbet Cont Cont T-bet

T-bet

2500

*

S4KO Tbet-/Stat4

2000

C

Th1 C S4KO -/C Th1 Stat4 WTTbet WT T-bet Cont Cont

-/Th2 C S6KO Gata3 S6KO Gata3 Stat6 WT WT Stat6C-/- Th2 Cont Cont GATA3 GATA3

GATA3

1500

-/S6KO Stat6Gata3

1000

-/- Th2WT S6KO C Gata3 Stat6 Cont Cont GATA3

Th2 WTC

0

2 0

0

0

2

500

500

4

RPKM

*

p=0.04

1500 1000

Tag-per-million

8

+2.5kbp

1,614 (11%)

+2.5kbp

0

6

-2.5kbp

5,206 (37%)

0

10

7,486 (51%)

2,402 (36%) 2,639 (39%)

0

-2.5kbp

0

4

0

Genes Recovered by T-bet in Th1

p=0.4

RPKM

0

E

p300

p=0.8

0

Distance from Th1-specific p300-bound sites

Genes Recovered by Gata3 in Th2 mRNA

0

Distance from Th2-specific p300-bound sites

B

Stat4 -/Control

WT T-bet

WT Control

20

1,687 (25%)

WT Control

Tbet WT S4KO Stat4C -/-Th1WT Cont Cont T-bet

Th1 C

S4KO Tbet-/-

Stat4 T-bet

WT Stat4 -/- WT Cont Cont T-bet

Stat4 -/T-bet

Figure 6. Overexpression of T-bet or GATA3 in STAT-Deficient Cells Fails to Reconstitute STAT-Dependent Active Repertoires (A) GATA3 expression recovers half of STAT6-dependent elements in Th2 cells. Of 5,041 Th2-specific-STAT6-dependent p300 sites, 2,639 (50%) regulatory elements are recovered in STAT6-deficient cells in which GATA3 was reconstituted. Overall, 36% of Th2-specific enhancers are STAT6 dependent and GATA3

990 Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc.

Our study also identified potential candidates for pioneering or ‘‘setter’’ factors that might contribute to the poised enhancer landscape. These include transcription factors that globally determine T cell commitment or factors like AP-1 and NFAT, which sense T cell receptor engagement. Indeed, our analysis revealed that both Th1- and Th2-specific enhancers are enriched for binding sites of these transcription factors. It has been shown that transcription factor AP-1 can condition chromatin to be a basal permissive state, which then facilitates the ligand-dependent recruitment of glucocorticoid receptors (Biddie et al., 2011). Redundant and Unique Enhancer-Shaping Properties of Master Regulators and STATs T-bet and GATA-3 are referred to as helper T cell master regulators because they are sufficient to induce characteristic cytokine expression in Th1 and Th2 cells in the absence of STAT4 or STAT6 (Lee et al., 2001; Mullen et al., 2001). Our global enhancer profiling revealed that T-bet and GATA3 differed in their capacity to affect p300 binding in STAT-deficient cells; however, neither was sufficient to re-establish the normal active enhancer landscape. Our data revealed that GATA3 overexpression could restore roughly half of STAT6-dependent enhancer elements. For genes induced by GATA3, proximate p300 binding correlated with gene expression. In an interesting contrast, T-bet had limited effects on global p300 maps. This was true both in T-bet-deficient cells and upon enforced T-bet expression in STAT4-deficient cells. The differential effects of T-bet and GATA3 argue that ‘‘master regulators’’ may have very distinct modes of action in cell specification. In the future, it will be of interest to compare and contrast the effects of these lineage-defining transcription factors with other classic master regulators. In this regard, an additional unexpected observation on T-bet was its preferential repressive, rather than enabling, role in establishing enhancer competence; whether this is a general property of T-box transcription factors remains to be determined. Although it is believed that silencing of genes expressed in other cell fates is the most relevant regulatory decision in lineage commitment (Zhang et al., 2012), the direct negative role of key regulatory factors on global enhancers of opposite lineages has not been shown before. We speculate that context-dependent proteinprotein interaction involving transcription factors may impose alternative accretion of coactivator or corepressor complexes and, in doing so, may dictate enhancer competence and transcriptional outcome.

The overlapping yet specialized contribution of STATs and master regulators to shape enhancer signature points to an interesting possibility of the formation of coherent feed-forward loops (FFL) (Shen-Orr et al., 2002). Coherent FFL have several dynamic and functional properties, one of the most relevant being the ability to protect the system from undesired responses to fluctuating inputs. For T cells, which migrate to diverse sites within the body and continually survey infectious challenges, such a scenario is likely to be advantageous. Such a perspective might also be relevant in other cells for which plastic versus static patterns of gene expression are functionally important. An emerging view from our study is that, although broad potentials exist in the poised enhancer landscape, STATs sense environmental stimuli and act upon the available repertoire of enhancers to trigger specific transcriptional responses. Beyond T cells and host defense, STATs have broad functions in programming gene expression in embryonic development, cell growth, and cancer in various organisms (Horvath, 2000). The fact that STATs are involved in synaptic plasticity in the brain (Miller and Gauthier, 2007; Nicolas et al., 2012) is reminiscent of STAT action as environment sensors in T cells. Similarly, in mammary tissue, STATs control different phases of cell development and involution (Watson and Neoh, 2008). It is tempting to speculate that the creation of active enhancer landscapes may be dependent upon STATs in these and other cells. In a broader perspective, there are a wide variety of transcription factors like STATs that sense environmental cues and regulate cellular differentiation. The extent to which nuclear hormones, retinoid receptors, SMAD family transcription factors, and Wnt pathway act analogously to STATs will be interesting to ascertain. Clearly though, the present work establishes that environmental sensors can have profound effects on active enhancer landscapes in the process of cellular differentiation and, in this manner, directly link signal transduction with epigenetic regulation. EXPERIMENTAL PROCEDURES Mice, Isolation of Cells, and Cell Culture C57BL/6J, T-bet-, and STAT6-deficient mice were purchased from Jackson Laboratory. STAT4- and STAT1-deficient mice were provided by Dr. Mark Kaplan (Indiana University) and Dr. Joan Durbin (NYU), respectively. Animals were handled and housed in accordance with the guidelines of the NIH Animal Care and User Committee. Splenic and lymph node T cells were obtained by disrupting organs of 8- to 10-week-old mice. All cell cultures were performed in RPMI supplemented with 10% fetal calf serum, 2 mM glutamine, 100 IU/ml penicillin, 0.1 mg/ml streptomycin, and 2.5 mM b-mercaptoethanol. T cells

independent. Each column represents p300 binding in wild-type (Stat6 / ) cells transduced with control or GATA3-expressing retrovirus centered on the Th2specific p300 sites. Color map corresponds to binding intensities where ‘‘black’’ represents no binding. (B) GATA3 recovers p300 binding at genes whose expression levels are recovered by GATA3. Box plots show median and quartiles of expression levels in RPKM (left) and normalized p300 binding in tag per million (right) in wild-type (Stat6 / ) cells transduced with control or GATA3-expressing retrovirus at genes recovered by GATA3. (C) GATA3 has no effect on p300 binding at genes whose expression levels are not affected by GATA3. (D) T-bet overexpression fails to recover the chromatin signature of STAT4-dependent enhancers. Of 6,820 Th1-specific STAT4-dependent sites, 1,614 (23%) regulatory elements are recovered in STAT4-deficient T-bet-expressing cells. Each column represents p300 binding in wild-type (Stat4 / ) cells infected with control or T-bet-expressing retrovirus centered on the Th1-specific p300 sites. (E) T-bet fails to recover p300 binding at genes whose expression levels are recovered by T-bet. Box plots show median and quartiles of gene expression levels in RPKM (left) and normalized p300 binding in tag per million (right) in wild-type (Stat4 / ) cells transduced with control or T-bet-expressing retrovirus. (F) T-bet has no effect on p300 binding at genes whose expression levels are not recovered by T-bet. See also Figure S6 and Table S3.

Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc. 991

were enriched by using a CD4+ T Cell Kit and AutoMacs isolator (Miltenyi Biotec, Auburn, CA). Naive CD4+ T cells were isolated by flow cytometry, staining with anti-CD4, anti-CD62L, anti-CD44, and anti-CD25 antibodies. Naive CD4+ T cells were first cultured in the presence of plate-bound anti-CD3 and antiCD28 (10 mg/ml each), IL-12 (10 ng/ml), and anti-IL-4 (10 mg/ml) for 3 days, followed by IL-2 (50 U/ml) and IL-12 (10 ng/ml) for 4 days (Th1) or anti-CD3 and anti-CD28, IL-4 (10 ng/ml) and anti-IFNg (10 mg/ml) for 3 days, followed by IL-2 (50 U/ml) and IL-4 (10 ng/ml) for 4 days (Th2). Before harvesting, cells were restimulated with plate-bound anti-CD3 and anti-CD28 and cytokines for 2 hr. Cytokines were from R&D Systems (Minneapolis, MN), and antibodies were from BD Phamingen (San Jose, CA) and eBiosciences. Chromatin Immunoprecipitation For histone modification H3K4me1 (H3K4me1: ab8895, AbCam), T cells (2 3 107) were treated with MNase to generate mononucleosome fraction. For STAT1 and p300, we chemically crosslinked and sonicated cells to generate fractionated genomic DNA. Chromatin immunoprecipitation was performed by using anti-STAT1 (sc-592, Santa Cruz Biotechnology) and anti-p300 (sc585, Santa Cruz Biotechnology). The DNA fragments were blunt-end ligated to the Illumina adaptors, amplified, and sequenced by using the Illumina Genome Analyzer II (Illumina, San Diego, CA). Sequence reads of 25 or 36 bps were obtained by using the Illumina Analysis Pipeline. All reads were mapped to the mouse genome (mm9), and only uniquely matching reads were retained. More details on H3K4me3, H3K27me3, STAT4, STAT6, and T-bet ChIP-seq data can be found in Wei et al. (2010) and Nakayamada et al. (2011).

sequenced for 100 cycles (single read) with HiSeq 2000 (Illumina). Sequence reads from each cDNA library were mapped onto the mouse genome build mm9 by using tophat, and the mappable data were then processed by Cufflinks (Trapnell et al., 2010). The obtained data were normalized based on RPKM (reads per kilobase exon model per million mapped reads). To find differentially regulated genes, we used a 1.5- to 4-fold change difference between different cell types or genotypes. To define recovered genes after the overexpression of master regulators, we used 1.5-fold change difference. ChIP-Seq Peak Calling The unique tags for histone modifications were mapped into nonoverlapping 200 bp windows of the mouse genome. Significant islands were identified based on window tag-count threshold determined from a p value = 0.05 defined by Poisson background model using SICER, a method appropriate for broad peaks (Zang et al., 2009). For p300, STATs, T-bet, and Gata3, which have discrete binding sites, CisGenome v2.0, an extension of the earlier version (Ji et al., 2008), was utilized with a reference control of the normal rabbit serum IP. ACCESSION NUMBERS The ChIP-seq and RNA-seq data are deposited in GEO under accession number GSE40463. SUPPLEMENTAL INFORMATION

Retroviral Transduction To overexpress T-bet, we first made pMY-IRES-hNGFR vector as a control vector by replacing EGFP of pMYs-IRES-GFP vector (Cell Biolabs) with hNGFR lacking intracellular domain. Then T-bet complementary DNA (cDNA) was appropriately subcloned into pMY-IRES-hNGFR vector to generate pMY-T-bet-IRES-hNGFR vector (RV-T-bet) for T-bet overexpression. Retroviral vector was transfected into PlatE cells (Cell Biolabs) to generate recombinant retrovirus. To perform retroviral transduction of CD4+ T cells, sorted naive CD4+ T cells from WT or Stat4 / mice were cultured in the presence of platebound anti-CD3 and anti-CD28 (10 mg/ml each) with anti-IL-4 (10 mg/ml) for 16 hr. Culture medium was replaced with retroviral soup and 4 mg/ml polybrene, followed by centrifugation at 2,500 rpm for 2 hr. After 4 hr incubation at 37 C, viral supernatant was replaced with Th1 cell culture medium containing IL-12 (10 ng/ml) and anti-IL-4 for 2 days. After that, cells were released from TCR stimulation and were cultured further in IL-2 (50 U/ml) and IL-12 and grown for an additional 3 days. In a similar manner, Gata3 overexpression vector was made in Th2 cells. To perform retroviral transduction of CD4+ T cells, sorted naive CD4+ T cells from WT or Stat6 / mice were cultured in the presence of plate-bound anti-CD3 and anti-CD28 (10 mg/ml each) with anti-IFNG (10 mg/ml) for 16 hr. Culture medium was replaced with retroviral soup and 4 mg/ml polybrene, followed by centrifugation at 2,500 rpm for 2 hr. After 4 hr incubation at 37 C, viral supernatant was replaced with Th2 cell culture medium containing IL-4 (10 ng/ml) and anti-IFNG for 2 days. After that, cells were released from TCR stimulation and were cultured further in IL-2 (50 U/ml) and IL-4 and grown for an additional 3 days. For intracellular staining, cells were restimulated for 2 hr with 50 ng/ml PMA and 1 mg/ml ionomycin with the addition of brefeldin A (GolgiPlug; BD) and then fixed and permeabilized with Cytofix/Cytoperm solution (BD). Intracellular staining was performed by using APC anti-IFN-g, PerCP-Cy5.5 anti-CD4, Alexa Fluor-488 anti-GATA3, PE anti-IL-4 (BD), or Alexa Fluor-647 anti-T-bet (eBioscience) on ice for 30 min. Stained cells were analyzed on a flow cytometer (FACSVerse; BD). Events were collected and analyzed with FlowJo software (Tree Star).

Supplemental Information includes Extended Experimental Procedures, six figures, and three tables and can be found with this article online at http:// dx.doi.org/10.1016/j.cell.2012.09.044.

RNA Sequencing Total RNA was prepared from 2–5 million cells by using mirVana miRNA Isolation Kit (AM1560, ABI). One microgram of total RNA was subsequently used to prepare RNA-seq library by using TruSeq SRRNA sample prep kit (FC-1221001, Illumina) by following manufacturer’s protocol. The libraries were

Bulger, M., and Groudine, M. (2011). Functional and mechanistic diversity of distal transcription enhancers. Cell 144, 327–339.

992 Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc.

ACKNOWLEDGMENTS The authors thank Drs. K. Zhao, A. Poholek, and K. Ghoreschi for critically reading this manuscript. We also thank G. Gutierrez-Cruz (Biodata Mining Core Facility, NIAMS), J. Simone, and J. Lay (Flow Cytometry Section, NIAMS) for their excellent technical support. This study utilized the high-performance computational capabilities of the Biowulf Linux cluster at the NIH. This work was supported by the Intramural Research Programs of NIAMS. Received: April 20, 2012 Revised: July 23, 2012 Accepted: September 28, 2012 Published: November 20, 2012 REFERENCES Afkarian, M., Sedy, J.R., Yang, J., Jacobson, N.G., Cereb, N., Yang, S.Y., Murphy, T.L., and Murphy, K.M. (2002). T-bet is a STAT1-induced regulator of IL-12R expression in naı¨ve CD4+ T cells. Nat. Immunol. 3, 549–557. Ansel, K.M., Djuretic, I., Tanasa, B., and Rao, A. (2006). Regulation of Th2 differentiation and Il4 locus accessibility. Annu. Rev. Immunol. 24, 607–656. Balasubramani, A., Shibata, Y., Crawford, G.E., Baldwin, A.S., Hatton, R.D., and Weaver, C.T. (2010). Modular utilization of distal cis-regulatory elements controls Ifng gene expression in T cells activated by distinct stimuli. Immunity 33, 35–47. Biddie, S.C., John, S., Sabo, P.J., Thurman, R.E., Johnson, T.A., Schiltz, R.L., Miranda, T.B., Sung, M.H., Trump, S., Lightman, S.L., et al. (2011). Transcription factor AP1 potentiates chromatin accessibility and glucocorticoid receptor binding. Mol. Cell 43, 145–155.

Chong, M.M.W., Simpson, N., Ciofani, M., Chen, G., Collins, A., and Littman, D.R. (2010). Epigenetic propagation of CD4 expression is established by the Cd4 proximal enhancer in helper T cells. Genes Dev. 24, 659–669.

Creyghton, M.P., Cheng, A.W., Welstead, G.G., Kooistra, T., Carey, B.W., Steine, E.J., Hanna, J., Lodato, M.A., Frampton, G.M., Sharp, P.A., et al. (2010). Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc. Natl. Acad. Sci. USA 107, 21931–21936. Eisenberg, E., and Levanon, E.Y. (2003). Human housekeeping genes are compact. Trends Genet. 19, 362–365. Ernst, J., Kheradpour, P., Mikkelsen, T.S., Shoresh, N., Ward, L.D., Epstein, C.B., Zhang, X., Wang, L., Issner, R., Coyne, M., et al. (2011). Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 473, 43–49. Ghisletti, S., Barozzi, I., Mietton, F., Polletti, S., De Santa, F., Venturini, E., Gregory, L., Lonie, L., Chew, A., Wei, C.L., et al. (2010). Identification and characterization of enhancers controlling the inflammatory gene expression program in macrophages. Immunity 32, 317–328. Goenka, S., and Kaplan, M.H. (2011). Transcriptional regulation by STAT6. Immunol. Res. 50, 87–96. Heintzman, N.D., Hon, G.C., Hawkins, R.D., Kheradpour, P., Stark, A., Harp, L.F., Ye, Z., Lee, L.K., Stuart, R.K., Ching, C.W., et al. (2009). Histone modifications at human enhancers reflect global cell-type-specific gene expression. Nature 459, 108–112. Horvath, C.M. (2000). STAT proteins and transcriptional responses to extracellular signals. Trends Biochem. Sci. 25, 496–502. Intlekofer, A.M., Takemoto, N., Wherry, E.J., Longworth, S.A., Northrup, J.T., Palanivel, V.R., Mullen, A.C., Gasink, C.R., Kaech, S.M., Miller, J.D., et al. (2005). Effector and memory CD8+ T cell fate coupled by T-bet and eomesodermin. Nat. Immunol. 6, 1236–1244. Jacob, F., and Monod, J. (1961). Genetic regulatory mechanisms in the synthesis of proteins. J. Mol. Biol. 3, 318–356. Ji, H., Jiang, H., Ma, W., Johnson, D.S., Myers, R.M., and Wong, W.H. (2008). An integrated software system for analyzing ChIP-chip and ChIP-seq data. Nat. Biotechnol. 26, 1293–1300. Kashiwada, M., Cassel, S.L., Colgan, J.D., and Rothman, P.B. (2011). NFIL3/ E4BP4 controls type 2 T helper cell cytokine expression. EMBO J. 30, 2071– 2082. Krebs, A.R., Karmodiya, K., Lindahl-Allen, M., Struhl, K., and Tora, L. (2011). SAGA and ATAC histone acetyl transferase complexes regulate distinct sets of genes and ATAC defines a class of p300-independent enhancers. Mol. Cell 44, 410–423. Lee, G.R., Fields, P.E., and Flavell, R.A. (2001). Regulation of IL-4 gene expression by distal regulatory elements and GATA-3 at the chromatin level. Immunity 14, 447–459. Lighvani, A.A., Frucht, D.M., Jankovic, D., Yamane, H., Aliberti, J., Hissong, B.D., Nguyen, B.V., Gadina, M., Sher, A., Paul, W.E., and O’Shea, J.J. (2001). T-bet is rapidly induced by interferon-gamma in lymphoid and myeloid cells. Proc. Natl. Acad. Sci. USA 98, 15137–15142. May, D., Blow, M.J., Kaplan, T., McCulley, D.J., Jensen, B.C., Akiyama, J.A., Holt, A., Plajzer-Frick, I., Shoukry, M., Wright, C., et al. (2011). Large-scale discovery of enhancers from human heart tissue. Nat. Genet. 44, 89–93.

Role of T-bet in commitment of TH1 cells before IL-12-dependent selection. Science 292, 1907–1910. Nakayamada, S., Kanno, Y., Takahashi, H., Jankovic, D., Lu, K.T., Johnson, T.A., Sun, H.W., Vahedi, G., Hakim, O., Handon, R., et al. (2011). Early Th1 cell differentiation is marked by a Tfh cell-like transition. Immunity 35, 919–931. Natoli, G. (2010). Maintaining cell identity through global control of genomic organization. Immunity 33, 12–24. Nicolas, C.S., Peineau, S., Amici, M., Csaba, Z., Fafouri, A., Javalet, C., Collett, V.J., Hildebrandt, L., Seaton, G., Choi, S.L., et al. (2012). The Jak/STAT pathway is involved in synaptic plasticity. Neuron 73, 374–390. O’Shea, J.J., and Paul, W.E. (2010). Mechanisms underlying lineage commitment and plasticity of helper CD4+ T cells. Science 327, 1098–1102. Paulson, M., Pisharody, S., Pan, L., Guadagno, S., Mui, A.L., and Levy, D.E. (1999). Stat protein transactivation domains recruit p300/CBP through widely divergent sequences. J. Biol. Chem. 274, 25343–25349. Rada-Iglesias, A., Bajpai, R., Swigut, T., Brugmann, S.A., Flynn, R.A., and Wysocka, J. (2011). A unique chromatin signature uncovers early developmental enhancers in humans. Nature 470, 279–283. Shen-Orr, S.S., Milo, R., Mangan, S., and Alon, U. (2002). Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet. 31, 64–68. Smale, S.T. (2010). Seq-ing LPS-induced enhancers. Immunity 32, 296–298. Szabo, S.J., Kim, S.T., Costa, G.L., Zhang, X., Fathman, C.G., and Glimcher, L.H. (2000). A novel transcription factor, T-bet, directs Th1 lineage commitment. Cell 100, 655–669. Thieu, V.T., Yu, Q., Chang, H.C., Yeh, N., Nguyen, E.T., Sehra, S., and Kaplan, M.H. (2008). Signal transducer and activator of transcription 4 is required for the transcription factor T-bet to promote T helper 1 cell-fate determination. Immunity 29, 679–690. Trapnell, C., Williams, B.A., Pertea, G., Mortazavi, A., Kwan, G., van Baren, M.J., Salzberg, S.L., Wold, B.J., and Pachter, L. (2010). Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol. 28, 511–515. Visel, A., Blow, M.J., Li, Z., Zhang, T., Akiyama, J.A., Holt, A., Plajzer-Frick, I., Shoukry, M., Wright, C., Chen, F., et al. (2009). ChIP-seq accurately predicts tissue-specific activity of enhancers. Nature 457, 854–858. Watson, C.J., and Neoh, K. (2008). The Stat family of transcription factors have diverse roles in mammary gland development. Semin. Cell Dev. Biol. 19, 401–406. Wei, L., Vahedi, G., Sun, H.W., Watford, W.T., Takatori, H., Ramos, H.L., Takahashi, H., Liang, J., Gutierrez-Cruz, G., Zang, C.Z., et al. (2010). Discrete roles of STAT4 and STAT6 transcription factors in tuning epigenetic modifications and transcription during T helper cell differentiation. Immunity 32, 840–851. Wei, G., Abraham, B.J., Yagi, R., Jothi, R., Cui, K., Sharma, S., Narlikar, L., Northrup, D.L., Tang, Q., Paul, W.E., et al. (2011). Genome-wide analyses of transcription factor GATA3-mediated gene regulation in distinct T cell types. Immunity 35, 299–311.

Miller, F.D., and Gauthier, A.S. (2007). Timing is everything: making neurons versus glia in the developing cortex. Neuron 54, 357–369.

Zang, C., Schones, D.E., Zeng, C., Cui, K., Zhao, K., and Peng, W. (2009). A clustering approach for identification of enriched domains from histone modification ChIP-Seq data. Bioinformatics 25, 1952–1958.

Mortazavi, A., Williams, B.A., McCue, K., Schaeffer, L., and Wold, B. (2008). Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 5, 621–628.

Zhang, J.A., Mortazavi, A., Williams, B.A., Wold, B.J., and Rothenberg, E.V. (2012). Dynamic transformations of genome-wide epigenetic marking and transcriptional control establish T cell identity. Cell 149, 467–482.

Mullen, A.C., High, F.A., Hutchins, A.S., Lee, H.W., Villarino, A.V., Livingston, D.M., Kung, A.L., Cereb, N., Yao, T.P., Yang, S.Y., and Reiner, S.L. (2001).

Zhu, J., and Paul, W.E. (2008). CD4 T cells: fates, functions, and faults. Blood 112, 1557–1569.

Cell 151, 981–993, November 21, 2012 ª2012 Elsevier Inc. 993