Accepted Manuscript Title: Identification of novel biomarkers for MLL translocated acute myeloid leukemia Author: Karine Lagacé, Fréderic Barabé, Josée Hebert, Sonia Cellot, Brian T. Wilhelm PII: DOI: Reference:
S0301-472X(17)30758-0 http://dx.doi.org/doi: 10.1016/j.exphem.2017.08.006 EXPHEM 3571
To appear in:
Experimental Hematology
Received date: Revised date: Accepted date:
19-7-2017 25-8-2017 28-8-2017
Please cite this article as: Karine Lagacé, Fréderic Barabé, Josée Hebert, Sonia Cellot, Brian T. Wilhelm, Identification of novel biomarkers for MLL translocated acute myeloid leukemia, Experimental Hematology (2017), http://dx.doi.org/doi: 10.1016/j.exphem.2017.08.006. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
1 2 3
Identification of novel biomarkers for MLL translocated acute myeloid leukemia
4
Karine Lagacé , Fréderic Barabé
1
2,3,4
5,6
7
, Josée Hebert , Sonia Cellot , Brian T Wilhelm
1,6
5 6 7
1
8 9
2
10
3
CHU de Québec Hôpital Enfant-Jésus; Quebec City, Quebec, Canada;
11
4
Department of Medicine, Université Laval, Quebec City, Quebec, Canada,
12 13
5
14
6
15 16
7
17
Category: Malignant Hematopoiesis
18
Word count : 1929
Laboratory for High throughput biology, Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, QC, Canada Centre de recherche en infectiologie du CHUL, Centre de recherche du CHU de Québec, Quebec City, Quebec, Canada,
Division of Hematology-Oncology and Leukemia Cell Bank of Quebec, Maisonneuve-Rosemont Hospital, Montréal, QC, Canada, Department of Medicine, Université de Montréal, Montréal, QC, Canada.
Department of pediatrics, division of Hematology, Ste-Justine Hospital, Montréal, QC, Canada, Université de Montréal, Montréal, QC, Canada,
19 20 21 22 23 24 25 26 27 28
*Correspondence to: Brian Wilhelm Institute for Research in Immunology and Cancer Université de Montréal P.O. Box 6128, Station Centre-Ville Montreal, Québec, Canada H3C 3J7 Phone : +1 514 343.6111 ext 0923 Fax : +1 514 343.7780 E-mail :
[email protected]
Page 1 of 15
29
Highlights
30
1) Human model AMLs has allowed the identification of new biomarkers for MLL-AML
31
2) qRT-PCR tests validate MLL-specific and pan-AML expression biomarkers
32
3) The novel biomarkers provide a more sensitive assay than current tests
33 34
Abstract
35
Acute myeloid leukemias (AML) with translocations of the Mixed Lineage Leukemia
36
(MLL/KMT2A) gene are common in young patients and are generally associated with
37
poor clinical outcomes. The molecular biology of MLL fusion genes remains incompletely
38
characterized and is complicated by the fact that over 100 different partner genes have
39
been identified in fusions with MLL. The continually growing list of MLL fusions also
40
represents a clinical challenge with respect to identification of novel fusions and tracking
41
the fusions to monitor progression of the disease after treatment. We have recently
42
developed a novel single donor model leukemia system that permits the development of
43
human AML from normal cord blood cells. Gene expression analysis of this model and
44
MLL-AML patient samples has identified a number of candidate biomarker genes with
45
highly biased expression on leukemic cells. Here we present the data demonstrating the
46
potential clinical utility of several of these candidate genes for identifying known and
47
novel MLL fusions.
48 49 50
Page 2 of 15
51
Introduction
52
Chromosomal translocations involving the MLL (KMT2A) gene (11q23) are
53
among the most common genetic aberrations in pediatric acute myeloid leukemia (AML)
54
(~13%) [1] and are associated with intermediate to poor prognosis depending on the
55
specific fusion partner gene [2]. Screening tools to detect MLL-rearranged AML are
56
therefore extremely valuable in the clinical setting to guide molecular classification. Most
57
clinical laboratories perform MLL break apart FISH (fluorescent in situ hybridization)
58
techniques [3] to screen for MLL-AML, while karyotype or RT-PCR using specific primers
59
allow for the identification of the specific MLL fusion partner, which is critical for risk
60
stratification and for monitoring residual disease after treatment (minimal residual
61
disease, MRD). The diagnosis of MLL translocations represents a clinical challenge
62
since more than 100 different partner genes have been found fused to MLL [4]. In
63
addition, some of the resulting fusions can be cryptic, complicating the development of
64
fusion-specific molecular assays by standard cytogenetic techniques. In this context, the
65
development of small panels of genes that are specifically overexpressed in MLL-AML
66
would contribute to streamline the detection of MLL-AML cases at diagnosis and
67
potentially to track molecular MRD when tumor burden is low, where sensitive and
68
specific assays are required. MRD tracking by flow cytometry in AML is currently
69
associated with a 20-40% rate of false negative results [5], prompting the elaboration of
70
improved strategies to follow treatment response. Through the development of single
71
donor human AML models [6], and their comparison to MLL-AML patients, we have
72
identified a set of genes with high, and leukemia biased, expression relative to normal
73
blood cells. Here we report on the use of 7 of these genes to define a molecular
74
signature that has potential clinical utility for the diagnosis of MLL-translocated AMLs.
75
Page 3 of 15
76
Methods
77
Patient and normal samples
78
All pediatric and adult AML patient samples and additional clinical information
79
used in this study were collected by the Banque de cellules leucémiques du Québec
80
(BCLQ) with informed consent and project approval from the Research Ethics Boards of
81
CHU Ste-Justine, Maisonneuve-Rosemont Hospital and Université de Montréal.
82
Peripheral blood and bone marrow samples were collected from healthy donors under
83
informed consent.
84
Identification of candidate genes
85
A list of 39 MLL-AF9 AML specific genes was previously identified using a human
86
leukemia model, as described in Barabé et al. [6]. Briefly, cord blood stem cells from
87
single healthy donors were transduced with a retrovirus expressing the MLL-AF9 (MA9)
88
fusion gene. Transduced cells were expanded in vitro and injected into NSG
89
immunodeficient mice to generate multiple human leukemias, arising from the same
90
donor genetic background. In order to identify changes in gene expression level taking
91
place during leukemogenesis, RNA-Seq was performed (as described [6]) on samples
92
from each stage of leukemia development (healthy CD34+, CD34++ MLL-AF9 and model
93
AMLs). A data filtering strategy was developed to identify genes expressed specifically in
94
MA9-AML (RPKM >3) but not expressed (RPKM < 1) in normal tissues and other AML
95
samples which have no MLL translocations. From the 39 candidate genes identified, 7
96
were selected for the testing as potential diagnostic tools based on their overall
97
expression level, and expression pattern in leukemia and normal tissues.
98
RNA isolation, cDNA synthesis and qRT-PCR
Page 4 of 15
99
Total RNA was extracted from AML and normal samples using TRIzol reagent
100
(Invitrogen). RNA was treated with DNase prior to being reverse transcribed using a mix
101
of random primers and oligo d(T) using the QuantiTect Reverse Transcription kit
102
(QIAGEN) as described by the manufacturer. Samples were diluted 1:3 prior to use for
103
qRT-PCR and gene expression was determined using custom TaqMan qPCR assays
104
(IDT) (Supplementary Tables 3, 4). The amplification efficiency of each qRT-PCR assay
105
was validated in triplicate using tenfold serial dilutions of each target gene over 5 orders
106
of magnitude in parallel with serial dilutions of the control gene (ABL1). qRT-PCR
107
reactions were performed using PerfeCTa® qPCR FastMix II (Quanta Biosciences), 0.5
108
µM of each primer and 0.25 µM of each probe on a Viia7 instrument (Life Technologies)
109
at the IRIC genomics platform. Amplification was programmed with an initial step of 20
110
sec at 95˚C, followed by 40 cycles of: 1 sec at 95˚C and 20 sec at 60˚C. Relative
111
expression was calculated using the comparative CT method (2-CT) and normalization
112
was done using ABL1 reference gene with each PCR reaction performed in duplicate.
113
MRD assay sensitivity
114
The MRD assay sensitivity was tested by serially diluting cDNA from MA9
115
patients in healthy bone marrow cDNA over 6 orders of magnitude (from 10 0 down to 10-
116
5
). qPCR experiments were performed in duplicates as described above.
117
Page 5 of 15
118
Results and discussion
119
We have previously developed a xenograft model system, using single human
120
donors, to generate acute myeloid leukemias [6]. Using RNA-seq data from this model,
121
combined with AML patients harbouring an MLL-AF9 (MA9) translocation, we sought to
122
identify potential novel biomarkers for this subgroup of AMLs. The selection of genes
123
consistently expressed in MA9-AMLs but not expressed in normal CD34+ cells,
124
mononuclear cells and B-cell acute lymphoblastic leukemias (B-ALL) generated by our
125
model, revealed 39 potential biomarker genes [6] (Fig. 1A). While these genes were
126
initially identified using MA9-AMLs, in fact many of these candidate genes are also
127
expressed in other AMLs with or without MLL fusions. Most of these genes have been
128
studied in other contexts (Supplementary Table 1), but to date, none have been
129
associated with a pathophysiological role in acute myeloid leukemia. From these 39
130
genes, 7 were selected based on their differential expression patterns in leukemic cells
131
and normal tissues (RET is shown as an example in Fig 1b) to design a qRT-PCR
132
diagnostic assay.
133
To evaluate the potential of these candidate genes as diagnostic targets, their
134
expression level was first assayed by qRT-PCR using a cohort of 87 AML patients and
135
14 healthy donors (Supplementary Table 2). Relative expression values were calculated
136
and compared between defined cytogenetic groups including MLL-rearranged, normal
137
karyotype, Monosomy 5/5q- and 7/7q-, intermediate abnormal karyotype and complex
138
karyotype. To examine the influence of the source of normal blood cells on the
139
expression level of our genes, we also analyzed 3 samples of peripheral blood (PB), 3
140
fresh bone marrow (BM) and 8 frozen bone marrow from different healthy donors.
Page 6 of 15
141
Our results confirmed that all 7 candidate genes were highly upregulated in
142
patients with MLL rearrangement compared to other cytogenetic subgroups and normal
143
bone marrow (Fig. 1C, Supplementary Fig. 1). Because of its high expression level,
144
PPP1R27 was the most promising target, being expressed around ~100 times higher on
145
average in MLL-rearranged leukemias than in control BM (Fig. 1C). When focusing on
146
MLL translocations organized by fusion partner, we found that our potential biomarkers
147
were upregulated in the vast majority of MLL fusion subgroups, except for MLL-GAS7,
148
for which all candidate genes but NRG4 were expressed at a lower or similar level to BM
149
(Supplementary Fig. 2). These results suggest that our candidate genes are suitable
150
biomarkers for clinical diagnostic of most MLL leukemias, with some rare exceptions.
151
Importantly, these results also clearly demonstrate that the downstream transcriptional
152
targets of MLL translocations, whether direct or indirect, have the potential to simplify the
153
diagnostic tests for a broad range of known or novel fusions. Moreover, the candidate
154
gene NRG4 was upregulated in all AML patients, regardless of the cytogenetic subgroup
155
(Fig. 1C), suggesting that this target could be suitable for a “pan-AML” diagnostic test.
156
With respect to the expression of the candidate genes in the normal control cells,
157
although a slight difference was observed between fresh and frozen BM samples, the
158
expression values were generally comparable. As expected, due to the higher frequency
159
of normal progenitor cells, we found that the expression level of the candidate genes
160
was generally higher in BM than in PB. The abundance of leukemia-specific transcripts
161
is thus generally higher in bone marrow than blood, particularly in the context of low
162
tumor burden following chemotherapy [7]. For this reason, the mean value of fresh BM
163
samples was used as the reference level of expression for our experiments, although
164
detection levels in PB were clearly more sensitive.
Page 7 of 15
165
To further validate our findings with an external cohort, we used the RNA-Seq
166
data from the TARGET pediatric AML cohort [8]. When grouped by FAB classification,
167
there was a significantly higher expression of our candidate genes in M5 AMLs,
168
characteristic of patients with MLL translocation (Supplemental Fig. 3A). Samples with
169
the reported presence of MLL translocation showed also an enrichment of the
170
biomarkers (Supplemental Fig. 3B). In addition, analysis of patients with high expression
171
of both of these markers identified ~76% of the MLL-positive patients in the cohort
172
(Supplemental Fig. 4). These results support our observation that these genes highlight
173
a large majority of MLL-translocated samples.
174
Given the specificity of our candidate genes, we next wanted to investigate their
175
performance relative to the current clinical standard tracking the expression level of the
176
fusion gene. We therefore compared the expression level of the candidate genes to the
177
fusion transcripts in several MLL-rearranged subgroups using two approaches, RNA-seq
178
and qRT-PCR. Previous RNA-Seq data of 13 AML patients expressing the MLL-AF9
179
translocation [10] suggested that most of our candidate genes are generally expressed
180
at higher levels (~20-150x) than the fusion gene (Fig. 2A). To confirm these data by
181
qRT-PCR, primers spanning the fusion junction were designed to measure the
182
expression level of the fusion gene in two or three samples of the most frequent MLL
183
translocation subgroups (Fig. 2B). For patients with MLL-AF6, MLL-ENL and MLL-ELL
184
fusions, we found that the expression level of the four tested candidate genes was
185
similar to the corresponding fusion. However, in MLL-AF9 cases, the four candidates
186
were expressed 4-80 times higher than the MLL-AF9 fusion transcript. Similar results
187
were observed in patients harbouring the MLL-AF10 translocation, except for SCUBE1,
188
which had comparable expression to the fusion. For diagnostic purposes, targeting the
189
candidate genes in these cases would provide a more sensitive assay than tracking the
Page 8 of 15
190
expression of the fusion alone. Importantly, the expression of our biomarker genes is
191
stable between diagnostic and relapse samples (Supplementary Fig. 5), indicating that
192
they have utility throughout the course of treatment. Additionally, because MLL-AF9 and
193
MLL-AF10 fusions represent together ~50% of all MLL translocations found in the
194
pediatric AML population and 37% of adult MLL-AML patients [11], use of these genes
195
as diagnostic targets could potentially benefit a large proportion of patients. Specifically,
196
qRT-PCR performed using the 2 most sensitive genes (PPP1R27, CCL23) would allow
197
the rapid identification of samples with potential MLL fusions, including potentially
198
novel/cryptic fusions, or in samples with complex karyotypes [8].
199
The monitoring of minimal residual disease (MRD) to assess treatment response
200
is of significant importance for risk stratification and early detection of relapse [12]. In
201
AMLs harboring chromosomal translocations such as MLL rearrangements, the fusion
202
transcripts are currently used in the clinic for MRD detection, mostly because fusion-
203
based MRD measurements are among the most sensitive methods available. The
204
sensitivity varies depending on the number of fusion transcripts per leukemic cell, but
205
typically reaches 1×10−5 (1 leukemic cell in 100,000 normal cells) [13]. To test whether
206
any of our candidate genes might be useful in the context of MRD testing, we performed
207
qRT-PCR assays using tenfold serial dilutions of AML cDNA added to healthy bone
208
marrow cDNA. The test results did not demonstrate greater sensitivity compared to
209
standard clinical MRD detection assays used, with the sensitivity of our assays ranging
210
between 10-1 and 10-3. These results likely reflect that our candidate genes, while biased
211
towards significantly higher expression in AMLs, are nevertheless expressed at low
212
levels in normal bone marrow and blood.
213
Conclusion
Page 9 of 15
214
While FISH and RT-PCR diagnostic methods currently in place to screen for MLL
215
rearrangements have good sensitivity, their technical limitations mean that they are not
216
as useful for patients who have less frequent or novel fusion breakpoints or some cryptic
217
translocations. We have shown in this study that measuring the expression level of MLL-
218
AML biomarkers could provide a simple and reliable diagnostic method for AML patients
219
with MLL rearrangements. We have also shown that in many cases our candidate genes
220
have generally higher expression levels than the fusion genes, allowing a more sensitive
221
detection. The method described in this study could provide a quick and cost-effective
222
screen of MLL status in routine referrals.
Page 10 of 15
223
Acknowledgments:
224
This work was supported by a Terry Fox New Investigator grant (1042-BTW and 1043-
225
SC) including funding from the Cole Foundation (BTW, SC), the Fonds de Recherche du
226
Québec en Santé (BTW), La Fondation du Centre de Cancérologie Charles Bruneau
227
and La Fondation CHU Sainte-Justine (SC). Human leukemia specimens were collected
228
and analyzed by the Banque de cellules leucémiques du Québec (BCLQ), supported by
229
the Cancer Research Network of the Fonds de Recherche du Québec en Santé (FRQS).
230 231
Page 11 of 15
232
References
233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265
1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13.
Grimwade, D. and S.D. Freeman, Defining minimal residual disease in acute myeloid leukemia: which platforms are ready for "prime time"? Blood, 2014. 124(23): p. 3345-55. Slany, R.K., The molecular mechanics of mixed lineage leukemia. Oncogene, 2016. 35(40): p. 5215-5223. Wolff, D.J., et al., Guidance for fluorescence in situ hybridization testing in hematologic disorders. J Mol Diagn, 2007. 9(2): p. 134-43. Meyer, C., et al., The MLL recombinome of acute leukemias in 2013. Leukemia, 2013. 27(11): p. 2165-76. Paietta, E., Minimal Residual Disease in AML: Why Has It Lagged Behind Pediatric ALL? Clin Lymphoma Myeloma Leuk, 2015. 15 Suppl: p. S2-6. Barabe, F., et al., Modeling human MLL-AF9 translocated acute myeloid leukemia from single donors reveals RET as a potential therapeutic target. Leukemia, 2017. 31(5): p. 1166-1176. Hills, R.K., et al., Assessment of Minimal Residual Disease in Standard-Risk AML. N Engl J Med, 2016. 375(6): p. e9. Farrar, J.E., et al., Genomic Profiling of Pediatric Acute Myeloid Leukemia Reveals a Changing Mutational Landscape from Disease Diagnosis to Relapse. Cancer Res, 2016. 76(8): p. 2197-205. Lemieux, S., et al., MiSTIC, an integrated platform for the analysis of heterogeneity in large tumour transcriptome datasets. Nucleic Acids Res, 2017. 45(13): p. e122. Lavallee, V.P., et al., The transcriptomic landscape and directed chemical interrogation of MLL-rearranged acute myeloid leukemias. Nat Genet, 2015. 47(9): p. 1030-7. Krivtsov, A.V. and S.A. Armstrong, MLL translocations, histone modifications and leukaemia stem-cell development. Nat Rev Cancer, 2007. 7(11): p. 823-33. Ommen, H.B., Monitoring minimal residual disease in acute myeloid leukaemia: a review of the current evolving strategies. Ther Adv Hematol, 2016. 7(1): p. 3-16. Gabert, J., et al., Standardization and quality control studies of 'real-time' quantitative reverse transcriptase polymerase chain reaction of fusion gene transcripts for residual disease detection in leukemia - a Europe Against Cancer program. Leukemia, 2003. 17(12): p. 2318-57.
266 267
Page 12 of 15
268
Figure legends
269
Figure 1 - Identification and validation of candidate genes by leukemia subgroup.
270
(A) Schematic representation of the data filtering strategy used to identify candidate
271
MLL-AF9 biomarker genes. (B) Boxplots of gene expression data (RPKM) for RET are
272
shown from NK-AMLs (pink), CD34+ cells, and MLL translocated patient or model AMLs;
273
red) and from a variety of normal tissues (GTEx; green) (C) The expression level of
274
candidate genes (log10 of qRT-PCR values relative to ABL1) that are either specific for
275
MLL translocated AML (PPP1R27) or that are expressed in most AMLs (NRG4) are
276
shown. AML subgroups (MLL= MLL translocated, NK-AML=normal karyotype,
277
Mono7=monosomy 7, Int=Intermediate cytogenetic risk, Complex=complex karyotype)
278
are shown along the x-axis. Normal controls from peripheral blood (PB), fresh bone
279
marrow (BM), or previously frozen bone marrow (BM-Frozen) are shown in green. The
280
mean value of the BM group is shown as a reference level of expression (green
281
horizontal dotted line). The bottom panel shows expression level of all MLL translocated
282
is shown, organized by the fusion partner genes involved (x-axis).
283
Figure 2 – RNA-seq and qRT-PCR of candidate genes vs gene fusion. (A) RNA-seq
284
expression data is shown for four candidate genes (red dots) and for the two genes
285
involved in the fusion gene in those specific patients (dark/light green dots). (B) qRT-
286
PCR was performed in AML patients with different MLL fusions using primers specific for
287
candidate genes (coloured forms) or spanning the fusion gene in each patient (black *
288
shape).
289
Supplemental figure 1 – Identification and validation of additional candidate genes
290
by leukemia subgroup. The expression level of seven candidate genes (log10 of qRT-
291
PCR values relative to ABL1) are shown, grouped by cytogenetic classification. Patient
292
sample colouring, AML subgroups and dotted line are as shown in Figure 1, top panel.
Page 13 of 15
293 294
Supplemental figure 2 – Expression of candidate genes by MLL fusion genes. The
295
expression level of seven candidate genes (log10 of qRT-PCR values relative to ABL1)
296
are shown, grouped by MLL fusion genes present. Patient sample colouring, AML
297
subgroups and dotted line are as shown in Figure 1, bottom panel.
298 299
Supplemental figure 3 – External validation using TARGET pediatric AML data.
300
The expression level of two of our candidate genes in the TARGET pediatric AML cohort
301
[8] (n=358) are shown with the data grouped by FAB classification (A) or by the reported
302
presence/absence of MLL translocations (B). The mean expression of the candidate
303
genes is significantly different between patients with/without MLL fusions.
304 305
Supplemental figure 4 – Paired gene expression in MISTiC. TARGET gene
306
expression data and clinical annotation was loaded into a local instance of the MISTiC
307
tool [9] and correlation between gene expression was examined for two candidate genes
308
(PPP1R27 and PHACTR3). Selection of patient samples with high expression of both of
309
these genes (~23% of the cohort) results in the identification of 39/51 (~76%) of samples
310
with an MLL fusion which represents a statistically significant enrichment (p = 3.5e-18).
311 312
Supplemental figure 5 – Expression of candidate genes in paired diagnostic and
313
relapse samples. RNA-seq data from diagnostic and relapse samples from two adult
314
AML patients with MLL-AF9 fusions was analyzed. Expression levels of four candidate
315
genes are shown on the left while the two partners of the gene fusion are shown on the
Page 14 of 15
316
right. Solid line shown the expression of the diagnostic (D) samples while dotted lines
317
shown the relapse (R) samples.
Page 15 of 15