A daunting challenge: Human Papillomavirus assays and cytology in primary cervical screening of women below age 30 years

A daunting challenge: Human Papillomavirus assays and cytology in primary cervical screening of women below age 30 years

European Journal of Cancer (2015) 51, 1456– 1466 Available at www.sciencedirect.com ScienceDirect journal homepage: www.ejcancer.com A daunting cha...

447KB Sizes 3 Downloads 38 Views

European Journal of Cancer (2015) 51, 1456– 1466

Available at www.sciencedirect.com

ScienceDirect journal homepage: www.ejcancer.com

A daunting challenge: Human Papillomavirus assays and cytology in primary cervical screening of women below age 30 years Matejka Rebolj a,⇑, Jesper Bonde b,c, Ditte Ejegod b, Sarah Preisler c, Carsten Rygaard b, Elsebeth Lynge a a

Department of Public Health, University of Copenhagen, Øster Farimagsgade 5, P. O. Box 2014, 1014 Copenhagen, Denmark Department of Pathology, Copenhagen University Hospital, Hvidovre, Kettega˚rd Alle´ 30, 2650 Hvidovre, Denmark c Clinical Research Centre, Copenhagen University Hospital, Hvidovre, Kettega˚rd Alle´ 30, 2650 Hvidovre, Denmark b

Received 21 January 2015; received in revised form 14 April 2015; accepted 18 April 2015 Available online 12 May 2015

KEYWORDS Cervical cancer Mass screening Human Papillomavirus Cytology Hybrid Capture 2 Cobas CLART APTIMA

Abstract We compared cytology with Hybrid Capture 2 (HC2), cobas, CLART and APTIMA Human Papillomavirus (HPV) assays in primary cervical screening at age 23– 29 years based on data from the Danish Horizon study. SurePath samples were collected from 1278 women undergoing routine cytology-based screening. Abnormal cytology was managed according to the routine recommendations, and women with cytology-normal/HPV-positive samples were invited for repeated cytology and HPV testing in 1.5 years. Loss to follow-up was similar between HPV assays. PCIN3 was detected in 44 women. The sensitivity of HC2 for PCIN3 was 95% (95% confidence interval (CI): 85–99), of cobas 98% (95% CI: 88–100), of CLART 100% (95% CI: 92–100), of APTIMA 82% (95% CI: 67–92), and of cytology 59% (95% CI: 43–74). Specificity for PCIN3 varied between 61% (95% CI: 59–64) for cobas and 75% (95% CI: 73–78) for APTIMA, and was 94% (95% CI: 93–96) for cytology. Similar results were observed for PCIN2 (N = 68). HPV screening with cytological triage doubled the number of colposcopies compared to cytology screening, and increased the frequency of repeated testing by four (APTIMA) to seven (cobas) times. The positive predictive value of a referral for colposcopy was relatively high for all screening tests (P30% for PCIN3, and P50% for PCIN2). CIN1 was detected by cytology in 1% of women, and in 2% by any of the four HPV assays. Although highly sensitive, HPV-based screening of

⇑ Corresponding author at: Clinical Research Centre/Department of Pathology, Copenhagen University Hospital, Hvidovre, Kettega˚rd Alle´ 30, 2650 Hvidovre, Denmark. Fax: +45 38 62 33 54. E-mail addresses: [email protected] (M. Rebolj), [email protected] (J. Bonde), [email protected] (D. Ejegod), [email protected] (S. Preisler), [email protected] (C. Rygaard), [email protected] (E. Lynge).

http://dx.doi.org/10.1016/j.ejca.2015.04.012 0959-8049/Ó 2015 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

M. Rebolj et al. / European Journal of Cancer 51 (2015) 1456–1466

1457

young Danish women should be approached cautiously, as it resulted in large reductions in specificity, and increased the demand for additional testing. Ó 2015 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BYNC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

1. Introduction Human Papillomavirus (HPV) infections are frequent in young women, but the majority clears spontaneously. To avoid false-positive test results, HPV-based primary cervical screening has been considered primarily for women aged P30 years [1–4]. Nevertheless, some studies suggested that certain HPV assays, particularly those based on detection of HPV mRNA rather than DNA, might be suitable for screening at younger ages. In the French FASE (French APTIMA Screening Evaluation) study, for example, 1109 women aged 20–29 years attending routine cervical screening were tested with ThinPrep liquid-based cytology (LBC), Hybrid Capture 2 (HC2) HPV DNA assay, and APTIMA HPV mRNA assay [5]. In that study, APTIMA detected as many cases of high-grade cervical intraepithelial neoplasia (CIN) as HC2, whereas its specificity for high-grade CIN was similar to that of LBC. However encouraging, these findings should ideally be confirmed by data from other studies. Unfortunately, other studies comparing APTIMA with HPV DNA assays in screening populations did not present data specifically for this age group [6] or did not ascertain histological results in women with positive HPV tests and normal cytology [7]. Here, we presented data from the Danish Horizon study using samples from 1278 women aged 23–29 years attending primary cervical screening. All samples were tested with SurePath LBC and four HPV assays (APTIMA HPV Test (APTIMA; Hologic, San Diego, CA), HC2 (QIAGEN, Gaithersburg, MD), cobas HPV Test (cobas; Roche Diagnostics, Pleasanton, CA), and CLART HPV2 Assay (CLART; Genomica, Madrid, Spain)). We used these data to study the impact of the five tests on screening sensitivity, specificity, proportions of women with false-positive tests, and on colposcopy referral rates in primary screening of young women.

2. Materials and methods

primary screening samples were defined as those without a: previous cervical cancer, CIN in 63 years, atypical squamous cells of undetermined significance (ASCUS) or non-CIN cervical biopsy in 615 months, or a more severe cytological abnormality, inadequate cytology, or a positive HPV test in 612 months. Approximately 10% of women aged 23–29 years living in the thencatchment area of the laboratory were vaccinated against HPV [14]. Danish women are recommended for routine cervical screening every 3 years from age 23 onwards. Women included in the Horizon study were managed in line with their cytology and HPV test results, so that this setup where cytology was the basis for routine clinical management also mimicked primary screening with HPV testing and cytology triage. Women with abnormal cytology, regardless of their HPV status, were managed according to the routine Danish guidelines (repeated cytology if ASCUS or low-grade squamous intraepithelial lesions (LSIL), referral for colposcopy otherwise). Women with a positive test result on at least one of the four HPV assays at baseline and normal cytology (i.e. triage-negative) were invited, for study purposes, for repeated cytology and HPV testing in November 2012, approximately 1.5 years after the baseline. A reminder was sent in March 2013. Women who responded to the study follow-up invitation had two SurePath samples taken. Those with abnormal cytology or a positive HC2 test result (corresponding to the routine HPV testing in the laboratory at the time of the study) were recommended for colposcopy. Histology in women with follow-up outside of the study was included in the analysis. All followup outcomes were retrieved from the Patobank in December 2013. This means that all histology was ascertained in approximately 2.5 years after the baseline testing, i.e. throughout most of the recommended 3-year screening interval. Colposcopies were performed following routine protocols recommending biopsies from all suspicious areas, or random biopsies from the four quadrants if lesions were not visible.

2.1. Study design 2.2. Cytology The design of the Horizon study was described in detail previously [8–12]. Consecutive SurePath samples from 5034 women arriving for routine LBC analysis at the Department of Pathology of Copenhagen University Hospital, Hvidovre, in June–August 2011 were tested with the four HPV assays. By linkage to the national Pathology Data Bank (Patobank) [13],

Routine cytological evaluation was undertaken first by FocalPoint Slide Profiler (BD, Burlington, NC). Blinded to the outcomes of HPV testing, samples were thereafter evaluated by cytoscreeners using FocalPoint GS Imaging System (BD), and abnormal findings were adjudicated by pathologists.

1458

M. Rebolj et al. / European Journal of Cancer 51 (2015) 1456–1466

2.3. HPV testing All assay testing was undertaken in strict accordance with the protocols agreed upon with all manufacturers prior to the study, described in detail previously [8–12]. HC2 testing was undertaken on the post-quot LBC material; cobas, CLART and APTIMA testing were undertaken on the original residual material, diluted approximately 1:1 in SurePath. HC2 detects 13 high-risk HPV genotypes [15] collectively. It is based on hybridisation of HPV DNA to a high-risk HPV RNA probe cocktail. No re-test range was used. Cobas is a real-time PCR analysis detecting the 13 high-risk HPV genotypes and HPV66. The assay separately identifies HPV16 and HPV18, while the remaining 12 genotypes are detected collectively. CLART is a PCR-based low density array assay individually reporting 35 defined genotypes including the 13 high-risk. APTIMA detects E6/E7 mRNA expression of the 13 high-risk HPV types and HPV66 collectively. 2.4. Ethical considerations Baseline testing of residual material was undertaken as a quality development study. In Denmark, such studies do not require ethical approval. Invitation to follow-up of HPV-positive/cytology-normal women was approved by the Ethics Committee of the Danish Capital Region (H-4-2012-120). It was not permitted to reveal the results of the baseline testing in the invitation letter, but women could obtain them from their general practitioner (GP). The study was notified to the Danish Data Inspection Agency (notification number 2010-41-5594). 2.5. Statistical analysis A positive HPV test result was defined as relative light units per cut-off value P1.0 for HC2; cycle threshold values 640.5, 640.0 and 640.0 for cobas’s channels 16, 18 and other high-risk, respectively; signal to cut off P0.5 for APTIMA; and detection of P1 of the 13 highrisk genotypes by CLART. Abnormal cytology was defined as PASCUS. A screening test’s clinical sensitivity was calculated as the proportion of women with high-grade CIN (detected at any point during the study) having a positive screening test result at baseline, and clinical specificity as the proportion of women who did not test positive among those without high-grade CIN. A false-positive test result was defined as a positive test result at baseline not followed by a diagnosis of high-grade CIN. We calculated the relative sensitivity, specificity and the proportion of false-positive tests by comparing the values to those of the reference screening test (HC2 or cytology). The 95% confidence intervals (CI) for sensitivity and

specificity were calculated assuming binomial distribution; those for the relative indicators were calculated assuming lognormal distribution. Persistent infection was defined as a (pooled) high-risk positive HPV test result at baseline and at follow-up on the same HPV assay. We calculated the positive predictive value (PPV) of a persistent infection as the proportion of women with a persistent infection having high-grade CIN. 3. Results 3.1. Study population At baseline, 1278 women had screening samples at age 23–29 years (Table 1, Fig. 1). Ninety-five (7%) had abnormal cytology. Whereas 681 (53%) had normal cytology and negative test results on all four HPV assays, 495 (39%) had normal cytology with a positive test result on at least one assay. Test results were positive on HC2 for 406 (32%) of women, on cobas for 519 (41%), on CLART for 481 (38%) and on APTIMA for 342 (27%). Of the 95 women with abnormal baseline cytology, six (6%) were lost to follow-up (Table 2). For 495 cytology-normal/HPV-positive women, this was 247 (50%); 158 (32%) had study follow-up (Table 3) and 90 (18%) had other follow-up. The completeness of follow-up did not differ substantially by HPV assay. Of the 158 women with repeated testing in 1.5 year within the study (mean = 18.0 months, SD = 1.4 month), 15 (9%) developed abnormal cytology. Persistent infections, i.e. a positive (pooled) high-risk HPV test result followed by another (pooled) high-risk positive HPV test result in 1.5 years, were observed in 69 (62%) of 112 women with a baseline positive HC2 result, 85 (61%) of 140 women positive on cobas, 63 (48%) of 130 positive on CLART and 45 (48%) of 94 women positive on APTIMA. PPV for PCIN2 of persistent HPV infections were 30–40%, and 50% for incident abnormal cytology. 3.2. Sensitivity and specificity for high-grade CIN Of the 44 PCIN3, HC2 detected 42 (sensitivity: 95%, 95% CI: 85–99; Table 4). Cobas detected 43, sensitivity: 98% (95% CI: 88–100), CLART 44 (100%, 95% CI: 92– 100), APTIMA 36 (82%, 95% CI: 67–92) and cytology 26 (59%, 95% CI: 43–74). APTIMA and cytology detected statistically significantly fewer PCIN3 than HC2, cobas and CLART. Specificity for PCIN3 varied between 61% (95% CI: 59–64) for cobas and 75% (95% CI: 73–78) for APTIMA; it was 94% (95% CI: 93–96) for cytology. The differences were significant, with lower specificity for cobas and CLART, and higher for APTIMA, compared to HC2. The results for PCIN2 (N = 68) were similar.

M. Rebolj et al. / European Journal of Cancer 51 (2015) 1456–1466 Table 1 Description of the 1278 women aged 23–29 years with screening samples at baseline. Characteristics at baseline

N (%)

Total

1278 (100%)

Age (years) 23 24 25 26 27 28 29

339 (27%) 92 (7%) 123 (10%) 246 (19%) 138 (11%) 135 (11%) 205 (16%)

Cytological diagnosis Inadequate Normal ASCUS LSIL AGC, ASC-H, AIS, HSIL, or carcinoma

6 (<1%) 1177 (92%) 16 (1%) 49 (4%) 30 (2%)

HPV test results Positive on HC2 Positive on cobas Positive on CLART Positive on APTIMA

406 519 481 342

Combined cytology and HPV results Cytology normal and all 4 HPV test results negative Cytology normal and at least one HPV test result positivea Cytology abnormal, regardless of HPV test results Otherb

(32%) (41%) (38%) (27%)

681 (53%) 495 (39%) 95 (7%) 7 (1%)

Abbreviations: AGC, atypical glandular cells; AIS, adenocarcinoma in situ; ASCUS, atypical squamous cells of undetermined significance; ASC-H, atypical squamous cells cannot exclude HSIL; HC2, Hybrid Capture 2; HPV, Human Papillomavirus; HSIL, high-grade squamous intraepithelial lesions; LSIL, low-grade squamous intraepithelial lesions. a Invited for repeated testing at 18 months according to the study protocol. b Inadequate cytology, or normal cytology with at least one invalid HPV test result and no positive HPV test result.

3.3. HPV testing compared with cytology In total, 7% of women were recommended for followup because of abnormal cytology (Table 5). With standalone HPV testing, this proportion was between 27% with APTIMA and 41% with cobas, i.e. about four to six times higher than with cytology. With cytology-based screening, 5% of women were referred for colposcopy and 2% needed only repeated testing. With HPV assays combined with cytological triage, about twice as many had colposcopy. The PPV of a referral for colposcopy was relatively high for all five screening tests (30% for PCIN3, 50% for PCIN2, Table 6). The frequency of repeated testing was four times higher with APTIMA and HC2 compared to cytology, six times higher with CLART and seven times higher with cobas. CIN1 was detected by cytology in 1% of women, and by any of the four HPV assays in 2%; HC2, cobas

1459

and CLART led to a statistically significant increase in CIN1 detection compared to cytology. Detection of PCIN3 was between 60% and 70% higher and of PCIN2 around 90% higher for the three HPV DNA assays compared to cytology. APTIMA detected more high-grade CIN than cytology (38%), but the difference was not significant. As 2.0% of women screened with cytology had PCIN3 detected, 5% had a false-positive cytology test. Using any of the four HPV assays, this proportion was statistically significantly higher, and ranged from 24% with APTIMA to 37% with cobas. 4. Discussion 4.1. General findings Follow-up of young, mostly unvaccinated, Danish women with normal cytology and positive HPV test results on HC2, cobas, CLART or APTIMA substantially increased the overall detection of both PCIN2 (by 70–90%) and PCIN3 (by 40–70%) compared to cytology alone. Not surprisingly, the increased sensitivity was mirrored by a substantial decrease in screening specificity. Even with cytological triage of women with positive HPV tests, about twice as many were referred for colposcopy, and between four and seven times as many had a false-positive test compared to cytology alone. To this end, the lowest increases in the detection of CIN, false-positive tests and referrals to colposcopy were observed for the HPV mRNA assay APTIMA. 4.2. Strengths and weaknesses Horizon was one of the few studies comparing several HPV assays in the same women while also ascertaining the clinical outcomes after cytology-normal/HPV-positive test results. The studied women were representatives for the routine screening population [10], and the samples were tested while fresh [9,11]. Of women with cytology-normal/HPV-positive test results at baseline, 50% had follow-up. Had the follow-up been complete, more high-grade CIN lesions would have been detected on account of HPV testing in cytology-normal women. This would increase the calculated differences in the sensitivity of HPV assays and cytology. However, the gap between the proportion of women with positive screening test results and the detection of high-grade CIN was so large that the incomplete follow-up cannot explain the high proportions of false-positive tests (data not reported). 4.3. Comparison with the literature The substantially higher sensitivity for high-grade CIN in young women for HPV testing compared to

1460

M. Rebolj et al. / European Journal of Cancer 51 (2015) 1456–1466

Fig. 1. Description of the study. *Women with normal cytology. **Other: Human Papillomavirus (HPV) negative on all four tests or with inadequate cytology. There were six women with inadequate cytology, of whom four had normal cytology in follow-up, and two had normal histology. One woman with normal cytology and four negative HPV test results had CIN2 in follow-up.

cytology was demonstrated previously [16]. In randomised controlled trials comparing primary HPV-based screening (using HC2) with cytology-based screening, the proportions of women with false-positive test results for PCIN3 varied between 12–17% at 25–34 years in Italy and Finland, and 31% at 20–29 years in the United Kingdom (UK) [17]. Proportions of women referred for colposcopy were generally not reported, except in the Italian NTCC Phase 1 trial [4,18]. There, the management at age 25–34 years after a positive HC2 test result was similar as in our study, with repeated testing in 12 months. Women were about twice as often referred for colposcopy with HC2 than with cytology. They also had three times as many false-positive tests with HC2, and were 11 times more likely referred for repeated testing [19]. These NTCC data were consistent with ours in pointing to a higher average number of health care contacts per woman screened with HPV testing than with cytology, at least in the initial HPV screening round. One of the most frequently reported indicators of specificity from the trials was the PPV of a colposcopy referral. Like in our study, the PPV of HPV-based screening with cytological triage tended to be similar to that of cytology-based screening [20]. Yet, about twice as many women without high-grade CIN had colposcopy following HPV testing than following cytology screening alone. Detection of HPV mRNA instead of DNA has been considered a viable and most likely a safe strategy to

reduce the high numbers of false-positive HPV screening tests [21]. The use of APTIMA in primary screening was evaluated in several studies [5–7,22,23]. As mentioned earlier, APTIMA was compared to LBC and HC2 in 1109 women aged 20–29 years in the FASE study [5]. Contrary to our findings, in French women APTIMA was as sensitive for high-grade CIN as HC2, but as specific as cytology. Among French women, 13.5% had abnormal cytology, whereas 23.5% tested positive on HC2, and 15.6% on APTIMA (relative proportion of positive test results for APTIMA versus HC2: 0.66). This difference between APTIMA and HC2 was less pronounced in our study, where the relative proportion of positive test results was 0.84. The FASE study used ThinPrep samples, and women with inadequate cytology (10.5%) were excluded from the analysis. Moreover, any difference in the background risk for cervical cancer, however unmeasured, between the French study and ours may have played a role, and suggested that the trade-offs between the detection and the burden by the different screening technologies may be populationspecific. In our study, APTIMA tended to detect fewer highgrade CIN than the three DNA assays. As the lower detection was confined to young women [24], a plausible interpretation might be a lower degree of over-diagnosis compared to HPV DNA assays [4,25]. mRNA assays are in principle designed to detect transcriptionally active HPV infections. By measuring only transcriptionally active/integrated HPV infections, transient, recent

Table 2 Follow-up outcomes for women with positive screening test results at baseline. Baseline screening test Cytology test result

Normal or abnormal (total)

HPV test result

Total N at baseline (column%)

Worst outcome during follow-up (row%) No follow-up

No histology

Histology

Normal cytology and/or negative HPV testing

Abnormal cytology and/or positive HPV testing

Inadequate histology

No CIN (CIN 0)

CIN1a

CIN2

CIN3b

1278 (100%)

804 (63%)

279 (22%)

32 (3%)

6 (<1%)

61 (5%)

28 (2%)

24 (2%)

44 (3%)

Cytology Abnormal Normal

Any P1 Positive

95 (7%) 495 (39%)

6 (6%) 247 (50%)

22 (23%) 137 (28%)

4 (4%) 23 (5%)

2 (2%) 4 (1%)

16 (17%) 34 (7%)

11 (12%) 17 (3%)

8 (8%) 15 (3%)

26 (27%) 18 (4%)

HC2 Normal or abnormal Normal Abnormal

Positive Positive Positive

406 (32%) 314 (25%) 91 (7%)

152 (37%) 146 (46%) 6 (7%)

101 (25%) 80 (25%) 21 (23%)

22 (5%) 18 (6%) 4 (4%)

6 (1%) 4 (1%) 2 (2%)

36 (9%) 20 (6%) 15 (16%)

25 (6%) 14 (4%) 11 (12%)

22 (5%) 14 (4%) 8 (9%)

42 (10%) 18 (6%) 24 (26%)

Cobas Normal or abnormal Normal Abnormal

Positive Positive Positive

519 (41%) 432 (34%) 87 (7%)

215 (41%) 211 (49%) 4 (5%)

138 (27%) 117 (27%) 21 (24%)

27 (5%) 23 (5%) 4 (5%)

6 (1%) 4 (1%) 2 (2%)

41 (8%) 29 (7%) 12 (14%)

27 (5%) 16 (4%) 11 (13%)

22 (4%) 14 (3%) 8 (9%)

43 (8%) 18 (4%) 25 (29%)

CLART Normal or abnormal Normal Abnormal

Positive Positive Positive

481 (38%) 396 (31%) 85 (7%)

199 (41%) 194 (49%) 5 (6%)

127 (26%) 109 (28%) 18 (21%)

24 (5%) 20 (5%) 4 (5%)

6 (1%) 4 (2%) 2 (2%)

35 (7%) 23 (6%) 12 (14%)

23 (5%) 13 (3%) 10 (12%)

23 (5%) 15 (4%) 8 (9%)

44 (9%) 18 (5%) 26 (31%)

APTIMA Normal or abnormal Normal Abnormal

Positive Positive Positive

342 (27%) 261 (20%) 81 (6%)

128 (37%) 123 (47%) 5 (6%)

82 (24%) 64 (25%) 18 (22%)

19 (6%) 15 (6%) 4 (5%)

6 (2%) 4 (2%) 2 (2%)

27 (8%) 15 (6%) 12 (15%)

22 (6%) 12 (5%) 10 (12%)

22 (6%) 14 (5%) 8 (10%)

36 (11%) 14 (5%) 22 (27%)

Abbreviations: CIN, cervical intraepithelial neoplasia; HC2, Hybrid Capture 2; HPV, Human Papillomavirus. a Includes atypia on histology and CIN not otherwise specified. b No woman had cervical cancer.

M. Rebolj et al. / European Journal of Cancer 51 (2015) 1456–1466

Negative or positive (total)

1461

1462

M. Rebolj et al. / European Journal of Cancer 51 (2015) 1456–1466

Table 3 Women with normal cytology and at least one positive HPV test result at baseline (n = 495): 1.5-year persistence of infection, and final histology in 158 women with repeated cytology and HPV testing, by HPV assay. Screening test

Cytology HC2 Cobas CLART APTIMA

Cytology normal and HPV positive at baseline

Women with persistent infections/ incident abnormal cytology

Total

With 18-month follow-up (%)

N (% of those with follow-up)

PCIN2 (% of those with persistent infection)

PCIN3 (% of those with persistent infection)

495a (100%) 314 (100%) 432 (100%) 396 (100%) 261 (100%)

158 (32%) 112 (36%) 140 (32%) 130 (33%) 94 (36%)

15 69 85 63 45

8 (53%) 26 (38%) P25 (P29%)b P25 (P40%)b P20 (P44%)b

4 (27%) 13 (19%) P13 (P16%)b P12 (P19%)b P11 (P19%)b

(9%) (62%) (61%) (48%) (48%)

Abbreviations: CIN, cervical intraepithelial neoplasia; HC2, Hybrid Capture 2; HPV, Human Papillomavirus. a A positive test result on any of the four HPV assays. b Only women with abnormal cytology and/or a positive HC2 test result were referred for colposcopy.

Table 4 Sensitivity and specificity of HPV assays and cytology, and relative sensitivity and specificity compared to Hybrid Capture 2, for detection of PCIN3 and PCIN2, in women aged 23–29 years undergoing primary cervical screening. Sensitivity (95% CI)

Relative sensitivity (95% CI)

Specificity (95% CI)

Relative specificity (95% CI)

Endpoint: PCIN3 HC2 Cobas CLART APTIMA Cytology

95% (85–99) 98% (88–100) 100% (92–100) 82% (67–92) 59% (43–74)

1.00 1.02 1.05 0.86 0.62

(Ref) (0.95–1.11) (0.98–1.12) (0.74–1.00) (0.48–0.80)

71% 61% 65% 75% 94%

(68–73) (59–64) (62–67) (73–78) (93–96)

1.00 0.87 0.92 1.07 1.34

(Ref) (0.82–0.92) (0.87–0.97) (1.02–1.12) (1.29–1.39)

Endpoint: PCIN2 HC2 Cobas CLART APTIMA Cytology

94% 96% 99% 85% 50%

1.00 1.02 1.05 0.91 0.53

(Ref) (0.94–1.10) (0.98–1.12) (0.81–1.02) (0.42–0.68)

72% 62% 66% 77% 95%

(69–74) (60–65) (63–68) (74–79) (94–96)

1.00 0.87 0.92 1.07 1.32

(Ref) (0.82–0.92) (0.87–0.97) (1.02–1.12) (1.27–1.37)

(86–98) (88–99) (92–100) (75–93) (38–62)

Abbreviations: CI, confidence interval; CIN, cervical intraepithelial neoplasia; HC2, Hybrid Capture 2.

and cytoplasmatic and/or transcriptionally inactive HPV infections are theoretically avoided. It can be hypothesised that CIN missed by APTIMA but detected by an HPV DNA assay tended to be those that would have spontaneously regressed if left untreated, which is often the case in younger women [26]; however, this hypothesis is hard to prove. For now, the relative progression potential of HPV mRNA-negative/HPV DNA-positive high-grade CIN is unknown, and leaving these lesions untreated would be unethical. 4.4. Clinical implications Despite screening since the 1960’s, the incidence rate in Danish women aged 25–29 years remains around 12/ 100,000 women [27]. Although highly sensitive HPV assays would probably improve the protection from cervical cancer, they would unlikely eradicate the disease at young age [28]. Furthermore, there is a downside to detecting CIN lesions particularly in women of childbearing age, as their treatment may be associated with an increased risk of poorer pregnancy outcomes [29– 31]. This is particularly problematic given that many high-grade CIN detected at young age do not tend to

progress to cervical cancer [26,32]. Furthermore, a positive HPV test result induces a psychological burden [33,34], and may reduce sexual satisfaction [35]. The challenge, therefore, is to detect and triage to colposcopy only those HPV infections that have a real chance of progressing to cervical cancer. Although a careful selection of women for a referral for colposcopy can on its own not resolve the problem of low specificity of the screening test, colposcopy referral should be optimised with respect to its frequency and CIN detection. Based on the data from the Horizon study, we made some basic post hoc comparisons of the proportions of referred women and CIN detection at baseline for likely triage strategies. In interpreting the outcomes of this post hoc analysis, it should be taken into account that HPV-positive women were in reality managed through cytology triage, and also that in a true routine HPVbased screening setting the performance of cytology as a triage test might change. With cytology triage of HPV-positive women, 6–7% of all screened women would be referred for colposcopy at baseline (Table 7), and the remainder with positive HPV test results for follow-up with repeated testing (which would include a referral in case of abnormal findings). This strategy

Table 5 Consequences of replacing cytology with either of the four HPV assays combined with cytology triage in primary cervical cancer screening of women below age 30 years (N = 1278). Follow-up proceduresa

Abnormal/positive test results

b

Detection of CIN c

False-positive test results

Repeated testing only

CIN1

PCIN2

PCIN3

Endpoint: PCIN2

Endpoint: PCIN3

Absolute numbers, N (%) Cytology 95 (7%) HC2 406 (32%) Cobas 519 (41%) CLART 481 (38%) APTIMA 342 (27%)

63 (5%) 131 (10%) 139 (11%) 131 (10%) 113 (9%)

26 (2%) 123 (10%) 165 (13%) 151 (12%) 101 (8%)

11 25 27 23 22

34 64 65 67 58

26 42 43 44 36

61 (5%) 342 (27%) 454 (36%) 414 (32%) 284 (22%)

69 (5%) 364 (28%) 476 (37%) 437 (34%) 306 (24%)

Relative risk (95% CI) Cytology 1 (Ref) HC2 4.27 (3.47–5.27) Cobas 5.46 (4.45–6.70) CLART 5.06 (4.12–6.22) APTIMA 3.60 (2.91–4.46)

1 (Ref) 2.08 (1.56–2.78) 2.21 (1.66–2.94) 2.08 (1.56–2.78) 1.79 (1.33–2.42)

1 (Ref) 4.73 (3.12–7.17) 6.35 (4.23–9.53) 5.81 (3.86–8.74) 3.88 (2.54–5.94)

1 (Ref) 2.27 (1.12–4.60) 2.45 (1.22–4.93) 2.09 (1.02–4.27) 2.00 (0.97–4.11)

1 (Ref) 5.61 (4.32–7.28) 7.44 (5.76–9.61) 6.79 (5.25–8.78) 4.66 (3.57–6.07)

1 (Ref) 5.28 (4.13–6.74) 6.90 (5.43–8.77) 6.33 (4.97–8.07) 4.43 (3.46–5.69)

(0.9%) (2.0%) (2.1%) (1.8%) (1.7%)

(2.7%) (5.0%) (5.1%) (5.2%) (4.5%)

1 (Ref) 1.88 (1.25–2.83) 1.91 (1.27–2.87) 1.97 (1.31–2.96) 1.71 (1.13–2.59)

(2.0%) (3.3%) (3.4%) (3.4%) (2.8%)

1 (Ref) 1.62 (1.00–2.62) 1.65 (1.02–2.67) 1.69 (1.05–2.73) 1.38 (0.84–2.28)

Abbreviations: CI, confidence interval; CIN, cervical intraepithelial neoplasia; HC2, Hybrid Capture 2. a The difference between the total number of women with positive screening test results and women with either colposcopy or repeated testing only were women who had no follow-up. b Measured as registered with a biopsy in the Patobank. c Includes CIN1, histological atypia and CIN NOS.

Table 6 Positive predictive value of colposcopy. Screening test

Cytology HC2 Cobas CLART APTIMA

Colposcopy (%)

63 (100%) 131 (100%) 139 (100%) 131 (100%) 113 (100%)

Endpoint: PCIN2

Endpoint: PCIN3

N detected (PPV of colposcopy)

Relative PPV (95% CI)

N FP colposcopy (% of all screened women)

RR of FP colposcopy (95% CI)

N detected (PPV of colposcopy)

Relative PPV (95% CI)

N FP colposcopy (% of all screened women)

RR of FP colposcopy (95% CI)

34 64 65 67 58

1 (ref) 0.91 (0.68–1.21) 0.87 (0.65–1.16) 0.95 (0.71–1.26) 0.95 (0.71–1.27)

29 67 74 64 55

1 (ref) 2.3 (1.5–3.5) 2.6 (1.7–3.9) 2.2 (1.4–3.4) 1.9 (1.2–3.0)

26 42 43 44 36

1 (ref) 0.78 (0.53–1.14) 0.75 (0.51–1.10) 0.81 (0.56–1.19) 0.77 (0.52–1.15)

37 89 96 87 77

1 (ref) 2.4 (1.7–3.5) 2.6 (1.8–3.8) 2.4 (1.6–3.4) 2.1 (1.4–3.1)

(54%) (49%) (47%) (51%) (51%)

(2%) (5%) (6%) (5%) (4%)

(41%) (32%) (31%) (34%) (32%)

(3%) (7%) (8%) (7%) (6%)

M. Rebolj et al. / European Journal of Cancer 51 (2015) 1456–1466

Colposcopy

Abbreviations: CI, confidence interval; CIN, cervical intraepithelial neoplasia; FP, false-positive; PPV, positive predictive value; RR, relative risk.

1463

58 (87%) 313 (24%) HPV positive

HPV positive

HPV 16 and/or 18, and PASCUS HPV 16, 18, 31 and/or 33, and PASCUS HPV positivec Genotyping and cytology Genotyping and cytology HPV test HPV positive

HPV 16, 18, 31 and/or 33 Genotyping HPV positive

Abbreviations: ASCUS, atypical squamous cells of undetermined significance; CIN, cervical intraepithelial neoplasia; HPV, Human Papillomavirus. a Number of women referred for colposcopy at baseline (proportion of all screened women). b Number of women with PCIN2 detectable at baseline (proportion of all PCIN2 detectable by the screening assay). c For HC2, cobas and CLART as primary screening assays: triage with APTIMA. For APTIMA as primary screening assay: triage with HC2.

29 (43%) 60 (5%)

Not relevant 57 (88%)

18 (27%) 36 (3%)

Not relevant 323 (25%)

60 (90%)

Not relevant 18 (28%) Not relevant 34 (3%)

297 (23%)

342 (27%) 81 (6%) Not relevant Not relevant Not relevant Not relevant 312 (24%) 67 (100%) 34 (51%) 38 (57%) 481 (38%) 85 (7%) 182 (14%) 65 (100%) 33 (51%) 37 (57%)

64 (100%) 32 (50%) Not relevant Not relevant Not relevant Not relevant 56 (88%) – PASCUS HPV 16 and/or 18 HPV positive HPV positive HPV positive

None Cytology Genotyping

406 (32%) 91 (7%) Not relevant Not relevant Not relevant Not relevant 312 (24%)

519 (41%) 87 (7%) 201 (16%)

N (%)a

APTIMA

PCIN2 (%)b N (%)a

CLART

N (%)a N (%)a

PCIN2 (%)b Cobas

PCIN2 (%)b HC2 Triage test cut-off Triage test Screening test result

Screening test Screening strategy

Table 7 Number of women referred for colposcopy at baseline depending on the results of cytology or genotyping, and detected PCIN2 at baseline.

58 (100%) 30 (52%) Not relevant Not relevant Not relevant Not relevant 56 (97%)

M. Rebolj et al. / European Journal of Cancer 51 (2015) 1456–1466

PCIN2 (%)b

1464

would detect just over half of all PCIN2 already at baseline, leaving the remaining half to be detected through repeated testing. If, instead, triage were undertaken using limited genotyping for HPV16 and 18, 16% of all young screened women would be referred for colposcopy at baseline. This triage strategy would detect slightly more high-grade CIN at baseline than cytological triage. In young Danish women with normal cytology, progression to PCIN3 was most frequent for women persistently infected with HPV16 (cumulative 12-year PCIN3 incidence: ca. 27%), HPV18, HPV31, or HPV33 (15–20%) [36]. If triage was undertaken by genotyping for these four genotypes on CLART, 23% of screened women would be referred, however, approximately 90% of all high-grade CIN could be detected already at baseline. Combining genotyping with cytology as triage tests would refer to colposcopy relatively few women at baseline, but leave relatively large proportions of CIN to be detected at follow-up. Finally, triaging positive HPV DNA test results with a positive HPV mRNA test result or vice versa would refer to colposcopy about a quarter of all young screened women but would detect close to 90% of all high-grade CIN at baseline. A more detailed analysis of optimal triage strategies is on-going, and will be reported separately. In conclusion, HPV-based screening of young Danish women with DNA or mRNA assays should be approached with caution. The gains in the detection of high-grade CIN tended to be obtained through large reductions in screening specificity, and increases in the demand for additional testing. Role of the funding source Genomica, the manufacturer of the CLART HPV2 Assay; Hologic, the manufacturer of the APTIMA HPV Assay; Roche, the manufacturer of the cobas HPV Test; and QIAGEN, the manufacturer of the Hybrid Capture 2 Assay, provided tests, instrumentation, and limited co-funding for laboratory material. According to the contract between the manufacturers, Hvidovre University Hospital, and University of Copenhagen, all four manufacturers had the right to comment on a draft version of this manuscript, but had no editorial rights. MR and SP were funded by the Danish Strategic Research Council (Grant No.: 10092793). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The researchers worked independently of the funders. Contributors Design of the study: MR, JB, EL. Laboratory work: SP, DE, JB. Analysis of the data: MR.

M. Rebolj et al. / European Journal of Cancer 51 (2015) 1456–1466

Drafting of the manuscript: MR, JB. Critical revisions of the manuscript: all authors. Decision to submit: all authors. All authors had full access to all of the data in the study. Conflict of interest statement All authors have attended meetings with manufacturers of HPV assays. Matejka Rebolj and her former employer received honoraria from QIAGEN for lectures on her behalf. Jesper Bonde used to serve as a paid advisor to Roche and Genomica, and received honoraria from Hologic, Roche, QIAGEN, Genomica, and BD Diagnostics for lectures. He is principal investigator on studies funded by BD Diagnostics. Ditte Møller Ejegod received honoraria from Genomica and QIAGEN for lectures, and is project manager on studies funded by BD Diagnostics. Sarah Preisler received honoraria from Hologic for lectures. Carsten Rygaard served as an unpaid advisor to Roche. Elsebeth Lynge served as an unpaid advisor to Hologic and Norchip. Hvidovre Hospital holds a recompense agreement with Genomica on a KRAS/BRAF diagnostic system. None of the authors was compensated for their work on this project, holds stock, or received bonuses from any of the manufacturers. Acknowledgements The authors would like to thank women for the participation in the follow-up study. The authors would, furthermore, like to acknowledge the Danish Genomica, Hologic, QIAGEN and Roche representatives for enabling the collaboration between their parent companies, Hvidovre University Hospital, and University of Copenhagen. References [1] Castle PE. Screening: HPV testing for cervical cancer: the good, the bad, and the ugly. Nat Rev Clin Oncol 2010;7(7):364–5. [2] Cuzick J, Arbyn M, Sankaranarayanan R, et al. Overview of human papillomavirus-based and other novel options for cervical cancer screening in developed and developing countries. Vaccine 2008;26(Suppl. 10):K29–41. [3] Leinonen M, Nieminen P, Kotaniemi-Talonen L, et al. Agespecific evaluation of primary human papillomavirus screening vs conventional cytology in a randomized setting. J Natl Cancer Inst 2009;101(23):1612–23. [4] Ronco G, Giorgi-Rossi P, Carozzi F, et al. Efficacy of human papillomavirus testing for the detection of invasive cervical cancers and cervical intraepithelial neoplasia: a randomised controlled trial. Lancet Oncol 2010;11(3):249–57.

1465

[5] Monsonego J, Hudgens MG, Zerat L, et al. Evaluation of oncogenic human papillomavirus RNA and DNA tests with liquid-based cytology in primary cervical cancer screening: the FASE study. Int J Cancer 2011;129(3):691–701. [6] Heideman DA, Hesselink AT, van Kemenade FJ, et al. The Aptima HPV assay fulfills the cross-sectional clinical and reproducibility criteria of international guidelines for human papillomavirus test requirements for cervical screening. J Clin Microbiol 2013;51(11):3653–7. [7] Cuzick J, Cadman L, Mesher D, et al. Comparing the performance of six human papillomavirus tests in a screening population. Br J Cancer 2013;108(4):908–13. [8] Bonde J, Rebolj M, Ejegod DM, Preisler S, Lynge E, Rygaard C. HPV prevalence and genotype distribution in a population-based split-sample study of well-screened women using CLART HPV2 Human Papillomavirus genotype microarray system. BMC Infect Dis 2014;14:413. [9] Rebolj M, Preisler S, Ejegod DM, Bonde J, Rygaard C, Lynge E. Prevalence of Human Papillomavirus Infection in Unselected SurePath Samples Using the APTIMA HPV mRNA Assay. J Mol Diagn 2013;15(5):670–7. [10] Rebolj M, Preisler S, Ejegod DM, Rygaard C, Lynge E, Bonde J. Disagreement between Human Papillomavirus assays: an unexpected challenge for the choice of an assay in primary cervical screening. PLOS ONE 2014;9(1):e86835. [11] Preisler S, Rebolj M, Untermann A, et al. Prevalence of human papillomavirus in 5,072 consecutive cervical SurePath samples evaluated with the Roche cobas HPV real-time PCR assay. PLOS ONE 2013;8(3):e59765. [12] Goldman B, Rebolj M, Rygaard C, et al. Patterns of cervical coinfection with multiple human papilloma virus types in a screening population in Denmark. Vaccine 2013;31(12):1604–9. [13] Bjerregaard B, Larsen OB. The Danish Pathology Register. Scand J Public Health 2011;39(7 Suppl.):72–4. [14] Danish Cancer Society. Strk social slagside i HPV-vaccinationer. URL: http://www.cancer.dk/Nyheder/nyhedsartikler/ 2011kv3/socialulighedihpvvacciner.htm>; 2011 [last accessed: 9 September 2011]. [15] Bouvard V, Baan R, Straif K, et al. A review of human carcinogens–Part B: biological agents. Lancet Oncol 2009;10(4):321–2. [16] Wright TC, Stoler MH, Behrens CM, Sharma A, Zhang G, Wright TL. Primary cervical cancer screening with human papillomavirus: end of study results from the ATHENA study using HPV as the first-line screening test. Gynecol Oncol 2015 [Epub ahead of print]. [17] Rebolj M, Njor SH, Lynge E. Restriction of human papillomavirus DNA testing in primary cervical screening to women above age 30: systematic review. Eur J Cancer Prev 2012;21(1):73–81. [18] Ronco G, Giorgi-Rossi P, Carozzi F, et al. Human papillomavirus testing and liquid-based cytology in primary screening of women younger than 35 years: results at recruitment for a randomised controlled trial. Lancet Oncol 2006;7(7):547–55. [19] Rebolj M, Pribac I, Frederiksen ME, Lynge E. The Problem of False-Positive Human Papillomavirus DNA Tests in Cervical Screening. Curr Pharm Des 2013;19(8):1439–49. [20] Rebolj M, Pribac I, Lynge E. False-positive Human Papillomavirus DNA tests in cervical screening: it is all in a definition. Eur J Cancer 2011;47(2):255–61. [21] Cuschieri K, Wentzensen N. Human papillomavirus mRNA and p16 detection as biomarkers for the improved diagnosis of cervical neoplasia. Cancer Epidemiol Biomarkers Prev 2008;17(10):2536–45. [22] Wu R, Belinson SE, Du H, et al. Human papillomavirus messenger RNA assay for cervical cancer screening: the

1466

[23]

[24]

[25] [26]

[27]

[28]

[29]

M. Rebolj et al. / European Journal of Cancer 51 (2015) 1456–1466 Shenzhen Cervical Cancer Screening Trial I. Int J Gynecol Cancer 2010;20(8):1411–4. Ratnam S, Coutlee F, Fontaine D, et al. Aptima HPV E6/E7 mRNA test is as sensitive as Hybrid Capture 2 Assay but more specific at detecting cervical precancer and cancer. J Clin Microbiol 2011;49(2):557–64. Rebolj M, Lynge E, Ejegod D, Preisler S, Rygaard C, Bonde J. Comparison of three human papillomavirus DNA assays and one mRNA assay in women with abnormal cytology. Gynecol Oncol 2014;135(3):474–80. Castle PE, Katki HA. Benefits and risks of HPV testing in cervical cancer screening. Lancet Oncol 2010;11(3):214–5. van Oortmarssen GJ, Habbema JD. Epidemiological evidence for age-dependent regression of pre-invasive cervical cancer. Br J Cancer 1991;64(3):559–65. Engholm G, Ferlay J, Christensen N, et al. NORDCAN: cancer incidence, mortality, prevalence and prediction in the Nordic countries, Version 3.5. Association of the Nordic Cancer Registries. Danish Cancer Society (http://www.ancr.nu); 2014 [last accessed: 25 November 2014]. de Kok IM, van Rosmalen J, Rozemeijer K, Penning C, van Ballegooijen M. How many cervical cancer cases can potentially be prevented using a more sensitive screening test at young age? Int J Cancer 2014;134(2):460–6. Arbyn M, Kyrgiou M, Simoens C, et al. Perinatal mortality and other severe adverse pregnancy outcomes associated with treatment of cervical intraepithelial neoplasia: meta-analysis. BMJ 2008;337:a1284.

[30] Castanon A, Landy R, Brocklehurst P, et al. Risk of preterm delivery with increasing depth of excision for cervical intraepithelial neoplasia in England: nested case-control study. BMJ 2014;349:g6223. [31] Castanon A, Brocklehurst P, Evans H, et al. Risk of preterm birth after treatment for cervical intraepithelial neoplasia among women attending colposcopy in England: retrospective-prospective cohort study. BMJ 2012;345:e5174. [32] McCredie MR, Sharples KJ, Paul C, et al. Natural history of cervical neoplasia and risk of invasive cancer in women with cervical intraepithelial neoplasia 3: a retrospective cohort study. Lancet Oncol 2008;9(5):425–34. [33] Szarewski A, Sasieni P. Cervical screening in adolescents–at least do no harm. Lancet 2004;364(9446):1642–4. [34] Hendry M, Pasterfield D, Lewis R, et al. Are women ready for the new cervical screening protocol in England? A systematic review and qualitative synthesis of views about human papillomavirus testing. Br J Cancer 2012;107(2):243–54. [35] Kitchener HC, Fletcher I, Roberts C, Wheeler P, Almonte M, Maguire P. The psychosocial impact of human papillomavirus testing in primary cervical screening-a study within a randomized trial. Int J Gynecol Cancer 2008;18(4):743–8. [36] Kjaer SK, Frederiksen K, Munk C, Iftner T. Long-term absolute risk of cervical intraepithelial neoplasia grade 3 or worse following human papillomavirus infection: role of persistence. J Natl Cancer Inst 2010;102(19):1478–88.