Human papillomavirus (HPV) 18 genetic variants and cervical cancer risk in Taizhou area, China

Human papillomavirus (HPV) 18 genetic variants and cervical cancer risk in Taizhou area, China

Accepted Manuscript Human papillomavirus (HPV) 18 genetic variants and cervical cancer risk in Taizhou area, China Hui-Hui Xu, Ling-Zhi Zheng, Ai-Fen...

3MB Sizes 0 Downloads 22 Views

Accepted Manuscript Human papillomavirus (HPV) 18 genetic variants and cervical cancer risk in Taizhou area, China

Hui-Hui Xu, Ling-Zhi Zheng, Ai-Fen Lin, Shan-Shan Dong, ZeYing Chai, Wei-Hua Yan PII: DOI: Reference:

S0378-1119(18)30045-3 https://doi.org/10.1016/j.gene.2018.01.037 GENE 42487

To appear in:

Gene

Received date: Revised date: Accepted date:

28 September 2017 20 December 2017 9 January 2018

Please cite this article as: Hui-Hui Xu, Ling-Zhi Zheng, Ai-Fen Lin, Shan-Shan Dong, ZeYing Chai, Wei-Hua Yan , Human papillomavirus (HPV) 18 genetic variants and cervical cancer risk in Taizhou area, China. The address for the corresponding author was captured as affiliation for all authors. Please check if appropriate. Gene(2017), https://doi.org/ 10.1016/j.gene.2018.01.037

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT Human papillomavirus (HPV) 18 genetic variants and cervical cancer risk in Taizhou area, China Hui-Hui Xu1, Ling-Zhi Zheng2, Ai-Fen Lin3, Shan-Shan Dong3, Ze-Ying Chai2,*, Wei-Hua Yan1,*

RI

Province, Wenzhou Medical University, Linhai, Zhejiang, China

PT

1 Laboratory of Gynecologic Oncology, Medical Research Center, Taizhou Hospital of Zhejiang

SC

2 Department of gynecology and obstetrics, Taizhou Hospital of Zhejiang Province, Wenzhou Medical University, Linhai, Zhejiang, China

NU

3 Human Tissue Bank, Taizhou Hospital of Zhejiang Province, Wenzhou Medical University,

PT E

D

MA

Linhai, Zhejiang, China

**

Correspondence: Ze-Ying Chai, MD, Department of Gynecology and Obstetrics, E-mail:

CE

[email protected]; Wei-Hua Yan, PhD, MD, Medical Research Center, Taizhou Hospital of

AC

Zhejiang Province, Wenzhou Medical University, Linhai, Zhejiang 317000, People’s Republic of China, E-mail: [email protected]

1

ACCEPTED MANUSCRIPT

ABSTRACT Human papillomavirus (HPV) type 18 is predominantly associated with the development of cervical adenocarcinomas, whereas data on HPV18 genetic variability in China are limited. HPV18 genetic variants were formed phylogenetic tree, including lineages A, B, and C. We aimed

PT

to evaluate the diversity of HPV18 genetic variants by sequencing the entire E6, E7 and L1 genes.

RI

Between 2012 and 2015, a total of 138 (0.8%, 138/17669) women with single HPV18 infection

SC

were selected in this study. Finally, we observed 122 HPV18 isolates of the complete E6-E7-L1 sequences, and obtained 36 distinct variation patterns which the accession GenBank numbers as

NU

KY457805-KY457840. Except KY457805, KY457813, KY457819, KY457827, KY457829, the

MA

rest of HPV18 isolates (81.1%, 31/36) are novel variants. All of HPV18 variants belong to lineage A, while no lineage B, and C was found in our population of Taizhou region, Southeast China.

D

Sublineage A1 was the most common variants (85.2%, 104/122), followed by sublineage A4, A3

PT E

and A5, while no sublineage A2 was obtained. Based on the tree topologies, there were three newly identified candidates’ sublineages A6-A8. Out of 122 women, 67 (54.9%) had diagnosed by

CE

biopsy, including 49 women who diagnosed with cervicitis, 12 with cervical intraepithelial

AC

neoplasia (CIN)1, 4 with CIN2/3, and 2 with adenocarcinomas, respectively. Nevertheless, there was no association between HPV18 (sub) lineages and CIN1 or worse (CIN1+) lesions comparing with normal biopsies

(P=0.469).

In conclusion, knowledge of the distribution of

geographic/ethnical HPV18 genetic diversity provides critical information for developing diagnostic probes, epidemiologic correlate of cervical cancer risk and design of HPV vaccines for targeted populations. Keywords: HPV18, Genetic variants, Cervical lesion, Gynecological cancer

2

ACCEPTED MANUSCRIPT

1. Introduction

Cervical cancer is the second most common female malignancy worldwide. In China, approximately 98,900 new cases have been reported in 2015 which accounts for 18.7% of the

PT

global incidence (Chen et al., 2016). Presence of carcinogenic genotypes of papillomavirus HPV was recognized as a necessary factor in the development of cervical cancer. HPV16 and HPV18

RI

are well-known as “high-risk” because of their strong carcinogenic potentials (Crosbie et al.,

SC

2013). HPV18, which was first described in 1984 (Boshart et al., 1984), is widely accepted as the

NU

second most carcinogenic genotype after HPV16 since they are responsible for approximately 10-15% and 55-65% of all invasive cervical cancers (ICC) worldwide, respectively (Guan et al.,

MA

2012). Moreover, HPV18 is significantly over-represented in glandular lesions or in adenocarcinomas (ADC) in comparison to squamous cell carcinomas (SCC) (Burk et al., 2003;

PT E

D

Clifford et al., 2008). These findings suggest HPV18 share a phylogenetic trait that denotes a greater tendency to cause ADC in comparison to other HPV genotypes. Previous study revealed

CE

that different variants of HPV18 coevolved with the three main phylogenetic human branches of Africans, Whites, and Asians which were clustered into three distinct groups as African (Af),

AC

European (E), and Asian-American (As/Ai) lineages (Ong et al., 1993). Currently, based on the phylogenetic tree the new nomenclature for HPV18 genetic variants, three lineages have been described: lineages A (that includes previously called As/Ai and E lineages), B and C (these two included previous Af lineage) (Chen et al., 2009). HPV18, which belongs to the alpha genus and the A7 species, is a small double-stranded DNA virus with 8 kb genomes containing early expressed genes (E1, E2, E4, E5, E6, and E7), late genes (L1, L2) and a long control region (Bernard et al., 2013). E6 and E7 are major oncogenes, which 3

ACCEPTED MANUSCRIPT are highly expressed in tumors and related to induce cellular immortalization, malignant transformation, and carcinogenesis. The L1 open reading frame (ORF) encodes the major capsid protein that assemble into virus-like particles (VLPs) are the component of the HPV prophylactic vaccines (Doorbar et al., 2012).

PT

In our previous study, we found that the natural prevalence of HPV18 was 1.5% among the

RI

general population of Taizhou area, which is located in the southeast China (Xu et al., 2016).

SC

Among women diagnosed with cervical intraepithelial neoplasia (CIN) grade 2 or worse (CIN2+), the prevalence of HPV18 was increased to 5.6% (Xu et al., 2016), but far lower than that of 13.0%

NU

reported in a global meta-analysis (Guan et al., 2012). HPV18 genetic variants have been reported

MA

to differ in their virus assembly, pathogenic potential, and host immune responses (Xi et al., 2006; Xi et al., 2007; de Araujo Souza et al., 2009), but data on its evolutionary history and genetic

D

variability in China are limited. The aim of the present study was: 1) to analyze the nucleotide

PT E

variants in the E6, E7, and L1 viral genes of HPV18, 2) to assess the prevalence of HPV18 variants in our region, and 3) to explore the relationship between HPV18 genetic variants and the

AC

CE

risk for cervical cancer.

2. Materials and Methods 2.1 Ethics Statement

This study was approved by the Institutional Medical Ethics Review Board of Taizhou Hospital of Zhejiang Province (approval # MERB-2016-047). Informed consent was obtained from all of the participants. Confidentiality was ensured during the data collection process, which was completed by Taizhou Hospital. Data were analyzed anonymously.

4

ACCEPTED MANUSCRIPT

2.2 Study population Between December 2012 and April 2015, a total of 17669 consecutive women (median age 41.3 years; range 16-89) were underwent cervical cancer screening in gynecological clinic at Taizhou

PT

Hospital of Zhejiang Province, and 3923 (22.2%) of them were HPV positive (Xu et al., 2016). All

RI

cervical exfoliated cell samples for HPV genotyping were performed with a standard protocol as

SC

previously described (Xu et al., 2016). Of the 264 (1.5%) women with HPV18 infection, a single infection was detected in 138 (0.8%) women (median age 39.7 years; range 22-70) who were

NU

selected in the present study. Genomic DNA was extracted using the DNeasy Blood and Tissue kit

MA

(QIAGEN) according to the manufacturer’s instructions. DNA was extracted from 200μl of each sample and collected in 50μl elution buffer and stored at -20°C until amplification. According to

D

the 2012 ASCCP consensus guidelines for the abnormal cervical cancer screening, women with

PT E

HPV18 positive should be referred directly to colposcopy (Massad et al., 2013). Histological diagnoses were adjudicated by two pathologists and classified as normal, CIN grade 1, 2, 3 or

AC

CE

ICC.

2.3 PCR amplification and sequencing PCR amplifications were performed with 20ng genomic DNA by the following conditions: 1×PCR buffer, 2mM MgCl2, 0.2mM 2′-deoxyribonucleotide 5′-triphosphate mix, 1mM of each primer and 2.0U Taq DNA polymerase (QIAGEN) in a final volume of 50μl. PCR reactions were pre-heated for 5min at 95°C, followed by 35 repeated cycles of 94°C for 30s, 55/60°C for 30s, 72°C for 30s, and a final extension step at 72°C for 10min (Thermo Hybaid, Waltham, MA).

5

ACCEPTED MANUSCRIPT Amplification of HPV18 E6, E7 and L1 genes were performed using type-specific primers, which were shown in Table 1. All data were confirmed twice at least by repeat PCR amplification and sequence analysis. PCR products were visualized by 1.5% agarose gel, and then sequenced (BigDye™ Terminator v3.1

PT

Cycle Sequencing Kit) by Beijing Genomics Institute and Shanghai Genomics Institute (ABI 3730,

RI

Applied Biosystems, Foster City, CA, USA). The same forward and reverse primers were used

NU

2.4 Variant analysis and phylogenetic tree construction

SC

separately in cycle sequencing in order to sequence both sense and antisense strands.

MA

HPV18 prototype reference sequence (GenBank accession number NC001357), which belongs to the A1 sublineage, was used as the standard for comparisons and nucleotide position numbering

D

in this study. The nucleotide sequences that contained complete E6, E7 and L1 genes were aligned

PT E

and compared to the HPV 18 prototype sequence, using NCBI Blast, BioEdit Sequence Alignment Editor, and Clustal X 1.83. The nucleotide sequences were translated into proteins using MEGA

CE

version 5.05 for determination of amino acid changes caused by nucleotide changes.

AC

Other entire E6-E7-L1 sequences were downloaded from GenBank (A1 sublineage: EF202143EF202145, A2 sublineage: EF202146, A3 sublineage: EF202147-EF202149, A4 sublineage: EF202150-EF202151, A5 sublineage: GQ180787; B1 sublineage: EF202153-EF202155, B2 sublineage: KC470224-KC470225, B3 sublineage: EF202152; and C lineage: KC470229KC470230). Maximum likelihood tree was constructed using the MEGA version 5.05 with 1000 bootstrapped replicates and values above 50% were considered significant.

6

ACCEPTED MANUSCRIPT

3. Results 3.1 Characteristics of the Population 138 women with HPV18 single-infection were obtained from our previous TZHPV Study (Xu et al., 2016). 131 (94.9%) E6-E7 sequences and 124 (89.9%) L1 sequences from HPV18 isolates

PT

were successfully amplified by PCR. Other sequences were excluded because of virus DNA

RI

integration or PCR failure. In total, the complete E6-E7-L1 genes of 122 (88.4%) HPV18 isolates

SC

were sequenced for variant analysis. Out of 122 women, 67 (54.9%) had diagnosed by biopsy, including 49 women who diagnosed with cervicitis, 12 with CIN1, 4 with CIN2/3, and 2 with

MA

NU

adenocarcinomas, respectively.

3.2 Nucleotide variants

D

All sequences used in this analysis were submitted to GenBank, and the GenBank accession

PT E

numbers were KY457805-KY457840. All of HPV18 variants belong to lineage A, while no lineage B, and C was found in our population of Taizhou region, Southeast China. Compared to

CE

the HPV18 prototype sequence (NC001357), we detected 64 nucleotides substitutions, with 37/64

AC

(57.8%) being novel substitution and 30/64 (46.9%) being non-synonymous substitution. A summary of nucleotide and amino acid sequence variation throughout the HPV18 E6-E7-L1 region is shown in Figure 1. In the E6 gene, eleven nucleotide variants were detected. Five of them were specific to the A1 sublineage: two synonymous substitutions T149C, A482C and three non-synonymous substitutions G189C (E29Q), G222A (E40K), T382G (L93R). Four synonymous substitution C153T, A377G, T485C, C549A and one non-synonymous substitution T314G (D70E) was found

7

ACCEPTED MANUSCRIPT to be specific to the A3-A8 sublineages, while T485C and C549A was only detected among the European lineage (except A6 sublineage). In our study, synonymous substitution C287G was observed in all HPV18 isolates. E7 gene genetic variability analysis revealed four nucleotides substitutions. One synonymous

PT

substitution C860T was detected in A1 sublineage. One synonymous substitution C751T and two

RI

non-synonymous substitutions T734G (L49V), A770C (M61L) was found to be specific for the

SC

A3-A5, and A8 sublineages. To our knowledge, the base substitutions of G189C, G222A, T314G, T382G in E6 and T734G, A770C, C860T in E7 had never been reported in previous studies.

NU

L1 sequence analysis revealed forty-nine nucleotides substitutions and demonstrated the

MA

presence of substitutions G5503A (R25Q), C5701G (P91R), C6460G (P344R), C6625G (P399R), and C6842G in all our isolates. One non-synonymous substitution C5920T (A164V) was detected

D

in lineage A. Seventeen substitutions were specific to the A1 sublineage: seven synonymous

PT E

substitutions C5478T, T5801C, A5832C, A5924C, G6071A, C6170A, G6400A and ten non-synonymous substitutions C5496T (H23Y), T5570G (I47M), A5580G (R51G), G5671A

CE

(R81K), G5952A (G175S), G5991C (D188H), A6943C (K505T), A7045C (K539T), T7092G

AC

(S555A), G7129C (R567T). Seventeen synonymous substitutions A5741G, T5810G, A5825G, A5828C, A5852C, T5942G, T5966C, C6014T, A6059G, G6089A, G6143C, T6182G, A6638G, G6929A, T6983A, G7022A, G7130A and nine non-synonymous substitutions G5446A (R6Q), A5497C (H23P), T5619A (L64M), A5796G (I123V), C5875A (T149N), A6401G (R324K), C6427A (P333H), A6430C (Q334P), A6923C (E498D) was found to be specific for the European lineage. To our knowledge, most of them are novel nucleotide substitutions detected in our study except for C5496T, A5497C, G5503A, T5619A, C5701G, C5875A, C5920T, T5942G, G6089A,

8

ACCEPTED MANUSCRIPT G6400A, A6401G, A6430C, C6460G, C6625G, C6842G, A6943C, G7022A, A7045C, and G7130A in L1.

3.3 HPV18 variants

PT

We obtained 36 distinct variation patterns denoted as 18CNTZ01-18CNTZ36, and the accession

RI

GenBank numbers was KY457805-KY457840. 31 (81.1%, 31/36) novel HPV18 variants were

SC

detected in our study (Figure 1). 18CNTZ01, which was reported in 2009 (GenBank accession number EF202143) (Chen et al., 2009), is the most common variants (60.7%, 74/122) in our

NU

population. Four variants 18CNTZ09, 18CNTZ15, 18CNTZ23, and 18CNTZ25 were reported in

MA

previous studies (KU721790 (unpublished), EF202145 (Chen et al., 2009), KU707802 (King et al. 2016), KU707808 (King et al. 2016), respectively). The novel variant 18CNTZ18 was the second

D

most common (6.6%, 8/122) in our population. Followed by 18CNTZ16 (2.5%, 3/122),

PT E

18CNTZ25 (2.5%, 3/122), 18CNTZ11 (1.6%, 2/122), 18CNTZ23 (1.6%, 2/122), and the remaining HPV18 variants (0.8%, 1/122).

CE

In addition to previously described lineages and sublineages (A1 to A5, B1 to B3, and C) (Burk

AC

et al., 2013; Chen et al., 2013), we saw evidence of three candidate A6-A8 sublineages according to the phylogenetic tree of our study (Figure 2). Lineage A was further divided into eight sublineages A1-A8 based on tree topologies, of which A1, A2 belong to the Asian-American branch and A3-A8 belong to the European branch. Isolates of all Asian-American variants from our population mapped to the A1 sublineage contains 21 variants (58.3%) including 18 novel variants (50.0%). HPV18 prototype was also assigned to the A1 sublineage. There was no HPV18 variant found in A2 sublineage. The A3 and A4 sublineages contain 15 variants (41.7%) including

9

ACCEPTED MANUSCRIPT 13 novel variants (36.1%). One variant 18CNTZ28 represented A5 sublineage, which appeared to be uniquely defined by the substitutions A5741G, A5796G (I123V), A6059G, G6089A, G6143C, A6923C (E498D), and G7130A in L1. However, 18CNTZ28 variant sequence is not consistent with the previously reported A5 whole-genome sequence GQ180787 from Thailand (Chen et al.,

PT

2013; Lurchachaiwong et al., 2010), which has 149C, 153C, 770A, 5790A, 5914T, 6053A, and

RI

6917A. In addition, we observed nine variants representing ten samples that appeared to form

SC

three new candidates, A6-A8 sublineages. A6 sublineage appeared to be uniquely defined by the C6014T in L1 and A7 to be uniquely defined by the T5810G, in L1 which were first detected in

NU

our study. 18CNTZ32 variant was found to be recombinant, which classified as European branch

MA

but in E6 region belonged to the Asian-American branch. Furthermore, 18CNTZ22 and 18CNTZ23 represented A8 sublineage appeared to be uniquely defined by the T5942G, G7022A in

PT E

D

L1, but differ with the A3-A5 sublineages, which has G6401A (R324K).

3.4 Genetic variants and cervical lesion

CE

Out of 122 women, 67 (54.9%) had diagnosed by biopsy, including 49 women with cervicitis,

AC

12 with CIN1, 4 with CIN2/3, and 2 with adenocarcinomas, respectively (Table 2). We detected the presence of the 18CNTZ01 variant in both cases of adenocarcinomas. 18CNTZ01, which was reported in previous study in 2009 (GenBank accession number EF202143) (Chen et al., 2009), was the most common variant (60.7%, 74/122) in our population. Four cases of CIN2/3 were detected three A1 sublineage (two 18CNTZ01 and one 18CNTZ18) and one European variant 18CNTZ22, which nucleotides substitutions of T314G (D70E) in E6 and T734G (L49V) in E7. Our results suggest that the two sites may affect E6 and E7 function. Twelve cases of CIN1 were

10

ACCEPTED MANUSCRIPT detected nine A1 sublineage (five 18CNTZ01, one 18CNTZ11, one 18CNTZ14, one 18CNTZ18, one 18CNTZ19) and three European lineages (one 18CNTZ23, one 18CNTZ31, one 18CNTZ35). Nevertheless, there was no association between HPV18 (sub) lineages and CIN1+ lesions comparing with the normal biopsies (P=0.469).

PT

Previous studies confirmed that different HPV 18 variants were associated with progression of

RI

cervical neoplasia dependent on geography and ethnicity (Villa et al., 2000; Sichero et al., 2007).

SC

Therefore, nucleotides substitutions may play an important role in HPV 18 variants obtained from our population. Substitutions C287G in E6, and G5503A, C5701G, C6460G, C6625G, C6842G in

NU

L1 were found in all HPV18 isolates. Furthermore, seven novel nucleotides substitutions G189C,

MA

G222A, T314G, T382G in E6 and T734G, A770C, C860T in E7 have never been reported in previous studies and six of them were non-synonymous and lead to amino acid changes, which

PT E

D

may interfere with the viral oncogenic potential.

4. Discussion

CE

Human papillomavirus (HPV) variants differ in geographic origins, evolutionary dynamics, and

AC

pathogenicity. The HPV18 is associated with a higher proportion of ADC in comparison with other HPV (Burk et al., 2003; Clifford et al., 2008), whereas data on HPV18 genetic variability in China are limited. Moreover, identification of HPV genetic diversity in specific clinical settings may be important for the rational design of diagnostic, and vaccine strategies (Li et al., 2011). In this study, we analyzed the genetic variants in HPV18 E6, E7, and L1 viral genes, constructed phylogenetic trees of all HPV18 variants, and explored the relationship between HPV18 genetic variants and the risk for cervical cancer in our population.

11

ACCEPTED MANUSCRIPT In this study, we observed 122 cases of HPV18 variants in the complete E6-E7-L1 genes, which is the first comprehensive study to explore HPV18 evolutionary history and genetic variants in Southeast China. The changes of the amino acid sequence may influence the transforming activity of the E6, and E7 genes, which integrate into host genome, resulting in inhibit Rb and p53

PT

functions and induce cellular immortalization and transformation (Bernard et al., 2013; Doorbar et

RI

al., 2012). L1 gene encodes the viral major capsid protein which is the primary target for

SC

prophylactic vaccination (Erickson et al., 2014). Substitution C287G, G5503A, C5701G, C6460G, C6625G, and C6842G were found in all HPV18 isolates in our population of Taizhou region,

NU

Southeast China. The synonymous substitution C287G may be a common nucleotides substitution

MA

worldwide (Arroyo et al., 2012; Sun et al., 2012; Shen et al., 2013). In China, the substitution C287G were found in all HPV18 isolates in Northeast (Sun et al., 2012) and Southeast region, but

D

the frequent in Southwest region was 48.2% (Shen et al., 2013). The frequent of G5503A, C5701G,

PT E

C6460G, C6625G, and C6842G were found in all HPV18 isolates in Southeast region, but was about 40% in Southwest region (Shen et al., 2013). Thus, it is worthy to evaluate the L1 gene in

CE

geographical region and racial characteristics of the study population. The AS04-adjuvanted

AC

HPV16/18 vaccine (Cervarix®, GlaxoSmithKline Biologicals, Belgium), which has been licensed in China in 2015, is a prophylactic vaccine containing a mixture of virus-like particles derived from the L1 proteins of HPV16 and 18 (McKeage et al., 2011; Zhu et al., 2017). In addition, our data about the genetic diversity of HPV18 variants in China may be helpful to design the second-generation prophylactic HPV vaccine strategies and implement a feasible nationwide vaccination programs. The distribution of HPV18 variants (sub)lineages in Southeast China were confirmed to be

12

ACCEPTED MANUSCRIPT geographically/ethnically specific (Xi et al., 2006; de Araujo Souza et al., 2009; Chen et al., 2013). Consistent with previous nomenclature of A1/A2 branch as Asian-American lineage (Burk et al., 2013), our results shown the A1 variants account for the majority (85.2%) but no A2 variants in Southeast China. Additionally, our results also show that there are no B and C branches, which are

PT

specific for Africa (Ong et al., 1993; Chen et al., 2013; Schlecht et al., 2005). In our study, the

RI

18CNTZ28 variant was classified as A5 branch; its sequence is not consistent with the previously

SC

reported A5 whole-genome sequence GQ180787 from Thailand (Chen et al., 2013; Lurchachaiwong et al., 2010), which has 149C, 153C, 770A, 5790A, 5914T, 6053A, and 6917A.

NU

Of note, the A5 branch was found in a higher proportion of HPV18 isolates from northern Africa

MA

than elsewhere (Chen et al., 2015). Moreover, the 18CNTZ32 variant was found to be recombinant, which classified as European (A6) branch but in E6 region belonged to the Asian-American (A1)

D

branch. Our data provided new evidence of recombination in HPV, which is an important

PT E

molecular evolutionary mechanism that could impact pharmacogenomics and vaccine development (Angulo et al., 2007).

CE

Previously studies reported the risk of developing high-grade CIN to be significantly higher in

AC

non-European variants (Villa et al., 2000; Arroyo et al., 2012). Xi et al (Xi et al., 2007) reported that Asian-American/European variants were found to be statistically signigicant more associated with pre-invasive lesions than African variants. In contrast, other studies showed that no significant difference in pre-invasive lesions risk was observed between the variant lineages (A, B, and C) (Chen et al., 2015; Schiffman et al., 2010; Arias-Pulido et al., 2005). Furthermore, there was not a statistically significant association between HPV18 (sub)lineages level and cervical pre-invasive lesions risk in Southeast China(P=0.469). This result may have influenced by the

13

ACCEPTED MANUSCRIPT limited sample size of case-matched HPV18-related cervical lesions in this study. A1 variants account for the majority in eastern Asia (Chen et al., 2015; Sun et al., 2012), but significantly underrepresented in South/Central Asia (Chen et al., 2015). Such regional differences should be taken into account in assessing pre-invasive lesions risk as described above. Whatever the

PT

underlying cause, such apparent regional difference reveals the inherent complexity of HPV

RI

variants and cervical cancer risk studies, and gives warnings about the extent to which data can be

SC

pooled across countries/regions.

Our study was limited by the number of HPV18-poistive samples, due to the low natural

NU

incidence of HPV18 in our region. However, the number of HPV18 positive samples obtained

MA

from our previous TZHPV study (2012-2015) (Xu et al., 2016) was by far the largest studied in China. In this study, we reported 31 novel HPV18 variants that will ensure sequencing of the

D

whole genomes in order to strengthen the full picture of HPV18 genetic evolution in our ensuing

PT E

studies.

CE

5. Conclusion

AC

In summary, our present study provides a practical approach for phylogenetic classification for use in epidemiological studies of the natural history and carcinogenicity of HPV18 genetic variants in China. Data about the geographic/ethnical HPV18 genetic diversity distribution may be helpful to design of diagnostic probes, epidemiologic correlate of cervical cancer risk and design of HPV vaccines for targeted populations in China.

Competing interests

14

ACCEPTED MANUSCRIPT The authors declare that they have no competing interests.

Acknowledgments This work was supported by grants from the National Natural Science Foundation of China

PT

(31370920, 81372247), the Science and Technology Bureau of Zhejiang Province (2016C33231),

RI

the Health Bureau of Zhejiang Province (2015KYB438) and the Zhejiang Provincial program for

SC

the cultivation of high-level innovative health talents.

NU

Authors’ contributions

MA

HHX, AL, and WHY designed the experiments, performed analysis and drafted the manuscript. LZZ, ZYC, and SSD carried out the sample collection, performed laboratory testing. All authors

PT E

D

read and approved the final manuscript.

References

CE

Angulo, M., Carvajal-Rodríguez, A., 2007. Evidence of recombination within human alpha-

AC

papillomavirus. Virol. J. 4, 33. Arias-Pulido, H., Peyton, C.L., Torrez-Martínez, N., Anderson, D.N., Wheeler, C.M., 2005. Human papillomavirus type 18 variant lineages in United States populations characterized by sequence analysis of LCR-E6, E2, and L1 regions. Virology. 338(1), 22-34. Arroyo, S.L., Basaras, M., Arrese, E., Hernáez, S., Andía, D., Esteban, V., Garcia-Etxebarria, K., Jugo, B.M., Cisterna, R., 2012. Human papillomavirus (HPV) genotype 18 variants in patients with clinical manifestations of HPV related infections in Bilbao, Spain. Virol. J. 9,

15

ACCEPTED MANUSCRIPT 258. Bernard, H.U., 2013. Taxonomy and phylogeny of papillomaviruses: an overview and recent developments. Infect. Genet. Evol. 18, 357-361. Boshart, M., Gissmann, L., Ikenberg, H., Kleinheinz, A., Scheurlen, W., zur Hausen, H.,

RI

lines derived from cervical cancer. EMBO. J. 3(5), 1151-1157.

PT

1984. A new type of papillomavirus DNA, its presence in genital cancer biopsies and in cell

SC

Burk, R.D., Terai, M., Gravitt, P.E., Brinton, L.A., Kurman, R.J., Barnes, W.A., Greenberg, M.D., Hadjimichael, O.C., Fu, L., McGowan, L., Mortel, R., Schwartz, P.E., Hildesheim, A.,

NU

2003. Distribution of human papillomavirus types 16 and 18 variants in squamous cell

MA

carcinomas and adenocarcinomas of the cervix. Cancer Res. 63(21), 7215-7220. Burk, R.D., Harari, A., Chen, Z., 2013. Human papillomavirus genome variants. Virology.

D

445(1-2), 232-243.

PT E

Chen, A.A., Gheit, T., Franceschi, S., Tommasino, M., Clifford, G.M.; IARC HPV Variant Study Group, 2015. Human Papillomavirus 18 Genetic Variation and Cervical Cancer Risk

CE

Worldwide. J. Virol. 89(20), 10680-10687.

AC

Chen, W., Zheng, R., Baade, P.D., Zhang, S., Zeng, H., Bray, F., Jemal, A., Yu, X.Q., He, J., 2016. Cancer statistics in China, 2015. CA Cancer J. Clin. 66(2), 115-132. Chen, Z., DeSalle, R., Schiffman, M., Herrero, R., Burk, R.D., 2009. Evolutionary dynamics of variant genomes of human papillomavirus types 18, 45, and 97. J. Virol. 83(3), 1443-1455. Chen, Z., Schiffman, M., Herrero, R., DeSalle, R., Anastos, K., Segondy, M., Sahasrabuddhe, V.V., Gravitt, P.E., Hsing, A.W., Burk, R.D., 2013. Evolution and Taxonomic Classification of Alphapapillomavirus 7 Complete Genomes: HPV18, HPV39, HPV45, HPV59, HPV68 and

16

ACCEPTED MANUSCRIPT HPV70. PLoS One. 8(8), e72565. Clifford, G., Franceschi, S., 2008. Members of the human papillomavirus type 18 family (alpha-7 species) share a common association with adenocarcinoma of the cervix. Int. J. Cancer. 122 (7), 1684-1685.

PT

Crosbie, E.J., Einstein, M.H., Franceschi, S., Kitchener, H.C., 2013. Human papillomavirus and

RI

cervical cancer. Lancet. 382(9895), 889-899.

SC

de Araujo Souza, P.S., Sichero, L., Maciag, P.C., 2009. HPV variants and HLA polymorphisms: the role of variability on the risk of cervical cancer. Future Oncol. 5(3), 359-370.

NU

Doorbar, J., Quint, W., Banks, L., Bravo, I.G., Stoler, M., Broker, T.R., Stanley, M.A., 2012. The

MA

biology and life-cycle of human papillomaviruses. Vaccine. 30(suppl 5), F55-70. Erickson, B.K., Landers, E.E., Huh, W.K., 2014. Update on vaccination clinical trials for

D

HPV-related disease. Clin. Ther. 36(1), 8-16.

PT E

Guan, P., Howell-Jones, R., Li, N., Bruni, L., de Sanjosé, S., Franceschi, S., Clifford, G.M., 2012. Human papillomavirus types in 115,789 HPV-positive women: a meta-analysis from

CE

cervical infection to cancer. Int. J. Cancer. 131(10), 2349-2359.

AC

King, A.J., Sonsma, J.A., Vriend, H.J., van der Sande, M.A., Feltkamp, M.C., Boot, H.J., Koopmans, M.P.; Medical Microbiological Laboratories and Municipal Health Services., 2016. Genetic Diversity in the Major Capsid L1 Protein of HPV-16 and HPV-18 in the Netherlands. PLoS One. 11(4), e0152782. Li, N., Franceschi, S., Howell-Jones, R., Snijders, P.J., Clifford, G.M., 2011. Human papillomavirus type distribution in 30,848 invasive cervical cancers worldwide: Variation by geographical region, histological type and year of publication. Int. J. Cancer. 128(8), 927-935.

17

ACCEPTED MANUSCRIPT Lurchachaiwong,W., Junyangdikul,

P., Termrungruanglert,W., Payungporn,

S., Sampatanukul,

P., Tresukosol, D., Niruthisard, S., Trivijitsilp, P., Karalak, A., Swangvaree, S., Poovorawan, Y., 2010. Whole-genome sequence analysis of human papillomavirus type 18 from infected Thai women. Intervirology. 53(3), 161-166.

PT

Massad, LS., Einstein, M.H., Huh, W.K., Katki, H.A., Kinney, W.K., Schiffman, M., Solomon, D.,

RI

Wentzensen, N., Lawson, H.W.; 2012 ASCCP Consensus Guidelines Conference. , 2013. 2012

SC

updated consensus guidelines for the management of abnormal cervical cancer screening tests and cancer precursors. Obstet. Gynecol. 121(4), 829-846.

NU

McKeage, K., Romanowski, B., 2011. AS04-adjuvanted human papillomavirus (HPV) types 16

MA

and 18 vaccine (Cervarix®): a review of its use in the prevention of premalignant cervical lesions and cervical cancer causally related to certain oncogenic HPV types. Drugs. 71(4),

D

465-488.

PT E

Ong, C.K., Chan, S.Y., Campo, M.S., Fujinaga, K., Mavromara-Nazos, P., Labropoulou, V., Pfister, H., Tay, S.K., ter Meulen, J., Villa, L.L., 1993. Evolution of human papillomavirus

CE

type 18: an ancient phylogenetic root in Africa and intratype diversity reflect coevolution with

AC

human ethnic groups. J. Virol. Nov. 67 (11), 6424-6431. Schiffman, M., Rodriguez, A.C., Chen, Z., Wacholder, S., Herrero, R., Hildesheim, A., Desalle, R., Befano, B., Yu, K., Safaeian, M., Sherman, M.E., Morales, J., Guillen, D., Alfaro, M., Hutchinson, M., Solomon, D., Castle, P.E., Burk, R.D., 2010. A population-based prospective study of carcinogenic human papillomavirus variant lineages, viral persistence, and cervical neoplasia. Cancer Res. 70(8), 3159-3169. Schlecht, N.F., Burk, R.D., Palefsky, J.M., Minkoff, H., Xue, X., Massad, L.S., Bacon,

18

ACCEPTED MANUSCRIPT M., Levine, A.M., Anastos, K., Gange, S.J., Watts, D.H., Da Costa, M.M., Chen, Z., Bang, J.Y., Fazzari, M., Hall, C., Strickler, H.D., 2005. Variants of human papillomaviruses 16 and 18 and their natural history in human immunodeficiency virus-positive women. J. Gen. Virol. 86(Pt 10), 2709-2720.

PT

Shen, M., Ding, X., Li, T., Chen, G., Zhou, X., 2013. Sequence variation analysis of HPV-18

RI

isolates in southwest China. PLoS. One. 8(2), e56614.

SC

Sichero, L., Ferreira, S., Trottier, H., Duarte-Franco, E., Ferenczy, A., Franco, E.L., Villa, LL., 2007. High grade cervical lesions are caused preferentially by non-European variants of HPVs

NU

16 and 18. Int. J. Cancer. 120(8), 1763-1768.

MA

Sun, Z., Liu, J., Wang, G., Zhou, W., Liu, C., Ruan, Q., 2012. Variant lineages of human papillomavirus type 18 in Northeast China populations characterized by sequence analysis of

D

E6, E7, and L1 regions. Int. J. Gynecol. Cancer. 22(6), 930-936.

PT E

Villa, L.L., Sichero, L., Rahal, P. , Caballero, O., Ferenczy, A., Rohan, T. , Franco, E.L., 2000. Molecular variants of human papillomavirus types 16 and 18 preferentially associated with

CE

cervical neoplasia. J. Gen. Virol. 81(Pt 12), 2959-2968.

AC

Xi, L.F., Kiviat, N.B., Hildesheim, A., Galloway, D.A., Wheeler, C.M., Ho, J., Koutsky, L.A., 2006. Human papillomavirus type 16 and 18 variants: race-related distribution and persistence. J. Natl. Cancer Inst. 98(15), 1045-1052. Xi, L.F., Koutsky, L.A., Hildesheim, A., Galloway, D.A., Wheeler, C.M., Winer, R.L., Ho, J., Kiviat, N.B., 2007. Risk for high-grade cervical intraepithelial neoplasia associated with variants of human papillomavirus types16 and 18. Cancer Epidemiol. Biomarkers Prev. 16(1), 4-10.

19

ACCEPTED MANUSCRIPT Xu, H., Lin, A., Shao, X., Shi, W., Zhang, Y., Yan, W., 2016. Diagnostic accuracy of high-risk HPV genotyping in women with high-grade cervical lesions: evidence for improving the cervical cancer screening strategy in China. Oncotarget. 7(50), 83775-83783. Zhu,F.C.,Hu,S.Y., Hong,Y., Hu,Y.M., Zhang,X.,Zhang,Y.J., Pan,Q.J., Zhang,W.H., Zhao,F.H., Zha

PT

ng, C.F., Yang, X., Yu, J.X., Zhu, J., Zhu, Y.,Chen, F., Zhang, Q., Wang, H., Wang, C., Bi,

RI

J., Xue, S., Shen, L., Zhang, Y.S., He, Y., Tang, H., Karkada,N., Suryakiran, P., Bi, D., Struyf,

SC

F., 2017. Efficacy, immunogenicity, and safety of the HPV-16/18 AS04-adjuvanted vaccine in Chinese women aged 18-25 years: event-triggered analysis of a randomized controlled trial.

AC

CE

PT E

D

MA

NU

Cancer Med. 6(1), 12-25.

20

ACCEPTED MANUSCRIPT Legends: Figure 1. Genetic variability of HPV18 E6-E7-L1 sequences in Taizhou area, Zhejiang province. Numbering refers to the first nucleotide of the HPV18 prototype reference sequence (GenBank accession number NC001357). Each row indicates the isolate identification and the

PT

nucleotide sequence alignment compared to the reference. Isolates EF202143-EF202155 are HPV

RI

18 known variant sequences which belong to lineage A, B and C. Novel HPV18 variants are

SC

highlighted in bold and novel nucleotide substitutions are highlights in gray.

Figure 2. Phylogenetic tree of the HPV18 variants. Maximum likelihood analysis (with MEGA

NU

5.05 program) of E6-E7-L1 nucleotide sequences. Numbers below branches indicate bootstrap

MA

values.

Table 1. Polymerase chain reaction characteristics for Human papillomavirus 18.

AC

CE

PT E

D

Table 2. Distribution of HPV18 (sub) lineages in grade of cervical lesion.

21

ACCEPTED MANUSCRIPT

Table 1. Polymerase chain reaction characteristics for Human papillomavirus 18 Site**

Annealing Temp/Cycles

Amplicon size

GGAGTRACCRAAAACGGTYG CCTTCTGRATCAGCCATTGTT GTMTCTGCYACGGRGGACAA CACAGCTGCCAGGTGAAGC CTGGATATGGTGCCAYGGRC CTCACYAGGGCGCAACCACAT

36-55 909-930 5258-5277 6462-6480 6220-6239 7308-7328

55°C /35 cycles

895 bp

60°C /35 cycles

1223bp

60°C /35 cycles

1109bp

PT

18E6/7 F 18E6/7 R 18L1 F1 18L1 R1 18L1 F2 18L1 R2

Primer Sequence(5′ - 3′) *

EF202143-EF202155, KU298886, KC470208-KC470230, GQ180784-GQ180792);

RI

* The degenerate primers were matched with complete genomes of the HPV18 from the Nucleotide Database (accession number

AC

CE

PT E

D

MA

NU

SC

** Position numbering refers to the first nucleotide of the HPV18 reference genome (accession number NC001357).

22

ACCEPTED MANUSCRIPT

Table 2. Distribution of HPV18 (sub)lineages in grade of cervical lesion CIN2/3

Adenocarcinomas

38.6 ± 8.8 42 0 0 3 1 1 2 0 49

40.1 ± 11.3 9 0 0 0 0 1 1 1 12

40.1 ± 8.5 3 0 0 0 0 0 0 1 4

59.0 ± 5.7 2 0 0 0 0 0 0 0 2

RI

PT

CIN1

AC

CE

PT E

D

MA

NU

SC

Ages A1 A2 A3 A4 A5 A6 A7 A8 Total

Cervicitis

23

ACCEPTED MANUSCRIPT Highlights  The data on HPV18 genetic variability in Southeast China are limited.  We obtained 36 distinct variation patterns denoted as 18CNTZ01-18CNTZ36 (GenBank accession numbers were KY457805-KY457840),and 31 (86.1%) are novel.  All of HPV18 variants belong to lineage A, while no lineage B, and C was found in our

PT

population of Taizhou region, Southeast China.  There was no association between HPV18 (sub) lineages and CIN1+ lesions comparing with

AC

CE

PT E

D

MA

NU

SC

RI

the normal biopsies.

24

ACCEPTED MANUSCRIPT Abbreviations

AC

CE

PT E

D

MA

NU

SC

RI

PT

ADC:adenocarcinomas Af:African As/Ai:Asian-American CIN: cervical intraepithelial neoplasia E:European HPV:Human papillomavirus ICC:invasive cervical cancers ORF:open reading frame SCC:squamous cell carcinomas VLPs:virus-like particles

25

Figure 1

Figure 2