A study of old Serbian skeletal remains using ForenSeq DNA Signature™ kit

A study of old Serbian skeletal remains using ForenSeq DNA Signature™ kit

Journal Pre-proof A Study of old Serbian Skeletal Remains using ForenSeq DNA SignatureTM Kit Eida Almohammed PII: S1875-1768(19)30266-5 DOI: https...

1MB Sizes 0 Downloads 34 Views

Journal Pre-proof A Study of old Serbian Skeletal Remains using ForenSeq DNA SignatureTM Kit Eida Almohammed

PII:

S1875-1768(19)30266-5

DOI:

https://doi.org/10.1016/j.fsigss.2019.11.010

Reference:

FSIGSS 1847

To appear in:

Forensic Science International: Genetics Supplement Series

Received Date:

19 August 2019

Revised Date:

11 November 2019

Accepted Date:

11 November 2019

Please cite this article as: Almohammed E, A Study of old Serbian Skeletal Remains using ForenSeq DNA SignatureTM Kit, Forensic Science International: Genetics Supplement Series (2019), doi: https://doi.org/10.1016/j.fsigss.2019.11.010

This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. © 2019 Published by Elsevier.

Page 1 of 9 A Study of old Serbian Skeletal Remains using ForenSeq DNA Signature™ Kit Eida Almohammed Ministry of Interior- Qatar State [email protected]

Abstract Recent advances in massively parallel sequencing (MPS) has become a very promising technology for massive genetic sequencing [1]. In this study Illumina ForenSeq™ DNA Signature Prep kit was tested to determine if MPS offers a more comprehensive evaluation of degraded samples than the traditional fragment analysis/capillary electrophoresis based method [2]. The use of MPS would therefore reduce the analysis time and augment the identification of human remains. In this context we aimed to analyse the hard tissue degraded samples using ForenSeq™ DNA Signature kit. These samples had given partial profiles with dropout at several loci with GlobalFiler™ kit previously. The MPS kit showed that it is highly sensitive, aids in higher allele recovery for STR loci and provides valuable information about biogeographic ancestry, identity and phenotypic features from a single analysis. The work resulted in highly successful amplification and sequencing of 30 degraded bone/teeth samples using MPS method.

lP

re

-p

ro of

1.Introduction Recent advances in massively parallel sequencing (MPS), provide advantages to analyse DNA from skeletal human remains, particularly old ones due to enhanced sensitivity and as many more markers can be analysed simultaneously. The ForenSeq™ DNA Signature kit, includes autosomal STRs, Y STRs, X STRs, Identity SNPs and Ancestry SNPs [1]. In the forensic context, the major advantage of these markers is their possible usage with highly degraded DNA as in disaster victim identification and forensic samples [3], [9]. The greatest challenge faced for the analysis of degraded samples, especially for ancient specimens, is the potential for any extracted DNA to have come from a source other than the target [7], [12]. The larger loci (>200bp) are therefore the first to be affected, and are commonly lost [1], [11], [12]. Recent advances in (MPS), provides some advantages to analyse DNA from human remains [7], [14]. An advantage of MPS technology is that shorter STR markers suitable for profiling degraded DNA and SNPs can be multiplexed together. One of the significant advantages to using MPS for forensic analysis is that rather than a profile based on fragment size analysis, the actual sequence of the STR is obtained. Numerous isoalleles can provide the capability to distinguish between two individuals with the same allele call for a locus, thus increasing the discriminatory value of the allele and also aiding the deconvolution of mixtures [5], [13], [14], [20]. In this work we have investigated the performance of the ForenSeq™ DNA Signature kit, on a set of degraded skeletal DNA samples from Serbia.

na

2.Materials and Methods 2.1 Typing Success on (Bones samples)

ur

A number of 30 challenged samples were chosen based on data generated for these samples with standard CE methodology. These samples included DNA from aged bone samples and poor STR results and 30 bone DNA samples that yielded a varying number of typable alleles with CE technology. 2.2 ForenSeq DNA Signature Prep Kit (Beta version)

Jo

The beta version of the ForenSeq DNA Signature Prep kit provides PCR primer mixes for the targeted amplification of 58 STRs, 94 identity informative SNPs (iSNPs), 56 ancestry informative SNPs (aSNPs) and 22 phenotypic informative SNPs (pSNPs) [2]. A primer mix containing a pair of tagged oligos for each target sequence was mixed with the DNA sample. PCR cycles linked the tags to the copies of each target to form DNA templates consisting of the regions of interest flanked by universal primer sequences. The tags were used to attach indexed adapters, which were then amplified using PCR, purified, pooled into a single tube, and then sequenced. This process was done with integrated indexing to support sequencing of 30 samples in a single run including the positive and negative controls. The samples were sequenced on Illumina Miseq FGx [19]. 3.Results

Page 2 of 9 3.1 ForenSeq™ UAS Results

ro of

The ForenSeq™ UAS provided quality and coverage data for all samples and was used to determine STR and SNP genotypes as well as estimate ancestry and phenotype. The analytical threshold was set at 10 reads while the stochastic threshold was set at 30 reads. The ForenSeq™ UAS calculated 1/RMPs from the STR length-genotypes, STR sequence-genotypes, and SNP genotypes. Allele frequencies from the (ForenSeq™) universal software population datasets (African, admixed American, Asian, and European) were used. The ForenSeq™ DNA Signature kit results were quite robust given the nature of the samples. A total of 88.8% of the loci was observed to have amplified, of which 84.3% were <150 bp in fragment size. Of all the markers which amplified the highest proportion present were pSNPs (100%), auSTRs and iSNPs (86.5%) and AISNPs (96.4%). Each sample produced concordant results with the GlobalFiler™ kit. MPS results showed a higher number of allele recovery for autosomal STRs apart from sample No 22 (1406/12). This sample had a DNA quantity of (0.018 ng) and a DI of 14. Sample 11 (90/12) had a high DI 7 and GlobalFiler™ alleles recovered of 40. Sample No 9 had DNA quantity of 0.001ng; DI of 1133 and 33 GlobalFiler™ alleles were recovered. For all the bone samples, the auSTR allele recovery was much higher than the GlobalFiler™ kit despite low quant values and inhibited nature of these samples. Of all the markers amplified the highest proportion present were (100%) phenotypic SNPs, (98.6%) for autosomal STRs, X-STRs, (98.6%) identity SNPs and ancestry SNPs (Fig 1 A&B).

na

lP

re

-p

The discriminatory power for each analysis of degraded DNA from bones, random match probabilities were calculated. The inverse of RMPs (1/RMPs) were reported to give the likelihood of obtaining the same DNA profile from an unrelated individual of the US Caucasian/European population group (Fig 3). The GlobalFilerTM kit was able to consistently achieve the highest discriminating power, having an average 1/RMP value of one in (10 27). The STR sequence-genotype was more discriminating compared to the STR length-genotype because the underlying sequence variation of the STRs increased allelic diversity and therefore power. In addition, the small number of reported loci combined with the low discriminating power of bi-allelic SNPs suppressed the 1/RMP values for the MiSeq® iSNP which had an average 1/RMP value of one in (1012). Looking at the unknown bone samples, the results generated the highest 1/RMP values, an average of one in (1036). Predications were made using the ForenSeq™ DNA Signature kit examined 56 SNPs that estimated biogeographical ancestry of the samples based on principle component analysis (PCA). The ForenSeq™ UAS created PCA plots that reflected the best fit population estimate of each sample’s biogeographical ancestry using 1000 Genomes data: Admixed American, African, East Asian, European. Results showed that the major population of biogeographical ancestry for the skeletal bones was European, which was consistent with antemortem records (Fig 3). The MiSeq® FGx™ platform also had to the potential to estimate hair and eye colour with 22 phenotype informative SNPs [16], [18]. The ForenSeq™ UAS could generate individual probabilities for four hair color categories (black, brown, blonde, and red) and three eye color categories (brown, blue, and intermediate). In order to make these estimates, all phenotype SNP loci had to be detected [6].

ur

4.Discussion

Jo

For the majority of sequencing work, depth of coverage refers to the average coverage per base across a run, whereas targeted sequencing for forensic applications relies on the actual amplicon or base count for comparison between alleles/loci and for the accurate identification of the present variants. In our study, the depth of coverage for the degraded bone samples found at 95.6% and the range of read depth was 20000 to 4000000 reads, which means that the MPS based analysis were highly sensitive. In this study, for all 30 bone samples, the STR allele recovery using MPS was higher than the GlobalFilerTM kit. Positive and negative controls performed as expected. The results were highly informative and the study demonstrates the potential of MPS to analyze unidentified human skeletal remains and to provide more genetic information from the same initial quantities of DNA sample as that of CE-based analyses. Combined results for all MPS panels included genetic data for 24/24 Y-STRs, 56/56 ancestry informative SNPs, 24/24 phenotype-informative SNPs, 27/27 autosomal STRs (plus amelogenin), and 8/8 X-STRs. The average number of STRs loci detected is (2069) and SNPs alleles (4546). Fig 2B shows that the number of loci which is detected in ForenSeqTM DNA signature kit is more than the normal amplification GlobalFilerTM kit (Fig 2A). An example of old sample which showed a highly degraded sample in Case 627-91, 28 years old gave a partial degraded in GlobalFilerTM EPG (Fig 1B), however, the ForenSeqTM DNA signature kit. It showed a

Page 3 of 9 successful full profile for 27 STRs, 21 Y-STRs belongs to (I2a1b3) Haplogroup of Western Europe predicted in NevGen online tool.

ro of

Ancestry-informative SNP results were obtained for all ancestry SNP markers for 20 samples, 10 samples had a range of 31 to 54 ancestry SNPS respectively. Using the ancestry-informative SNP data, the major population bioancestry bone samples was determined to be Western European Ancestry (86% of samples), followed by Admixed American (7%), and East Asia (7%) since the bone cases were from Belgrade Serbia. The phenotypic SNPs provided blue eye colour predictions and the most likely hair colour was blond or dark blond/brown for majority of the samples in line with the expected phenotypic features [4], [17]. The Nevgen provided Y-Haplogroup predictions that were used. The bone samples from Serbia showed that the samples ancestry predicted to be Western European (86%), Northern and Western European (86%), followed by Admixed American and Europe (7%). The data show that haplogroup I2a was the most common Y-haplogroup in the Europeans studied here. This was reflected in this study where 44% of the bone samples were predicted to belong to haplogroup I2a (44%) followed by E1b1b (17%), G2a (11%), R1b (11%), J2a (6%), C2b1a (5%) and N2 (6%). These Haplogroups are presented in high percentages within European populations, especially in Serbia. A study by Reguiero et al. (2012) indicated that about 58% of Serbian Y-haplogroups (I1, I2a and R1a1a) belong to ancestries and assumed to be preNeolithic [15]. Haplogroup I2a seems to have come out of the Neolithic period and strongly linked to Neolithic cultures in South-East and North-Western Europe.

-p

It has been recommended to generate data using the limited 150 base single read sequences as it is available now on the MiSeq platform [8], [10], [20]. In this study, 30 Bone samples DNA yielded positive results after sequencing for approximately 230 markers. There are 30 bone samples, which give positive results produced STR profiles that were 98.6% percentage complete (a total of 98.6% loci per full profile) and an of these produced full SNP profiles. The results produced STR loci recovery of alleles generated in MiSeq FGX is higher than the GlobalFilerTM. Of all the markers amplified the highest proportion present were (100%) phenotypic SNPs, (98.6%) for autosomal STRs, X-STRs, (98.6%) identity SNPs, and ancestry SNP.

Jo

ur

na

lP

re

Ancestry-informative SNP results were obtained for 54 of the 54 SNP markers amplified via the ForenSeq™ DNA Signature Prep Kit for 20 samples, 10 samples have a range of 31 to 54 ancestry SNPS respectively. About 96.3% of ancestry-informative SNPs yielded were generated successfully in this study from 30 bone samples. Using the ancestry-informative SNP data, the major population bio-ancestry bone samples was determined to be European Ancestry since the bone cases are from Serbia except two samples which shown Asian Ancestry. The targeted sequencing data produces high depths of coverage, thereby enhancing the possibility of successful analysis of challenging forensic specimens such as mixtures and highly degraded DNA samples, since the bones have a range of degradation index (1-40) [11]. Fig 4, Pie chart showing results of the Haplogroup Predictor online tool [2] for the male bone samples Y-STR haplotypes and ancestry SNPs biographic predictor using Illumina Forensic signature Tool. The bone samples is predicted to belong to haplogroup I2a (44%) followed by E1b1b (17%), G2a (11%), R1b (11%), J2a (6%), C2b1a (5%) and N2 (6%), whilst the results for the bones are with a range of probability of (71% -100%), (Table 1 -Supplementary material). Sample number 25 showing results of the Nevgen Haplogroup Predictor for the male bone samples Y-STR haplotypes and ancestry biographic predictor using ForenSeqTM Universal software, it shows almost the same Haplogroup in both tools which is R1B and shows the ancestry Y-Haplogroup with a 32% match in the World Wide in the Y-Haplogroup for this bone sample that is originated from Western European. 5. Conclusion In this work Forensic Signature DNA kit analyses of degraded bone samples indicated that the allele recovery for auSTRs was significantly higher than the GlobalFiler™ kit. All samples sowed concordant results for GlobalFiler™ and ForenSeq™ kit. MPS data showed concordant and superior quality auSTR profiles. The availability of information about the visible phenotypic traits along with identity and ancestry information and traditional STR data can tremendously enhance the value of the MPS analyses in forensic casework. This work has shown clear advantages of MPS analyses of degraded human remains in terms of the amount of relevant information gained for forensic casework and mass disaster victim identification. Overall 230 markers of were detected in all of 30 of the bone samples that includes femur and tooth samples.

Page 4 of 9

References 1.

Børsting, C. and Morling, N. Next generation sequencing and its applications in forensic genetics. Forensic Sci. Int. Genet. 2015; DOI: http://dx.doi.org/10.1016/j.fsigen.2015.02.002

2. ATHEY, T. W. 2006. Haplogroup prediction from Y-STR values using a Bayesian-allele-frequency approach. J Genet Geneal, 2, 3439.

Jo

ur

na

lP

re

-p

ro of

3. CHAITANYA, L., PAJNIČ, I. Z., WALSH, S., BALAŽIC, J., ZUPANC, T. & KAYSER, M. 2017. Bringing colour back after 70 years: Predicting eye and hair colour from skeletal remains of World War II victims using the HIrisPlex system. Forensic Science International: Genetics, 26, 48-57. 4. COLLINS, J. R., STEPHENS, R. M., GOLD, B., LONG, B., DEAN, M. & BURT, S. K. 2003. An exhaustive DNA micro-satellite map of the human genome using high performance computing. Genomics, 82, 10-19. 5. DIVNE, A.-M., EDLUND, H. & ALLEN, M. 2010. Forensic analysis of autosomal STR markers using Pyrosequencing. Forensic science international: genetics, 4, 122-129. 6. DRAUS-BARINI, J., WALSH, S., POŚPIECH, E., KUPIEC, T., GŁĄB, H., BRANICKI, W. & KAYSER, M. 2013. Bona fide colour: DNA prediction of human eye and hair colour from ancient and contemporary skeletal remains. Investigative genetics, 4, 1. 7. EKBLOM, R. & GALINDO, J. 2011. Applications of next generation sequencing in molecular ecology of non-model organisms. Heredity, 107, 1-15. 8. FORDYCE, S. 2011. MC Á vila-Arcos, E. Rockenbauer, C. Børsting, R. Frank-Hansen, FT Petersen, et al., High-throughput sequencing of core STR loci for forensic genetic investigations using the Roche Genome Sequencer FLX platform. BioTechniques, 51, 127-133. 9. GRUNENWALD, A., KEYSER, C., SAUTEREAU, A.-M., CRUBÉZY, E., LUDES, B. & DROUET, C. 2014. Adsorption of DNA on biomimetic apatites: Toward the understanding of the role of bone and tooth mineral on the preservation of ancient DNA. Applied Surface Science, 292, 867-875. 10. KIM, E. H., LEE, H. Y., YANG, I. S., JUNG, S.-E., YANG, W. I. & SHIN, K.-J. 2016. Massively parallel sequencing of 17 commonly used forensic autosomal STRs and amelogenin with small amplicons. Forensic Science International: Genetics, 22, 1-7. 11. KING, T. E., FORTES, G. G., BALARESQUE, P., THOMAS, M. G., BALDING, D., DELSER, P. M., NEUMANN, R., PARSON, W., KNAPP, M. & WALSH, S. 2014. Identification of the remains of King Richard III. Nature communications, 5. 12. MEYER, M., FU, Q., AXIMU-PETRI, A., GLOCKE, I., NICKEL, B., ARSUAGA, J.-L., MARTÍNEZ, I., GRACIA, A., DE CASTRO, J. M. B. & CARBONELL, E. 2014. A mitochondrial genome sequence of a hominin from Sima de los Huesos. Nature, 505, 403-406. 13. MOROZOVA, O., HIRST, M. & MARRA, M. A. 2009. Applications of new sequencing technologies for transcriptome analysis. Annual review of genomics and human genetics, 10, 135-151. 14. ROCKENBAUER, E., HANSEN, S., MIKKELSEN, M., BØRSTING, C. & MORLING, N. 2014. Characterization of mutations and sequence variants in the D21S11 locus by next generation sequencing. Forensic Science International: Genetics, 8, 68-72. 15. REGUEIRO, M., RIVERA, L., DAMNJANOVIC, T., LUKOVIC, L., MILASIN, J. & HERRERA, R. J. (2012). High levels of Paleolithic Y-chromosome lineages characterize Serbia. Genetics, 498, 59-67. 16. TOOM, V., WIENROTH, M., M’CHAREK, A., PRAINSACK, B., WILLIAMS, R., DUSTER, T., HEINEMANN, T., KRUSE, C., MACHADO, H. & MURPHY, E. 2016. Approaching ethical, legal and social issues of emerging forensic DNA phenotyping (FDP) technologies comprehensively: Reply to ‘Forensic DNA phenotyping: Predicting human appearance from crime scene material for investigative purposes’ by Manfred Kayser. Forensic Science International: Genetics, 22, e1-e4. 17. WALSH, S., CHAITANYA, L., CLARISSE, L., WIRKEN, L., DRAUS-BARINI, J., KOVATSI, L., MAEDA, H., ISHIKAWA, T., SIJEN, T., DE KNIJFF, P., BRANICKI, W., LIU, F. & KAYSER, M. 2014. Developmental validation of the HIrisPlex system: DNA-based eye and hair colour prediction for forensic and anthropological usage. Forensic Science International: Genetics, 9, 150-161. 18. WALSH, S., LIU, F., WOLLSTEIN, A., KOVATSI, L., RALF, A., KOSINIAK-KAMYSZ, A., BRANICKI, W. & KAYSER, M. 2013. The HIrisPlex system for simultaneous prediction of hair and eye colour from DNA. Forensic Science International: Genetics, 7, 98-115. 19. Illumina, Inc. Forenseq™ DNA Signature Prep Guide, February 2016 (Part#15049528 Rev. D) http://support.illumina.com/content/dam/illumina-support/documents/documentation/chemistry_documentation/forenseq/forenseqdna-signature-prep-guide-15049528-d.pdf 20. ZENG, X., KING, J. L., STOLJAROVA, M., WARSHAUER, D. H., LARUE, B. L., SAJANTILA, A., PATEL, J., STORTS, D. R. & BUDOWLE, B. 2015. High sensitivity multiplex short tandem repeat loci analyses with massively parallel sequencing. Forensic Science International: Genetics, 16, 38-47.

ro of

-p

re

lP

na

ur

Jo

Page 5 of 9

Page 6 of 9

B

lP

re

-p

ro of

A

Jo

ur

na

Figure 1. (A & B) Results of 30-bone samples run on FGx-MiSeq showing a cluster density of 866 K/mm2. A total 95% of clusters generated run passed filters, which showed that the reaction was highly successful. The average intensity of the reads was 200000.

Page 7 of 9

B

A

D22S1045

4430

D21S11

1214 9439

D20S482

D19S433

3338 2228 2827 1327 1069 1953 1056 435

D18S51 D17S1301 D16S539

PentaE D13S317 D12S391 vWA

TH01

7487

D10S1248

ro of

2278

D9S1122

4858 4371

D8S1179

D7S820

1685

D6S1043

4054

CSF1PO

1308 1314 2219

D5S818

-p

FGA

D4S2408

3987 4746

D3S1358

lP

re

D2S1338

2103

D2S441

4240

TPOX

2056

D1S1656

698 564

Amelogenin 0

2000

4000

6000

8000

Number of Reads in MPS

Jo

ur

na

Figure 2. (A & B) DNA Profile of a degraded bone sample (DI = 13) showing a dropout of 7 auSTR loci using GlobalFiler™ kit. MPS results for various markers from the same sample show high efficacy and enhanced discrimination power of Illumina® ForenSeq™ DNA Signature Prep kit

10000

na

lP

re

-p

ro of

Page 8 of 9

Jo

ur

Figure 3. Phenotype estimation of a Serbian bone sample showing a high prediction for blond hair and blue eyes. The sample was predicted to have an ancestry of Eastern European population correctly

Page 9 of 9

C2b1a 5%

R1b 11% N2 6%

E1b1b 17%

G2a2 11%

na

I2a1 44% C2b1a

lP

re

-p

ro of

J2a2 6%

ur

E1b1b

G2a2

I2a1

J2a2

N2

R1b

Figure 4. Pie chart showing NevGen Haplogroup Predictor tool results for 18 male bone samples showing the distribution of

Jo

haplogroups.