Forensic Science International 171 (2007) 237–249 www.elsevier.com/locate/forsciint
Announcement of population data
Population data from sub-populations of the Northern Territory of Australia for 15 autosomal short tandem repeat (STR) loci Carmen Eckhoff a,1, Simon J. Walsh b,c,*, John S. Buckleton d b
a Northern Territory Police, Fire and Emergency Services, PO Box 39764, Winnellie, NT 0821, Australia Centre for Forensic Science, University of Technology Sydney, PO Box 123, Broadway, NSW 2007, Australia c Science & Justice Consulting, PO Box 83, Surry Hills, NSW 2010, Australia d ESR Ltd., Private Bag 92-021, Auckland, New Zealand
Received 2 May 2005; received in revised form 5 March 2006; accepted 8 June 2006 Available online 1 August 2006
Abstract It is a requirement that forensic DNA profiling evidence be accompanied by an estimation of its weight, in order that the court can assign an appropriate probative value to the evidence during legal proceedings. There are various models by which this estimation can be made, but each relies on approximations of the allele frequencies in the relevant population. It is also important to assess relevant population genetic features of the available data. This report provides allele frequencies and estimates of common population genetic parameters for the major sub-populations of the Northern Territory of Australia genotyped at 15 autosomal short tandem repeat (STR) loci. # 2006 Elsevier Ireland Ltd. All rights reserved. Keywords: DNA profiling; Autosomal microsatellites; Population data; Allele frequencies; Profiler plus; Identifiler; Independence testing; Northern Territory
Population: Forensic biology personnel from the Northern Territory Police, Fire and Emergency Services (NTPFES) have compiled sub-population datasets from five major subpopulation groups of Northern Australia (Table 1). The subpopulation of origin of each donor was assigned by a combination of self-declaration or assignment by a third party. Extraction: DNA was extracted from blood stain cards and buccal swabs. The reference samples were air-dried before being stored at 20 8C. A small portion was removed from the reference sample and DNA was extracted using the Chelex method as described by Walsh et al. [1]. Amplification: The NTPFES Forensic Biology Laboratory uses both the Applied Biosystems AMPFlSTR1 Profiler PlusTM kit and AMPFlSTR1 IdentifilerTM kit for routine casework and database samples. In a single PCR reaction the kits amplify 9 and 15 tetranucleotide loci, respectively, as well
* Corresponding author. Tel.: +61 2 9514 4064; fax: +61 2 9514 8206. E-mail addresses:
[email protected] (C. Eckhoff),
[email protected] (S.J. Walsh). 1 Tel.: +61 8 8922 1324; fax: +61 8 8922 1326. 0379-0738/$ – see front matter # 2006 Elsevier Ireland Ltd. All rights reserved. doi:10.1016/j.forsciint.2006.06.009
as a gender identification marker. The procedures used by the NTPFES Biology Laboratory are proportionate to the manufacture’s protocols for both kits, using a total amplification volume of 20 ml, not 50 ml as recommended. The samples were amplified on the Applied Biosystems GeneAmp1 PCR System 9700 thermal cycler in accordance to the respective Applied Biosystems protocol for each kit. Detection: Aliquots of 1–2 ml of the amplified product were combined with 24 ml of HiDi Formamide containing 0.4 ml GeneScanTM 500RoxTM (Profiler PlusTM) or 0.4 ml GeneScanTM 500LizTM (IdentifilerTM) and analysed using an automated sequencer, the ABI PrismTM 310 Genetic Analyzer, as per the Applied Biosystems protocols. GeneScanTM and Genotyper software were used to verify the fragment size of the alleles and thus determine the DNA profile for each of the reference samples. All DNA results were independently verified by a second reader. Results:The allele frequencies for the 15 STR loci have been tabulated within each of the five sub-population categories listed in Table 1. The data are represented in Tables 2–6. Estimates of observed and expected heterozygosity (HO and HE, respectively; Tables 2–6) and Fisher’s exact test [2] for allelic association was undertaken with 10,000 shuffles using the Genetic Data Analysis [3] software provided courtesy of Paul
238
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249
Table 1 Description of sub-population categories for the NT Sub-population
Description
NT Caucasian NT Declared Aboriginal NT Pure Aboriginal NT Asian NT East Timorese
Data Data Data Data Data
compiled compiled compiled compiled compiled
from from from from from
individuals individuals individuals individuals individuals
who who who who who
have self-declared their ethnicity as Caucasian have self-declared their ethnicity as Aboriginal (or Indigenous Australian) are known to be Indigenous Australian and who are resident of remote tribal communities have self-declared their ethnicity as Asian have self-declared their ethnicity as East Timorese
Lewis (Department of Ecology and Evolutionary Biology, The University of Connecticut). Departures from Hardy–Weinberg equilibrium ( p < 0.05) were detected at some loci (Table 7). However, in this dataset we have what is termed the ‘multi-testing problem’ as we have undertaken 15 Hardy–Weinberg and 105 linkage-equilibrium tests per ethnic group. In addition, some of these comparisons were based on small datasets. Hypothesis testing on very small datasets is ineffective [4,5]. Accordingly the results from tests on datasets of less than 200 profiles were not considered. This meant that six loci (TH01, TPOX, CSF, D16, D2 and D19) from the Asian sub-population dataset were excluded from consideration. There are various ways offered to deal with the multitesting problem. These include the Bonferonni correction, a simple graphical examination (including p–p plots) or the truncated product method. The Bonferonni correction suffers from the handicap that it lowers the power of an already weak test [6]. For these reasons we prefer the other two methods. Graphical representations of these data are given in Fig. 1. If the hypotheses of Hardy–Weinberg and linkage equilibrium are true then the p-values should be distributed uniformly between 0 and 1; p U[0,1]. As an example, the x = y trend-line in the p–p plots (Fig. 1) represents equilibrium and deviations from that trend-line can be seen as mild departures from equilibrium. The 95% confidence limit is also displayed on the p–p plots as the region within the two curved lines. The p–p plot for the Caucasian dataset (Fig. 1) shows evidence of mild departure. The values are plotting away from the linear trend-line (which represents a state of Hardy– Weinberg and linkage equilibrium) although remain within the 95% confidence limit envelope. The p–p plots for both the Declared and Pure Aboriginal (Fig. 1) datasets show evidence of departure from independence as values plot away from the linear trend-line (which represents a state of Hardy– Weinberg and linkage equilibrium) and occur mostly outside the 95% confidence limit envelope. The shape of the curve is more pronounced in the Declared Aboriginal dataset, than in the Pure Aboriginal dataset, however, the difference in the sizes of these datasets limits our ability to compare these outcomes. The p–p plots for both the Asian and East Timorese (Fig. 1) sub-populations do not show any evidence of departure. The truncated product method gives the test statistics provided in Table 8. This elegant method was described by Zaykin et al. [7]. This method involves forming the sum of 2 ln( p) across, say, x independent tests. This sum is expected
to have a chi-square distribution with 2x degrees of freedom and a p-value is estimated (Table 8). It is important to note that these tests are not independent and hence the truncated product method can only be used as a guide. The truncated product test statistics do not suggest any strong evidence for departure from independence for the Asian or East Timorese datasets ( p > 0.05) but do give such an indication for the Caucasian, Declared and Pure Aboriginal datasets ( p < 0.05). Discussion: Due to the range of loci analysed in the Northern Territory sub-population datasets, their sizes vary widely for different loci and different sub-population categories. The Caucasian and Aboriginal datasets are large by world standards for 9 of the 15 loci analysed (corresponding with the loci of the Profiler PlusTM system), and moderately sized for the remaining six loci. The Asian and East Timorese datasets are small by comparison, but of suitable size for allele frequency estimation at 9 of the 15 loci (again corresponding with the loci of the Profiler PlusTM system). Caution must be exercised when interpreting data from small datasets. We have advised in the past, and re-iterate here, the need for adequate correction for sampling error when using allele frequency data from moderate to small datasets [8]. A discussion of approaches has been given by Curran and colleagues [9,10]. Caucasian populations worldwide are believed to be very close to equilibrium and previously when we have analysed datasets of this size we have found little or no evidence for departure [11–14]. However, some evidence of departure was observed in the NT Caucasian sub-population dataset (Fig. 1, Table 8). This mild departure is not grossly beyond expectation. The size of the dataset is approaching a level that permits the detection of small-scale departures. Aboriginal populations are believed to be sub-structured with complex patterns of relatedness within and between communities and/or tribal groups. Additionally, certain Aboriginal populations of Australia have undergone admixture with other non-indigenous sub-populations. Hence, the expectation for Aboriginal datasets is moderate disequilibrium. The combined Aboriginal dataset analysed here is extensive. It has also been partitioned into declared and pure categories. Testing these sub-sets revealed a greater extent of disequilibrium in the Declared Aboriginal dataset. This may be indicative of more extensive admixture, or may also be related to the increased power of the tests to find departures, due to the greater size of the Declared Aboriginal dataset relative to the Pure Aboriginal dataset.
Table 2 Allele frequencies for the 15 IdentifilerTM loci for the NT Caucasian sub-population Allele
D3 TM
– – – – – – – – – 0.0001 – 0.0004 – – – 0.0011 – – 0.1063 – – 0.0975 – 0.2223 – 0.2671 – 0.2001 – 0.0886 – 0.0152 – 0.0011 – – 0.0001 – – – – – – – – – –
– – – – – – – – – – – – – – – – – – – – – 0.0001 – 0.0003 – 0.0009 – 0.0144 0.0001 0.0591 0.0002 0.1370 0.0021 0.1791 – 0.0028 0.1799 0.0090 0.0001 0.1467 0.0045 0.1390 0.0007 0.0817 0.0004 0.0317 0.0001
D8
D21 – – – – 0.0210 – 0.0131 – 0.0887 0.0785 – 0.1423 0.0001 – – 0.3215 – – 0.1919 – – 0.1076 – 0.0313 – 0.0037 – 0.0003 – – – – – – – – – – – – – – – – – – –
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.0010 – 0.0009 0.0019 –
D18 – – – – – – 0.0004 – 0.0078 0.0147 0.0001 0.1443 – – – 0.1260 – 0.0001 0.1643 0.0001 0.0001 0.1491 – 0.1375 – 0.1070 0.0001 0.0711 – 0.0436 – 0.0180 0.0002 0.0081 – – 0.0049 – – 0.0015 – 0.0004 – 0.0003 – 0.0001 –
D5
D13 – 0.0002 0.0007 – 0.0024 – 0.0369 – 0.0704 0.3632 – 0.3588 – – – 0.1541 – – 0.0106 – – 0.0026 – – – – – – – – – – – – – – – – – – – – – – – – –
– – 0.0003 – 0.1342 – 0.0773 – 0.0654 0.3058 – 0.2801 – – – 0.0936 – – 0.0429 – – 0.0005 – – – – – – – – – – – – – – – – – – – – – – – – –
D7
TH01 – – 0.0234 0.0001 0.1587 – 0.1588 – 0.2665 0.2086 – 0.1459 – – – 0.0307 0.0001 – 0.0067 – – 0.0006 – – – – – – – – – – – – – – – – – – – – – – – – –
0.0016 0.2322 0.1885 – 0.1328 0.0005 0.1525 0.2814 0.0093 0.0011 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
TPOX – 0.0012 – – 0.5125 – 0.1241 – 0.0546 0.2696 – 0.0374 – – – – – – – – – 0.0006 – – – – – – – – – – – – – – – – – – – – – – – – –
CSF – – 0.0018 – 0.0072 – 0.0235 – 0.2398 0.3060 – 0.3325 – – – 0.0759 – – 0.0120 – – 0.0012 – – – – – – – – – – – – – – – – – – – – – – – – –
D16 – – – – 0.0088 – 0.1103 – 0.0574 0.2926 – 0.3309 – – – 0.1721 – – 0.0265 – – 0.0015 – – – – – – – – – – – – – – – – – – – – – – – – –
D2 – – – – – – – – – – – – – – 0.0015 – – – – – – 0.0030 – 0.0519 – 0.1840 – 0.0801 – 0.1039 – 0.1350 – 0.0282 – – 0.0341 – – 0.1068 – 0.1187 – 0.1276 – 0.0223 –
D19 – – – – – – – – 0.0015 0.0044 – 0.0746 0.0015 – – 0.2515 – 0.0058 0.3480 – 0.0190 0.1725 0.0570 0.0512 0.0073 0.0044 – – 0.0015 – – – – – – – – – – – – – – – – – –
239
loci – – – – – – – – – 0.0010 – 0.0004 – – – 0.0029 – – 0.1175 – – 0.2745 – 0.2505 – 0.2014 – 0.1373 0.0001 0.0134 – 0.0008 – 0.0001 0.0001 – – – – – – – – – – – –
FGA
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249
Identifiler 5 6 7 7.3 8 8.3 9 9.3 10 11 11.2 12 12.1 12.2 12.3 13 13.1 13.2 14 14.1 14.2 15 15.2 16 16.2 17 17.1 18 18.2 19 19.2 20 20.2 21 21.1 21.2 22 22.2 22.3 23 23.2 24 24.2 25 25.2 26 26.1
vWA
240
Table 2 (Continued ) Allele
vWA
FGA
D8
D21
D18
D5
D13
D7
TH01
TPOX
CSF
D16
D2
D19
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.789 0.789 9702
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.810 0.808 9684
– 0.0079 0.0014 – 0.0004 – – – 0.0001 – – – – – – – – – – – – – – – – – – 0.864 0.861 9692
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.812 0.806 9696
0.0002 0.0356 0.1519 0.0001 0.2159 0.0001 0.0010 0.0001 0.2409 0.0335 0.0719 0.1058 0.0148 0.0845 0.0009 0.0299 0.0003 0.0002 0.0039 0.0010 0.0011 0.0001 0.0005 0.0007 0.0007 0.0004 0.0001 0.845 0.846 9686
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.876 0.881 9588
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.709 0.707 9696
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.789 0.779 9668
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.812 0.813 9574
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.791 0.785 1830
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.645 0.639 1684
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.732 0.745 1660
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.760 0.776 680
– 0.0015 0.0015 – – – – – – – – – – – – – – – – – – – – – – – – 0.885 0.896 674
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.775 0.807 684
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249
26.2 27 28 28.2 29 29.1 29.2 29.3 30 30.2 31 31.2 32 32.2 33 33.2 34 34.1 34.2 35 35.2 36 36.2 37.2 38.2 39.2 40.2 HE HO n
D3
Table 3 Allele frequencies for the 15 IdentifilerTM loci from the NT Declared Aboriginal sub-population Allele
D3 TM
– – – – – – – – – – – – – 0.0001 – 0.0356 – – 0.0702 – – 0.2427 – 0.2701 0.2181 – 0.1398 – – 0.0219 – – 0.0014 0.0001 – – – – – – – – – – – – –
D8 – – – – – – – – – – – – – 0.0001 – – – – – – – 0.0017 – 0.0052 0.0258 – 0.0729 0.0001 0.0001 0.1050 0.0001 0.0011 0.1087 – 0.0037 – 0.1671 0.0056 0.1639 0.0034 0.1642 0.0037 0.1139 0.0001 0.0002 0.0403 0.0118
D21 – – – 0.0028 – 0.0025 – – – 0.0313 0.0889 0.1531 – 0.2372 – 0.1812 – – 0.1907 – 0.0033 0.0813 – 0.0233 0.0041 – 0.0002 – – – – – – – – – – – – – – – – – – – –
D18 – – – – – – – – – – – 0.0001 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.0003 – – 0.0022 0.0065 0.0106
D5 – – – – – – – – – 0.0028 0.0585 0.0471 0.0002 0.2038 0.0001 0.1694 0.0001 – 0.1216 – – 0.1297 – 0.0737 0.0781 0.0001 0.0619 0.0001 0.0001 0.0257 0.0001 – 0.0175 – – 0.0001 0.0080 – 0.0010 – 0.0001 – – – – – –
D13 – 0.0007 – 0.0006 – 0.0771 – – – 0.2201 0.2772 0.2822 – 0.1273 – 0.0137 – – 0.0011 – – – – – – – – – – – – – – – – – – – – – – – – – – – –
D7 0.0007 0.0028 – 0.3074 – 0.0513 – – – 0.0376 0.3184 0.2120 – 0.0517 – 0.0182 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
TH01 0.0003 0.0085 0.0040 0.3697 0.0001 0.0847 0.0001 – – 0.1987 0.2261 0.0909 – 0.0160 – 0.0008 – – 0.0001 – – – – – – – – – – – – – – – – – – – – – – – – – – – –
0.3615 0.2559 – 0.1227 – 0.1925 – – 0.0599 0.0065 0.0012 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
TPOX 0.0006 0.0006 – 0.2239 – 0.4325 – – – 0.0486 0.2790 0.0142 – 0.0006 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
CSF – 0.0012 – 0.0090 – 0.0127 – 0.0006 – 0.2398 0.2795 0.3560 – 0.0886 – 0.0114 – – 0.0012 – – – – – – – – – – – – – – – – – – – – – – – – – – – –
D16 – 0.0009 – 0.0091 – 0.0356 – – – 0.1005 0.3958 0.2331 – 0.1764 – 0.0439 – – 0.0046 – – – – – – – – – – – – – – – – – – – – – – – – – – – –
D2
D19 – – – – – – – – – – – – – – – – – – – – – 0.0064 – 0.0681 0.1031 – 0.2320 – – 0.0958 – – 0.0617 – – – 0.0773 – 0.1510 – 0.1197 – 0.0672 – – 0.0175 –
– – – – – – – – – – 0.0027 0.0605 – 0.3484 0.0045 0.4161 – 0.0045 0.1227 0.0054 – 0.0289 0.0027 0.0027 0.0009 – – – – – – – – – – – – – – – – – – – – – –
241
loci – – – – – – – – – – 0.0013 0.0004 – 0.0028 – 0.0444 – – 0.3773 – – 0.2448 – 0.2158 0.0967 – 0.0131 – – 0.0023 0.0003 – 0.0002 0.0007 – – – – – – – – – – – – –
FGA
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249
Identifiler 6 7 7.3 8 8.3 9 9.1 9.2 9.3 10 11 12 12.2 13 13.2 14 14.1 14.2 15 15.2 15.3 16 16.2 17 18 18.2 19 19.1 19.2 20 20.1 20.2 21 21.1 21.2 21.3 22 22.2 23 23.2 24 24.2 25 25.1 25.2 26 27
vWA
242
Table 3 (Continued ) Allele
D3
FGA
D8
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.794 0.783 10744
0.0001 – 0.0011 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.875 0.835 0.853 0.833 10732 10750
D21
D18
– – 0.1131 – 0.0001 – 0.2258 – 0.0003 – 0.0018 – 0.0001 – 0.1948 – 0.0386 – 0.0413 – 0.1142 – 0.0135 – 0.0909 – 0.0014 – 0.0468 – 0.0001 – 0.0005 – 0.0076 – 0.0237 – 0.0001 – 0.0203 – 0.0001 – 0.0174 – 0.0155 – 0.0078 – 0.0034 – 0.0010 – 0.870 0.876 0.845 0.849 10714 10564
D5
D13
D7
TH01
TPOX
CSF
D16
D2
D19
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.773 0.746 10746
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.752 0.735 10710
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.757 0.746 10530
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.749 0.725 1704
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.683 0.635 1688
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.730 0.728 1660
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.745 0.697 1094
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.871 0.856 1086
– – – – – – – – – – – – – – – – – – – – – – – – – – – 0.686 0.713 1108
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249
27.1 – 28 – 28.2 – 29 – 29.1 – 29.2 – 29.3 – 30 – 30.2 – 31 – 31.2 – 32 – 32.2 – 33 – 33.2 – 34 – 34.1 – 34.2 – 35.2 – 36 – 36.2 – 37 – 37.2 – 38.2 – 39.2 – 40.2 – 41.2 – HE 0.740 0.721 HO n 10756
vWA
Table 4 Allele frequencies for the 15 IdentifilerTM loci from the NT Pure Aboriginal sub-population Allele
D3 TM
– – – – – – – – – – – 0.0350 – 0.0650 – – 0.2359 – 0.2906 0.2222 0.1274 – 0.0231 – 0.0009 – – – – – – – – – – – – – – – – – – – – – –
– – – – – – – – – – – – – – – – 0.0017 – 0.0017 0.0299 0.0589 – 0.1032 0.0017 0.1101 – 0.0034 0.1894 0.0051 0.1613 0.0034 0.1596 0.0034 0.1135 – 0.0418 0.0119 – – – – – – – – – –
D8
D21 – – – 0.0026 0.0026 – 0.0274 0.0915 0.1641 0.2248 – 0.1615 – 0.2145 – 0.0017 0.0846 – 0.0222 0.0026 – – – – – – – – – – – – – – – – – – – – – – – – – – –
– – – – – – – – 0.0009 – – – – – – – – – – – – – – – – – – – – – – – – – 0.0034 0.0043 0.0094 0.1026 0.2291 0.0034 0.2077 0.0385 0.0376 0.1179 0.0094 0.0778 0.0009
D18 – – – – – – 0.0009 0.0575 0.0334 0.2144 – 0.1801 – 0.1184 – – 0.1407 – 0.0540 0.0780 0.0635 0.0009 0.0309 – 0.0197 – – 0.0069 – 0.0009 – – – – – – – – – – – – – – – – –
D5
D13 – 0.0009 – – 0.0701 – 0.2598 0.2427 0.2667 0.1385 – 0.0162 – 0.0051 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
– 0.0060 – 0.3353 0.0350 – 0.0512 0.3106 0.2022 0.0410 – 0.0188 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
D7
TH01 – 0.0111 0.0077 0.3699 0.0616 – 0.1978 0.2568 0.0796 0.0137 – 0.0017 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
0.3808 0.2598 – 0.1379 0.1842 0.0329 0.0036 0.0009 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
TPOX – 0.0028 – 0.1955 0.4609 0.0475 0.2812 0.0121 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
CSF – – – 0.0123 0.0113 – 0.2155 0.3299 0.3393 0.0870 – 0.0038 – 0.0009 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
D16 – – – 0.0084 0.0253 – 0.1132 0.3632 0.2736 0.1841 – 0.0321 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
D2 – – – – – – – – – – – – – – – – 0.0098 – 0.0623 0.0574 0.3049 – 0.0918 – 0.0656 – – 0.0689 – 0.1361 – 0.1262 – 0.0623 – 0.0148 – – – – – – – – – – –
D19 – – – – – – – – 0.0984 0.3419 0.0048 0.4081 0.0032 0.0952 0.0048 – 0.0371 0.0016 0.0016 0.0032 – – – – – – – – – – – – – – – – – – – – – – – – – – –
243
loci – – – – – – – – – 0.0026 – 0.0401 – 0.3942 – – 0.2278 – 0.2099 0.1024 0.0205 – 0.0017 – – 0.0009 – – – – – – – – – – – – – – – – – – – – –
FGA
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249
Identifiler 6 7 7.3 8 9 9.3 10 11 12 13 13.2 14 14.2 15 15.2 15.3 16 16.2 17 18 19 19.1 20 20.2 21 21.1 21.2 22 22.2 23 23.2 24 24.2 25 25.2 26 27 28 29 29.2 30 30.2 31 31.2 32 32.2 33
vWA
244
Table 4 (Continued ) Allele
D3
vWA
FGA
D8
D21
D18
D5
D13
D7
TH01
TPOX
CSF
D16
D2
D19
– – – – – – – – – – 0.737 0.700 1172
– – – – – – – – – – 0.789 0.761 1170
– – – – – – – – – – 0.871 0.858 1172
– – – – – – – – – – 0.834 0.800 1170
0.0487 0.0009 0.0060 0.0222 0.0214 0.0179 0.0009 0.0214 0.0094 0.0085 0.867 0.834 1170
– – – – – – – – – – 0.870 0.852 1166
– – – – – – – – – – 0.779 0.749 1170
– – – – – – – – – – 0.745 0.746 1172
– – – – – – – – – – 0.748 0.728 1168
– – – – – – – – – – 0.734 0.740 1124
– – – – – – – – – – 0.670 0.649 1074
– – – – – – – – – – 0.722 0.677 1058
– – – – – – – – – – 0.746 0.770 592
– – – – – – – – – – 0.845 0.816 610
– – – – – – – – – – 0.698 0.645 620
TH01
TPOX
Table 5 Allele frequencies for the 15 IdentifilerTM loci from the NT Asian sub-population Allele
D3 TM
Identifiler 6 7 8 9 9.3 10 11 12 12.2 13 13.2 14 14.2 15 15.2 16 16.2 17 18 18.2
loci – – – – – – – – – 0.0044 – 0.0267 – 0.3178 – 0.3489 – 0.2178 0.0800 –
vWA – – – – – – – – – – – 0.1733 – 0.0511 – 0.1667 – 0.2844 0.2289 –
FGA – – – – – – – – – – – – – – – 0.0044 – 0.0022 0.0222 0.0022
D8 – – 0.0022 0.0044 – 0.1222 0.1222 0.0889 – 0.2311 – 0.2044 – 0.1356 – 0.0667 – 0.0178 0.0044 –
D21 – – – – – – – – – – – – – – – – – – – –
D18 – – – – – 0.0022 0.0089 0.0578 – 0.1067 – 0.2244 – 0.1956 – 0.1489 – 0.1089 0.0667 –
D5 – 0.0156 – 0.0600 – 0.2178 0.2911 0.2467 – 0.1533 – 0.0156 – – – – – – – –
D13 – 0.0022 0.2701 0.1339 – 0.1027 0.2589 0.1496 – 0.0625 – 0.0201 – – – – – – – –
D7 – 0.0045 0.1875 0.0670 – 0.1763 0.3304 0.1808 – 0.0513 – 0.0022 – – – – – – – –
0.1613 0.3226 0.0726 0.3145 0.0726 0.0484 0.0081 – – – – – – – – – – – – –
– – 0.4919 0.1129 – 0.0645 0.3145 0.0161 – – – – – – – – – – – –
CSF – – 0.0081 0.0242 – 0.1935 0.2339 0.4435 – 0.0726 – 0.0161 – 0.0081 – – – – – –
D16 – – 0.0250 0.2250 – 0.1500 0.3500 0.2000 – 0.0250 – 0.0250 – – – – – – – –
D2 – – – – – – – – – – – – – – – 0.0750 – 0.0750 0.0500 –
D19 – – – – – 0.0250 0.0500 0.0500 0.0250 0.2250 0.0250 0.2750 0.0750 0.0750 0.0500 0.0250 0.0750 0.0250 – –
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249
33.2 34.1 34.2 35.2 36.2 37.2 38 38.2 39.2 40.2 HE HO n
0.0044 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.724 0.702 450
0.0822 0.0133 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.801 0.804 450
0.0822 0.0600 0.0022 0.1267 0.0111 0.1867 0.0111 0.1600 0.0044 0.1711 0.0044 0.0778 0.0067 0.0578 0.0044 0.0022 – – – – – – – – – – – – – – – – 0.876 0.858 450
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.846 0.836 450
– – – – – – – – – – – – – – 0.0133 0.0756 0.0044 0.2467 0.0022 0.2133 0.0244 0.1000 0.0800 0.0222 0.1444 0.0022 0.0600 0.0022 0.0022 0.0022 0.0022 0.0022 0.848 0.862 450
0.0200 0.0200 – 0.0267 – 0.0111 – – – – – – – 0.0022 – – – – – – – – – – – – – – – – – – 0.858 0.831 450
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.781 0.738 450
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.807 0.817 448
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.787 0.777 448
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.764 0.806 124
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.647 0.597 124
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.711 0.742 124
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.782 0.700 40
0.2000 0.2000 – 0.0250 – 0.0250 – 0.0750 – 0.2250 – 0.0250 – 0.0250 – – – – – – – – – – – – – – – – – – 0.869 0.850 40
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.868 0.900 40
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249
19 20 20.2 21 21.2 22 22.2 23 23.2 24 24.2 25 25.2 26 27 28 28.2 29 29.2 30 30.2 31 31.2 32 32.2 33 33.2 34 34.2 36.2 38.2 39.2 HE HO n
245
246
Table 6 Allele frequencies for the 15 IdentifilerTM loci from the NT East-Timorese sub-population Allele
D3
vWA
FGA
D8
D21
D18
D5
D13
D7
TH01
TPOX
CSF
D16
D2
D19
– – – – – – – – – – – – 0.0922 – 0.0461 – 0.1986 – 0.2376 – 0.3156 0.0816 0.0248 0.0035 – – – – – – – – – – – – – – – – – – – – 0.789 0.745 282
– – – – – – – – – – – – – – – – – – – – – 0.0714 0.0821 0.1143 0.1643 0.2107 0.1214 0.1107 0.0893 0.0214 0.0107 0.0036 – – – – – – – – – – – – 0.871 0.850 280
– – – 0.0072 – 0.0435 0.0978 – 0.0797 – 0.2246 – 0.2572 – 0.1630 – 0.0833 – 0.0435 – – – – – – – – – – – – – – – – – – – – – – – – – 0.833 0.746 276
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.0179 0.0964 0.2679 0.0036 0.1821 0.0071 0.1071 0.1214 0.0107 0.1214 0.0036 0.0536 0.0036 0.0036 0.844 0.886 280
– – – – – – 0.0250 0.0036 0.0571 – 0.0607 – 0.2321 – 0.1286 – 0.1500 – 0.1179 – 0.0821 0.0643 0.0250 0.0214 0.0107 0.0071 0.0071 – 0.0036 – 0.0036 – – – – – – – – – – – – – 0.877 0.850 280
– 0.0036 – 0.0288 – 0.2986 0.1727 – 0.3309 – 0.1511 – 0.0108 – 0.0036 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.750 0.741 278
– – 0.2163 0.0816 – 0.0993 0.2695 – 0.2447 – 0.0603 – 0.0284 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.803 0.816 282
– 0.0252 0.2338 0.0432 – 0.1763 0.3165 – 0.1835 – 0.0216 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.780 0.813 278
0.1168 0.1729 0.3271 0.3084 0.0187 0.0561 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.754 0.738 214
– 0.0047 0.3679 0.3443 – 0.0330 0.2217 – 0.0283 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.698 0.726 212
– – 0.0100 0.0400 – 0.2350 0.2550 – 0.3650 – 0.0750 – 0.0200 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.743 0.780 200
– – 0.0381 0.1619 – 0.1810 0.2952 – 0.2381 – 0.0667 – 0.0190 – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 0.795 0.848 210
– – – – – – – – – – – – – – – – – – 0.0446 – 0.0347 0.1386 0.0891 0.0248 0.1337 0.2426 0.2178 0.0743 – – – – – – – – – – – – – – – – 0.844 0.802 202
– – – – – – – – 0.0421 0.0047 0.2383 0.0093 0.2850 0.0561 0.0794 0.2009 0.0421 0.0374 – 0.0047 – – – – – – – – – – – – – – – – – – – – – – – – 0.811 0.813 214
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249
IdentifilerTM loci 6 – 7 – 8 – 9 – 9.3 – 10 – 11 – 11.2 – 12 0.0035 12.2 – 13 0.0142 13.2 – 14 0.0213 14.2 – 15 0.2660 15.2 – 16 0.3582 16.2 – 17 0.2979 17.2 – 18 0.0390 19 – 20 – 21 – 22 – 23 – 24 – 25 – 26 – 26.1 – 27 – 28 – 29 – 29.2 – 30 – 30.2 – 31 – 31.2 – 32 – 32.2 – 33 – 33.2 – 35.2 – 36.2 – 0.713 HE 0.730 HO n 282
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249
247
Fig. 1. p–p Plots for each of the five population groups. The two curves on the Aboriginal plot represent the Pure (filled points) and Declared (unfilled points) Aboriginal datasets.
Variant alleles at the HumD21S11 locus were observed in the Caucasian and Aboriginal datasets (notably 37.2, 38.2, 39.2, 40.2 and 41.2). These off-ladder incomplete alleles in the upper range of the repeat region have been observed in other Australian populations [11,14]. Further molecular analysis of these variant alleles [15] observed in the Western Australian Aboriginal dataset revealed a structural conformation (allele type V) that had previously only been observed in samples from a Papuan population [16,17]. A recent publication reports the observation of a HumD21S11 37.2 allele in a population from East Timor [18]. The findings of similar alleles at the HumD21S11 locus in the Northern Territory Aboriginal Australian datasets implies these variant alleles appear related to peoples of the Australasian region. The structural confirmation of these variant alleles also provides support for an ancestral
link between the Australian Aborigine and the Papuan people. Asian and East Timorese populations in Australia are expected to be both sub-structured and undergoing admixture. There is little evidence of that here, although the data are uninformative in this respect due to the limited size of the datasets. In summary, population data from the major sub-populations of the Northern Territory of Australia at the 15 STR loci of the AMPFlSTR1 IdentifilerTM STR multiplex system has been statistically examined to the level required by the international forensic community. We consider these databases suitable for use in matters of disputed paternity or DNA-based forensic evidence. This paper follows the specified approach outlined by Lincoln and Carracedo [19].
Table 8 Truncated product method statistics for examining p-values from the results of Fisher’s exact test on NT sub-populations Caucasian
HWE
2 ln( p) p(sum)
41.335 0.081
LE
2 ln( p) p(sum)
260.753 0.010
0.556 0.013 0.761 – 0.856
0.885 0.624 0.028 – 0.697
Test statistic
0.029 0.005 0.397 – 0.967
Asian
East Timorese
120.469 0.000
79.528 0.000
15.477 0.629
35.684 0.219
526.518 0.000
310.543 0.000
87.984 0.097
218.399 0.331
References
0.388 0.071 0.753 0.715 0.903
0.551 0.032 0.048 0.147 0.389
Pure Aboriginal
The authors wish to acknowledge the cooperation of management of the Northern Territory Police, Fire and Emergency Services Forensic Services Branch, and the contribution of staff of the forensic biology section. We also thank ESR internal reviewers Catherine McGovern and Dr. SallyAnn Harbison whose comments greatly improved this manuscript.
Some loci in the Asian dataset were not analysed due to the small sample size (indicated by a dash).
0.008 0.002 0.395 0.274 0.398 0.593 0.000 0.132 0.529 0.555 0.091 0.000 0.058 0.080 0.627 0.540 0.336 0.023 0.960 0.275 0.679 0.016 0.222 0.777 0.000 0.012 0.199 0.001 0.732 0.157 0.719 0.393 0.116 0.654 0.118 Caucasian Declared Aboriginal Pure Aboriginal Asian East Timorese
Declared Aboriginal
Acknowledgements
0.437 0.030 0.003 – 0.623
0.806 0.521 0.045 – 0.597
D2 D16 CSF TPOX TH01 D7 D13 D5 D18 D21 D8 FGA vWA D3
Table 7 Results of Fisher’s exact test for allelic association within loci (Hardy–Weinberg equilibrium) in the five major categories of sub-populations of the Northern Territory
0.723 0.003 0.061 – 0.841
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249 D19
248
[1] P.S. Walsh, D.A. Metzger, R. Higuchi, Chelex1 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material, BioTechniques 10 (1991) 506–513. [2] S.W. Guo, E.A. Thompson, Performing the exact tests of Hardy–Weinberg proportion for multiple alleles, Biometrics 48 (1992) 361–372. [3] P.O. Lewis, D. Zaykin. Genetic Data Analysis: computer program for the analysis of allelic data. Version 1.0 (d16c), 2001. Free program distributed by the authors over the internet from http://lewis.eeb.uconn.edu/ lewishome/software.html. [4] B. Law, J.S. Buckleton, C.M. Triggs, B.S. Weir, Effects of population structure and admixture on exact tests for association between loci, Genetics 164 (2003) 381–387. [5] J.S. Buckleton, J.M. Curran, S.J. Walsh, R v Bropho: the need for careful interpretation of independence testing in courtroom decision-making, Aust. Law J. 79 (2005) 709–721. [6] J.S. Buckleton, S.J. Walsh, S.A. Harbison, The fallacy of independence testing and the use of the product rule, Sci. Justice 41 (2001) 81–84. [7] D. Zaykin, L.A. Zhivotovsky, B.S. Weir, Truncated product method for combining p-values, Genet. Epidemiol. 22 (2002) 170–185. [8] J.S. Buckleton, Validating databases, in: J.S. Buckleton, C.M. Triggs, S.J. Walsh (Eds.), Forensic DNA Evidence Interpretation, CRC Press LLC, Boca Raton, FL, 2005, pp. 149–196. [9] J.M. Curran, J.S. Buckleton, C.M. Triggs, B.S. Weir, Assessing uncertainty in DNA evidence caused by sampling effects, Sci. Justice 42 (2002) 29–37. [10] J.S. Buckleton, J.M. Curran, Sampling effects, in: J.S. Buckleton, C.M. Triggs, S.J. Walsh (Eds.), Forensic DNA Evidence Interpretation, CRC Press LLC, Boca Raton, FL, 2005, pp. 195–215. [11] A. Bagdonavicius, G.R. Turbett, J.S. Buckleton, S.J. Walsh, Western Australian sub-population data for the thirteen AMPFlSTR Profiler PlusTM and COfilerTM STR loci, J. Forensic Sci. 47 (2002) 1149– 1153. [12] S.J. Walsh, J.R. Cullen, S.A. Harbison, Allele frequencies for the four major sub-populations of New Zealand at the 10 AMPFlSTR1 SGM PlusTM loci, Forensic Sci. Int. 122 (2001) 189–195. [13] J. Lee, S.J. Walsh, Population data for the AMPFlSTR Profiler PlusTM STR loci from residents of the Australian Capital Territory (ACT), J. Forensic Sci. 50 (2005) 984–985. [14] V. Ientile, B. Blair, P. Clausen, M. Hadfield, J.S. Buckleton, S.J. Walsh, Population data for the AMPFlSTR Profiler PlusTM STR loci for Queensland
C. Eckhoff et al. / Forensic Science International 171 (2007) 237–249 Caucasian, Aboriginal Australian and Asian sub-populations. Journal of Forensic Sciences, submitted for publication. [15] S.J. Walsh, G.R. Turbett, S.L. Robinson, N.P. Davies, A.N. Wilton, Characterisation of variant alleles at the HumD21S11 locus implies unique Australasian genotypes and re-classification of nomenclature guidelines, Forensic Sci. Int. 135 (2003) 35–41. [16] B. Brinkmann, E. Meyer, A. Junge, Complex mutational events at the HumD21S11 locus, Human Genet. 98 (1996) 60–64.
249
[17] B. Brinkmann, A. Junge, E. Meyer, P. Wiegand, Population genetic diversity in relation to microsatellite heterogeneity, Human Mutat. 11 (1998) 135–144. [18] L. Souto, C. Alves, L. Gusmao, E. Ferreira, A. Amorim, F. Corte-Real, D.N. Vieira, Population data on 15 autosomal STRs in a sample from East Timor, Forensic Sci. Int. 155 (2005) 77–80. [19] P. Lincoln, A. Carracedo, Publication of population data of human polymorphisms, Forensic Sci. Int. 110 (2000) 3–5.