Back to Publications

Letter

Perlin, M.W., Butt, N., and Wilson, M.R. Commentary on: Thompson WC. Uncertainty in probabilistic genotyping of low template DNA: a case study comparing STRmix™ and TrueAllele®. J Forensic Sci. 2023;68(3):1049–63.

Downloads

Article
Journal of Forensic Sciences

Letter

Editor:

This Letter is a response to “Uncertainty in probabilistic genotyping of low template DNA: A case study comparing STRmix™ and TrueAllele®”, a JFS Case Report published in February of this year (1).

Case Background

In a California criminal case, a man was accused of drug possession. At the defendant’s request, two drug packages were tested for DNA using short tandem repeat (STR) markers. Both items were two-person mixtures that gave similar match statistic results.

On one item, Cybergenetics TrueAllele^® probabilistic genotyping (PG) software found a strong exclusionary match statistic for the defendant of one over 1.2 million, with a false negative error rate of one over 222 million. On the same item, ESR’s STRmix™ PG program produced a weaker exclusionary match statistic of one over twenty-four.

There was no trial. Based on the exculpatory DNA evidence, the prosecutor dropped the more serious DNA-related possession charge and offered a plea agreement. The court accepted the defendant’s plea in March of this year.

The TrueAllele and STRmix PG software programs qualitatively agreed. Their likelihood ratio (LR) match statistics both supported the hypothesis that the defendant did not contribute his DNA to the drug package evidence. However, the magnitude of the LR match statistics differed between the software programs.

This letter briefly explains why the two PG software results differed. As JFS requested, we address some issues raised in the Case Report (1). A more extensive response (2) to the paper (1) was posted online in May, discussing 20 topics and examining 120 assertions.

Data Usage

The two programs were given different amounts of STR input data. TrueAllele is a fully Bayesian system capable of looking at all the (allelic and non-allelic) peak data without relying on laboratory-imposed data thresholds. Most other PG software applies peak height thresholds to limit the amount of input data. Peak heights are measured in relative fluorescent units (rfu).

TrueAllele used 210 data peaks across all 21 GlobalFiler™ STR loci, or 10 peaks per locus. At a 40 rfu threshold, the STRmix program saw 24 peaks across 14 loci, or just 1.7 peaks per locus. This 1.7 peak density is insufficient for an informative analysis of a two-person mixture, since at least 3 or 4 peaks would be needed. The 88% reduction in STRmix data peaks, relative to TrueAllele input, accounts for the observed LR output differences.

We tested STRmix on the STR data at different thresholds, ranging from 0 rfu to 90 rfu, in 10 rfu increments. The weakest STRmix subsource LR value in our sensitivity study was 1 over 3.35 (using 11 peaks at a high 90 rfu threshold), while the strongest LR was 1 over 30.5 million (38 peaks at a low 20 rfu threshold). Less STRmix input data gave less output identification information; more data yielded more information.

At a 10 rfu threshold (54 peaks), the STRmix LR of one over 4.8 million was close to TrueAllele’s reported one over 1.2 million. Given more data, STRmix got about the same LR results as TrueAllele. The difference in data input explains the difference between the reported TrueAllele and STRmix LR values in this case. The Case Report’s "opinions" (3) did not.

Comparison Methodology

The Case Report assumed that TrueAllele and STRmix software should produce similar LR answers on the same DNA evidence. With abundant DNA, where thresholds aren’t an issue, the two programs often agree. But TrueAllele’s hierarchical modeling is specifically designed to process low-template DNA data. Different statistical models can lead to different answers.

The Case Report compared TrueAllele and STRmix probabilistic genotypes. However, TrueAllele numerically represents contributor genotypes using posterior probability, while STRmix uses likelihood-derived genotype “weights.” Probability and likelihood are different concepts whose numbers cannot be directly compared (4).

The Case Report compared TrueAllele and STRmix mixture weights (MW). TrueAllele examined 10 peaks per locus at all 21 STR loci. This is enough STR pattern data for hierarchical MW modeling of a two-person mixture with differential DNA degradation. However, STRmix analyzed just 14 loci, averaging only 1.7 peaks per locus, which is insufficient genotyping data for determining MW. The Case Report looked at only a few nonrepresentative loci showing short STR molecules with little degradation.

The Case Report compared TrueAllele and STRmix LR reporting language. TrueAllele separates complex mixture data into probabilistic contributor genotypes, producing LR values that compare single-contributor genotypes (5). STRmix calculates LR values based on how well a set of genotypes jointly explain unseparated mixture data (6). The two approaches compute the same LR value (7), each having appropriate reporting language for their calculation method.

The Case Report took issue with reporting a “match”. However, the separated single-contributor LR language reports a match probability ratio, not a "match" (2). Reporting "match" statistics (e.g., random "match" probability) has long been standard in forensic science (8).

TrueAllele Approach

The Case Report speculated at length on why TrueAllele would give zero probability to two genotype values: locus D1 allele pair 14 14, and D22’s 11 17. However, TrueAllele had assigned those allele pairs nonzero probabilities of 0.00022 and 0.00018, respectively.

TrueAllele can use more data from low-template DNA than other programs because it hierarchically models baseline noise and PCR variance (5). This extra modeling obviates the need for peak height thresholds, considering more STR data for deriving more LR information.

TrueAllele constructs high-resolution LR distributions (9) for calculating LR error rates. This comprehensive method supports both false positive rates for inclusionary match statistics, and false negative rates for exclusionary results (10, 11).

TrueAllele Validation

The Case Report cited only three TrueAllele validation studies (12-14). In fact, from 2009 onward there have been eight peer-reviewed studies, validating TrueAllele interpretation for mixtures containing 2 to 10 unknown contributors (5, 15-18).

The Case Report suggested that TrueAllele uses an “ad hoc” LR cutoff. In fact, as presented at AAFS in 2013, the LR floor is based on a validation study of the impact of single or double allele drop out on under-sampled LR values (19).

At PCAST's 2016 meeting, Dr. Perlin gave the committee 34 validation studies, including 7 peer-reviewed papers (20). In 14 of these studies, false inclusion error rates (i.e., false incrimination) were specifically addressed.

Conclusions

Defendants and victims are entitled to meaningful DNA evidence. With low-level mixtures, more data and more variables can deliver more LR information, whether exculpatory or inculpatory. The JFS Case Report advised crime labs to "punt" when they are unable to interpret DNA data using potentially limited software. But, as this case shows, advanced PG software that can use more data lets them "go for the goal" of truth.

Mark W Perlin PhD MD PhD¹
Nasir Butt PhD²
Mark R Wilson PhD³

¹Cybergenetics, Pittsburgh, Pennsylvania, USA
²Cuyahoga County Medical Examiner’s Office, Cleveland, Ohio, USA
³George Mason University, Forensic Science Program, Fairfax, Virginia, USA

Correspondence

Mark W Perlin PhD MD PhD,
Cybergenetics, Pittsburgh, PA 15213 USA.
Email: perlin@cybgen.com

References

Thompson WC. Uncertainty in probabilistic genotyping of low template DNA: A case study comparing STRmix™ and TrueAllele®. Journal of Forensic Sciences. 2023;68(3):1049-63.
Perlin MW, Allan WP, Bracamontes JM, Danser KR, Legler MM. Reporting exclusionary results on complex DNA evidence, a case report response to 'Uncertainty in probabilistic genotyping of low template DNA: A case study comparing STRmix™ and TrueAllele®' software [Preprint]. 2023.
United States v. Damond Reynard Lockett. Judge Brian Jackson. Motion to Suppress Denied: Middle District of Louisiana; 2023.
Edwards AWF. Likelihood. Expanded ed. Baltimore: Johns Hopkins University; 1992.
Perlin MW, Legler MM, Spencer CE, Smith JL, Allan WP, Belrose JL, et al. Validating TrueAllele® DNA Mixture Interpretation. J Forensic Sci. 2011;56(6):1430-47.
Kelly H, Bright J-A, Coble MD, Buckleton JS. A description of the likelihood ratios in the probabilistic genotyping software STRmix™. WIREs Forensic Science. 2020;2(6):e1377.
Perlin MW. Explaining the likelihood ratio in DNA mixture interpretation. Promega's Twenty First International Symposium on Human Identification; San Antonio, TX 2010.
National Research Council. Evaluation of Forensic DNA Evidence: Update on Evaluating DNA Evidence. Washington, DC: National Academies Press; 1996.
Perlin MW. Efficient construction of match strength distributions for uncertain multi-locus genotypes. Heliyon. 2018;4(10):e00824.
Perlin MW, inventor, Method, apparatus and computer software program for determining probability of error in identifying evidence. US patent 10,489,233. 2019.
Perlin MW, inventor, Method, apparatus and computer software program for determining probability of error in identifying evidence. US patent 11,385,955. 2022.
Perlin MW, Hornyak J, Sugimoto G, Miller K. TrueAllele® Genotype Identification on DNA Mixtures Containing up to Five Unknown Contributors. J Forensic Sci. 2015;60(4):857-68.
Greenspoon SA, Schiermeier-Wood L, Jenkins BA. Establishing the Limits of TrueAllele® Casework: A Validation Study. J Forensic Sci. 2015;60(5):1263-76.
Bauer DW, Butt N, Hornyak JM, Perlin MW. Validating TrueAllele® Interpretation of DNA Mixtures Containing up to Ten Unknown Contributors. J Forensic Sci. 2020;25(2):380-98.
Perlin MW, Sinelnikov A. An Information Gap in DNA Evidence Interpretation. PLoS ONE. 2009;4(12):e8327.
Perlin MW, Belrose JL, Duceman BW. New York State TrueAllele® Casework Validation Study. J Forensic Sci. 2013;58(6):1458-66.
Ballantyne J, Hanson EK, Perlin MW. DNA mixture genotyping by probabilistic computer interpretation of binomially-sampled laser captured cell populations: Combining quantitative data for greater identification information. Sci Justice. 2013;53(2):103-14.
Perlin MW, Dormer K, Hornyak J, Schiermeier-Wood L, Greenspoon S. TrueAllele® Casework on Virginia DNA Mixture Evidence: Computer and Manual Interpretation in 72 Reported Criminal Cases. PLoS ONE. 2014;9(3):e92837.
Perlin MW, Dormer K, Hornyak J, Meyers T, Lorenz W, editors. How inclusion interpretation of DNA mixture evidence reduces identification information (A123). AAFS 65th Annual Scientific Meeting; 2013; Washington, DC: American Academy of Forensic Sciences.
Perlin MW. Transparency in DNA evidence Washington, DC: President’s Council of Advisors on Science and Technology (PCAST); 2016.

previous next