forensic biochemistry 2 exam 4

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/96

Earn XP

Description and Tags

assigning DNA profiles, probability, frequency, mixture interpretation, next generation sequencing, testimony

Last updated 9:03 PM on 4/28/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

97 Terms

New cards

defined as the probability of discriminating between 2 unrelated individuals

power of discrimination

New cards

formula for probability of match (PM)?

sum of (frequency of genotype at a locus)²

New cards

power of discrimination formula?

1- PM

New cards

steps for determining the probability of a single source DNA profile?

determine alleles at each locus, find allele frequency from relevant populations, calculate expected genotype frequency, report multilocus results

New cards

when calculating expected genotype frequency using Hardy Weinburg, what are the potential subpopulation corrections?

correction for profile probability or correction for match probability

New cards

aggregates and harmonizes data across mnay large scale sequencing projects to create summary allele frequency statistics

gnomAD database

New cards

_______________ is distinct from either race or ethnicity

genetic ancestry

New cards

reflects an individual’s demographic history and refers to the specific lines of decent through a family tree by which an individual inherited DNA from specific ancestors

genetic ancestry

New cards

sociopolitical constructs used to group individuals based on perceived shared ancestry, biological characteristics , or on perceived shared cultural heritage

race and ethnicity

New cards

genetic ancestry is a ____________________

continuous measure

New cards

homozygote loci formula without population substructure correction

p²

New cards

heterozygote loci formula without population substructure correction

p²j²

New cards

homozygote loci formula with inbreeding population

p²+pF(1-p)

New cards

heterozygote loci formula with inbreeding population

2pq(1-F)

New cards

subpopulation theory is similar to inbreeding but F (which is really ______) becomes __________

Fis, Fst

New cards

F_ST refers to

probability that 2 alleles randomly drawn from the population are identical by decent

New cards

5 main points of the 196 NRC Report on Forensic DNA Evidence

validated DNA evidence, new formulas to calculate the likelihood of a match for better understanding for jurors, protecting suspects from false incrimination, recommending the use of a DNA profile database specific to the racial background of the sample, assuring DNA profiling is reliable

New cards

conservative value to be used for θ (in the US) in this formula p² +p(1-p)θ, when the exact genotype can be determined

0.01

New cards

recommendation 4.1 of the NRC report stated

profile frequency of heterozygotes need to use H-W without theta correction

New cards

heterozygote formula to be used per the 4.1 recommendation?

2pipj

New cards

why did recommendation 4.1 change the formula for heterozygotes?

formula with theta was overestimating the frequency of a genotype

New cards

homozygote formula with a subpopulation?

pi² + pi(1-pi)thetaii

New cards

heterozygote formula with a subpopulation?

2pipj(1-thetaij)

New cards

what did recommendation 4.2 of the NRC report say?

use allele frequencies from the subgroup the sample came from. if unknown subgroup, use the formula

New cards

signs that sample is a mixture

loci with more than 2 alleles, severe peak imbalance, abnormally high stutter

New cards

expected severe peak balance of _______% in a mixture sample

60-70

New cards

expected high stutter of _______% in mixture samples

15-20

New cards

minimum height requirement at and above which detected peaks can be reliably distinguished from background noise

analytical threshold

New cards

the analytical threshold or AT is typically around _______ RFUs

25-50

New cards

peak height value below which is reasonable to assume that, at a given locus, allelic dropout of a sister allele in a heterozygous pair may have occurred

stochastic threshold

New cards

stochastic threshold or ST is typically around _______ RFUs

200

New cards

steps for interpreting a mixture

identify presence of a mixture, designate allele peaks, identify number of contributors, estimate relative ratio of individuals contributing to the mixture, consider all possible genotypes, compare reference samples, statistical interpretation

New cards

formula for determining minimum number of alleles

Nalleles/2 then rounded up

New cards

relative ratio considers

the peak heights of the whole profile

New cards

all possible genotypes for 4 peaks (A, B, C, D)

A, B + C, D

A, C + B, D

A, D + B, C

New cards

all possible genotypes for 3 peaks (A, B, C)

A, A + B, C

B, B + A, C

C, C + A, B

A, B + A, C

B, C + A, C

A, B + B, C

New cards

all possible genotypes for 2 peaks (A, B)

A, A + A, B

A, B + A, B

A, A + B, B

A, B + B, B

New cards

accounts for if a single peak below the stochastic threshold results from the homozygous genotype or the heterozygous genotype

2p rule

New cards

one or more of the mixture components could comprise low template DNA, as such we need to take into account

allele drop out and drop in

New cards

2p rule is used to calculate if

an actual allele dropped out or if the sample is a homozygote

New cards

2p rule formula

2pa-pa² < 2pa

New cards

probability that the DNA of a randomly chosen person has the same DNA profile as the DNA of the casework sample

RMP or random match probability

New cards

sum of the probabilities for all of the genotypes that represent the possible contributors to a DNA mixture under the assumption of a defined number of contributors

RMP calculation or modified RMP

New cards

how is modified RMP different from the combined probability of Inclusion (CPI)?

doesn’t use assumptions to determine number of contributors

New cards

estimate of the probability that a randomly selected, unrelated individual would be included as a possible contributor to a mixture

combined probability of inclusion or CPI

New cards

probability that a randomly selected, unrelated individual would be excluded as a contributor to the mixture

combined probability of exclusion or CPE

New cards

if it is determined that there is allele dropout at a given locus, the locus ______________

will be excluded from the match probability

New cards

steps for calculating the likelihood ratio for a 2-person mixture

condition the number of contributors, state the alternative hypothesis, evaluate the probability of the evidence under the defense proposition, evaluate the probability of the casework sample under the prosecution proposition, calculate the likelihood ratio, report the likelihood ratio

New cards

refers to the use of biological modeling, statistical theory computer algorithms, and probability distributions to calculate likelihood ratios and/or infer genotypes for the DNA typing results of forensic samples

probabilistic genotyping

New cards

why do we use probabilistic genotyping?

statistically interprets mixture samples

New cards

PG continuous models consider _______ as a continuous variable

peak heights

New cards

probabilistic genotyping genotyping considers _________ in order to deconvolute a DNA profile into a list of genotype sets

observable data, models, calibration data, and unknowable

New cards

specific for a set of laboratory hardware and DNA typing kit

calibration data

New cards

refers to the specifics of the actual DNA profile being analyzed

unknowables

New cards

the unknowables of PG continuous models include

number of contributors, DNA amounts of each contributor, degradation of each contributor, amplification efficiency of each locus, replicate amplification strength, level of peak height variability within the sample

New cards

“mass parameters” or the total allelic product within PG continuous models includes

DNA amounts of each contributor, degradation of each contributor, amplification efficiency of each locus, replicate amplification strength

New cards

assumes degradation is exponential but that each contributor to have different curves

total allelic product modeling

New cards

total allelic product modeling tests different mass parameters to form a ____________

probability density

New cards

iterative re-sampling process-in each iteration, genotype combinations and biological parameters (mass parameters) are proposed to describe the profile

Markov Chain Monte Carlo

New cards

how does the Markov Chain Monte Carlo deconvolution work?

genotype and set of values is proposed for every iteration and compared to observed results to see how well they explain the data

New cards

preliminary MCMC run to ensure the post burn-in MCMC begins in an area of high probability space

burn-in

New cards

parameters for MCMC burn-in?

8 independent chains must reach 100,000 accepted iterations

New cards

occurs after burn-in and uses the same number of chain to acheive ~50,000 accepted iterations

post burn-in

New cards

occurs at completion of MCMC and normalizes the number of genotype sets accepted during post-burn in

weight

New cards

an MCMC weight of 0 means

observed data cannot be explained by the proposed genotype set

New cards

an MCMC weight of 1 means

only genotype set that explains the DNA profile

New cards

the progression fo the MCMC is influenced by a “seed” set by a __________

random number generator

New cards

process of using calculating the probability density of each peak in the profile, comparing it with the proposed model, measuring it’s “fit” , and accepting or rejecting the proposed values

Metropolis-Hastings

New cards

the Metropolis-Hastings Algorithm operates

within the Markov Chain Monte Carlo framework

New cards

when working with the Metropolic-Hastings algorithm, the ________ the probability density the better fit of the parameter values to the observed profile

higher

New cards

within the Metropolis-Hastings algorithm, the proposed values for the genotypes and mass parameters are either accepted or rejected depending on ________

probability density

New cards

after deconvolution, a likelihood ratio can be assigned to any POI based on ____________

propositions considered

New cards

parameters requiring optimization for probabilistic genotyping

analytical threshold, stutter ratios, saturation limit, drop-in parameters, allele/stutter peak height variance, LSAE variance, relevant population parameters

New cards

year QIAGEN developed the first DNA purification method in forensics

1998

New cards

year QIAGEN launched its first STR kit

2010

New cards

QIAGEN workflow steps

collection, pre-treatment, sample preparation, array setup, quantification, STR/NGS analysis

New cards

traditional DNA analysis workflow

sample collection, extraction and quantification, PCR, CE & data analysis

New cards

why use next generation sequencing over CE?

add more loci targets, not limited by ampicon bp size, can use STRs and SNPs, visible trait estimation

New cards

ForenSeq Human Identification workflow?

sample collection, extraction & quantification, library preparation, sequencing & data analysis

New cards

why sequence STRs?

smaller amplicons, looks at the whole sequence not length, can target STRs and SNPs

New cards

the ForenSeq Signature Plus is the only QIAGEN machine that has

STR analysis, kinship, and externally visible characteristics

New cards

SNPs are used over STRs bc

need way more for a match

New cards

the MainstAY and MainstAY SE kits can identify relatives of the ________ degree

first

New cards

the SIgnature Plus kit can identify relatives to the ________ degree

first or second

New cards

Kintelligence can identify relatives to the _________ degree

fourth or fifth

New cards

how are libraries prepared?

amplify and tag targets, attach indexes and adapters, purify, dilute sample to make loci all the same concentration

New cards

what is the purpose of indexes in QIAGEN NGS?

provide a unique marker specific to that allele and sample

New cards

how does the sequencing part of the QIAGEN NGS work?

samples get pulled onto the flowcell, make a U shape on the cell to be read, one nucleotide is added and read during each cycle

New cards

a really special feature about sequencing is that it able to

easily determine number of contributors

New cards

steps of PCR

extraction, quantification, amplification, analysis

New cards

STRmix is used to

help declutter mixture samples

New cards

forensic scientists/biologists can only speak to the _______ level of testimony

source or sub-source

New cards

occurs when the conclusion is restated in a manner that bolsters the hypothesis of the prosecutor, typically by transposing the conditional and making the evidence seem more exclusive

prosecutor’s fallacy

New cards

error in logic on the part of the defense counsel that bolster’s the defense’s hypothesis and favors the defendant, typically by relating the probability to a specific population to make the profile seem more inclusive

defendant’s fallacy

New cards

fallacy in which the statistic is bolstered by relating it directly to the profile being compared in relation to the general population

uniqueness fallacy

New cards

occurs when the probability statement is taken from one level within the hierarchy of propositions to a higher level

association fallacy