VNTRs and Electrophoresis – Vocabulary Flashcards
Core forensic idea: Non-coding ("junk") DNA makes up ≈ 80 % of the human genome and changes freely → rich source of person-specific markers
“Junk” DNA & Its Forensic Relevance
Lies between functional genes; exerts little to no selective pressure
Replication errors are rarely corrected → high mutation rate (especially in repetitive regions)
Four functional classes in the genome
Protein-coding DNA (genes)
Repetitive DNA (includes VNTRs, telomeres, satellites)
Pseudogenes (non-functional copies of genes)
Unique non-coding DNA (regulatory, intronic, etc.)
Why useful?
Extremely polymorphic → likelihood of two unrelated people sharing lengths is very low (except identical twins)
Inherited in Mendelian fashion → trace family and paternity relationships
Genes, Alleles & Chromosomal Context
Genes spread unevenly across the 23 pairs of chromosomes, but reside at the same locus on maternal & paternal copies
Alternative DNA sequences at a locus = alleles; diploid humans carry max. 2 alleles per locus
Gene stretches are separated by non-coding DNA; VNTRs often sit near chromosome ends (telomeres)
Polymorphisms
Definition: inter-individual DNA differences
Sequence polymorphism → one base replaced by another (SNP)
Size/Length polymorphism → bases inserted or deleted (indels, VNTRs)
More abundant in non-coding regions than coding ones
Variable Number Tandem Repeats (VNTRs)
A subtype of size polymorphisms; a.k.a. mini-satellites
Basic characteristics
Repeat unit length: 6 – 100 bp
Copy number: a few to many thousands in tandem (head-to-tail)
Entire locus can span hundreds→thousands of base pairs
Mutation mechanism
DNA polymerase slippage during replication forms loops → whole repeat units added or deleted (never partial repeats)
Change occurs one whole repeat at a time
Telomeric association
VNTRs frequently follow the telomeric repeat \text{TTAGGG} which “caps” chromosomes
VNTR Terminology
Repeat Unit = number of bases per single repeat (e.g. “6 bp”)
Repeat Sequence = exact base string (e.g. “CGTCAT”)
Copy Number = consecutive repetitions of the unit (e.g. 6)
Example (Page 8):
Sequence: ACGTGCT ACGT CAT CGT CAT CGT CAT CGT CAT CGT CAT AACGTGC
Repeat unit = 6 bp, repeat sequence = “CGTCAT”, copy number = 6
VNTR Alleles: Diversity, Inheritance & Counting
Each distinct copy number = one allele (thousands possible)
Alleles change by ±1 repeat unit per mutational event
Allele count formula when repetitive block length is known: \text{Number of alleles}=\frac{\text{Total repeat region length}}{\text{Repeat unit length}}
Example: 500 bp block / 10 bp repeat = 50 possible length alleles
Mendelian inheritance: one maternal + one paternal allele per person; different homologs may bear different copy numbers
Probability of two unrelated people sharing an allele combination is low; identical twins are the only guaranteed match
VNTR Testing – Five-Step Workflow
DNA Collection
Restriction Enzyme (RE) Digestion
Gel Electrophoresis
Southern Blot Transfer
Probe Hybridisation (plus optional re-probing)
Step 1 – DNA Collection
Requires relatively large, good-quality samples (blood, semen, tissue)
Process: extraction → purification → quantification → concentration normalisation for cross-sample comparison
Step 2 – Restriction Enzyme Digestion
REs = bacterial immune proteins recognising palindromic sequences and cutting DNA
Read 5'→3' identically on both strands (e.g. GAATTC / CTTAAG)
Two cut types
Blunt ends – cut exactly in the middle
Sticky ends – staggered cleavage, leaving single-stranded overhangs
Cutting frequency
Formula (approx.): \text{Cuts}=\frac{N}{4^{n}}
N = genome size, n = recognition site length (bp)
Shorter sites → more cuts; e.g. in human genome (3.2 × 10^9 bp):
HaeIII (4-bp site “CCGG”) → ≈ 12.5 M cuts
EcoRI (6-bp “GAATTC”) → ≈ 781,250 cuts
NotI (8-bp “GCGGCCGC”) → ≈ 48,828 cuts
Post-digestion mixture
Contains thousands of fragments; only some hold VNTRs, each fragment’s length reflects flanking RE sites + VNTR copy number
Step 3 – Gel Electrophoresis
Porous agarose/polyacrylamide gel acts as molecular sieve
Electric field drives negatively charged DNA toward the positive electrode
Separation strictly by size: shorter fragments run faster/farther
Features
Multiple wells → load ladder (size standard) + numerous samples side-by-side
DNA ladder fragments of known sizes (acts like a ruler)
VNTR smear: because a digest contains many fragments of similar/overlapping lengths, bands smear instead of discrete lines → requires blotting/probing to visualise specific VNTRs
Step 4 – Southern Blotting
Invented by Edwin Southern
Steps
Gel placed on wick soaked in high-salt/alkali solution; nitrocellulose/nylon membrane laid on top
Buffer wicks upward, carrying ssDNA from gel → membrane; alkaline conditions break H-bonds so DNA becomes single-stranded
Membrane now holds immobilised, single-stranded fragments (still a smear)
Advantages: membrane far sturdier, can undergo harsh washing & multiple probings
Step 5 – Probe Hybridisation (Probing)
Synthetic ssDNA (or RNA) oligonucleotides complementary to VNTR repeat sequence
Labelled radioactively (e.g. ^{32}\text{P}) or fluorescently
Membrane sealed with probe solution → probes hybridise to complementary VNTR-containing fragments
Excess probe washed away → membrane exposed to X-ray film / imager
Result: only VNTR fragments with the targeted repeat sequence appear as bands (“bar-code” pattern)
Multi-locus probes: recognise all VNTR loci with that repeat unit → many bands per lane
Re-probing: strip off first probe (boil/alkali) and repeat with different probe to survey additional VNTRs (time-consuming)
Single-Locus vs Multi-Locus VNTR Analysis
Multi-locus probes
Advantages: one hybridisation reveals many loci simultaneously
Disadvantages: need large DNA quantity, slow, can’t detect mixture/contamination, poor for degraded samples
Single-locus VNTRs
Each probe targets one unique genomic VNTR → each person yields ≤ 2 bands (diploid)
Enhances speed, simplifies interpretation, flags mixed/contaminated samples (extra bands)
1980-90s standard panel: six markers (D1S7, D2S44, D4S139, D10S28, D14S13, D17S79) – all highly polymorphic (>19 alleles each)
Probability Calculations with VNTR Alleles
For a locus with n distinct alleles, number of diploid genotype combinations (order irrelevant) = \frac{(n+1)\times n}{2}
Example: D1S7 (28 alleles) → \frac{29\times 28}{2}=406 combinations → random match probability 1/406
D4S139 (19 alleles) → \frac{20\times 19}{2}=190 combinations → 1/190 chance
Combined probability across loci (assuming independence) = product of individual probabilities
Example (D1S7 & D4S139): \frac{1}{406}\times\frac{1}{190}=\frac{1}{77,140}
Six-marker panel example: \frac{1}{406}\times\frac{1}{351}\times\frac{1}{190}\times\frac{1}{300}\times\frac{1}{465}\times\frac{1}{190}=\frac{1}{7.18\times10^{14}} (≈1 in 7 × 10^14)
Caveats: allele frequencies differ among ethnic groups → real calculations use population-specific databases; nevertheless, adding loci always lowers chance of coincidental match
Applications
Criminal investigations: compare DNA from evidence (blood, semen, hair roots) to suspects
Example slide: Crime-scene semen matched Suspect 2, excluded Suspect 1
Paternity & familial tests: child must share one VNTR allele with each biological parent
Patterns lacking parental allele lead to exclusion; multiple markers increase confidence
Mass-disaster victim ID, missing-person cases, migration studies of human populations
Advantages & Disadvantages Summary
Advantages of VNTR approach:
Extremely polymorphic; high discriminatory power
Mendelian inheritance allows kinship assessment
Early technique that proved DNA could identify individuals → foundation for modern STR/NGS methods
Disadvantages (particularly multi-locus):
Requires large, undegraded DNA amounts
Time-consuming (week-long)
Complex interpretation when samples are degraded or mixed
Outdated: largely replaced by PCR-based STR typing today
Exam Preparation Checklist (from Page 51)
Define VNTR & list its key characteristics (repeat unit size, copy-number variability, telomeric proximity)
Outline ALL five steps in VNTR testing, detailing techniques & rationale
Explain restriction enzymes: palindromic recognition, blunt vs sticky ends, cut-frequency formula \frac{N}{4^{n}}
Describe electrophoresis mechanics (charge, gel matrix, ladders, lane organisation)
Discuss pros/cons of multi-locus vs single-locus VNTRs
Justify why testing multiple VNTRs boosts reliability (qualitative probability argument sufficient)
Provide real-world examples: e.g. paternity, crime-scene matching
Ethical, Practical & Historical Notes
Ethical: need for population allele-frequency databases; ensure privacy of genetic fingerprints; identical twins as special case
Practical: poor for old/degraded samples; mixture detection better with single-locus probes; modern forensics shifted to PCR STRs & capillary electrophoresis for speed & sensitivity
Historical significance: VNTR fingerprinting first brought DNA evidence into court, leading to 1986 Colin Pitchfork conviction (UK)
Take-Home Message
VNTRs exploit length variability in non-coding tandem repeats; their high polymorphism underpins early DNA fingerprinting
Testing involves digestion, separation, blotting & hybridisation; interpretation relies on allele length patterns inherited from parents
More loci = exponentially lower random-match probability; nonetheless, current practice now favours STR PCR due to greater speed & lower DNA requirements