Lecture 1: Course Introduction and Overview
Instructor:
Tristan Baecklund
Office DNA B108.7
Office Hours: Wed 1-3 pm
Demonstrator for MScFS program
Recent graduate of ENLS program
PhD in host population genetics in context of disease (arctic fox and bats)
Course Description:
Applied: Know the potential applications
Molecular: Know the molecular technique
Genetics: Understand the target gene function and biology, whit it was selected
Learning Outcomes:
Understand the fundamental principles of DNA marker characterization and profiling through a history of genetic and genomic marker development
Understand how the characteristics of DNA markers can be used directly to applications in diverse fields and species
Gain perspective on the pace of technological developments in this area will be assessed by examining current news about genomics
Understand how the area of molecular genetics is rapidly changing due to the emergence of novel technologies and be able to apply these underlying technologies to applications in diverse fields of study
Genetics Terminology:
Heredity: The transmission of traits from one generation to the next
Genetics: The scientific study of heredity and variation in heredity
Gregor Mendel documented discrete heritable units (genes) that were passed from one generation to the next
Molecular genetics: The study of structure and function of genes at the molecular level
Applied molecular genetics: Using this information for research, conservation, management, and law enforcement
Classical Genetics:
Classical genetics: Refers to the use of crosses to breed new strains of organisms and to understand how traits are transmitted
An important tool of classical genetics is the isolation of mutants affecting particular traits which can then be compared to normal strains
The first genetic maps of genes on chromosomes were generated through this approach
Common in health research
Biological process → identify mutants → find the gene → biochemical function
Phenotype → genotype
Discover new phenotypes
Show that is has genetic basis (heritable trait)
Find the gene that has mutated
Understand what/how the wild-type gene functions
Modern Genetics:
Modern/reverse genetics: The study of genes at the molecular level
It has been made possible because of advances in cloning and sequencing of genes or complete genomes
Modern genetics has provided an enormous understanding about the physical nature of genes, their expression patterns, their role in development and disease, and how their products interact
Gaining traction in human health and forensics:
23 & Me
Commercially available profiling kits that encompass both STRs and informative loci (phenotypes, hair colour, eye colour, etc)
Gene in hand → create mutants → phenotype? → biological process
Genotype → phenotype
Change something in a known gene
Observe phenotypic effect
Find out why you see what you see
Understand what/how the wild type gene functions
In non-model organisms we often go directly from gene to phenotype
Does not have a published genome or is not an ideal species for lab research
Transmission of Genetic Information:
The laws of inheritance were first elucidated by Gregor Mendel
1856-1863
Mendel made careful crosses between pea plants that differed in particular characteristics (traits), counted the number of times the traits appeared in subsequent generations, and interpreted the results
Law of segregation:
Mendel crossed the F1 hybrids with many of the F2 plants having purple flowers; however, some had white flowers
Mendel observed a ratio of 3:1, purple to white flowers in the F2 generation
From these observations he developed a model to explain inheritance pattern
Inherited characteristics are determined by indivisible factors (genes) and alternative versions of genes (alleles) account for variations in inherited characters
For each characteristic, a diploid organism inherits two alleles, one from each parent
If two alleles at a locus differ, then one (dominant) determines the organism’s appearance, and the other (recessive) has no noticeable effect on appearance
The two alleles for a heritable character separate (segregate) during gamete formation
Law of independent assortment:
Mendel identified his second law of inheritance by following two characters at the same time: seed colour (Y, y) and seed shape (R, r)
Crossing two true-breeding parents differing in two characters produces dihybrids in the F1 generation, heterozygous for both characters (YyRr)
A dihybrid cross between F1 dihybrids (YyRr) can determine whether two characters are transmitted to offspring dependently or independently
Each pair of alleles segregates independently during gamete formation
Applies to genes on different chromosomes or genes that are located very far apart on the same chromosome
Genes located near each other on the same chromosome tend to be inherited together
The relationship between genotype and phenotype is rarely as simple as in the pea plant characteristics Mendel studied
Most heritable traits are not determined by a single gene (today referred to as one gene of large effect) rather many genes of small effect
However, the basic principles of segregation and independent assortment apply even to more complex patterns of inheritance
Genetic Disorders Following Mendelian Inheritance:
Cystic fibrosis: Recessive
Sickle-cell anemia: Recessive
Tay-Sachs disease: Recessive
Phenylketonuria: Recessive
Hemophilia: Sex-linked recessive
Huntington’s disease: Dominant
Muscular dystrophy (Duchenne): Sex-linked recessive
Congenital hypothyroidism: Recessive
Hypercholesterolemia: Dominant
DNA as the Genetic Material:
DNA is a polymer of nucleotides, each consisting of a nitrogenous base, sugar, and phosphate group
1950: Chargaff reported that DNA composition varies from one species to the next
Two findings became known as Chargaff’s “rules”
The base composition of DNA varies between species
In any species, the number of A and T bases are equal and the number of G and C bases are equal
Building a Structural Model of DNA:
After DNA was accepted as the genetic material, the challenge was to determine how its structure accounts for its role in heredity
Wilkins and Franklin were using a technique called X-ray crystallography to study molecular structure
1952: Franklin produced a picture of the DNA molecule using this technique
Watson and Crick thought the bases paired like with like (A with A and so on), but such pairings did not result in uniform width
Purine = Adenine and Guanine
Pyrimidine = Cytosine & Thymine
Instead, pairing a purine with a pyrimidine resulted in a uniform width consistent with the X-ray data
They determined that adenine (A) paired only with thymine (T) and guanine (G) paired only with cytosine (C)
The Watson-Crick model explains Chargraff’s rules: in any organism the amount of A = T and G = C
1953: Watson and Crick introduced the double-helical model for the structure of DNA
DNA, the substance of inheritance, is the most celebrated molecule of our time
Hereditary information is encoded in DNA and reproduced in all cells of the body
DNA strands are antiparallel and their backbones are negatively charged
Sources of Genetic Variation:
Many phenotypic differences between organisms are the results of differences in the genes they carry
Genetic differences arise through the process of:
Mutations: Change in genetic material (substitutions, deletions, insertions, translocations)
Recombination and crossing over: Exchange of chromosomal material between homologous chromosomes at meiosis
Random fertilization: billions of gametes – laws of segregation and assortment
Mutations:
Mutations are changes in genetic material of a cell or virus
Spontaneous mutations can occur during DNA replication, recombination, or repair
Approximately 110 mutations introduced to your offspring
Mutagens are physical or chemical agents that can cause mutations
Germline mutations: occur in gametes
Significant because they can be transmitted to offspring
Most from paternal line
Somatic mutations: occur in other cells of the body
May have little effect on the organism because they are confined to just one cell and its daughter cells
Cannot be passed onto offspring
The behaviour of chromosomes during meiosis and fertilization is responsible for most of the variation that arises in each generation
Three mechanisms contribute to genetic variation:
Independent assortment of chromosomes:
Each pair of chromosomes sorts maternal and paternal homologs into daughter cells independently (randomly) of the other pairs at metaphase I of meiosis
The number of combinations possible when chromosomes assort independently into gametes is 2^n, where n is the haploid number
For humans n = 23, there are more than 8.4 million (2^23) possible combinations of chromosomes
Random fertilization:
Adds genetic variation because any sperm can fuse with any unfertilized egg
The fusion of two gametes produces a zygote with any of about 70 trillion diploid combinations
Crossing over adds even more variation; each zygote has a unique genetic identity
Replication Variation in SARS-CoV2:
1: Spike protein on the virion binds to ACE2, a cell-surface protein
2: The virion releases its RNA
3: Some RNA is translated into proteins by the cell’s machinery
4: Some of these proteins form a replication complex to make more RNA
Proteins and RNA are assembled into a new virion in the golgi
6: New virions are released
This is non-mendelian transmission
Drug treatments typically target spike protein binding (1), RNA translation (3), and replication complex formation (4)
Rapid antigen tests detect spike proteins (1) and qPCR tests target proteins in the replication complex (4)
Variants arise from the replication complex (4)
Importance of Basic Concepts:
Our understanding of inheritance and genetics has evolved over the last 170 years
Understanding patterns of inheritance and generation of variation is fundamental to its application
Multiple disciplines exist that focus on improving our understanding in this area
Gene-phenotype receiving lots of attention in basic and applied research
Molecular genetics employs known principles of DNA structure and function to investigate the molecular basis for genotype-directed phenotypes under normal and pathological conditions
The term applied molecular genetics is used to describe the set of laboratory-based research tools that exploit the information potential or organismal DNA and use them for a variety of applications
Databases play an important role when it comes to application
Lecture 2: DNA Markers and Platforms
Key Terminology:
Gene: Any region of the genome (not restricted to protein coding)
Allele: Variation of a gene
Genotype: A combination of alleles
Haplotype: A combination of linked alleles
PCR: Polymerase chain reaction – makes millions of copies of a gene
Primer: Used to tell the PCR what gene to amplify
Goal of Molecular Methods:
Genetic methods simply aim to visualize and quantify differences
The Molecular Era:
Before the mid-1960s, no methods were available for directly assessing molecular genetic variation
In the mid 1960s, protein electrophoresis was invented
Direct assessment of heritable variation became possible
Protein Electrophoresis and Allozyme Markers:
Proteins move in the electric field (in gel medium)
Relative speed (distance) depends on the charge, size, and shape of the protein
Early protein electrophoresis studied revealed that the extent of genetic variation is much higher than previously thought
An early application was a study on northern elephant seals
Uniform homozygosity among 21 proteins encoded by 24 loci identified allele fixation and helped explain the decimation of the species and vulnerability of the species
Genetic Marker Acronyms:
Restriction enzyme and length variation:
RFLPs: Restriction Fragment Length Polymorphisms
AFLPs: Amplified Fragment Length Polymorphisms
Repetitive DNA (size of fragment = # repeats):
Minisatellites/VNTRs: Variable Number of Tandem Repeats
Microsatellites/STRs: Short Tandem Repeats
Direct assessment of nucleotide variation:
SNPs: Single Nucleotide Polymorphisms
Mitochondrial sequence
Chloroplast sequence
REs and DNA:
Restriction enzymes recognize sequence motifs and cut DNA at the motif
Cleavage products can be Sticky or Blunt:
5’ protruding ends (sticky)
3’ protruding ends (sticky)
Blunt
How enzymes cleave nucleotides can be leveraged for downstream applications
Making sequencing competent molecules
In an idealized genome:
RE cuts every 4^length of motif
This is a base of 4 because there are 4 nucleotides
RE that has a 4bp motif cuts every 4^4 of 1 in every 356 bp
Some REs do not recognize methylated sites and cutting is often not 100%; there is variation in repeatability
Genomes vary in nucleotide composition, which is important for experimental design and genome coverage
The RFLP Principle:
RFLPs were detected by hybridizing radioactively labelled probes to DNA, transferred from a gel to a filter “Southern Blotting”
DNA restriction fragments
Agarose gel electrophoresis
Fragments separated by size
Transfer of DNA fragments from gel to nitrocellulose filter
Radioactive DNA probe hybridized to fragments on filter
Autoradiograph showing positions of hybrids DNA
Parentage test with RFLP:
RFLPs were detected by hybridizing radioactively labelled probes to DNA, transferred from a gel to a filter (“southern blotting”)
AFLP (and RAPDs):
AFLPs are essentially RFLPs but followed by a PCR step to be more selective
Random Amplified Polymorphic DNA: No enzyme, rather a random set of primers and PCR
Produces a profile for small fragments
RAPDs rely on primers; a string of nucleotides
Given what we know about RE motifs, how often would we expect to find a matching 10 bp primer across a genome? 4^10 = every 1 million
Minisatellites:
Repeat unit > 9 bp, usually 30 bp
Number of repeat units are scored when they are used as genetic markers
Among the first markers to be used for DNA fingerprinting
Microsatellites:
Di, tri, tetra… repeat units
Shorter than VNTRs
Humans vs non-humans
Replication Slippage:
Microsatellites arise through polymerase slippage during DNA replication and a higher rate of replication slippage with more repeats
Dissociation and subsequent mispairing gives change in number of repeats
The length of strand increases the risk of slippage
Point mutations can terminate replication
Scoring Microsatellites in Practice:
You score the number of repeats of a particular repeat unit
You do not know (or care) if it's an imperfect repeat
Score both alleles (or variants) in each individual based on size
Allele or variants gets named and scored in individuals
Huge Number of Applications:
Abundant in the genome (human genome contains > 400,000); highly variable and neutral with respect to fitness (phenotype); simple and cheap to genotype
Many microsatellites give you a genotype; variation in genotypes leads to population structure; variation in genotypes leads to population structure (non-random mating)
Microsatellites as Genetic Markers:
Some problems with using microsatellites as genetic markers include:
Time and labour-consuming to develop primers from non-model species
Often do not transfer well between species (may not amplify or may not be variable)
Null alleles or PCR-induced mutations can cause problems
Difficulties in modeling the mutation process poses problems for population genetics
Basic Principles of DNA Profiling:
The genome of each individual is unique with the exception of most identical twins
Probe subset of genetic variation in order to differentiate between individuals; statistical probabilities of a random match are used
DNA typing must be performed efficiently and reproducibly as information must hold up in court
Current standard DNA tests do not look at genes: they give little to no information about predisposal to disease or phenotypical information is obtained
DNA Typing Applications:
Forensic science:
Human and non-human
Solving crime (compare suspect with evidence)
Identifying accident victims, soldiers, missing persons, mass disasters
Paternity testing
Conservations genetics
Deciphering evolutionary relationships
Pathogen detection and monitoring
Diagnosis of disease
Breeding in agriculture
DNA Typing in Forensics:
A DNA profile by itself is fairly useless because it has no context
DNA analysis for identity only works by comparison, you need a reference sample (databases)
Crime scene evidence compared to suspect(s)
Child compared to alleged father
Victim’s remains compared to biological relative
Soldier’s remains compared to direct reference sample
Three DNA typing outcomes:
Exclusion (no match)
Non-exclusion (inclusion)
Inconclusive result (no result or complex mixture)
Single Nucleotide Polymorphisms:
A SNP is a single nucleotide variation at a specific location in the genome
Common in most genomes: In humans, average 1 SNP per 1000 bp across the genome
Fewer SNPs in coding regions, wide variation in SNP density across most genomes
Genes must be highly conserved, vital to organism
SNPs are popular genetic markers because:
They are abundant
They can be genotyped in a high throughput manner
The mutation mechanism is well established
Most often are biallelic
SNPs are frequently used in associations studies and in population and evolutionary genetics
Humans, agricultural animals
You need to know where your SNPs are; first do SNP identification
SNPs are usually identified by sequencing a limited pool of individuals
By this procedure, only common SNPs are found
This can create an “ascertainment bias”
Less power of discrimination per locus for individual identification compared to microsatellites generally need more SNPs compared to microsatellites
Can give you clues into phenotypes
Mitochondrial Sequence:
Haploid and maternally inherited
Approximately 17000 BP (37 genes)
All SNPs (alleles) are linked and inherited together
mtDNA halotypes
Variation in haplotypes is very useful for species identification and provenance
Cytochrome b is used for mammalian ID
Control region (faster evolving, species and provenance)
Chloroplast Sequence:
dsDNA
Plants and algae
120-247kbp (most are small)
100+ genes
IR sequence
Repetitive regions
Vary in size and position between species
Why SNPs Might Not Replace STRs
Large databases containing STR information; would need to replace data on existing samples with new DNA markers
Databases for STRs are already established
Mixture detection and interpretation benefits from marker systems with alleles; SNPs only have two alleles and three genotype possibilities
Degraded DNA can be successfully analyzed with STRs in many cases, thus removing a primary motivation for using SNPs
SNPs and Microsatellite Profiling:
Probability of identity (PI)
Likelihood of two people having the same profile (genotype)
Different than RMP, which represents the estimated frequency of a profile occurring in a given population database
Based on H-W principles
p is the frequency of allele A in a population
q is the frequency of allele a in a population
p^2 and q^2 are frequency of homozygotes
2pq is the frequency of heterozygotes
Why is there a difference between SNPs and microsatellites?
Example locus of 4 alleles with equal frequency
p^2 = aa,bb,cc,dd = 0.0625 for each homozygote
2pq = ab, ac, ad, bc, bd, cd = 0.135 each heterozygote
P1 = sum of each genotype frequency squared
(0.0625)^2…+(0.135)^2… = 0.109375
H-W Example:
Population contains: A, A, A, A, a
p = ⅘ = 0.8
q = ⅕ = 0.2
Frequency of AA = (0.8)^2 = 0.64
Frequency of aa = (0.4)^2 = 0.04
Frequency of Aa = 1 - 0.64 - 0.04 = 0.32
SNP Sanger Sequencing:
Chain termination with ddNTP (dideoxynucleotide)
4’ OH group replaced with an H atom to terminate synthesis
ABI 3730 series genetic analyzer is the largest fully automated system
Other smaller models can be used to best suit lab needs
Sanger sequencing = chain termination sequencing
Automated sanger sequencing with dye termination:
Only one lane is needed
ddNTPs are labelled with different fluorescent dyes (one per base)
Hit with a laser and photograph
Shorter fragments migrate faster through the capillary tube
Produces good quality sequences up to 1 kb (1000 bps) but normally 60-700 is good quality
Somewhat costly and not high-throughput (each sample requires a sequencing reaction)
Dominated the sequencing market until very recently and is still extensively used
Best method for around 3 decades until Next Gen Sequencing
SNPs are good for small data, high quality
Provides the most complete set of genetic information but has until recently been to expensive for large-scale studies
Next-Generation Sequencing:
Massively parallel sequencing, short-read sequencing, second-generation sequencing, high-throughput sequencing
A suite of new sequencing technologies capable of producing a lot more sequence information, in much shorter time than Sanger sequencing
Emerged from development in nano-technologies
Generally (Illumina) produces shorter reads with lower quality at each base than Sanger sequencing
The longest read chemistry available is 350 bp, the current norm is 125-150 bp
Length and quality issues are outweighed by the massive number of sequences obtained (compared to Sanger)
Huge implications for all commercial and research programs that rely on genetic information (medical genetics, evolutionary biology, population genetics, forensics)
There are several new technologies:
Roche 454: Titanium, 500 bp
Illumina: NextSeq, 150-300 bp
ABI SOLiD sequencing: ligation based, 100 bp
Pacific Biosciences: Single Molecule (SMRT) Sequencing
Ion Torrent: ion semiconductor chips, 100-200 bp
Different technologies have advantages for different applications
Illumina Technology:
DNA is modified to make it compatible with the flow cell
1. DNA fragmentation and adapter ligation
2. Bridge amplification on solid support: Each fragment is amplified to generate a cluster
3. Sequence cluster in real time
Ion Torrent:
Hydrogen ions are released when a nucleotide is added
Similar concept to pyrosequencing
The electrical current change which occurs when hydrogen is released is recorded
PacBio:
Start with high-quality double-stranded DNA
Ligate SMRTbell adapters and size select
Anneal primers and bind DNA polymerase
Circularized DNA is sequenced in repeated passes
The polymerase reads are trimmed of adapters to yield subreads
Concensus is called from subreads
RADseq:
High-throughput sequencing of AFLPs
Genomic Data:
Defined as high-throughput sampling of the genome
Our genome is 3,000,000,000 basepairs
Most of the genome is not polymorphic
We are now getting 1000,000s of SNPs
Information on recombination
Information on methylation
For now, genomics = 1000s to 1000,000s of SNPs
Biggest shift in the past couple years has been to long-read single molecule sequencing
Massively Parallel Sequencing:
The critical difference between Sanger sequencing and HTS is sequencing volume
Sanger method only sequences a single DNA fragment at a time
HTS is massively parallel, sequencing millions of fragments simultaneously per run
This high-throughput process translate into sequencing hundred to thousands of genes at one time
Poses a computational challenge in putting together millions of short reads into a meaningful sequence
Easier is there is a reference genome sequence
Tools are rapidly being improved and developed
Bioinformatics is a necessary skill to have
Applications to MPS technologies:
Genome sequencing
Expression analysis (RNA) including ID of noncoding RNAs
Analysis of methylation and chromatin packaging
Population genomics and genotyping
Epigenetic Assays:
Most approaches use the above sequencing techniques
PacBio simply detects methylation
Bisulfite conversion involves the deamination of unmodified cytosines to uraci
Lecture 3: Non-Human Forensic Applications
Marker Terms and What they Identify:
Genes: Any region in a genome
Alleles: Variation in a gene
Haplotype: Alleles inherited together
SNPs: Phenotypes, individuals, populations
Microsatellites: Individuals and populations
Mitochondria: Populations and species
Chloroplast: Species
Wildlife Forensics:
Native species
Domestrics
Exotics
Wildlife conservation: Enforcement of harvesting laws and agreements
Largely based on microsatellites and mtDNA
DNA applications:
Native species
Species ID
Sex ID
Individual ID
Population assignment
Parentage
Novel Applications:
Environmental DNA
Models of illegal trade
The NRDPFC Forensic Laboratory:
1987: First case submitted
HAs seen more than 1000 cases
Provided evidence for every province and territory in Canada, plus some US states
1991: Provided the first DNA evidence for a wildlife infraction to be accepted into a North American court
2002: Involved in the first human homicide investigation with the OPP
Lab involved in case against man found with Black Bear gallbladders
Why do we need Wildlife Forensics?
Native species:
Has seen more than 1000 cases
Poaching (diminish natural resources)
Illegal movement of animals (introduction of non-native species)
Ranched animals and domestics:
Track source and spread of disease
Animal cruelty cases
Exotic species:
Illegal trade in animal parts: CITES and WAPPRIITA
Threats to species sensitive to overexploitation
Wildlife Investigations:
Investigation by wildlife officers similar to police investigators
Gather evidence regarding offences, verify accusations, carry out searches, testify in court
The ID of tens of thousands of protected species, their parts, and derived products
Requires broad scientific knowledge: DNA largely bypasses that need
The Convention on International Trade in Endangered Species (of wild fauna and flora)
Sets controls on the international trade and movement of animal and plant species that are/will be threatened
Appendix 1: Species are rare or endangered, and trade will not be permitted for primarily commercial purposes
Appendix 2: Species are not rare or endangered at present but could become so if trade is not regulated
Appendix 3: Species are not endangered but are managed within the listed nation
Wild Animal and Plant Protection and Regulation of Interprovincial and International Trade Act
Legislative vehicle by which Canada meets its obligations under CITES
Various national and taxa specific laws
World Wildlife Fund
Monitors wildlife trade
Founded to assist in the implementation of CITES
Illegal animal trade is a multi billion dollar industry
There are about 200-300 agents/officers in the US monitoring illegal trade
In Ontario:
Fish and Wildlife Conservation Act
Endangered Species Act
Pangolin Trade Case Study:
Most common trafficked mammal
1,000,000 likely traded between 2006-2015
Moved to CITES Appendix 1 in 2016
200,000 trafficked in 2019
All 8 species at risk of extinction
Trafficked for their scales which are powdered or made into paste for medicinal purposes
There is no scientific basis for their use; the scales are made of keratin, the same material as our fingernails
Fetuses are consumed in soup and are used to demonstrate social status
Loss of pangolins is detrimental to ecosystems and communities, they eat termites which would otherwise eat agricultural crops
Narwhal Tusks Case Study:
The Marine Mammal Protection Act in the United States
Man smuggled 250 narwhal tusks worth up to 3 million dollars into US
Marine mammals parts include any part of a marine mammal, both hard and soft, including parts derived from tissues, such as cell lines and DNA, but do not include urine or feces
Elephant Ivory Trade Case Study:
20,000 elephants illegally killed in 2015
Legal sale of 107 tonnes of ivory in 2008 intended to flood market and crash the illegal trade market
This backfired, illegal poaching increased
Molecular markers (microsatellites) have been used to identify the networks of elephant trafficking
Collection of elephant dung samples across Africa to create a DNA map of their genetics
DNA extracted from ivory could then be cross-referenced to this database to figure out where the elephant was poached from
Tracking ivory is linked to terrorist activity and has allowed for the identification of poaching hotspots and trafficking networks
Armed guards posted in hotspots to protect wildlife
More policing/.governance of animals has lead to a decrease in death
The GDP of China increases as the number of poached elephants increases
Rhino Ivory Trade Case Study:
Rhinoceros horns are worth up to 40,000$/kg
Horns used in medicine, carvings, statues, and status symbols in Africa
Blood and urine used for traditional Chinese medicine
Poaching is projected to exceed natural growth rates in much of Africa
DNA forensic tests track rhino poachers; daggers or powdered medicines are tracked back to source populations
A DNA database out of South Africa (RHODIS) has over 15,000 animals
24 microsatellites, sex, and species markers
Over 2,000 forensic cases, multiple with >10 year jail terms
In these cases the horn is often linked to the carcass or tool
Sexing rhinos: Zinc finger Gene found on the X and Y chromosome but there is a length polymorphism between X and Y
The male sex has two peaks (heterozygous)
Chilean Sea Bass Case Study:
Also called Patagonian toothfish
Long lived (50 years old)
Unsustainably fished
Became overharvested; Wholefoods stopped selling it
Seafood Watch suggested avoiding this fish
One population is certified sustainable (Marine Stewardship Council eco-label)
Whole Foods started selling MSC eco-labeled fish
Researchers looked at whether chilean sea bass carrying the MSC eco-label actually come from the certified fishery
Mitochondrial data was used in this study
This a haplotype that is inherited all at once
8% of eco-labeled fish were not Chilean Sea Bass to begin with
15% did not come from the certified population
This is not restricted to Sea Bass, adulteration is widespread
Now: Sea bass populations are rebounding and approximately 60% of stocks are considered sustainable
Monitoring Harvested Species:
Harvested for sport, food, fur, and other derivatives (bones, urine, gallbladder for asian medicines)
Not always clear how many animals and from what populations
Use DNA profiling to define populations and non-invasively census
Use DNA to enforce wildlife laws in the illegal harvest and trade of wildlife and better manage harvests
Goat Poaching Case Study:
Hunter in possession of a goat claimed it was female
The Alaska Dept. of Fish and Game and State Troopers believed the goat was male
5 DNA samples from different tissues were used for individual and species identification
DNA was extracted and amplified
Used gel electrophoresis for sex determination
All samples were from male goat
All samples were from the same individual
Wildlife Profiling Issues:
Development of species-specific markers
Low genetic variation and lower power of inclusion
Need to establish databases for each species across their distributions
Need to identify species-specific markers
Markers for closely related species can be transferable
No standardized loci among wildlife labs, needs to happen as the number of labs increases
Calculations to establish PI need to be conservative when few individuals in database from region of crime scene
Main difference to human forensics: Need to identify and distinguish between a variety of species (mtDNA or cpDNA)
Might also include:
Sex determination (X/Y)
Individual identification (microsatellites)
Population identification (microsatellite/SNPs)
Parentage (microsatellites)
Microbial DNA Forensics:
The properties that make microorganisms potentially harmful for cultivation and outbreaks:
Accessibility
Culturability
Capability for large-scale production
Stability during preparation
Ability to retain potency during transport and storage
Ability for dissemination
Stability and retention of potency after dissemination
Incubation period
Infectivity
Lethality
Pathogenicity
Toxicity
Transmissibility
Virulence
1,415 species known to be pathogenic to humans
217 are viruses and prions
538 are bacteria and rickettsia
61% are zoonotic and 33% are transmissible among humans
Human forensics relies on 20 microsatellite loci that are useful for the majority of scenarios encountered: This is not the case for microorganisms
Unique genetic identification of a microorganism is likely never possible
Clonal nature of many microorganisms means we cannot pin down the source
Less than optimal population and phylogenetic data means it is difficult to develop genetic resources
Limited historical and epidemiological information means limited databases and understanding of trends
Widespread demand is somewhat limited but some well known examples are documented
Anthrax
Epidemiology Case Studies:
Aq handful of nosocomial (originated from a hospital) outbreak studies have been performed in real-time with the goal of reducing the duration and impact of transmission
Also being applied in the field (outside the hospital) as sequence data is useful to monitor outbreaks, but needs to be delivered in a timely fashion
Acinectobacter baumannii Case Study:
Multi-drug resistant microorganism
Opportunistic pathogen in humans, affecting people with compromised immune systems
Outbreak lasted almost 2 years
Genome sequencing revealed 57 potential transmission events that linked 55 patients by genomic (SNP) analysis
Could identify likely transmission events and sites
Burn-patient operating theatre and specialized burn-care bed were identified as contamination sources
A total of 7 genotypes were identified over the course of the outbreak
Reproduce several thousand times a day
The outbreak began with a contaminated bed; the patient left the hospital and the spread began three weeks later
A subsequent incident occurred in a contaminated burn theatre
Carabapenem-Resistant K. pneumoniae:
Bacterial pathogen Klebsiella pneumoniae is responsible for roughly 15% of Gram-negative infections in hospital ICUs
Primarily affects immuno-compromised patients, and outside the hospital is manifested as pneumonia
Infections caused by carabapenem-resistant strains; have few treatment options and are associated with mortality rates upwards of 50%
Genome sequencing is used to track an outbreak of carbapenem-resistant K. pneumoniae in the U.S. National Institutes of Health (NIH) Clinical Center that colonized 18 patient, with 10 deaths
Implementation of rigorous infection control procedures ultimately halted the outbreak
Sought to better understand how the outbreak progressed to learn how to control future outbreaks
Genome sequencing of isolates from the index (P1) patient taken from four body sites (urine, groin, throat, broncho-alveolar lavage)
Sequenced isolates from each of the affected patients to find SNPs (6Mb genome) to infer transmission links
Analysis of 41 SNPs to link patients
Suggested three independent transmission events from this patient lead to hospital-wide dissemination of the outbreak strain
Used sequencing and patient overlap in hospital wards to predict the most probable transmission sequence
First transmission through patient 3
Second transmission though patient 4 by other patients as silent transmission vectors (five suspects)
Third transmission linked to a contaminated ventilator
SARS-CoV-2 Study:
Coronaviruses (CoV) are a broad family of viruses named after the crown-like spikes on their surface
These viruses can be found in people and animals
CoV typically causes mild to moderate upper respiratory tract disease in humans, but can also cause more severe infections such as pneumonia
In Nov/Dec 2019, an outbreak of a novel coronavirus in Wuhan China originally infected 40+ people, most of whom developed pneumonia and has contact with the same seafood market
The N protein (end gene) in the SARS genome is used to identify different variants
PCR and rapid antigen test results:
Positive (CoV detected)
Negative (CoV not detected)
Inconclusive/invalid
Many mutations which lead to new variants occur at the end of the genome, where the membrane and spike proteins are encoded
Targeting these genes helps identify variants via sanger and Next Gen sequencing
SARS-CoV-2 is zoonotic; a variant emerged in white-tailed deer with deer-to-human transmission
What is a Microbiome?
A community of microorganisms
Can be detected through PCR amplification of the 16S ribosomal DNA gene sequences followed by Illumina sequencing
Measure aspects like diversity and species composition
The 16S approach:
Extract DNA from microbial community sample
Amplify and sequence 16S rRNA
Group similar sequences and quantification of OTUs representation
Complete sequencing technique:
Extract DNA from microbial community sample
Use shotgun and high-throughput metagenomics approaches
Sequence community DNA
Compare sequence to reference genomes and quantification
Microbiome Case Study Examples:
Mountain goats:
Compared fecal microbiomes of captive vs wild mountain goats
Identified genus more abundant in wild and captive communities
Microbiome is influenced by diet and environment
Pandas:
Captive pandas were reintroduced into the wild but only 6/9 survived
Wild pandas were found to have different gut/fecal microbiomes than captive pandas
Fecal microbiome transplants from wild to captive pandas to support reintroduction programs
The use of Microbiomes in Forensics:
Use of microbiome assessments provides associative evidence between people and objects with places or other people
Example: unknown soil on a suspect’s clothing or shoes could be compared with a known control soil sample from the crime scene
Soil contains a diverse community of microbes that can vary significantly in composition from one site to the next
Post-mortem interval (necrobiome)
For most PMIs, the focus becomes predictive modeling
Given a single microbiome profile, the PMI can be estimated
Machine learning algorithms and cross-validation are typically applied
Linear regression for PMI:
Y = B0 + B1X1 + B2X2 +...+ E, where B1X1 represents microbiome metrics
Automates variable selection and addresses multicollinearity
Cross-validation using test and training samples from the complete data
R^2 value is a score from 0-1 to evaluate how the linear regression evaluates the variation
Trace Soil Evidence in Forensics:
Traces of soil evidence may be associated with suspects and analysis can link the suspect to the crime:
Gas chromatography
Electron microscopy
Analysis of soil microorganisms has to date largely been ignored by the forensic scientific community
Physically look at soil, properties, elements
Mainly due to the limitations of traditional culturing techniques, which allow only a small subset of organisms to be isolated and characterized
Techniques have been developed to circumvent the requirement to isolate and culture microorganisms as a prerequisite to identification
Soil microbial diversity is now characterized using simple molecular techniques (mostly) based on amplified rRNA (gene) DNA
The potential exists for such technologies to be used to analyze microbial populations in many diverse environments for forensic purposes
Soil serves as powerful contact trace evidence because it is highly individualistic and has a high transfer and retention rate
Standard analyses examine intrinsic properties of soil including mineralogy, geophysics, texture, and colour
Soils can support a vast amount of organisms, which can be examined using DNA fingerprinting techniques:
Terminal restriction fragment length polymorphism (T-RFLP)
Denaturing gradient gel electrophoresis (DGGE)
Random whole metagenomic sequencing
High-throughput sequencing
In practice:
Amplify 16S rRNA gene of the bacterial soil community DNA (1300 bp)
Each primer is fluorescently labelled at its 5’ end
PCR products are digested with restriction enzymes and separated using capillary electrophoresis
The output is an electropherogram that shows the digested PCR products
Fragments differ based on peak intensity and fragment length
Variations in the number and size (+/- 1 bp) of the peaks for each profile were compared and a Sorenson’s similarity index:
Cs = 2Nab/(Na + Nb)
Cs = sorenson’s similarity index
Nab = Number of matching peaks
Na = Total number of peaks in soil A
Nb = Total number of peaks in soil B
Bacterial Community DNA Profiling Case Study:
A soil microbial community DNA profile was obtained from the small sample of soil recovered from the sole of a shoe and from soil stains on clothing
The profiles were representative of the site of collection and could potentially be used as associative evidence to support a link between suspects and crime scenes
Soil community profiles were obtained using the Terminal-RFLP fingerprinting method that uses fluorescent primer technology and semi-automated analysis techniques
1. DNA is extracted from the microbial sample
2. PCR is used to amplify the gene of interest
3. Fluorescent primers copy and label the gene fragments
4. Restriction digest cuts fragments
5. Electrophoresis separates the labeled fragments by size
6. Laser detection of colours representing OTUs
Forensic Comparison of Soils by Bacterial Community:
Differences in profiles with time are not unexpected as seasonal fluctuations in parameters such as rainfall and temperature, may impact on the microbial community causing population shifts
To be routinely applied to forensic casework the methodological parameters than can significantly influence variability in profiles need to be determined (optimization of DNA isolation, PCR conditions, PCR product digestion)
Likely require more sophisticated approaches to the statistical interpretation of Sorenson’s similarity index
DGGE Analysis of 16S rDNA of Soil Bacterial Populations:
The denaturing gradient gel electrophoresis uses gradient DNA denaturant to separate amplicons of roughly the same size based on sequence properties
A GC camp within the last 5 nucleotides of primer is used in a special primer to anchor the PCR fragments together once they have denatured
When the fragment reaches its melting point, it stops moving
Partially melted dsDNA can no longer migrate through the gel
1. Samples from different microbial communities
2. Extract DNA from each community
3. Add GC clamp (anchors DNA together once its denatured)
4. Each lane in gel electrophoresis represents one microbial community
5. Band alignments represent shared genes amongst communities while lone bands represent unique genes within the communities
Limitations:
There is a strong bias for dominant populations
Biases generated by differential DNA extraction and PCR amplification, and bands can migrate to the same position (different genes of same size)
Lower specificity
Case Study:
A blind test on soils from a crime an alibi scene and unrelated locations was conducted to evaluate the potential of environmental PCR and DGGE for use in forensic science
The primers for PCR were specific for conserved bacterial 16S rDNA sequences
Isolated PCR products of the expected size (550 bp) were subjected to DGGE
Digital image of gel and software performs cluster analysis
In most cases, soil patterns clustered according to soil type and location
Has potential but similar limitations to that of T-RFLP
Forensic Soil DNA Analysis using High-Throughput Sequencing:
High-throughput sequencing offers a means to improve discrimination between forensic soil samples by identifying individual taxa and exploring non-culturable species
Less than 1% of bacteria grow in a lab (no new class antibiotic has been discovered in the past 30 years)
Technology that permits screening 10,000s of genes in many individuals simultaneously
Previous genetic approaches relied on patterns of fragments length variation produced by amplification of unidentified microbial taxa in the soil extract and, these provide little resolution of spatial and temporal communities
HTS technologies provide the ability to rapidly generate a detailed picture of soil microbial communities but require bioinformatics and significant computational abilities
Reproducibility is a key area of concern when utilizing HTS in forensic casework
In DGGE, is it always the same genes that amplify? How is that impacted by sample quality? No, this is a limitation
Validation standards must be met, which includes demonstrating:
Reproducibility
Robustness
Reliability
Forensic Soil DNA Analysis Limitations:
Temporal effects could impact the DNA profile obtained since a time lapse often exists between the criminal activity and the recovery of reference samples during investigations
Storage conditions of evidence samples could adversely affect the soil community profile
When soil is removed from the environment by transfer to an object, such as a shoe sole, conditions will be altered and in many cases such soil will dry-out
Soil moisture and sample drying will be critical in establishing the robustness of this method in practice, and further determining the most reliable target data
Transfer of soil to objects also introduces further limitations:
Soil particles differ in persistence, with larger aggregates being primarily lost
Soil from the site of interest can be mixed with other soils, for example, a car tire may contain multiple layers of soils from different locations
These alter soil composition and presumably the microbiota detected within samples, therefore careful consideration is required prior to analysis and interpretation of data
Microbiome and PMI:
Current popular research topic but considerations:
Animal surrogates decompose differently and different microbiomes
Predictive models need to be developed for each region to account for factors like environment
Appreciate and understand the Mean Absolute Error of predictive models
Introduction to eDNA:
Species and population monitoring has traditionally relied on physical identification by visual surveys and counting of individuals
Complex sightability models of one species
In many cases these techniques fall short of performing efficient and standardized surveys
Phenotypic plasticity (same genotype, different phenotype)
Similar species
Sex and age class
Terrain and weather
Some traditional methods are invasive/destructive on the species or ecosystem under study
Physical removal
By-catch and physical damage
Morphological identification is heavily dependent on taxonomic expertise, which is often lacking or in rapid decline
eDNA:
Environmental DNA can be defined as trace DNA released from skin, mucous, saliva, sperm, secretions, eggs, faeces, urine, blood, roots, leaves, fruit, pollen, and rotting bodies
eDNA is a mixture of potentially degraded DNA from many different organisms
This definition remains controversial due to the sampling of whole microorganisms that might appear in an environmental sample
DNA based species identification or DNA barcoding is used in order to detect species through extracellular DNA or cellular debris, present in environmental samples, coming from cell lysis or living organism excretion or secretion
Can detect species without catching, seeing, or hearing them
eDNA methods are accelerating the rate of discovery, because no a priori information about the likely species found in a particular environment are required for ID of said species
Methodology overview:
1. Environmental sampling
2. DNA extraction
3. Amplification of a gene fragment using universal primers
Next-Generation sequencing
Comparison to reference databases
If a sample is not in a database, it is possibly a new species
Absence of evidence is not evidence of absence
Can to some degree give you trends
Who Works with eDNA:
Those working with invasive species, biodiversity surveys and understanding functional diversity, wildlife and conservation biology
Consulting companies now offer eDNA as a service
Many government agencies
US Army Corps of Engineers monitor Asian carp
General focus on aquatic environments but changing
The first application of the method in aquatic environments dates to 2008 where the invasive american bullfrog was successfully detected with an eDNA method in France
Applications are increasing far and wide
What is eDNA?
eDNA can be extracted from environmental samples such as soil, water and faeces without having to isolate the target organism
Often a standard or modified DNA extraction protocol
Composed of living and dead cells; intracellular DNA and extracellular DNA (from natural cell death and the subsequent destruction of its structure)
To date, eDNA studies have predominantly focused on:
The method (improving, optimizing, comparing to standard field techniques)
Identifying species specifics (pathogenic, endangered, invasive, genetically modified, and game species)
Reconstruction of diets and ancient communities
eDNA in the Environment:
DNA persistence in nature varies by environment (marine, freshwater or terrestrial) and substrates (water, soils, and sediments)
In marine and freshwater environments, eDNA persistence varies considerably between studies, ranging from a few hours up to a month
eDNA persistence may vary within the water column where higher levels of DNA degradation are observed at the top (warmer and exposed to UV radiation) compared to the bottom (colder) layer
In general, cold and dry conditions severely slow down DNA degradation whereas, warm and wet conditions rapidly facilitate degradation
In sediments and in terrestrial soils, a very high proportion of DNA can persist for long periods, adsorbed to organic or inorganic particles that protect it from several possible degradation agents
In marine sediments it was demonstrated that extracellular DNA turnover is around 200 times slower than in sea water (93 days in sediments vs 10 hours in sea water)
Factors Influencing eDNA:
DNA is degraded by endonucleases (enzymes that cut the nucleic acids into smaller fragments), water, UV radiation, and microorganisms
In aquatic ecosystems (or samples containing water), the main process that causes DNA damage is hydrolysis and the breakage of the DNA sugar backbone
In order to stop this process, samples should be quickly dried, or put in a solution containing a preservative (alcohol)
UV radiation can disrupt the DNA base-pair bonds
eDNA was not detectable in samples exposed to sunlight (8 days); detected in samples stored in the dark (up to 18 days)
Microorganisms consume DNA as a source of nutrients (carbon, nitrogen, phosphorus) and to repair their own DNA
The rupture of a cell releases DNA and cellular fluids into the environment, which stimulates the growth of microorganisms and leads to further DNA degradation
Increased chlorophyll has shown increased faster rates of eDNA degradation
Increased chloroplast = increased decomposition rate
Advantages of eDNA:
Studies have found higher detection probabilities using the eDNA method compared to traditional methods
This does not apply to all species, but is true for rare and secretive/cryptic species, or invasive alien species at early stages of the invasion
Once optimized, obtaining an environmental sample can be carried out in a very standardized manner across sites
Traditional methods rely on the taxonomic knowledge and experience by personnel carrying out the surveys
eDNA is a non-invasive method that inflicts little to no damage on the species or habitats under study
For several species, traditional surveys are difficult outside particular seasons or certain conditions
Several studies report shorter handling time and lower cost using eDNA compared to traditional monitoring techniques
eDNA is often more cost-efficient than traditional methods because of its higher detection probability
2.5 times cheaper, less time consuming than traditional methods for surveying invasive American bull frogs
For species that are easy to observe or catch, eDNA will be less cost effective than traditional methods (context dependent)
With eDNA methods, DNA-free materials are used between locations and many precautions are taken to prevent contamination with DNA
This helps to reduce the risk of unintentional translocations of invasive species, or transmission of pathogens into new areas
For example, the infectious amphibian disease chytridiomycosis, could be introduced into new areas by conventional field gear that is not disinfected
Use positive and negative controls to assess contamination
Limitations of eDNA:
The majority of eDNA studies use a mitochondrial gene as a marker because there are hundreds to thousands of copies of mtDNA per cell (this is barcoding)
Smaller, circular, highly conserved, abundant
This high copy number enhances the likelihood of detection of the DNA in degraded samples such as environmental samples; also think about its structure
Some mitochondrial genes (Cox1, Cytb, control region) also have an evolutionary rate, which makes them more appropriate for species ID based on the genetic variation
mtDNA is usually maternally inherited, which does not allow the ID of hybrid organisms, but only permits determination on the maternal species of the organism
This is a major limitation in cases where an invasive species hybridizes with a native species since they do not get DNA from both parents
Might also require conventional methods (morphological characteristics to identify hybrids)
Assay development and bioinformatics are not straightforward
eDNA is not universally entirely quantitative
Current methods thus are only capable of detecting the presence or absence of a species
Contrary to conventional methods, no information can be collected on life stages, demography, fecundity or health of the target species – all critical to management
There is no collateral knowledge, it is gained from observations with traditional methods
eDNA is not homogeneously distributed throughout a water body (depth, shoreline)
Sampling locations should be selected according to the habitat preference of the target species
Samples should be collected from areas where the target species is most likely to be detected
Weatherfish in the Netherlands were detected 2.5 times more often in habitats judged to be good
Random sampling or by non-specialists may not work
Lecture 5: Non-Human Models for Human Applications
Questions of Interest to Researchers at Trent (Shafer Lab):
Estimating time since deposition of blood
Estimating rate of secondary DNA transfer from blood
Clear applications to human forensics
Use cow blood and semen for all experiments (pigs currently being used for decomposition)
Why Cow Blood?
Easier to work with than human blood
Species specific attributes of the blood (viscosity, hematocrit levels, but useful (relevant) alternative and accessible
Time Since Deposition:
No accurate or robust way to measure time since deposition of blood on crime scenes
Important for establishing a time frame and we know blood undergoes a series of changes after being deposited ex vivo
Can we determine the TSD of blood by pairing spectroscopy and high-resolution automated electrophoresis
A tapestation machine is used for automated gel electrophoresis
Tapes containing gels with lanes fit into the machine
The standard ladder contains fragment sizes from 45000 bp to 250 bp
The top panel shows the size of any detectable DNA fragments
The bottom panel shows us the quantity of fragments at each size via quantification w intercalating dye detection
Spectroscopy (Luminometer) shows blood degradation over time
The alpha and beta Soret bands reflect the stages of heme over time
“A Blue Spectral Shift of the Hemoglobin Soret Band Correlates with the Age (Time Since Deposition) of Dried Bloodstains
Overall, DNA and RNA concentrations are expected to decline over time, but finer metrics such as shifts in DNA fragment length have not been quantified
Fragments should become shorter over time, decrease on long fragments, increase in short fragments
Methodology of Hemoglobin Soret Band Correlates with TSD of Dried Bloodstains:
15 uL fresh pathogen free bovine blood
10 samples for each (5 TapeStation, 5 Luminometer)
11 time intervals (0-36 hours)
Room temperature (Part 1)
Warm and cool environments (Part 2)
Look for a correlation to time and test predictability via cross-validation
FTA paper: preserves blood samples
Results:
Clear signature but not hourly; fresh vs old (days vs week)
Controlled environmental and high individual variation are current limitations
Difficult to implement in a court of law as we see environmental effects
Could RNA provide a finer resolution?
Automated Gel Electrophoresis (Tapestation) for RNA:
The RNA integrity number equivalent (RINe) delivers an objective assessment of eukaryotic and prokaryotic total RNA degradation
There is a strong inflection point at 28 hours for the ratio of RNA fragments above 200 bp (DV200)
RNA decays drastically after 24 hours; could this be a diagnostic window?
Could targeting specific RNA genes improve the model? qPCR techniques could be used to target specific genes
Time Since Deposition:
High variation, often non-human
Could this be used in a court of law?
Non-human surrogates are often used
Does our foundational understanding through human blood surrogates meet these standards
Often meet 3Rs but we must apply them to humans
If yes, how transferable are these patterns to human blood?
DNA Transfer:
How common is it?
What factors promote DNA transfer?
Important for understanding potential links to evidence; has also been used as a defence
Can determine the rate of secondary DNA transfer via qPCR and genotyping STRs
If a DNA profile is identified at a scene, does that mean the ID was involved in the crime?
DNA transfer examples:
It was not possible to say whether the DNA had been deposited directly (primary transfer) or indirectly (secondary transfer)
DNA Transfer on Denim Methodology:
Passive, pressure, friction, and control transfers
qPCR was used to amplify the cytochrome b sequence (mtDNA)
Results:
The results showed that DNA is transferred between materials and can be amplified
The pressure samples showed the best measure of variance
DNA does transfer, varies by substrate but differs in concentration between primary and secondary transfer
Often the transfer substrate did not have a blood stain (we knew where to look), whereas there is no visual of DNA transfer in real world cases
Can you amplify STRs? This is the primary marker used in human applications
Can we generate a profile from the DNA present?
Lab Studies to Date:
Incremental , multi-pronged assessment of real forensically relevant phenomenon with molecular tools
High variation, does make application challenging but lots of options
Large enough database might allow for diagnostic patterns to be identified and validated
A meta-analysis has been done on ex-vivo whole blood degradation
This is a statistical analysis of the data from independent primary studies and is not a literature review paper
This paper calls for:
Considering interdisciplinary research that integrates multiple analytical techniques
Use of the most readily available mammalian blood source for proof of concept TSD studies
Validation with human blood prior to crime scene implementation
Focus on research on non-porous substrates due to their larger effect size (appears most promising)
Report exact environmental conditions for room or ambient temperature, and humidity
Application of multiple analytical techniques where possible
The Role of Cats in Human DNA Transfer:
Background samples from the cat’s head, back, right, and left
Transfer from a cat to a gloved hand after petting of the neck
Transfer from an ungloved hand to a cat
20 cats
Quantifiler trio (specific to human DNA) to not amplify other sample type, gives female/male ratios as well
PowerPlex 21 (STR assay kit)
Assessed the background amount of DNA on the cats at the four locations
Detectable DNA was collected from:
95% of the back samples (19/20)
90% of the head samples (18/20)
80% of the right samples (16/20)
10% of the left samples (2/10)
The researcher patted and scratched a cat’s back of neck 8 times while wearing new gloves
The same, foreign to the household, researcher patted and scratched the cat’s chest and neck area 8 times using their bare dominant hand
Integrating Microbial Profiling and Machine Learning for Drowning Site Inference:
Drowning locations are typically difficult to pinpoint
Can we leverage our understanding of microbial communities to assist with this process?
Used mice and rabbit surrogates
Alpha diversity indices of eight sampling locations
Highlights variations in species richness and diversity across different sampling points
Results suggest that when inferring drowning sites across different animal species, relatively good accuracy can still be achieved, even when there are significant differences between water samples from the target location and those from other locations
Changes in Oral Health During Aging in Primate Model:
Marmoset system, comparing 3 stages of life (young, middle, old)
Some work in humans, largely gut microbiome
Looked at morphological changes in oral health and in the oral microbiome
Trying to establish marmoset model recapitulating the changes in oral health associated with human aging
Results:
Despite some differences in abundances, no significant findings associated with the microbiome and ageing
The changes in oral manifestations (tooth loss) and alpha diversity of the oral microbiome that failed to show any statistically significant differences between age groups
Could be due to the limited statistical power of our sample size (6)
Detecting Human DNA in Aquatic Environments – Potential of eDNA in Forensics:
Human eDNA is largely untouched in the area of investigation
Looked at the rate of DNA decay and profile recovery of temporal samples
Results demonstrated that human eDNA remains detectable for up to 36 hours in freshwater samples and up to 84 hours in saltwater samples
nuDNA profiles are less stable and reliable
Degradation index = ratio of small nuDNA fragments to large nuDNA fragments
Ratios below 1 have little to no degradation and those greater than 10 have severe degradation
After 24 hours in both conditions, no viable DNA remained for analysis
Comparison of NGS (SNPs) and Capillary Electrophoresis (STRs) in the Genetic Analysis of Human Remains:
Two laboratories
3 different methods; 2 CE-STR and 1 NGS-SNP approach
Powerplex ESX 17 Fast System (17 autosomal STRs)
Powerplex Y23 system (23 Y-STRs)
ForenSeq Kintelligence Kit (NGS; 10,230 SNP markers curated for forensic kinship)
Applied to 40-83 year old remains of war victims
Only obtained sufficient DNA from 16 samples to proceed with all workflows
The data support that high throughput SNPs could be the method of choice for missing persons cases
The current STR systems would not yield viable data or would not be able to be associated with family members
SNP data is best for smaller and lower quality strand samples
This is best for degraded samples and missing persons cases
Fourth Generation Sequencing:
In 2nd and 3rd generation sequencing, we do not know where DNA/RNA is coming from within samples
With 4th generation sequencing, we can track where DNA, RNA, and single cell samples are coming from