Genome Variation

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/13

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

14 Terms

1
New cards

How big is the human genome?

The human genome has roughly 3 billion bases and about 2000 genes. Only ≈1.5% codes for proteins, known as the exome. The rest include non-coding DNA, regulatory regions, and repetitive elements

2
New cards

Are we all genetically identical?

No, while around 99.7% of DNA is the same between any 2 people, around 9 million bars differ and these differences are called variants or polymorphisms. Some differences are harmless while others can cause disease eg point mutation in sickle cell anaemia

<p>No, while around 99.7% of DNA is the same between any 2 people, around 9 million bars differ and these differences are called variants or polymorphisms. Some differences are harmless while others can cause disease eg point mutation in sickle cell anaemia</p>
3
New cards

What is polymorphism?

Any point in the genome where DNA sequence varies between individuals.

Eg at position 17, some people have T and others have A

Reference allele: T

Minor allele: A

Minor allele frequency(MAF)= % of chromosomes with A(≈15%)

<p>Any point in the genome where DNA sequence varies between individuals.</p><p>Eg at position 17, some people have T and others have A</p><p>Reference allele: T</p><p>Minor allele: A</p><p>Minor allele frequency(MAF)= % of chromosomes with A(≈15%)</p>
4
New cards

What is a Single Nucleotide Variant (SNV) / Single Nucleotide Polymorphism (SNP)?

A change in a single base.

• Very common: 1 every 300 bases in the genome.

• Each person has ~3–5 million SNVs.

• Usually harmless, but some can alter protein function or gene regulation.

Origin: mistakes in DNA replication, often corrected by DNA mismatch repair.

<p><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">A change in a single base.</span></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">• Very common: 1 every 300 bases in the genome.</span></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">• Each person has ~3–5 million SNVs.</span></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">• Usually harmless, but some can alter protein function or gene regulation.</span></p><p class="p1"></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">Origin: mistakes in DNA replication, often corrected by DNA mismatch repair.</span></p>
5
New cards

Mutation vs. Polymorphism?

Polymorphism: MAF >1% in a population.

• Rare polymorphism: 1–5%

• Common polymorphism: >5

• Mutation: MAF <1% (rare, often recent or harmful)

👉 All variants begin as mutations. Evolutionary forces decide if they spread or remain rare.

6
New cards

What evolutionary forces affect variant frequency?

  1. Mutation – creates new alleles.

2. Gene flow – migration introduces variants into new populations.

3. Genetic drift – random changes in allele frequency (especially in small populations).

4. Selection – alleles increase or decrease depending on whether they are beneficial (positive selection) or harmful (negative selection).

7
New cards

What are microsatellites and how do they arise?

Short tandem repeats (e.g., CA repeats).

Caused by DNA polymerase slippage during replication.

Found across genome (in genes and non-coding regions).

Can change protein length if inside a coding sequence (e.g., Huntington’s disease = CAG repeat expansion).

Multiallelic: multiple repeat numbers possible at a locus.

<p><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">Short tandem repeats (e.g., CA repeats).</span></p><p class="p1"></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">Caused by DNA polymerase slippage during replication.</span></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">Found across genome (in genes and non-coding regions).</span></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">Can change protein length if inside a coding sequence (e.g., Huntington’s disease = CAG repeat expansion).</span></p><p class="p1"></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">Multiallelic: multiple repeat numbers possible at a locus.</span></p>
8
New cards

What is Copy Number Variation (CNV)?

Structural variant where a region of DNA is duplicated or deleted.

Usually >1 kb in size.

Caused by non-allelic homologous recombination during meiosis.

~12% of the genome is affected by CNVs.

Each person carries ~100 CNVs.

Can delete or duplicate genes → sometimes pathogenic.

<p><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">Structural variant where a region of DNA is duplicated or deleted.</span></p><p class="p1"></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">Usually &gt;1 kb in size.</span></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">Caused by non-allelic homologous recombination during meiosis.</span></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">~12% of the genome is affected by CNVs.</span></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">Each person carries ~100 CNVs.</span></p><p class="p1"><span style="font-family: UICTFontTextStyleBody; font-size: 17px;">Can delete or duplicate genes → sometimes pathogenic.</span></p>
9
New cards

What are the main types of common genetic variation?

  1. SNPs – ~17 million known; ~3 million per genome.

2. Microsatellites – ~3% of genome.

3. CNVs – >2000 identified; ~100 per genome.

10
New cards

How do common variants relate to disease?

Most common variants are neutral, not causing single-gene (Mendelian) disorders.

They often influence complex traits like height, memory, diabetes risk, and even behaviours.

Used in research to find disease-causing genes via linkage studies and GWAS (Genome-Wide Association Studies).

11
New cards

How do SNVs arise through DNA mismatch repair?

During DNA replication, DNA polymerase can insert the wrong base. Normally, the mismatch repair system detects and fixes these errors:

• Recognition: proteins detect mismatched bases (e.g., A–G instead of A–T).

• Excision: enzymes cut out the incorrect base.

• Repair: DNA polymerase fills in the correct nucleotide, and ligase seals the strand

If this system fails, the wrong base remains → SNV (single nucleotide variant). Over generations, these become SNPs if frequent in populations.

12
New cards

How are microsatellites inherited?

Microsatellites = repeated short DNA sequences (e.g., CA repeats).

• Each person inherits one microsatellite allele from each parent.

• Alleles differ by number of repeats, making them multiallelic (many possible versions in a population).

• They follow Mendelian inheritance (dominant/recessive patterns depending on location in coding or non-coding regions)

Example: Huntington’s disease → expansion of CAG repeats in the HTT gene, where severity and onset depend on repeat number.

13
New cards

Book analogy for genome variation

Genome = entire book.

Chromosome = chapter. (Deleting/duplicating a chapter = CNV, large effect).

CNV = deleting/duplicating a paragraph.

Microsatellite = repeating words in a sentence (annoying but not fatal).

SNP = typo in a letter (often little effect).

14
New cards

Key glossary terms

Locus: unique position in genome (can be 1 base or large region).

Allele: a version of a sequence at a locus.

Genotype: the two alleles an individual carries at a locus (homozygous/heterozygous).

Biallelic = 2 alleles exist. Triallelic = 3. Multiallelic = >3.

Allele frequency = proportion of chromosomes in a population with that allele.