MCB104 - Lecture 14 - Genome Annotation and Assembly & Genotypic Variation

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/82

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

83 Terms

1
New cards

What else needed on top of having a full genome sequence?

  • Full genome sequence - ONLY STEP ONE

  • We need to annotate the sequence to indicate what sequences promote what tasks

  • Without annotating, genome sequence is just a series of letters

2
New cards

What are Omics?

  • Collective characterization of complete sets of specific type of biological information, typically in high throughput manner

    • Genome: All DNA Sequences

    • Transcriptome: All transcribed RNAs

    • Exome: All Exons

    • Proteome: All Proteins

    • Metabolome: All metabolites

    • Connectome: All neural pathways

  • All these methods involve massive amounts of data, requiring various tools to filter out meaningful info

3
New cards

What is Euchromatin?

NOT Heterochromatin

4
New cards

What are Genomes?

  • Genome: All DNA Sequences

5
New cards

What are Transcriptome?

All transcribed RNAs

6
New cards

What are Exome?

All Exons

7
New cards

What are Proteome?

All Proteins

8
New cards

What are Metabolome?

All metabolites

9
New cards

What are Connectome?

All neural pathways

10
New cards

How can you identify genes through ORFs?

  • An open reading frame is a reading-frame uninterrupted by stop codons

  • DNA can be read in 6 reading frames (3 from each strand due to codon length)

  • Coding sequences tend to have longer ORFs

<ul><li><p>An open reading frame is a reading-frame uninterrupted by stop codons</p></li><li><p>DNA can be read in 6 reading frames (3 from each strand due to codon length)</p></li><li><p>Coding sequences tend to have longer ORFs</p></li></ul><p></p>
11
New cards

How can you identify genes through Phylogenetics?

  • Comparing genomes can help track natural selection

  • Phylogenetic trees depicting relatedness can be made by comparing DNA sequences among organisms

    • Branch points represent a series of nested common ancestors

    • Number at each branch point is millions of years before the present

<ul><li><p>Comparing genomes can help track natural selection</p></li><li><p>Phylogenetic trees depicting relatedness can be made by comparing DNA sequences among organisms</p><ul><li><p>Branch points represent a series of nested common ancestors</p></li><li><p>Number at each branch point is millions of years before the present </p></li></ul></li></ul><p></p>
12
New cards

What is Genome Conservation?

  • When comparing complete genome sequences, there are far less conservation than when only look at coding DNA sequences (CDS)

13
New cards

What is Visualizing Homology?

  • A homology map for a 100kb region of the human genome compared to 3 other species shows a range of overlap

    • Zebrafish overlap is largely contained to exons only, increasing overlap proportional to relatedness of species

<ul><li><p>A homology map for a 100kb region of the human genome compared to 3 other species shows a range of overlap</p><ul><li><p>Zebrafish overlap is largely contained to exons only, increasing overlap proportional to relatedness of species</p></li></ul></li></ul><p></p>
14
New cards

What are Transcriptomics?

  • Sequencing RNA transcripts provides info about what sequences are actually transcribed

  • Directly sequencing RNA is difficult

    • Fragile/Unstable

    • Any given RNA sequence is pretty uncommon

    • Tech required is elaborate/expensive

  • Solution? CONVERT TO DNA

    • Use retroviral reverse transcriptase to copy RNA into more stable complementary DNA (cDNA)

15
New cards

How can cDNA Library be created?

  • Isolate RNA from sample

  • Add primers

    • Use oligo-dT primers to bind to poly-A tails

  • Add Reverse Transcriptase and dNTPs

  • DNA Strand is synthesized

  • Denature and use RNase to degrade mRNA

  • Add DNA polymerase to synthesize second strand

    • Can self prime: 3’ end forms a hairpin loop

    • If used, need S1 nuclease to cleave loop after

  • Now have dsDNA sequences of mRNAs

<ul><li><p>Isolate RNA from sample</p></li><li><p>Add primers</p><ul><li><p>Use oligo-dT primers to bind to poly-A tails</p></li></ul></li><li><p>Add Reverse Transcriptase and dNTPs</p></li><li><p>DNA Strand is synthesized</p></li><li><p>Denature and use RNase to degrade mRNA</p></li><li><p>Add DNA polymerase to synthesize second strand</p><ul><li><p>Can self prime: 3’ end forms a hairpin loop</p></li><li><p>If used, need S1 nuclease to cleave loop after</p></li></ul></li><li><p>Now have dsDNA sequences of mRNAs </p></li></ul><p></p>
16
New cards

How can cDNA library be preserved?

  • To preserve - clone it into vectors

  • Alternatively can be used as is for:

    • Next Gen Sequencing

    • Targeted PCR amplification

    • Many others

  • cDNA library includes only some exons

    • Only those transcribed in sample cells

  • Genomic library represents all regions of DNA equally, including introns

<ul><li><p>To preserve - clone it into vectors</p></li><li><p>Alternatively can be used as is for:</p><ul><li><p>Next Gen Sequencing</p></li><li><p>Targeted PCR amplification</p></li><li><p>Many others</p></li></ul></li><li><p>cDNA library includes only some exons</p><ul><li><p>Only those transcribed in sample cells</p></li></ul></li><li><p>Genomic library represents all regions of DNA equally, including introns </p></li></ul><p></p>
17
New cards

How does alternative splicing relate to cDNAs?

  • Alternative splicing complicates the prediction of proteome

  • Comparing cDNAs from different tissues can show where splicing occurs

    • Both “where in primary transcript” and “in which tissues”

<ul><li><p>Alternative splicing complicates the prediction of proteome</p></li><li><p>Comparing cDNAs from different tissues can show where splicing occurs</p><ul><li><p>Both “where in primary transcript” and “in which tissues”</p></li></ul></li></ul><p></p>
18
New cards

What is the Genome Architecture?

  • The human genome has about 28,000 genes

  • Most DNA sequences are introns and other noncoding DNA:

    • Exome = 1.5% to 2%

    • Remainder is: introns, centromeres, telomeres, transposable elements, etc

    • Variation in genome size mostly due to changes in noncoding DNA rather than gene number or size

19
New cards

What are Repetitive Sequences?

  • Most of DNA outside genes is repetitive

    • Particular DNA sequences found many times in genome

    • 2 types: multicopy tandem repeats and transposable elements

    • Repetitive DNA with no known function referred to ask junk DNA

20
New cards

Are Centromere/Telomere junk DNA?

  • NO

  • Centromeres anchor kinetochores

  • Telomeres protect the ends of DNA molecule and protect against replicative degeneration

21
New cards

What are gene-rich regions?

  • Chromosomal regions that have many more genes than expected from average gene density over entire genome

  • Example in human genome - class III region of major histocompatibility complex

22
New cards

What are gene deserts?

  • Regions that have no identifiable genes

  • Largest is 5.1 Mb on chromosome 5 with no identified genes

23
New cards

Describe gene-rich/gene deserts establishment with arrangement of genes

  • Biological significance of gene-rich regions and gene deserts isn’t well established

    • Both connected someway to gene regulation

24
New cards

What is MHC Region?

Class III regions of the human major histocompatibility (MHC) complex

  • Contains 60 genes within a 700 kb region

  • Most gene-rich region of human genome

25
New cards

What is Genomic Evolution

  • Exons often encode protein domains: sequence of amino acids that fold into functional units

  • Shuffling, addition, or deletion of exons during evolution can create new domain architectures

    • Domain architecture: number, kind and order of protein domains

26
New cards

What is Domain Analysis?

  • Function of a new protein can be deduced if it contains a domain known to play a role in other proteins

  • Shown below: Homeobox Domain (found in Hox genes)

<ul><li><p>Function of a new protein can be deduced if it contains a domain known to play a role in other proteins</p></li><li><p>Shown below: Homeobox Domain (found in Hox genes)</p></li></ul><p></p>
27
New cards

Can exon shuffling create new genes?

  • YES

  • After exon shuffling, protein products have novel domain architectures

  • Moving entire exons is more forgiving than moving parts of exons

    • Exons that have a specific function are more likely to stick when moved

<ul><li><p>YES</p></li><li><p>After exon shuffling, protein products have novel domain architectures</p></li><li><p>Moving entire exons is more forgiving than moving parts of exons</p><ul><li><p>Exons that have a specific function are more likely to stick when moved</p></li></ul></li></ul><p></p>
28
New cards

What are Gene Families?

  • Groups of genes closely related in sequence and function

  • Evolve via Duplication and Divergence

    • Duplicated DNA sequence products start out identical

    • Eventually diverge via accumulation of mutations

  • EX; Globin, Hox, Small Monomeric GTP (Rho, Ras, etc)

<ul><li><p>Groups of genes closely related in sequence and function</p></li><li><p>Evolve via Duplication and Divergence</p><ul><li><p>Duplicated DNA sequence products start out identical</p></li><li><p>Eventually diverge via accumulation of mutations</p></li></ul></li><li><p>EX; Globin, Hox, Small Monomeric GTP (Rho, Ras, etc) </p></li></ul><p></p>
29
New cards

What is Homologous Genes?

Any evolutionary related sequences

30
New cards

What is Orthologous Genes?

Arose form same gene in common ancestor, usually retain same function

<p>Arose form same gene in common ancestor, usually retain same function</p>
31
New cards

What is Paralogous Genes?

Arise by duplication, often refers to members of gene family

<p>Arise by duplication, often refers to members of gene family</p>
32
New cards

What are Psuedogenes?

  • Sequences that look like, but don’t function as genes

  • Rapidly accumulate mutations

  • Common features:

    • Missing promotor/start codon

    • Early frameshift/nonsense mutations

<ul><li><p>Sequences that look like, but don’t function as genes</p></li><li><p>Rapidly accumulate mutations</p></li><li><p>Common features:</p><ul><li><p>Missing promotor/start codon</p></li><li><p>Early frameshift/nonsense mutations</p></li></ul></li></ul><p></p>
33
New cards

What are de novo genes?

  • Genes without homologs

  • Young genes that evolved recently from ancestral intergenic sequences

34
New cards

What are synthetic blocks?

Homologous blocks of chromosomal sequence

35
New cards

What are chromosomal rearrangements:

The cutting and reassembling of chromosomal blocks accompanying evolutionary divergence

36
New cards

How does Chromosomal Rearrangements relate to Mouse/Human genomes

  • Mouse and human genomes diverged 85 mya, but can be compared via chromosomes to visualize similarities

37
New cards

Is the number of genes in the genome the number of proteins the genome can generate?

NO, IT IS NOT

38
New cards

What does combinatorics and complexity relate?

  • Combinatorial amplification allow a comparatively small number of genes to produce a large variety of proteins

    • Remember product rule

  • Can occur at different levels

    • DNA level: The DNA itself is rearranged into different combinations

    • RNA level: Alternative Splicing

    • Protein level: Multimeric proteins and post translational modifications

39
New cards

What is VDJ Recombination?

  • Best studied DNA-level combinatorial amplification

  • T-cell receptors have Variable (V), Diversity (D), Joining (J) and Constant (C) segments

  • DNA rearrangement in T cell precursors combines V, D, and J segments into an exon

    • Done by deleting intervening sections

  • Result is about 1000 different combinations

  • Only occurs once per T-cell precursor (all other are gone, DNA itself is edited, happens only ONCE)

40
New cards

What is an example of Combinatorial Amplification: Alternative Splicing

  • EX: Neurexin Genes

    • 2 alternative promoters; 5 sites for alternative splicing

    • 3 different neurexin genes

    • Can generate >2000 different mRNAs

41
New cards

What is Bioinformatics?

  • The science of using computational methods to decipher biological meaning of information contained in organismal systems

42
New cards

What is GenBank (Bioinformatics)

  • Database established by NIH in 1982

  • Online repository of sequence data

43
New cards

What is RefSeq (Bioinformatics)

  • Single, complete, annotated version of species’ genome

  • Agreed upon standard for comparison

  • Maintained by NCBI

44
New cards

What is Basic Local Alignment Search Tool (BLAST)

  • Aligns query sequences with sequences in a database and finds areas of homology

  • Has variants for nucleotide and amino acid sequence searches

    • Amino acid search often includes information on how similar 2 amino acids are to improve alignment

      • Similar to conservative vs. nonconservative subtitutions

      • i.e: Leucine vs. Isoeucine Aspartic Acid vs. glutamic acid

45
New cards

What is Hemoglobin?

  • Hemoglobin carries oxygen in blood

  • Adult hemoglobin consists of 4 polypeptide chains

    • 2 alpha (a) globins

    • 2 beta (b) globins

  • Each polypeptide chains surrounds a heme group that binds and releases oxygen

<ul><li><p>Hemoglobin carries oxygen in blood</p></li><li><p>Adult hemoglobin consists of 4 polypeptide chains</p><ul><li><p>2 alpha (a) globins</p></li><li><p>2 beta (b) globins</p></li></ul></li><li><p>Each polypeptide chains surrounds a heme group that binds and releases oxygen</p></li></ul><p></p>
46
New cards

How is hemoglobin during development?

  • Embryonic (Z and e) and fetal (y) hemoglobins bind more tightly to oxygen to facilitate transfer of oxygen from mother to embryo or fetus

  • Adult hemoglobin binds oxygen less tightly to allow delivery of oxygen to organs

<ul><li><p>Embryonic (Z and e) and fetal (y) hemoglobins bind more tightly to oxygen to facilitate transfer of oxygen from mother to embryo or fetus</p></li><li><p>Adult hemoglobin binds oxygen less tightly to allow delivery of oxygen to organs</p></li></ul><p></p>
47
New cards

What is the Hemoglobin Cluster?

  • Globin Genes are in 2 clusters: (a and B)

    • Face same direction and are in order they are expressed during development

  • Clusters are controlled by locus control regions (LCR)

    • LCR are long range cis-regulatory elements with many enhancer sites

  • One model for function is that LCR forms loops to specific sites and as development progresses, the available sites change

<ul><li><p>Globin Genes are in 2 clusters: (a and B)</p><ul><li><p>Face same direction and are in order they are expressed during development</p></li></ul></li><li><p>Clusters are controlled by locus control regions (LCR)</p><ul><li><p>LCR are long range cis-regulatory elements with many enhancer sites</p></li></ul></li><li><p>One model for function is that LCR forms loops to specific sites and as development progresses, the available sites change </p></li></ul><p></p>
48
New cards

What is the hereditary persistance of fetal hemoglobin?

  • Rare condition of continued fetal globin expression

    • Deletion of omega and beta genes (should be lethal)

    • y genes continue to be expressed

    • Results in near normal level of health

  • LCR cannot switch to adult conformation as looping sites were deleted

<ul><li><p>Rare condition of continued fetal globin expression</p><ul><li><p>Deletion of omega and beta genes (should be lethal)</p></li><li><p>y genes continue to be expressed</p></li><li><p>Results in near normal level of health</p></li></ul></li><li><p>LCR cannot switch to adult conformation as looping sites were deleted</p></li></ul><p></p>
49
New cards

What are globin related genetic disorders?

  • Hemolytic anemias

    • Changes amino acid sequence of alpha- or beta- globin chain

    • Causes destruction of red blood cells

    • Ex: Sickle Cell Anemia

  • Thalassemias

    • Mutations reduce or eliminate production of ½ globin polypeptides

    • Range of phenotypes

50
New cards

How are Thalassemias associated with alpha-globin deletions

  • Severity of thalassemia correlates with copy number

  • Requires at least 3 copies of alpha-globin to have normal blood

    • alpha 1 and 2 (a1 and a2) are relatively interchangeable

<ul><li><p>Severity of thalassemia correlates with copy number</p></li><li><p>Requires at least 3 copies of alpha-globin to have normal blood</p><ul><li><p>alpha 1 and 2 (a1 and a2) are relatively interchangeable</p></li></ul></li></ul><p></p>
51
New cards

What is B-thalassemia?

  • Can occur via deletions of various B-globins however

  • Severe B-thalassemia occurs when LCR is deleted

    • Silences all B-globin genes even if all other regulatory sequences are preserved

<ul><li><p>Can occur via deletions of various B-globins however</p></li><li><p>Severe B-thalassemia occurs when LCR is deleted</p><ul><li><p>Silences all B-globin genes even if all other regulatory sequences are preserved</p></li></ul></li></ul><p></p>
52
New cards

Is there a “THE” human genome?

No,

  • The genome sequences of only 3 people reveal over 5.6 million DNA polymorphisms - sequence differences

53
New cards

Do most polymorphisms have a phenotype?

  • Codons make up less than 2% of human genome

  • Many mutations in codons don’t change amino acid

  • Many deleterious mutations disappear from population through natural selection

54
New cards

What are categories of variation?

  • Single nucleotide polymorphisms (SNPs)

    • • One base pair changes, the most common genetic variant

  • Deletion-insertion polymorphisms (DIPs or InDels)

    • Short insertions or deletions of a single or a few base pairs

    • In protein-encoding regions, DIP variants are frameshift mutations, unless a multiple of 3

  • Simple sequence repeats (SSRs or microsatellites)

    • 1 to 10 base sequence repeated typically ~5-50 times in tandem, can rarely be >100

    • Most common repeating units are one-, two-, or three-base sequences

  • Copy number variants (CNVs)

    • Large blocks of genetic material up to 1 Mb in length that are variable in copy number in
      the genome

    • Most important mechanism producing CNVs is unequal crossing-over in meiosis I

55
New cards

What are single nucleotides polymorphisms (SNPs)

  • Single nucleotide polymorphisms (SNPs)

    • • One base pair changes, the most common genetic variant

56
New cards

What are Deletion-insertion polymorphisms (DIPs or InDels)?

  • Short insertions or deletions of a single or a few base pairs

  • In protein-encoding regions, DIP variants are frameshift mutations, unless a multiple of 3

57
New cards

What are Simple sequence repeats (SSRs or microsatellites)?

  • 1 to 10 base sequence repeated typically ~5-50 times in tandem, can rarely be >100

  • Most common repeating units are one-, two-, or three-base sequences

58
New cards

What are Copy number variants (CNVs)?

  • Large blocks of genetic material up to 1 Mb in length that are variable in copy number in
    the genome

  • Most important mechanism producing CNVs is unequal crossing-over in meiosis I

59
New cards

What are the frequencies of Variations?

60
New cards

What is the Origin of Variations?

  • To determine when a variation occured, must compare genomes

  • Comparison of a human and chimpanzee genomes reveals the changes that has occured since divergence of these species

  • The single base change at loci 2 is polymorphic in humans

    • C is ancestral, present in ancestral organisms

    • T is derived, changed nucleotide

<ul><li><p>To determine when a variation occured, must compare genomes</p></li><li><p>Comparison of a human and chimpanzee genomes reveals the changes that has occured since divergence of these species</p></li><li><p>The single base change at loci 2 is polymorphic in humans</p><ul><li><p>C is ancestral, present in ancestral organisms</p></li><li><p>T is derived, changed nucleotide</p></li></ul></li></ul><p></p>
61
New cards

What are CNVs?

  • CNVs are tandem sequence repeats more than 10bp long

  • Misalignment during meiosis leads to unequal crossing over

  • Not common event, so most CNVs are inherited rather than being a new mutation

  • Example: Humans are fewer than 1000 olfactory receptor genes are different loci

    • At each locus, copy number varies

<ul><li><p>CNVs are tandem sequence repeats more than 10bp long</p></li><li><p>Misalignment during meiosis leads to unequal crossing over</p></li><li><p>Not common event, so most CNVs are inherited rather than being a new mutation</p></li><li><p>Example: Humans are fewer than 1000 olfactory receptor genes are different loci</p><ul><li><p>At each locus, copy number varies </p></li></ul></li></ul><p></p>
62
New cards

What is an overview of PCR?

  • Polymerase chain reaction (PCR)

    • Method of making many copies of target region of DNA

    • First developed in 1985

    • Fast and extremely efficient: can amplify DNA from single cell

  • Hinges on thermostable DNA polymerase

    • Originally Taq

  • Exponential increase in targeted DNA

<ul><li><p>Polymerase chain reaction (PCR)</p><ul><li><p>Method of making many copies of target region of DNA</p></li><li><p>First developed in 1985</p></li><li><p>Fast and extremely efficient: can amplify DNA from single cell</p></li></ul></li><li><p>Hinges on thermostable DNA polymerase</p><ul><li><p>Originally Taq</p></li></ul></li><li><p>Exponential increase in targeted DNA</p></li></ul><p></p>
63
New cards

What are the steps of PCR?

CYCLE 1

1) Denature strands

2) Base-pairing of primers

3) Polymerization of primers along templates

CYCLE 2

1) Denature strands

2) Base-pairing of primers

3) Polymerization of primers along templates

<p>CYCLE 1</p><p>1) Denature strands</p><p>2) Base-pairing of primers</p><p>3) Polymerization of primers along templates</p><p></p><p>CYCLE 2</p><p>1) Denature strands</p><p>2) Base-pairing of primers</p><p>3) Polymerization of primers along templates</p>
64
New cards

How does Genotyping via Sequencing work?

  • PCR amplify a targeted sequence

  • Use same primers or nested one to sequence allele

    • Usually Sanger Sequencing variant

  • EX: Sickle Cell anemia is caused by SNP in Hbbeta gene (Hbb)

    • Genotyping can idetify carriers and homozygous individuals

<ul><li><p>PCR amplify a targeted sequence</p></li><li><p>Use same primers or nested one to sequence allele</p><ul><li><p>Usually Sanger Sequencing variant</p></li></ul></li><li><p>EX: Sickle Cell anemia is caused by SNP in Hbbeta gene (Hbb)</p><ul><li><p>Genotyping can idetify carriers and homozygous individuals</p></li></ul></li></ul><p></p>
65
New cards

How does Genotyping by PCR product size work?

  • Amplify target sequence

  • Run PCR product on gel

  • If alleles have different length then PCR products with those alleles will run at different speeds

  • Can distinguish between homo and heterozygotes

  • VERY EASY TO DO YAYYYYY

66
New cards

What is PCR Product Size Variations?

  • Size variations can be detected by gel electrophoresis

  • Ex: Huntington disease locus

    • Normal allele has <36 C A G repeats

    • Disease-causing alleles have 36 or more CAG repeats; alleles with 42 or more repeats are completely penetrant

<ul><li><p>Size variations can be detected by gel electrophoresis</p></li><li><p>Ex: Huntington disease locus</p><ul><li><p>Normal allele has &lt;36 C A G repeats</p></li><li><p>Disease-causing alleles have 36 or more CAG repeats; alleles with 42 or more repeats are completely penetrant</p></li></ul></li></ul><p></p>
67
New cards

What are Restriction Fragment Length Polymorphisms (RFLPs)

  • Amplify target region with PCR

  • Digest with restriction enzyme

  • If variant disrupts/creates a restriction site, can distinguish between them

<ul><li><p>Amplify target region with PCR</p></li><li><p>Digest with restriction enzyme</p></li><li><p>If variant disrupts/creates a restriction site, can distinguish between them</p></li></ul><p></p>
68
New cards

How can Fetal and Embryonic cells be genotyped using PCR

  • Prenatal genetic diagnosis

    • Genotyping fetal cells isolated by aminocentesis: fetal cells in amniotic fluid are extracted using a needle

  • Preimplanatation embryo diagnosis

    • Utilizes in vitro fertilization and PCR

    • Genotype embryos before placing in womb

<ul><li><p>Prenatal genetic diagnosis</p><ul><li><p>Genotyping fetal cells isolated by aminocentesis: fetal cells in amniotic fluid are extracted using a needle</p></li></ul></li><li><p>Preimplanatation embryo diagnosis</p><ul><li><p>Utilizes in vitro fertilization and PCR</p></li><li><p>Genotype embryos before placing in womb</p></li></ul></li></ul><p></p>
69
New cards

What are Hybridization Probes?

  • Hybridization of short (<40 bases0 oligonucleotides to sample (target) DNAs (allele-specific hybridization)

    • If no mismatch between probe and target, hybrid will be stable at high temperature

    • If mismatch between probe and target, hybrid will NOT be stable at high temperature

<ul><li><p>Hybridization of short (&lt;40 bases0 oligonucleotides to sample (target) DNAs (allele-specific hybridization)</p><ul><li><p>If no mismatch between probe and target, hybrid will be stable at high temperature</p></li><li><p>If mismatch between probe and target, hybrid will NOT be stable at high temperature</p></li></ul></li></ul><p></p>
70
New cards

What are Microarrays?

  • Allele-specific oligonucleotides ( A S Os) are attached to a solid support (like silicon chip)

    • 2 oligonucleotides are shown here, but many can be put on one array

<ul><li><p>Allele-specific oligonucleotides ( A S Os) are attached to a solid support (like silicon chip)</p><ul><li><p>2 oligonucleotides are shown here, but many can be put on one array</p></li></ul></li></ul><p></p>
71
New cards

How does microarray for genomic DNA work?

  • Preparation of genomic DNA for microarray

    • Fragmented

    • Adapter attached

    • Amplified by PCR denatured to make single stranded

    • Fluorescent dye coupled to end of single stranded DNA

<ul><li><p>Preparation of genomic DNA for microarray</p><ul><li><p>Fragmented</p></li><li><p>Adapter attached</p></li><li><p>Amplified by PCR denatured to make single stranded</p></li><li><p>Fluorescent dye coupled to end of single stranded DNA </p></li></ul></li></ul><p></p>
72
New cards

How can a microarray be read?

  • Fluorescent output is proportional to number of copies of each allele

    • Can sometimes distinguish between hets and homozygous

  • Up to 4 million loci can be genotyped simultaneously for approximately $100

<ul><li><p>Fluorescent output is proportional to number of copies of each allele</p><ul><li><p>Can sometimes distinguish between hets and homozygous</p></li></ul></li><li><p>Up to 4 million loci can be genotyped simultaneously for approximately $100</p></li></ul><p></p>
73
New cards

How does Positional Cloning work?

  • Positional Cloning

    • Object is to identify disease causing genes by genetic linkage to polymorphic loci

  • Strategy

    • Same as linkage analysis using 2 phenotypes, except one gene tracked by phenotype, the other by DNA genotype

    • Use microarrays to simultaneously analyze millions of 2 point crosses, each one a test for linkage between a disease locus and DNA marker

74
New cards

What are steps of positional cloning?

  • Region of interest narrowed by finding closely linked DNA markers

  • Candidate genes are located in region of interest

  • Sequence and expression of candidate genes are determined in normal and diseased individuals

<ul><li><p>Region of interest narrowed by finding closely linked DNA markers</p></li><li><p>Candidate genes are located in region of interest</p></li><li><p>Sequence and expression of candidate genes are determined in normal and diseased individuals</p></li></ul><p></p>
75
New cards

What is an example of using positional cloning?

  • NEUROFIBROMATOSIS

    • Autosomal, dominantly inherited

  • Causes proliferation of nerve tissue forming tumerous bumps

  • Positional cloning example determines whether or not SNP is linked to neurofibromatosis gene

  • Children in GEN III are in effect the result of testcross

    • RF = 0.125 (N too small to be confident though)

<ul><li><p>NEUROFIBROMATOSIS</p><ul><li><p>Autosomal, dominantly inherited</p></li></ul></li><li><p>Causes proliferation of nerve tissue forming tumerous bumps</p></li><li><p>Positional cloning example determines whether or not SNP is linked to neurofibromatosis gene</p></li><li><p>Children in GEN III are in effect the result of testcross</p><ul><li><p>RF = 0.125 (N too small to be confident though) </p></li></ul></li></ul><p></p>
76
New cards

What are some limitations of positional cloning?

  • Configuration of alleles are not always known

  • Not all mating are informative

  • Difficult to obtain pedigree data in humans

  • Identified SNPs aren’t necessarily causative

<ul><li><p>Configuration of alleles are not always known</p></li><li><p>Not all mating are informative</p></li><li><p>Difficult to obtain pedigree data in humans</p></li><li><p>Identified SNPs aren’t necessarily causative</p></li></ul><p></p>
77
New cards

How are large pedigrees used for positional cloning?

  • Mapping of Huntington disease

    • Detection of linkage between DNA marker G8 and HD locus

    • Segregation of G8 DNA marker (4 alleles - ABCD) in a large Venezuelan pedigree affected with HD

<ul><li><p>Mapping of Huntington disease</p><ul><li><p>Detection of linkage between DNA marker G8 and HD locus</p></li><li><p>Segregation of G8 DNA marker (4 alleles - ABCD) in a large Venezuelan pedigree affected with HD</p></li></ul></li></ul><p></p>
78
New cards

What is Lod Score?

  • Lod Score (Log of the odds) is used to determine if data is sufficient to conclude with confidence that a disease gene and a marker are linked

79
New cards

What is Lod Score compared to RF? Relationship?

  • Relationship between Log Score and assumed RF

    • Lod of 3 means log1000 so 1000x more likely to be linked than not

  • Can add Lod scores from different pedigrees together to increase confidence

    • RED: Lod score from Neurofibromatosis pedigree

    • BLUE: sum of Lod scores from 3 such pedigrees

<ul><li><p>Relationship between Log Score and assumed RF</p><ul><li><p>Lod of 3 means log1000 so 1000x more likely to be linked than not</p></li></ul></li><li><p>Can add Lod scores from different pedigrees together to increase confidence</p><ul><li><p>RED: Lod score from Neurofibromatosis pedigree</p></li><li><p>BLUE: sum of Lod scores from 3 such pedigrees</p></li></ul></li></ul><p></p>
80
New cards

Describe Locus heterogeneity:

  • Allelic heterogeneity: disease caused by different mutations in same gene

    • Compound heterozygotes (trans-heterozygotes) - individuals with different mutant alleles of the same gene

    • Individuals with certain alleles may respond to drug treatment, while others do not

  • Locus heterogeneity: disease caused by mutation in one or two more different genes

<ul><li><p>Allelic heterogeneity: disease caused by different mutations in same gene</p><ul><li><p>Compound heterozygotes (trans-heterozygotes) - individuals with different mutant alleles of the same gene</p></li><li><p>Individuals with certain alleles may respond to drug treatment, while others do not</p></li></ul></li><li><p>Locus heterogeneity: disease caused by mutation in one or two more different genes</p></li></ul><p></p>
81
New cards

Describe an example with completely sequenced genomes

  • MILLER SYNDROME - First sample with completely sequenced genome

  • Pedigree:

    • Identical regions: brother and sister share same alleles

    • Nonidentical: siblings share NO alleles

    • Haploidentical maternal: same allele from mother, different from father

    • Haploidentical paternal: same allele from father, different from mother

  • Geneticists studying disease in affected children could focus on identical regions

  • Identified compound heterozygote in DHOD

<ul><li><p>MILLER SYNDROME - First sample with completely sequenced genome</p></li><li><p>Pedigree:</p><ul><li><p>Identical regions: brother and sister share same alleles</p></li><li><p>Nonidentical: siblings share NO alleles</p></li><li><p>Haploidentical maternal: same allele from mother, different from father</p></li><li><p>Haploidentical paternal: same allele from father, different from mother</p></li></ul></li><li><p>Geneticists studying disease in affected children could focus on identical regions</p></li><li><p>Identified compound heterozygote in DHOD</p></li></ul><p></p>
82
New cards

How can Gene location be narrowed down?

  • Identifying causative alleles via filtering

83
New cards

What are evolutionary conserved amino acids?

  • Nic’s XIAP gene had a missense mutation that changed a single amino acid that was completely conserved among humans, frogs, flies, and other species

  • Assumption that nonconversative variations in conserved regions are more likely to be causative

<ul><li><p>Nic’s XIAP gene had a missense mutation that changed a single amino acid that was completely conserved among humans, frogs, flies, and other species</p></li><li><p>Assumption that nonconversative variations in conserved regions are more likely to be causative</p></li></ul><p></p>