Genomes and Their Evolution

Chapter 21: Genomes and Their Evolution

Objectives

  • 21.1 Describe how the Human Genome Project contributed to DNA sequencing technology.

  • 21.2 Explain how scientists use bioinformatics to analyze genomes and their functions.

  • 21.3 Compare and contrast genome size, number of genes, and gene density across diverse genomes.

  • 21.4 Describe the composition of the genome of a multicellular eukaryote, such as humans.

  • 21.5 Identify the changes to DNA that contribute to the evolution of the genome.

  • 21.6 Explain how comparing genome sequences and developmental processes helps in understanding evolution.

Introduction to Genomes

  • Genome: An entire set of DNA instructions found in a cell, which is required for the development and functions of organisms.

Human Genome Project

  • Officially begun in 1990, with the sequencing of the human genome published in 2006.

  • Goal of the Human Genome Project is to determine the complete nucleotide sequence of each chromosome.

  • J. Craig Venter utilized a whole-genome shotgun approach to sequence the entire genome, elucidating unique human genes compared to other organisms (e.g., dolphins, seaweed).

  • Whole-genome shotgun (WGS) approach:

    • Involves cloning and sequencing fragments of randomly cut DNA, which are then assembled into a continuous sequence.

Advancements in Sequencing Techniques

  • The WGS approach remains relevant; however, newer “next-generation” sequencing techniques are emerging.

  • Features of next-generation techniques:

    • Do not require cloning steps.

    • Facilitate a metagenomics approach (sequencing DNA from groups of species in environmental samples).

    • Capable of generating large amounts of data rapidly.

Genomics

  • Genomics: The study of whole sets of genes and their interactions.

  • Complete genome sequences exist for various organisms, including humans, chimpanzees, E. coli, and more.

  • Comparison of genomes provides insights into evolution and biological processes.

Bioinformatics

  • Bioinformatics: A field combining biology and informatics (computer science) for storing, analyzing, and distributing biological data, primarily DNA and protein sequences.

  • Key bioinformatics resources include:

    • National Center for Biotechnology Information (NCBI)

    • European Molecular Biology Laboratory

    • DNA Data Bank of Japan

    • BGI in Shenzhen, China.

  • GenBank: The NCBI database for sequences, which is continuously updated.

  • NCBI BLAST:

    • Basic Local Alignment Search Tool that allows users to search specific DNA sequences, protein sequences, and common stretches of amino acids.

Protein Domains and Gene Annotation

  • Domains: Distinct functional and/or structural units within protein/DNA; usually responsible for particular functions.

  • Gene annotation involves automated identification of protein-coding genes, searching for translational start and stop signals, RNA splicing sites, and promoter sequences.

  • Comparison with genes across species aids in understanding gene functions.

Systems Biology

  • Proteomics: The study of large sets of proteins and their properties.

  • A proteome refers to the complete set of proteins expressed by a cell or group of cells.

  • Systems biology focuses on functional integration of genes and proteins, producing network-like functional maps of interactions.

Applications in Medicine

  • The Cancer Genome Atlas Project: Completed in 2018 with significant contributions to understanding tumor generation.

  • Utilizes DNA microarrays and RNA-seq to assess gene expression in cancer patients, enabling personalized treatments based on genetic profiles.

Variability in Genome Characteristics

  • Genomes vary in size, the number of genes, and gene density.

  • Bacteria and archaea genomes range from 1 to 6 million base pairs (Mb) with 1,500 to 7,500 genes.

  • Eukaryotic genomes are generally larger, with plants and animals exceeding 100 Mb; humans possess a genome of approximately 3,000 Mb.

  • There is no direct correlation between the number of genes and genome size in eukaryotes.

Noncoding DNA in Multicellular Eukaryotes

  • Multicellular eukaryotes, including humans, exhibit extensive noncoding DNA and multigene families.

  • Humans have relatively low gene density, with 98.5% of the genome noncoding, consisting of regulatory sequences, introns, and other nonfunctional DNA like pseudogenes and repetitive DNA.

Transposable Elements

  • Transposable elements are DNA segments that can move within the genome, constituting about 75% of human repetitive DNA.

  • Discovered through Barbara McClintock's experiments, transposable elements can impact genome evolution by altering gene functions and modulating gene expression.

Genome Evolution

  • Mutations are fundamental to genomic change and evolution. Increased genome sizes provide raw material for gene diversification.

  • Polyploidy occurs when chromosomes duplicate, allowing genes to diverge through mutation.

  • Unequal crossing during meiosis can lead to genetic deletions and duplications.

Gene Duplications and Divergence

  • Duplication of genes allows one copy to evolve new functions while the other maintains the original function.

  • Example: The lysozyme gene duplicated and evolved into α-lactalbumin, playing a crucial role in milk production in mammals.

Chromosomal Rearrangements

  • Humans exhibit 23 pairs of chromosomes while chimpanzees have 24 pairs due to ancestral chromosomal fusions post-divergence.

  • Such chromosomal changes can facilitate speciation.

Conservation of Genes

  • Genome comparisons across species reveal evolutionary relationships and functional insights regarding conserved genes, vital for understanding overall biological processes and developments.

  • Evo-devo: Evolutionary developmental biology that studies conserved developmental genes, such as Hox genes, which regulate body plans and developmental processes across many species.

FOXP2 Gene and Language Acquisition

  • The FOXP2 gene is implicated in vocalization and language acquisition; mutations within this gene lead to significant speech impairments in humans.

Homeotic Genes and Development

  • Homeotic genes possess a conserved sequence known as the homeobox, critical for developmental regulation in multicellular organisms.

  • Conservation highlights significant evolutionary links across diverse species and developmental processes.

Questions and Discussion

  • Invitation for discussion on key slides and concepts, encouraging explanation among peers as a form of reinforcing learning.