Human Genome Project Overview

Topic = The Human Genome Project and Genetic Mapping Techniques

Introduction to the Human Genome Project

  • Overview

    • Began in 1990 and completed in 2003, two years earlier than expected.

    • Allowed understanding of the human genetic blueprint.

    • Enhanced insight into genes and proteins.

    • Impact on various fields: medicine, biotechnology, agriculture.

What is the Human Genome Project?

  • The full complement of genetic material within a human cell.

  • Originally estimated to contain between 100,000 – 300,000 genes.

  • Actual count is approximately 30,000 genes.

  • Characterizes single and multi-gene disorders.

Contributors to the Project

  • Primary Creator: The US Government.

  • Additional contributions from:

    • Department of Energy

    • Genome Centres

    • Countries involved include:

    • Japan

    • United Kingdom

    • Other technologically advanced nations.

Goals of the Human Genome Project

  • Identify all 20,000-30,000 genes.

  • Determine the 3 billion bases of the human genome.

  • Store information in searchable databases.

  • Improve tools for data analysis.

  • Transfer related technologies to the private sector.

  • Address ethical, legal, and social issues.

  • Aim to sequence genomes of various organisms significant to medical research (e.g., mouse, fruit fly).

Milestones in the Human Genome Project

  • 1990: Project inception.

  • 2000: Completion of a working draft (90% coverage).

  • 2001: Publishing of the working draft.

  • 2003: Sequencing completed and the project declared finished, two years ahead of schedule.

Findings from the Human Genome Project

  • Human genome is 99.9% identical across all individuals.

  • Approximately 30,000 genes identified.

  • Functions of over half of the identified genes are still unknown.

  • Nearly half of human proteins have similarities with proteins from other organisms.

  • About 75% of the human genome is classified as non-coding.

Benefits of the Human Genome Project

  • Molecular Medicine:

    • Improves disease diagnosis and predisposition detection.

    • Enables new drug development based on molecular information.

    • Utilizes gene therapy and control systems.

    • Designs customized drugs (pharmacogenetics) tailored to individual genetic profiles.

  • Microbial Genomics:

    • Rapid detection and treatment of pathogens in clinical settings.

    • Development of new energy sources (biofuels).

    • Environmental monitoring for pollution.

    • Protection against biological warfare.

    • Safe and efficient toxic waste cleanup.

  • Risk Assessment:

    • Evaluates health risks from exposure to radiation, carcinogenic chemicals, and toxins.

Examples of Benefits from the Human Genome Project

  1. Hereditary Non-Polyposis Colon Cancer linked to a gene on chromosome 2.

  2. In March 2002, successful screening of embryos for a gene mutation related to early-onset Alzheimer’s.

Importance of Implantation in IVF

  • Failed implantation linked to the Leukemia Inhibitory Factor gene (LIF).

  • Micro-array analysis reveals 36 up-regulated genes and 27 down-regulated genes at the implantation site.

Polycystic Ovary Syndrome Findings

  • The CYP11A gene (encoding P450 side chain cleavage) identified as a major susceptibility locus for steroidogenic abnormalities.

  • MUC1: potential genetic link to infertility; women with unexplained infertility have small allele sizes linked to implantation failure.

Applications Beyond Medicine

  • Bioarchaeology and Anthropology:

    • Study evolutionary lines via germ line mutations.

    • Investigate population migrations through maternal inheritance.

    • Analyze Y chromosome mutations for male lineage tracing.

    • Correlate mutation evolution with population ages and historical events.

  • Agriculture and Livestock Breeding:

    • Cultivate disease, insect, and drought-resistant crops.

    • Breed healthier and more productive farm animals.

    • Develop bio-pesticides and edible vaccines incorporated into food.

    • Create new uses for plants in environmental cleanup, exemplified by tobacco.

DNA Identification and Forensics

  • Utilization for:

    • Identification of suspects.

    • Exoneration of the wrongly accused.

    • Paternity testing.

    • Prosecution of poachers.

    • Detection of pollutants (e.g., harmful bacteria).

    • Matching organ donors and recipients.

    • Pedigree determination for livestock.

    • Authenticating consumables like wine and caviar.

Common Uses of the Human Genome Project

  • Primarily applies to humans, but also extends to:

    • Crops

    • Animals

    • Biotechnology.

Techniques Employed in the Human Genome Project

  • Genetic Mapping:

    • Identifies locations and distances between genes on chromosomes through specific molecular markers.

  • Linkage Analysis:

    • Establishes links between genes and assists in genetic testing.

    • Genetic markers inherit together due to proximity on the same chromosome.

Pros and Cons of the Human Genome Project

  • Pros:

    1. Identified a variety of genes.

    2. Initiated genetically modified food development.

    3. Enhanced crop growth rates.

    4. Increased pest resistance in plants.

    5. Facilitated the location of cancer cells and mental illnesses.

    6. Identified genetic mutations during pregnancy.

  • Cons:

    1. Lengthy process.

    2. Overall cost of $3 billion.

    3. Requirement for specialized skills.

    4. Complexity and duration of the research.

    5. Ethical dilemmas.

Outcomes of the Human Genome Project

  • Completed in 2003 as an international collaborative effort.

  • Achievements include:

    • Recording the complete base sequence of the human genome.

    • Discovery of 30,000 loci shaping research in diagnostics and pharmacology.

    • Uncovering new proteins and understanding their functions.

    • Facilitating DNA comparisons across species to identify evolutionary histories.

    • Establishment of bioinformatics and creation of genetic databases.

Bioinformatics and its Evolution from the HGP

  • Comprehensive analysis of entire genomes at once.

  • Facilitates scanning of genetic markers enabling faster research and diagnosis.

  • Promotes early illness detection and tests for multiple genes simultaneously.

  • Aids in predicting potential future illnesses by identifying alleles associated with risk (e.g., for Alzheimer’s, Parkinson’s).

Evolutionary Relationships Highlighted by Sequencing

  • Sequencing allows comparison of genomes to determine similarities and differences among species.

  • Close genome matches indicate closer evolutionary relationships.

  • Human chromosome 2 identified as a fusion of two ancestral chromosomes.

Ethical, Legal, and Social Implications

  • Positive Impacts:

    • Accelerated disease diagnosis and identification of new diseases.

    • Targeted production of novel medicines.

  • Concerns:

    • Genetic report cards could raise privacy issues.

    • Potential impact on insurance premiums.

    • Importance of regulated research to protect individuals whose DNA is analyzed.

Discussion Questions

  1. Would you undergo testing for a rare genetic disorder in your family if it were quick and easy? Why or why not?

  2. Would you share your genome with researchers to find cures for genetic disorders?

Gene Patenting Issues

  • 2009: Patenting of BRCA1 and BRCA2 genes by Myriad, restricting competition.

  • Raises ethical, legal, and social concerns among stakeholders.

  • 2013: Patent was overturned, allowing more researchers to work in the area, expediting progress.

The 1000 Genomes Project

  • Launched in January 2008 to sequence genomes across diverse populations.

  • Aims to resource comprehensive human genetic variation.

  • Goal to identify genetic variants with at least 1% frequencies in studied populations.

Phases of the 1000 Genomes Project

  • Phase 1: 1167 samples from 13 populations (2010, 2011)

  • Phase 2: 633 samples from 7 populations (2011)

  • Phase 3: 700 samples from 9 populations (2011, 2012).

Outcomes of the 1000 Genomes Project

  • Genome-Wide Association Studies (GWAS):

    • Identifies regions in the genome linked with diseases.

    • Detects single nucleotide polymorphisms (SNPs) associated with diseases.

    • SNPs in linkage equilibrium (LD) have higher or lower presence in populations.

  • Additional genotyping allows precise localization of disease-associated regions; reduces need for extensive sequencing.

Genetic Mapping Techniques

  1. Linkage Mapping

  2. Restriction Digestion

  3. DNA Sequencing

Learning Objectives on Gene Mapping

  • Understand usage of molecular markers (SNPs, microsatellites, VNTRs) in locating genes/loci.

  • Analyze cross-data (e.g., gel patterns) for linked genes.

  • Justify assessments with data interpretation on linkage.

  • Construct genetic maps using linkage information among various loci.

Linkage Mapping

  • Establishes distance between mutations through recombination frequencies.

  • Linkage maps can be constructed by measuring recombination between genomic DNA sites.

Understanding Linkage

  • Defined as the failure of two genes to assort independently due to close proximity on a chromosome.

  • Genes far apart will assort independently and will not be linked.

  • Determined by crossing over during meiosis 1.

Recombination Process Explained

  1. Nicks in DNA strands lead to crossover.

  2. Strand Invasion allows recombination through base-pairing and ligation.

  3. Results in the formation of recombinant chromosomes.

Lack of Linkage

  • Independent Assortment:

    • Far-apart genes assort independently; multiple combinations possible.

  • Linkage Without Recombination:

    • Genes closely linked show inheritance of specific combinations (e.g., A along with B).

  • Linkage With Recombination:

    • Even after recombination, genes remain linked in various combinations.

Linkage Mapping Process

  • Each gene occupies a fixed position on chromosomes.

  • Maps can identify gene locations, facilitating mutation studies.

  • Closer genes correlate with reduced likelihood of crossing-over events.

  • Recombination map unit known as centiMorgan (cM), named after Morgan.

Restriction Mapping

  • Constructed by cleaving DNA into fragments to measure distances between cleavage sites.

  • Large genomic changes can be recognized by variation in restriction fragment size.

Understanding RFLP

  • Restriction Fragment Length Polymorphism (RFLP):

    • Example: A loci with sequence variation resulting in distinct restriction site detection.

    • Individual analysis can reveal differences in restriction fragment sizes, aiding identification.

Detection and Analysis of RFLP

  • Use of fragment length detection allows tracking of genetic patterns based on family inheritance and variations due to mutations.

Inheritance and Pedigree Analysis of RFLPs

  • Demonstrates inheritance patterns through family lines, using fragment variations to trace allele presence.

Case Study: Distal Myopathy

  • Distal muscular dystrophy primarily affects extremities.

  • Genetic causes difficult to ascertain due to multiple potential gene mutations.

  • Inheritability patterns can be either autosomal dominant or recessive.

Microsatellite Markers for Disease Localization

  • Adjustable inheritance patterns through family studies to identify shared markers in affected individuals.

Electrophoresis Result Assessment

  • Assessing results to distinguish between affected and non-affected family members via linked genetic markers.

Common Haplotype Identification in Distal Myopathy

  • All affected individuals share a common haplotype (6, 2, 4, 2, 5), identifying exclusive markers related to the disorder.

Conclusion of Lecture 4

  • Coverage of the Human Genome Project and 1000 Genomes Project, emphasizing genetic mapping, linkage analysis, and comprehending genetic variations contributing to human health.