Molecular Genetics
Levene (1910)
Levene isolated two types of nucleic acid: ribonucleic acid (RNA) and deoxyribonucleic acid (DNA)
these molecules were made up of long chains of individual units called nucleotides
nucleotides are made up of a sugar, a phosphate group and one of five nitrogen-containing bases
adenine (A), thymine (T), cytosine (C), guanine (G) and uracil (U)
RNA
single stranded
ribose sugar
A, U, C, G bases
small molecules used for synthesis of proteins (rRNA, tRNA, mRNA)
DNA
double stranded
deoxyribose sugar (missing an oxygen)
A, T, G, C bases
large molecules used to store genetic information
Frederick Griffith (1928)
studied pathogenic bacteria that caused a pneumonia epidemic in London
discovered the transforming principle in his experiments when injecting mice with different forms of pneumonia causing bacteria
found that hereditary information passed from dead bacterial cells to live bacterial cells were transformed from a harmless form to a disease causing form
Oswald Avery, Colin MacLeod, Maclyn McCarthy (1944)
repeated the mouse studies but first separated the components (lipids, RNA, carbohydrates, proteins, DNA) of the hear-killed pathogenic bacteria before adding it to the mice containing non-pathogenic bacteria —> only mice injected with DNA died
therefore, DNA contains the information necessary to turn harmless bacteria into killer bacteria
Erwin Chargaff (1950)
noticed a pattern in the amounts of the four bases in DNA: adenine, guanine, cytosine, and thymine
he took samples of DNA from different cells and found that the amount of adenine was almost equal to the amount of thymine and that the amount of guanine was almost the same amount of cytosine —> A=T and G=C
this later became Chargaff’s rule
Chargaff’s rule states that adenine only bonds with thymine and guanine only bonds with cytosine.
2 Types of Nitrogenous Bases
purines - double ring structures
adenine and guanine
pyrimidines - single ring structures
cytosine, thymine, and uracil
Alfred Hershey and Martha Chase (1952)
later experiments by Hershey and Chase using a strain of virus known as a bacteriophage - consists of a protein coast surrounding a length of DNA
found that viral DNA, not viral protein, enters the cell causing the host to produce virus particles instead of carrying out its own functions —> DNA is the genetic information
Rosalind Franklin and Maurice Wilkins (1952)
Franklin was able to crystalize the DNA molecule and capture a photo of it which revealed the helical shape of the molecule - Photo 51
Wilkins shows this photo to James Watson without Franklin’s knowledge
James Watson and Francis Crick (1953)
Watson and Crick published a paper on the structure of DNA - the double helix, by using information gained from x-ray diffraction
Watson, Crick and Wilkins were awarded the Nobel Prize in 1952
Franklin died in 1958 from ovarian cancer and has never been recognized for her crucial contribution in solving the structure of DNA
DNA Structure
DNA is a double helix molecule
made up of two antiparallel strands of nucleotides
this means that the 5’ end of one strand is paired wit the 3’ end of its complimentary strand, complimentary base pairs
each nucleotide is composed of a five carbon sugar (deoxyribose) and a phosphate group (together these two are the backbone) and one of four nitrogen-containing bases (the rungs of a the ladder)
nucleotides are linked to each other by their phosphates groups which bind the 5’ end of one sugar to the 3’ end of another sugar
two strands linked by hydrogen bonding between complimentary nitrogen base pairs
A and T via 2 H-bonds
C and G via 3 H-bonds
it is estimated that the human genome contains about 3 billion base pairs of DNA in 46 chromosomes
uncoiled, each chromosome averages about 5 cm in length wrapped around histones
DNA Replication
occurs during S-phase
nucleotides required to build new DNA strands come from ingesting food
DNA replication is called semi-conservative - when DNA is replicated each new molecule of DNA contains one strand of the original DNA molecule and one of the new daughter strand
replication starts at a specific nucleotide sequence called the origin of replication and continues in both directions at once
eukaryotes have multiple origins of replication, forming multiple replication bubbles
Enzymes for Replication
DNA Helicase
separate the two strands of DNA at the replication forks, unwinds that DNA double helix
binding proteins attach to the separated strands to prevent them from twisting back up
Primase
used to make a shot segment of RNA (around 12 nucleotides) which is used to get replication started
DNA Polymerases
responsible for elongation of new DNA at the replication fork
elongates strands in 5’ to 3’ direction
can only add nucleotides to an existing 3’ end of an existing strand, needs a primer
the leading strand is built 5’ to 3’, following the replication fork and will elongate continuously
the lagging strand is built in the direction moving away from the replication fork so it must grow in length by the addition of short segments discontinuously called Okazaki fragments
a different DNA polymerase also removes the RNA primer and replaces it with DNA
also proofreads/double checks to make sure mistakes aren’t made during DNA replication (a mistake happens around once every 100 000 nucleotides)
if an incorrect nucleotide is added, the mismatched base is displaced and removed and DNA polymerase then adds the correct nucleotide
mistakes that are not corrected are mutations
DNA Ligase
joins the fragments together when synthesis is complete
How Genes Work
DNA replication is vital for cell to pass on their genetic information to future generations
the order of the nucleotides in the DNA sequence encodes genes that are used to direct the roles of the cell —> to produce proteins
proteins are the working molecules of the cell
proteins are made up of subunits - amino acids
the order of nucleotides in DNA is known as the genetic code
genetic code contains only four bases found in DNA (T, A, G, C) and has to encode 20 amino acids
the sequence of bases on the DNA determine the sequence of amino acids which determine the function of the protein
3 bases code for 1 amino acid
short segments of a protein - polypeptides
in 1960 it was confirmed that 3 bases are used to determine and amino acid
DNA never leaves the nucleus and protein synthesis takes place in the cytoplasm
the intermediate used is mRNA
Transcription
DNA does not leave the nucleus, therefore a copy of the DNA is required to make proteins
the copy is called messenger RNA (mRNA)
the template strand is the side of the DNA molecule that stored the information that is transcribed into mRNA
the other strand is the non-template strand or coding strand and it has the same nucleotide sequence as the mRNA
similar to DNA synthesis except”
RNA is a much shorter strand
RNA is single stranded
RNA contains no thymine, it has uracil that bonds with adenine instead
sugar backbone is made from ribose sugar instead of deoxyribose sugar
RNA polymerase is the only enzyme required to make mRNA
promoter sequences (the TATA box) and terminator sequences tell the RNA polymerase where to start and stop - these sequences are before and after the gene that needs to be made
RNA Processing
eukaryotic genes are initially copied into nonfunctional RNAs called primary transcripts
these primary transcripts need modification called RNA processing to generate mature and functional RNA
removal of introns and the joining together of exons is known as splicing and occurs in the nucleus
in eukaryotes alternative gene splicing means that one gene can produce different combinations of exons leading to different polypeptides
20 000 genes —> 100 000 polypeptides
following transcription the mRNA moves to the cytoplasm where it is used for protein synthesis on the ribosomes
Nucleotide - basic subunit of nucleic acid (DNA or RNA) made of sugar, phosphate and nitrogen base
Gene - a sequence of nucleotides on DNA which encode for a polypeptide
Triplet - a group of three nucleotides on DNA
Codon - a group of three nucleotides on mRNA
Transcription Unit - the genes required to make a functional protein
Translation
translation is the process by which proteins are made using the genetic code copied onto mRNA
translation takes place on the ribosome or rER which is made of protein and rRNA (ribosomal)
there are 64 codons but only 20 amino acids, therefore there is redundancy in the genetic code
Initiation
the mRNA attaches to the ribosome at the start codon (AUG) which encodes for the amino acid methionine
codons are three nucleotides on mRNA that encode for an amino acid
a tRNA (transfer) delivers this amino acid to the ribosome
tRNAs have two specific ends - one that compliments the mRNA codon (called the anticodon) and the other that attaches to the specific amino acid translating the message
Elongation
tRNAs continue delivering amino acids to the ribosome according to the mRNA code
these amino acids are joined together via a peptide bond
Termination
construction of the polypeptide ends once a stop codon (UAA, UAG, UGA) is read on the mRNA
the stop codons encode for a protein release factor that releases the finished polypeptide and causes the ribosome to break apart
Point mutations - a chemical change that involves one or a few base pairs in a gene
Types of Point Mutations
Substitution
replacement of one nucleotide and its partner from the complimentary DNA strand with another pair of nucleotides
some substitution mutations are silent - they have no effect on the coded protein
usually are missense mutations which still codes for an amino acid that makes sense, but not necessarily the right sense
can be nonsense mutations - if the substitution changes the amino acid codon into a stop signal then this leads to nonfunctional proteins
Insertion and Deletions
addition or loss of one or more nucleotide pairs of a gene
usually have a more disastrous effect
may alter the reading frame (triplet grouping) of the genetic message
called a frameshift mutation
Chromosomal Mutations
chromosomal mutations affect large portions of the genetic code or entire genes or chromosomes
Mutagens
physical, chemical or biological agents that interact with DNA to cause mutations
DNA Technology - Genetic Engineering
the science of manipulating genes that carry hereditary information
recombinant DNA, combines the DNA from two different sources
a gene is cut out of one organism’s genome and inserted into the genome of another organism
once inserted, that DNA will be replicated, transcribed and translated into functional proteins
DNA Cloning
uses recombinant DNA to transfer a DNA fragment of interest from one organism to a self-replicating genetic element, such as a bacterial plasmid
used to make multiple copies of a gene
Cohen-Boyer Experiment
conducted the first genetic engineering experiment in 1973 by transferring frog DNA into a bacterial plasmid
Uses of DNA Technology
to produce a protein product such as human growth factor or insulin
to a endow a particular organism with a metabolic capability it did not previously possess
pest resistance in crops, bacteria that digest oil, fish that are fluorescent
to create more copies of the gene itself so that it can be structured further - DNA cloning
Tools Used
Restriction Enzymes
enzymes originally found in bacteria that were used as a defense mechanism
role is to cut up intruding DNA from other organisms
the key to much of the advances in molecular genetics
restriction enzymes cut DNA into pieces when they recognize specific sequences of nitrogen bases
when they cut up DNA the ends are not cut evenly so the ends are sticky, sticky ends can be combined with other sticky ends
Ligase
used to rejoin sticky ends of DNA cut with restriction enzymes
Vectors
DNA engineered in a test tube and must be returned to a cell in order to function
vectors are either bacteria plasmids or viruses, since they are able to transform their DNA into host cells
Applications for Recombinant DNA
gene therapy - providing “fixed” genes to people with faulty genes, must use a vector to deliver these genes
high tech selective breeding - instead of crossing individuals with desirable traits hoping for a suitable offspring, they can take an offspring and specifically insert genes to promote resistance to disease or increase productivity
biological warfare - insert genes for harmful toxins into harmless bacteria, transfer it into food and water to infect with bacteria immune to antibiotics
Gel Electrophoresis
other DNA is cut with restriction enzymes, the DNA fragments are separated according to size and charge using gel electrophoresis
Polymerase Chain Reaction (PCR)
used to amplify DNA by repeated cycles of heat, primer, DNA polymerase and nucleotides
DNA Sequencing
the Human Genome Project has three basic goals
identify all of the 20 000 - 25 000 genes in human DNA
determine the order of nucleotides for 3 billion chemical base pairs that make up human DNA
store this information in databases
mostly completed in 2006 when the sequence of the last chromosome was published in Nature - the complete sequence of all chromosomes was published in Science in March 2022
the Human Genome Project uses probes, restriction enzymes and DNA sequencers to do their work
DNA Fingerprinting
on average about 99.9% of the DNA between two humans is the same
the remaining 0.1% is what makes us unique (unless you have an identical twin)
these 3 million base pairs contain short sequences (10-69 base pairs) of repetitive DNA called mini-satellites that show variation between people
DNA fingerprinting detects lots of mini-satellites in the genome to produce a pattern that is unique to an individual - DNA fingerprint
DNA Not Part of the Human Genome
some organelles such as the mitochondria and chloroplasts contain their own DNA - their jobs are such that they need to be able to function somewhat on their own
all mitochondria comes from the mother, therefore mitochondrial disorders are inherited from the mom
mitochondrial DNA is much more limited and has a structure more like that found in a bacteria