Assisstant Prof. Cemalettin Bekpen's lecture on Molecular Evolution, part of Computational Biology II MBG2004.
Significant sequence similarity implies function assignment to unknown proteins based on known proteins due to evolutionary relationships.
A gene/protein related to a second gene/protein by descent from a common ancestral gene via speciation.
Genes/proteins in different species evolved from a common ancestor via speciation, retaining the same function.
Genes/proteins related by duplication of a common ancestral gene. These evolve new functions, which may or may not relate to that of the ancestor.
Evolution of new gene/protein that is genetically independent from the ancestral gene.
Evolution of similar features/properties in genes/proteins of different genetic lineages.
Examples include Trypsin (3NKK), Chymotrypsin (1ACB), and Subtilisin (1SBT).
Stochastic single point changes in genetic material due to:
Slow but constant rate (molecular clock) of 10^{-9} to 10^{-8} mutations per base per generation. Includes splicing errors in eukaryotes that retain introns.
Exchange of genes (or portions) between different chromosomes to create new combinations.
Duplication of a gene (or portion). One copy retains the original function, while the other evolves and acquires new functions.
Incorporation of mRNA sequences back into DNA, often inserting into new locations with different expression patterns.
The mechanisms by which new genes/proteins arise enable sequence analysis to infer functional and structural relationships.
Consider watching the YouTube video on this topic. Also review the Khan Academy article on Darwin, evolution, & natural selection.
(Phyletic evolution): A single population transforms enough to be designated a new species.
Branching evolution: A new species arises from a small population that buds from a parent species. Most new species probably evolve by cladogenesis, the branching evolution that is the basis for biological diversity.
The shift in moth population color from light to dark during the Industrial Revolution in England is an example of directional selection.
Yearling male lazuli buntings with either bright or dull coloration being able to establish territories and breed, while those with intermediate plumage do not mate.
Stabilizing selection favors the most common phenotype as best adapted. It reduces variation by selecting against alleles that produce more extreme phenotypes. An example is birth weight; babies with weights too low or too high face increased risks.
Woodpeckers and wasps influence gall-fly populations, applying pressure that results in stabilizing selection.
Examples include the domestication of wolves into various dog breeds and the selective breeding of wild mustard into cabbage, brussel sprouts, cauliflower, broccoli, kale, and kohlrabi.
Contrast Lamarck's inheritance of acquired characteristics with natural selection, using the example of giraffe neck length. Also, consider the evolution of modern corn from Teosinte.
When natural selection favors a single allele and the allele frequency continuously shifts in one direction.
Red bars on the selection map indicate loci under negative selection. Blue bars represent loci under positive selection (95% credibility level). Strong evidence of selection at >99%.
To understand the presence and form of natural selection on genes by:
Compare a gene of interest for different species
Rate of synonymous mutations and rate of non-synonymous mutations
Changes in a protein sequence come from changes in the nucleotide sequence
Nonsynonymous changes are more likely to have functional consequences and are generally deleterious, thus removed from populations more rapidly. The rate of nonsynonymous change will be slower than the rate of synonymous change.
Measuring selection on protein-coding genes:
-->Selection ‘for’ particular amino acid changes.
Changes are synonymous or non-synonymous
AAA \rightarrow AAG \text{ (Lysine)}
AAA \rightarrow GAA \text{ (Glutamic Acid)}
*What if dN/dS = 1?
Estimate of positive selection
Frequencies of different codons for the same amino acid are different.
Codon usage bias is caused by:
The genetic code is redundant.
*Some amino acids coded by more than one codon.
*Proteins are more conserved during evolution.
DNA \rightarrow 4 \text{ letter alphabet}
Proteins \rightarrow 20 \text{ letter alphabet}
KA = Nd / N and KS = Sd / S
Ratio KA/KS as an indicator of evolutionary mode in each gene Basic analyses of the proportion of non-synonymous to synonymous divergence KA/KS
KA or dN: rate of non-synonymous divergence and KS or dS: rate of synonymous divergence between species
Estimates of non-synonymous and synonymous polymorphisms and substitutions provide insight into evolutionary processes By Analysing divergence and polymorphism:
KA and dN: rate of non-synonymous substitutions , KS and dS: rate of synonymous substitutions, PN: Amount of non-synonymous polymorphisms, PS: Amount of synonymous polymorphisms KA/Ks branch-specific estimate
Positive selection is selection on a particular trait - and the increased frequency of an allele in a population
In PRM1 exon 2, there are six differences between humans and chimpanzees, five of which alter amino acids.
branch-specific dN/dS estimates for OGP (oviductal glycoprotein) for multiple species
Alleles and Haplotypes that increase in frequency rapidly due to positive selection will carry lots of “hitch-hiking”, flanking DNA, creating a linkage disequilibrium signature
Geographic variation in allele frequencies and patterns
Genes such as AGT, CYP3A, SLC24A5, FY, IL4, IL13, CASP12, NAT2, LCT, TRPV6, and MMP3, show evidence of geographically restricted selection in humans related to climate, pathogens, or diet.
Extreme population differences in FYO allele frequency; The FYO allele, which confers resistance to P. vivax malaria.
Brains, food, reproduction and parasites
Most-significant categories showing positive selection in the human lineage include:
Immunity and defense, T-cell-mediated immunity, Chemosensory perception, Biological process unclassified, Olfaction, Gametogenesis, Natural killer-cell-mediated immunity, Spermatogenesis and motility, Inhibition of apoptosis, Interferon-mediated immunity, Sensory perception and B-cell- and antibody-mediated immunity.
Chemosensory perception, Olfaction, Gametogenesis, Spermatogenesis and motility, Fertilization, Other carbohydrate metabolism, Electron transport, Chromatin packaging/remodeling, MHC-I-mediated immunity, Steroid metabolism, Lipid and fatty acid binding, mRNA transcription initiation , Protein modification, Vitamin/cofactor transport, Phosphate metabolism and Peroxisome transport
Human-specific duplicates evolving under adaptive natural selection include a surprising number of genes involved in neuronal and cognitive functions.
Neuroblastoma breakpoint family NBPF and others such as, FAM75A ,Williams Beuren syndrome region 19 and etc.
Small (~430 cc v ~1,400 cc) but otherwise ~normal brain, only mild mental retardation. Can be due to loss of activity of the ASPM gene, Abnormal spindle-like microcephaly associated, or MCPH1 gene
ASPM is still evolving adaptively in human lineage?! and May related to forms of human language, tonal and non-tonal convergence
relating human adaptive molecular evolution to human disease, Crespi 2010, Evol. Appl. Genes subject to recent positive selection in humans are differentially involved in neurological diseases
FOXP2 is highly conserved throughout mammals and beyond but for three nucleotide substitutions that change the FOXP2 protein between humans and the mouse, and two have occurred along the human lineage Examination of human genetic variation suggests that the region surrounding the gene underwent a selective sweep in the past 200,000 years
Brains of individuals with FOXP2 mutations have reduced grey matter in the frontal gyrus which includes Broca’s area and Functional abnormalities in Broca’s area during language tasks
The Derived FOXP2 Variant of Modern Humans Was Shared with Neandertals
FOXP2: two genetic variants (SNPs) are associated with risk of some neurodevelopmental disorders involving speech and language, schizophrenia and autism
Many genes related to primate brain development have been subject to positive selection Have identified several positively-selected genes related to brain size and language in humans, but we do not know how they work These same genes are also involved in human disorders related to the brain and language
All infants have high lactase enzyme activity to digest the sugar lactose in milk In most humans, activity declines after weaning, but in some it persists: LCT*P
Linkage and LD studies show association of lactase persistence with the T allele of a T/C polymorphism 14 kb upstream of the lactase gene and Lactase level is controlled by a cis-acting element
Genetic Signatures of Strong Recent Positive Selection at the Lactase Gene and Convergent adaptation of human lactase persistence in Africa and Europe show selection
A SNP in the gene encoding lactase (LCT) (C/T-13910) is associated with the ability to digest milk as adults (lactase persistence) in Europeans, These data provide a marked example of convergent evolution due to strong selective pressure resulting from shared cultural traits-animal domestication and adult milk consumption and Genotyping across a 3-Mb region demonstrated haplotype homozygosity extending >2.0 Mb on chromosomes carrying C-14010, consistent with a selective sweep over the past ~7,000 years
Not just milk - use of starches also increased in human diet, Diet and the evolution of human amylase gene copy number variation and Evidence for selection of suite of genes 'for' meat-eating adaptation
Better food, smaller guts, adaptations to meat, Humans adapted genetically to a novel diet that includes dairy products, grains, and more meat The selection involved has been strong The molecular adaptations involved in dietary adaptations tend to be local geographically, and still exhibit genetic polymorphisms
Rapid Evolution of Reproductive Proteins in Mammals, Rapid Evolution of Fertilization Proteins and Male-female conflicts, Egg laying (increased). Receptivity to mating sperm displacement and more
Correlation between SEMG2 Evolution and Primate Sexual Traits, comparative evidence for molecular adaptation related to sperm mobility, with implications for human male fertility different primate species and Carlson et al, research in fertile
Evidence from segregation in South Amerindian families: A strong signature of balancing selection in the 5' cis-regulatory region of CCR5
How selection on the human genome is related to disease; Strong, recent positive selection can create maladaptations as byproducts (via pleiotropy); Balancing selection creates maladapted homozygotes (as a form of tradeoff); Locally-selected adaptations become maladaptive with changes in the environment (such as recent human migrations)-local adaptation is common; Selection on brain, dietary, reproductive and disease genes has generated very rapid, recent, ongoing change, which helps in understanding human adaptation and disease
Immune recognition molecules: KIRS, LIRAS, HLAS, TRs Multiple changes, Inactivation or deletion, Duplication and Functional amino acid changes
Pkr Evolution in Response to Viral Mimicry - YouTube