Linkage, Recombination, and Gene Mapping Study Guide

Independent Assortment and Genetic Linkage

  • Mendel’s Law of Independent Assortment: This law assumes that the four possible allele combinations (e.g., TBTB, TbTb, tBtB, tbtb) formed by two traits are all equally likely.
  • Chromosomal Alignment: Two chromosome pairs can align in two different orientations during metaphase I of meiosis.     * Orientation 1: Gametes contain ABAB or abab alleles.     * Orientation 2: Gametes contain aBaB or AbAb alleles.
  • Physical Distance Requirement: Independent assortment occurs only for genes that lie at a significant distance from each other on the same chromosome or are located on different chromosomes.
  • Genetic Linkage Definition: Genes that do not assort independently are said to be genetically linked. If Gene A and Gene B are linked, the alleles inherited from one parent (e.g., AA with BB) tend to stay together.

Recombination and Recombination Frequency (RfRf)

  • Recombinant vs. Nonrecombinant Chromosomes:     * Nonrecombinant: Chromosomes that are identical to those inherited from the parents (e.g., if Dad provided ABCABC and Mom provided abcabc, the gametes remain ABCABC or abcabc).     * Recombinant: Chromosomes resulting from crossing over during meiosis, yielding new allele combinations (e.g., AbcAbc or aBCaBC for genes AA and CC relative to BB).
  • Calculating Recombination Frequency (RfRf):     * RfRf is the probability that a gamete contains a recombinant combination.     * Formula: Rf=Number of recombinant progenyTotal number of progeny×100\text{Rf} = \frac{\text{Number of recombinant progeny}}{\text{Total number of progeny}} \times 100     * Maximum Value: The maximum value of RfRf is 50%50\%. Even if a recombination breakpoint always falls between two genes, only two of the four chromatids in the tetrad participate in a single crossover, limiting recombinants to half the gametes.
  • Independent Assortment and RfRf:     * If genes assort independently (A/aA/a and C/cC/c), all combinations (ACAC, AcAc, aCaC, acac) are equally likely (25%25\% probability each).     * The RfRf for independently assorting genes is 50%50\% (25% Ac+25% aC=50%25\% \text{ Ac} + 25\% \text{ aC} = 50\%).
  • Complete Linkage: In complete linkage, the RfRf is 0%0\%. Recombinant allele combinations never appear.

Factors Influencing Recombination Frequency

  • Gene Distance: The closer two genes lie on a chromosome, the smaller the RfRf between them.
  • Chromosomal Position: Recombination occurs more frequently near the telomeres (ends of the chromosome) than near the centromere. Genes separated by the same physical distance will have a higher RfRf if they are near the telomere.
  • Chromosome Variation: Different chromosomes exhibit different RfRf values for genes separated by the same physical distance.
  • Biological and Demographic Factors: RfRf can vary between men and women, as well as between different ethnic groups.

Effects of Linkage on Phenotypic Ratios

  • Mendelian Expectations: A dihybrid cross (AaBb×AaBbAaBb \times AaBb) traditionally yields a 9:3:3:19:3:3:1 phenotypic ratio based on two assumptions:     1. Allele combinations (ABAB, AbAb, aBaB, abab) appear equally often in gametes.     2. All gamete combinations are equally likely to create viable offspring.
  • Linkage Distortion: If genes are linked and Rf < 50\%, the first assumption is violated. Some meioses will not produce recombinant chromosomes, causing nonrecombinant phenotypes to predominate.
  • Summary of Dihybrid Test Cross (AaBb×aabbAaBb \times aabb):     * Unlinked (Independent Assortment): Progeny are 25%25\% Dominant/Dominant, 25%25\% Dominant/Recessive, 25%25\% Recessive/Dominant, and 25%25\% Recessive/Recessive. Half are recombinant.     * Completely Linked: Only nonrecombinant progeny are produced (50%50\% Dominant/Dominant and 50%50\% Recessive/Recessive if alleles were in coupling).     * Linked with some crossing over: Nonrecombinant progeny predominate (number > 50%50\%), and recombinant progeny are present but less frequent.

Configuration: Coupling and Repulsion

An individual heterozygous for two genes (p+/pp^+/p and b+/bb^+/b) can have alleles in two physical arrangements:

  • Coupling (Cis Orientation): Two wild-type alleles are on the same chromosome, and two mutant alleles are on the other (p+b+pb\frac{p^+ b^+}{p b}).
  • Repulsion (Trans Orientation): One wild-type and one mutant allele are on each chromosome (p+bpb+\frac{p^+ b}{p b^+}).
  • Significance: The phenotypes of the offspring in a test cross are the same regardless of configuration, but the numerical distribution (which phenotypes are nonrecombinant vs. recombinant) differs.

Genetic Mapping and Map Units

  • Physical Maps: Describe distance in basepairs (bpbp).
  • Genetic Maps: Describe distances in terms of RfRf.     * Unit: map units (m.u.m.u.) or centiMorgans (cMcM).     * Conversion: 1m.u.=1cM=1%1\,m.u. = 1\,cM = 1\% recombination.
  • Additivity: Map distances can be added. If AB=5cMA-B = 5\,cM and BC=20cMB-C = 20\,cM, then AC=25cMA-C = 25\,cM. This allows for two equivalent representations: ABCA-B-C or CBAC-B-A.
  • Limitations:     * If Rf=50%Rf = 50\%, the map cannot distinguish if genes are far apart on the same chromosome or on different chromosomes.     * Double Crossovers: Observed RfRf often underestimates actual distance because double crossovers between two genes are not detected.

Three-Point Test Crosses

  • Purpose: To determine the correct order and genetic distance of three linked genes.
  • The Double Crossover Rule: Double recombination moves only the middle gene's alleles. By comparing the most common progeny (nonrecombinants) with the least common (double recombinants), the middle gene can be identified.
  • Procedure Example (Genes: st, e, ss):     * Identify Nonrecombinants (NR): Most frequent (e.g., wild type and triple mutant).     * Identify Double Recombinants (DR): Least frequent.     * Compare NR and DR: If total linkage was st+e+ss+st^+ e^+ ss^+ and the DR results in st+e+ssst^+ e^+ ss, then ssss must be the middle gene because only its allele changed relative to the others.
  • Calculating Distances:     * To find distance between Gene 1 and Gene 2: Rf=Sum of single crossovers in that interval+Double crossoversTotal progeny\text{Rf} = \frac{\text{Sum of single crossovers in that interval} + \text{Double crossovers}}{\text{Total progeny}}     * Example calculation: 50+52+5+3755=0.146=14.6%\frac{50 + 52 + 5 + 3}{755} = 0.146 = 14.6\% between stst and ssss.

Interference and Coefficient of Coincidence (CC)

  • Concept: A crossover in one region may influence (typically reduce) the likelihood of another crossover nearby.
  • Theoretical Double Recombinations: Calculated by multiplying the probabilities of single crossovers (Pinterval 1×Pinterval 2P_{\text{interval 1}} \times P_{\text{interval 2}}).     * Example: .146×.122=0.0178.146 \times .122 = 0.0178. For 755755 offspring, expected double recombinants = 0.0178×755=13.40.0178 \times 755 = 13.4.
  • Coefficient of Coincidence (CC): C=Observed double crossesExpected double crossesC = \frac{\text{Observed double crosses}}{\text{Expected double crosses}}.
  • Interference (II): I=1CI = 1 - C.     * If I > 0, there are fewer double crossovers than expected.     * If I < 0, there are more double crossovers than expected.

Mapping Human Genes

  • Linkage Analysis: Tracing the inheritance of disease mutations alongside polymorphic markers in families over multiple generations.     * Markers: Known polymorphic sequences (e.g., 2q11.2, 2q13).     * Strategy: Identify marker alleles inherited by all affected family members but not unaffected ones. Recombinants help narrow the genome region containing the mutation.
  • LOD Scores (Logarithm of Odds): This statistical method overcomes the small size of human families.     * LOD_{Rf} = \log\left(\frac{\text{Probability of results if linked at a specific Rf}}{\text{Probability of results if unlinked (Rf = 50%)}} ight)     * Interpretation: Positive scores favor linkage; a score of 3.03.0 indicates it is 1,0001,000 times more likely that genes are linked at that RfRf than unlinked.     * Calculating Odds: If unlinked, the probability of a specific outcome across nn children is (0.5)n(0.5)^n.
  • Case Study: Nail-Patella Syndrome and Blood Type:     * Observations: 2 recombinants out of 13 progeny.     * Calculated LOD scores at various RfRf:         * Rf=1%Rf = 1\%: LOD=0.134LOD = -0.134         * Rf=10%Rf = 10\%: LOD=1.41LOD = 1.41         * Rf=15%Rf = 15\%: LOD=1.49LOD = 1.49 (Most likely distace observed)         * Rf=20%Rf = 20\%: LOD=1.46LOD = 1.46         * Rf=25%Rf = 25\%: LOD=1.34LOD = 1.34

Association Studies and GWAS

  • Association Study (Case-Control Study): Compares allele frequencies of polymorphisms between a group with a disease (cases) and a group without (controls).     * Objective: Identify alleles found more frequently in cases, suggesting the gene influences disease risk.
  • Genome-Wide Association Study (GWAS): Genotypes subjects for a massive number of markers across the entire genome in a single scan.
  • Candidate Genes:     * If a polymorphism is within a gene, that gene is a "candidate."     * If not, the nearest genes are investigated based on their protein function.     * Validation: mRNA/protein level measurements or animal model manipulation.
  • Example: Asthma Risk Study:     * Gene 1 Data: Allele frequencies are similar in control and asthma groups; likely has no influence on risk.     * Gene 2 Data: Alleles 1 and 5 appear much more frequently in the asthma group (0.400.40 vs. 0.100.10 and 0.050.05 respectively). These are "risk-increasing alleles." Alleles 2, 3, and 4 are more frequent in controls and are "risk-decreasing alleles."

Questions & Discussion

  • Quiz 1: If a person who is AaBbAaBb (where A/BA/B are in trans) undergoes recombination, what are the 4 possible outcomes? Which are recombinants and which are nonrecombinants?     * Self-Correction/Logic: If trans, the chromosomes are AbAb and aBaB. Therefore, nonrecombinants are AbAb and aBaB. Recombinants are ABAB and abab.
  • Quiz 2: Draw a map indicating the relative positions of Genes A, B, C, D, and E on the same chromosome based on the following RfRf data (cMcM):     * AB=10AB = 10, AC=50AC = 50, AD=29AD = 29, AE=28AE = 28, BC=41BC = 41, BD=39BD = 39, BE=18BE = 18, CD=50CD = 50, CE=23CE = 23, DE=50DE = 50.