Study Notes on Gene Duplication and Natural Variation of Gene Expression in Evolution of Metabolism
Abstract
Background: Most eukaryotic genomes have experienced whole genome duplications throughout their evolution. Duplicated genes can diverge in function from their ancestral genes via neo-functionalization and sub-functionalization. Furthermore, gene duplicates may segregate functions among different genotypes, promoting genetic diversity. The divergence ability of duplicates may be influenced by their biological functions.
Methodology/Principal Findings
I assessed the impacts of gene duplication and metabolic functions on gene expression variability within Arabidopsis thaliana.
Key Findings: Younger tandem duplicated genes exhibit a higher level of intraspecific gene expression variation than the average gene in Arabidopsis. Older segmental duplicates also show elevated intraspecific variation but at lower levels than tandem duplicates.
The biological function affects the level of intraspecific variation in gene expression. Major pathways in energy metabolism and biosynthesis show decreased variability, indicating constraints on their expression. In contrast, herbivory defense pathways exhibit significant intraspecific variation, indicating selection pressures for diversity in response to herbivory.
Conclusion
The data reveal that intraspecific variation in gene expression is enhanced by the interaction between gene duplication and biological function, influencing plant metabolic diversity.
Citation Information
Kliebenstein DJ (2008) A Role for Gene Duplication and Natural Variation of Gene Expression in the Evolution of Metabolism. PLoS ONE 3(3): e1838. doi:10.1371/journal.pone.0001838
Received December 21, 2007; Accepted February 19, 2008; Published March 19, 2008
Copyright © 2008 Daniel Kliebenstein. Open-access under Creative Commons License.
Funding: NSF grant DBI#0642481 to DJK. No competing interests declared.
Introduction
Overview of Genome Duplication
Eukaryotic genomes have predominantly undergone whole genome duplications, especially in angiosperms.
Whole genome duplications and smaller local tandem duplication events are noted with younger tandem duplicated genes following segmental duplications.
Duplicated genes may become non-functional due to harmful mutations or retain dual functions if beneficial over time.
Divergence of Duplicated Genes
Duplicated genes can either undergo sub-functionalization, acquiring different expression patterns or neo functionaliz ation, acquiring new functions.
There have been observed associations between gene duplication and diverse expression patterns in response to various environmental stimuli in different species.
Genetic differentiation among species may emerge due to functional partitioning driven by gene duplication.
Gene Function and Expression Patterns
The biological function of genes plays a role in their evolutionary conservation.
Essential biological processes (e.g., primary metabolism) are believed to have constraints limiting their gene expression variations.
Secondary metabolite genes should exhibit more variation due to their involvement in diverse ecological interactions, driven by selection pressures.
Specific Example: Glucosinolates
Glucosinolates: a class of sulfur-rich secondary metabolites that pose a defense mechanism against herbivores in Arabidopsis.
Diversity in glucosinolates is subjected to different selective regimes, leading to changes in gene expression and gene duplication in these pathways.
Aim of the Study
Investigate the interplay between gene duplication, metabolic pathways, and gene expression variation i n natural accessions of Arabidopsis thaliana.
Compare developmental and abiotic stress-induced variations in gene expression to contextualize findings from duplicate genes.
Results
Transcript Accumulation Analysis
Utilized comprehensive datasets from Affymetrix ATH1 microarrays capturing intraspecific gene expression variation across Arabidopsis accessions and in response to development or abiotic stress.
Key Metrics: Variance mean ratio (VMR) and coefficient of variance (CV) assessed for diversity across expression datasets.
Duplicated Genes and Variance Estimates
Tandem duplicated genes typically exhibit increased variance across datasets as compared to the whole genome average, indicating functional diversity.
Segmental duplicated genes also demonstrated variance elevation in various datasets, although less pronounced compared to tandem duplicates.
Comparison of Genetic Variation and Treatment Effects
A factorial dataset focused on two accessions exposed to abiotic treatment indicated that genetic differences led to greater expression variance than treatment effects.
Tandem and segmental duplicates exhibited significant enhancements in expression variance in comparison to unique genes.
Metabolic Pathways Analysis
Transcript Variance Estimates
Investigated pathway levels of variance (CVs) across gene expressions within metabolic pathways utilizing the AraCyc database.
Essential pathways displayed reduced variance while secondary metabolic pathways reflected substantial diversity in expression, suggesting adaptive functions.
Significant Findings
Aliphatic glucosinolate pathways showed elevated expression diversity, indicative of selection for increased variability as a response to fluctuating insect pressures.
The exploration into the genetic architecture of expression variance highlighted the contributions of gene duplication status within specific metabolic pathways.
Discussion
Genetic Constraints and Pathway Analysis
Results indicate elevated variations in expression are associated with gene duplications, particularly within herbivore defense pathways.
Essential metabolic pathways maintain constraints due to their critical roles in cellular survival and development.
Implications for Evolution
Gene duplication provides raw evolutionary material, with biological functions determining whether these duplicates lead to enhanced diversity or functional redundancy.
The results support a model where the evolutionary capacity of duplicated genes is influenced by their context within biochemical pathways.
Materials and Methods
Microarray Datasets Overview
Included four datasets analyzed via normalization to capture responses to abiotic stresses, developmental stages, and natural accession variability across Arabidopsis genotypes.
Analyses Conducted
Evaluated transcript accumulation variance using established statistical measures, controlling for potential cross-hybridizations in microarray data.
Comparison of eQTL mapping data provided insights into the genetic control of transcript abundance regarding different gene classes concerning duplication status.
Supporting Information
Tables and figures detailing statistical analyses and additional data can be referenced in the original article due to their volume, including per gene statistical values, pathways, and variance estimates.