Study Notes on Gene Duplication and Natural Variation of Gene Expression in Evolution of Metabolism

Abstract

  • Background: Most eukaryotic genomes have experienced whole genome duplications throughout their evolution. Duplicated genes can diverge in function from their ancestral genes via neo-functionalization and sub-functionalization. Furthermore, gene duplicates may segregate functions among different genotypes, promoting genetic diversity. The divergence ability of duplicates may be influenced by their biological functions.

Methodology/Principal Findings

  • I assessed the impacts of gene duplication and metabolic functions on gene expression variability within Arabidopsis thaliana.

    • Key Findings: Younger tandem duplicated genes exhibit a higher level of intraspecific gene expression variation than the average gene in Arabidopsis. Older segmental duplicates also show elevated intraspecific variation but at lower levels than tandem duplicates.

    • The biological function affects the level of intraspecific variation in gene expression. Major pathways in energy metabolism and biosynthesis show decreased variability, indicating constraints on their expression. In contrast, herbivory defense pathways exhibit significant intraspecific variation, indicating selection pressures for diversity in response to herbivory.

Conclusion

  • The data reveal that intraspecific variation in gene expression is enhanced by the interaction between gene duplication and biological function, influencing plant metabolic diversity.

Citation Information

  • Kliebenstein DJ (2008) A Role for Gene Duplication and Natural Variation of Gene Expression in the Evolution of Metabolism. PLoS ONE 3(3): e1838. doi:10.1371/journal.pone.0001838

  • Received December 21, 2007; Accepted February 19, 2008; Published March 19, 2008

  • Copyright © 2008 Daniel Kliebenstein. Open-access under Creative Commons License.

  • Funding: NSF grant DBI#0642481 to DJK. No competing interests declared.

Introduction

Overview of Genome Duplication

  • Eukaryotic genomes have predominantly undergone whole genome duplications, especially in angiosperms.

  • Whole genome duplications and smaller local tandem duplication events are noted with younger tandem duplicated genes following segmental duplications.

  • Duplicated genes may become non-functional due to harmful mutations or retain dual functions if beneficial over time.

Divergence of Duplicated Genes

  • Duplicated genes can either undergo sub-functionalization, acquiring different expression patterns or neo functionaliz ation, acquiring new functions.

  • There have been observed associations between gene duplication and diverse expression patterns in response to various environmental stimuli in different species.

  • Genetic differentiation among species may emerge due to functional partitioning driven by gene duplication.

Gene Function and Expression Patterns

  • The biological function of genes plays a role in their evolutionary conservation.

    • Essential biological processes (e.g., primary metabolism) are believed to have constraints limiting their gene expression variations.

    • Secondary metabolite genes should exhibit more variation due to their involvement in diverse ecological interactions, driven by selection pressures.

Specific Example: Glucosinolates

  • Glucosinolates: a class of sulfur-rich secondary metabolites that pose a defense mechanism against herbivores in Arabidopsis.

  • Diversity in glucosinolates is subjected to different selective regimes, leading to changes in gene expression and gene duplication in these pathways.

Aim of the Study

  • Investigate the interplay between gene duplication, metabolic pathways, and gene expression variation i n natural accessions of Arabidopsis thaliana.

  • Compare developmental and abiotic stress-induced variations in gene expression to contextualize findings from duplicate genes.

Results

Transcript Accumulation Analysis

  • Utilized comprehensive datasets from Affymetrix ATH1 microarrays capturing intraspecific gene expression variation across Arabidopsis accessions and in response to development or abiotic stress.

    • Key Metrics: Variance mean ratio (VMR) and coefficient of variance (CV) assessed for diversity across expression datasets.

Duplicated Genes and Variance Estimates

  • Tandem duplicated genes typically exhibit increased variance across datasets as compared to the whole genome average, indicating functional diversity.

  • Segmental duplicated genes also demonstrated variance elevation in various datasets, although less pronounced compared to tandem duplicates.

Comparison of Genetic Variation and Treatment Effects

  • A factorial dataset focused on two accessions exposed to abiotic treatment indicated that genetic differences led to greater expression variance than treatment effects.

  • Tandem and segmental duplicates exhibited significant enhancements in expression variance in comparison to unique genes.

Metabolic Pathways Analysis

Transcript Variance Estimates

  • Investigated pathway levels of variance (CVs) across gene expressions within metabolic pathways utilizing the AraCyc database.

    • Essential pathways displayed reduced variance while secondary metabolic pathways reflected substantial diversity in expression, suggesting adaptive functions.

Significant Findings

  • Aliphatic glucosinolate pathways showed elevated expression diversity, indicative of selection for increased variability as a response to fluctuating insect pressures.

  • The exploration into the genetic architecture of expression variance highlighted the contributions of gene duplication status within specific metabolic pathways.

Discussion

Genetic Constraints and Pathway Analysis

  • Results indicate elevated variations in expression are associated with gene duplications, particularly within herbivore defense pathways.

  • Essential metabolic pathways maintain constraints due to their critical roles in cellular survival and development.

Implications for Evolution

  • Gene duplication provides raw evolutionary material, with biological functions determining whether these duplicates lead to enhanced diversity or functional redundancy.

  • The results support a model where the evolutionary capacity of duplicated genes is influenced by their context within biochemical pathways.

Materials and Methods

Microarray Datasets Overview

  • Included four datasets analyzed via normalization to capture responses to abiotic stresses, developmental stages, and natural accession variability across Arabidopsis genotypes.

Analyses Conducted

  • Evaluated transcript accumulation variance using established statistical measures, controlling for potential cross-hybridizations in microarray data.

  • Comparison of eQTL mapping data provided insights into the genetic control of transcript abundance regarding different gene classes concerning duplication status.

Supporting Information

  • Tables and figures detailing statistical analyses and additional data can be referenced in the original article due to their volume, including per gene statistical values, pathways, and variance estimates.