Eukaryotes possess more genes than prokaryotes, enabling greater functional diversity and cell specialization through complex regulatory networks.
Cell specialization depends on the regulation of gene expression, with certain genes being turned on or off based on cellular requirements to fulfill specific roles.
Examples of cell types based on gene expression include:
Skeletal muscle cells (express genes for contractile proteins like actin and myosin)
Osteocytes (express genes for bone matrix components like collagen and osteocalcin)
Differential Gene Expression
Definition: Differential gene expression refers to the phenomenon where some genes are expressed (turned on) while others are not (turned off) within the same organism.
This allows each cell type to perform distinct functions despite sharing the same genetic material, a core principle of the central dogma of molecular biology.
Important Note: Only about 1% of human DNA is involved in coding for proteins, highlighting that the vast majority of our DNA is non-coding and may have regulatory, structural, or evolutionary remnant functions, or its purpose is still unknown.
Types of Coding Genes
Single Copy Genes:
Genes that exist in a single copy per diploid genome, such as genes for the Rh factor or enzymes in metabolic pathways.
Example given:
Attached vs. free earlobes, a Mendelian trait determined by a single gene locus, illustrating dominant and recessive alleles.
Multigene Families:
Collections of similar or identical genes that code for related functions, often arising from gene duplication events. These allow for the rapid production of large quantities of gene products or for specialized versions of proteins active at different developmental stages.
Examples include:
RNA genes: genes coding for rRNA, mRNA, tRNA, and RNA polymerase, crucial for protein synthesis.
Hemoglobin genes: includes genes for alpha (α) and beta (β) subunits, with distinct fetal and adult forms (e.g., embryonic hemoglobin, fetal hemoglobin, and adult hemoglobin) adapted to varying oxygen requirements.
Non-coding DNA:
Comprises DNA sequences that do not code for proteins but can have significant regulatory or structural roles. It includes:
Vestigial genes (pseudogenes): sequences resembling functional genes but are no longer functional due to mutations.
Introns: non-coding sections within genes that are transcribed into pre-mRNA but are spliced out before translation; important during mRNA processing and can play regulatory roles.
Short Tandem Repeats (STRs), Variable Number Tandem Repeats (VNTRs), and transposons (jumping genes), which can insert or remove themselves within the genome, contributing to genetic variation and genome evolution.
Short Tandem Repeats (STRs)
STRs are sequences of DNA consisting of short segments (2–5 base pairs) that repeat multiple times in a head-to-tail fashion.
Example:(gata)n repeating several times, where 'n' denotes the number of repeats.
STRs are considered non-coding DNA and are highly polymorphic (unique in repeat numbers) across individuals, making them invaluable for DNA profiling in forensic analysis and paternity testing.
Linked to CODIS (Combined DNA Index System) for criminal investigations, comparing DNA profiles from crime scenes to known offender databases.
Variable Number Tandem Repeats (VNTRs)
VNTRs are similar to STRs but typically involve longer repeat units (10–100 base pairs) and vary in the number of repeats among individuals.
They can provide information about genetic differences and relationships between family members, as siblings can share some VNTR alleles, while others will be unique, useful in genetic mapping.
Single Nucleotide Polymorphisms (SNPs)
SNPs represent variations in a single nucleotide base pair at a specific position in the DNA sequence.
Example: An adenine (A) may be replaced with a guanine (G) at a particular locus.
Often located within introns or other non-coding areas, SNPs are useful for familial linkage studies, population genetics, and identifying genetic predispositions to diseases.
Gene Regulation Mechanisms
Pre-transcriptional Controls
Heterochromatin:
DNA that is tightly packed around histone proteins, forming a condensed chromatin structure. This restricts the access of transcription machinery (RNA polymerase and transcription factors), effectively silencing gene expression.
Histone Modifications:
Acetylation: The addition of acetyl groups to histone tails neutralizes their positive charges, loosening their binding to the negatively charged DNA. This makes the chromatin structure more open and accessible, thereby facilitating gene expression. Enzymes called histone acetyltransferases (HATs) mediate this.
Methylation: The addition of methyl groups, typically to cytosine bases in DNA (CpG islands) or to histone tails. DNA methylation often tightens histone binding and promotes a condensed chromatin state, effectively silencing genes. This is a key epigenetic mechanism involved in long-term gene inactivation.
Transcription Factors:
Proteins that help initiate or regulate the transcription of specific genes by binding to specific DNA sequences, usually in the promoter or enhancer regions.
General Transcription Factors: Are essential for the transcription of all protein-coding genes.
Specific Transcription Factors (Activators and Repressors): Bind to control elements (enhancers or silencers) and either boost or slow down transcription. Enhancers and activators facilitate transcription, while bending proteins assist in bringing distantly bound transcription machinery into proximity with the gene.
Post-transcriptional Controls
RNA Processing:
Involves several modifications to pre-mRNA in the nucleus, including splicing out introns (non-coding regions) and joining exons (coding regions) to create a mature, functional mRNA molecule. Alternative splicing allows a single gene to code for multiple protein isoforms.
RNA Stability:
The incorporation of a 5' cap (a modified guanine nucleotide) and a poly-A tail (a string of adenine nucleotides) at the 3' end of the mRNA molecule. These modifications protect the mRNA from degradation by nucleases and facilitate its transport from the nucleus to the cytoplasm, as well as aid in ribosome binding for translation, thus determining the lifespan of mRNA.
siRNA (Small Interfering RNAs):
Short, double-stranded RNA molecules that bind to complementary mRNA sequences, leading to its degradation or inhibiting its translation. This mechanism, known as RNA interference (RNAi), effectively silences gene expression.
Ubiquitination:
A post-translational modification process where proteins are covalently tagged with ubiquitin molecules. This tag signals for the targeted protein's degradation by a cellular machine called the proteasome, thus regulating protein abundance and function.
**MicroRNAs (miRNAs):
Small, non-coding RNA molecules (typically 20-22 nucleotides long) that bind to specific messenger RNA (mRNA) molecules, primarily in the 3' untranslated region. This binding typically prevents translation or promotes mRNA degradation, subsequently regulating gene expression.
Cell Specialization
Refers to the differentiation of embryonic stem cells or adult stem cells into specific cell types (e.g., neurons, muscle cells, blood cells), guided by intricate regulation of gene expression patterns.
Critical for tissue repair and regeneration, responding to injury through modulated cellular communication and coordinated gene activation.
Types of Cellular Communication
Gap Junctions:
Found in animal cells, these protein channels (connexons) connect adjacent cells directly, allowing the passage of ions, small molecules, and electrical impulses, facilitating rapid signal transmission and metabolic coupling.
Plasmodesmata:
Microscopic channels that traverse the cell walls of plant cells, connecting their cytosols and allowing direct intercellular transport of water, solutes, proteins, and RNA molecules, comparable in function to gap junctions.
Cell Recognition:
Involves direct contact between cell surface molecules, such as antigen-receptor interactions for immune identification.
Example: T cells recognizing specific peptide antigens presented by Major Histocompatibility Complex (MHC) molecules on the surface of infected or abnormal cells, initiating an immune response.
Signaling Mechanisms
Paracrine Signaling:
Local signaling where a secreting cell releases signaling molecules (local regulators) into the extracellular fluid, influencing the behavior of nearby target cells within the immediate vicinity.
Example: The SRY gene product (a transcription factor) directing sexual differentiation in the embryo by influencing adjacent cells to develop male characteristics. Growth factors are also common paracrine signals.
Synaptic Signaling:
Involves neurons transmitting signals across specialized junctions called synapses. An electrical signal along the neuron triggers the release of neurotransmitters, which diffuse across the synaptic cleft and bind to receptors on the target cell (another neuron or muscle cell), eliciting a response.
Hormonal Signaling (Endocrine Signaling):
Long-distance signaling through hormones, which are secreted by endocrine glands into the bloodstream and travel to distant target cells throughout the body. Target cells possess specific receptors to bind and respond to these hormones.
Hormones can be classified as water-soluble (e.g., insulin, growth hormone) or fat-soluble (e.g., steroid hormones like testosterone and estrogen, thyroid hormones).
Hormones and Protein Synthesis
Water-soluble hormones:
Hydrophilic and cannot cross the lipid bilayer of cell membranes. They bind to specific protein receptors on the cell surface, initiating a signal transduction pathway, often involving a phosphorylation cascade (e.g., activating protein kinases) and secondary messengers (e.g., cAMP), ultimately leading to a cellular response without directly entering the nucleus to affect gene expression.
Example: Human Chorionic Gonadotropin (HCG) binds to cell surface receptors to maintain progesterone production by the corpus luteum during early pregnancy.
Fat-soluble hormones:
Hydrophobic (lipophilic) and can readily diffuse across the cell membrane. Once inside the cell, they bind to intracellular receptors (either in the cytoplasm or nucleus) to form a hormone-receptor complex. This complex then acts as a transcription factor, binding directly to specific DNA sequences (hormone response elements) to promote or inhibit the transcription of target genes, thereby directly regulating protein synthesis.
Mutations and Cancer
Mutations:
Permanent changes in the DNA sequence can alter the genetic code, subsequently affecting RNA transcripts and potentially leading to changes in the structure and function of proteins. Mutations may result in a loss of function (the protein no longer works or is expressed at lower levels) or a gain of function (the protein acquires a new, often deleterious, activity or is hyperactive).
Examples:
Cystic Fibrosis: A loss of function mutation in the CFTR gene, which codes for a chloride channel, leading to defective ion transport and thick mucus accumulation.
Huntington's Disease: A gain of function mutation (trinucleotide repeat expansion) in the HTT gene, resulting in an abnormally long protein that causes neurodegeneration.
Cancer Regulation:
Cancer often arises from an accumulation of mutations in genes that regulate cell growth and division. These include:
Oncogenes: Genes that, when mutated (gain of function), can promote uncontrolled cell growth. For instance, the RAS protein (a G protein involved in cell signaling) abnormally stimulates cell division pathways when mutated to an oncogenic form.
Tumor Suppressor Genes: Genes that normally constrain cell growth. The p53 protein, known as "the guardian of the genome," typically functions as a tumor suppressor by activating DNA repair, cell cycle arrest, or apoptosis. Loss of its function (often due to loss of function mutations) can lead to uncontrolled cell growth and proliferation, contributing significantly to cancer development.