3. genomics and transcriptomicsn

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/13

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

14 Terms

1
New cards

What are the fields of genomics and transcriptomics and how do they differ

Genomics

  • study of whole organisms genetic material

  • Technologies like whole genome sequencing, whole exome sequencing, amplicon sequencing, and single cell genomics

Transcriptomics

  • Complete set of RNA transcripts produced BY GENOME

  • Involves whole transcriptome sequencing, targeting gene expression analysis, single cell transcriptomics, and spatial transcriptomics

While genomics provide DNA, transcriptomics show real time expression and regulation (RNA) of genes

2
New cards

What is sanger sequencing

Selectively incorporates chain-terminating dideoxynucleotides (ddNTPs) during DNA replication

  • Relies on the condensation reaction where DNA polymerase extends the primer by adding deoxynucleotides

  • Used ddNTPs which lack the 3’-OH group, causing extension stopping and allowing fragments to be size separated to determine sequence

3
New cards

How does the chain termination method in Sanger sequencing work

  1. Reaction begins with a primer annealed to a DNA template

  2. DNA polymerase extends the chain by adding standard deoxynucleotides

  3. When an dideoxynucleotiden(lacking the 3’OH group) is added, it prevents the addition of further nucleotides, terminating the chain

  4. Result fragments of various lengths are then separated (usually by capillary electrophoresis) to read the DNA sequence

<ol><li><p>Reaction begins with a primer annealed to a DNA template</p></li><li><p>DNA polymerase extends the chain by adding standard deoxynucleotides</p></li><li><p>When an dideoxynucleotiden(lacking the 3’OH group) is added, it prevents the addition of further nucleotides, terminating the chain</p></li><li><p>Result fragments of various lengths are then separated (usually by capillary electrophoresis) to read the DNA sequence </p></li></ol>
4
New cards

What is illumina sequencing (next-generation sequencing)

  • Involved library preparation where DNA/RNA fragments are lighted with adapters

  • Uses cluster generation on a flow cell, allowing massively parallel sequencing

  • The sequence-by-synthesis process records the incorporation of nucleotides in real time

  • a bioinformatics pipeline converts the raw data (typically stored in FASTQ files) into interpretable sequences.

5
New cards

Outline the main stages of Illumina sequencing and explain why each step is important.

  1. DNA or RNA isolation from human or animal tissues

  2. library preparation

    • fragmentation of DNA/RNA samples

    • fragmented DNA is end-repaired to ensure all fragments have blunt ends

    • An adenosine (A) overhang is added to the 3’ ends to allow sufficient ligation of adaptors

  3. cluster generation

    • illumina adaptors are added to complementary oligonucleotides

    • birdge amplication creates clusters of clonal DNA, which improves signal detection

  4. priming sites

    • barcode (index) sequencing for multiplexing multiple samples

    • P5 + P7 flow sites for immunobilization during sequencing

    • detection via imaging

  5. Bioinformatic analysis

    • raw data are output as FASTQ files containing sequence reads and quality scores

6
New cards

Define bioinformatics

leverages computational tools to manage, analyze, and interpret biological data such as DNA and RNA sequences.

7
New cards

define the role of FastQ file formats in sequencing data analsysi

A standard text-based format that stores both nucleotide sequences and their corresponding quality scores.

<p>A standard text-based format that stores both nucleotide sequences and their corresponding quality scores. </p><p></p>
8
New cards

hat are the key steps in a bioinformatics pipeline for transcriptomics data, and what tools are used at each stage?

  1. Quality control

    • Tool: FastQC

    • purpose: evaluates sequence quality scores (%) of each base and detects adapter contamination

  2. Trimming of adaptor sequencing

    • Tool: Trim Galore

    • purpose: removes adapter sequences (e.g. the first 13 bp of illumina adapters) and low-quality regions

  3. Alignment

    • Tool: Star aligner

    • purpose: maps reads to a reference genome, providing mapping quality and splice junction information

  4. Alignment quality assessment

    • tool: RESeQC

    • purpose: generates detailed statistics on the alignment (e.g. total records, duplicate reads, unmapped reads)

  5. feature counting:

    • tool: featurecounts

    • purpose: determines the number of reads mapping to specific genes or features

  6. differential expression analysis

    • tool: DESeq2

    • purpose: compares expression levels between different sample groups (e.g. cancerous vs. healthy tissue) to identify altered genes

<ol><li><p>Quality control</p><ul><li><p>Tool: FastQC</p></li><li><p>purpose: evaluates sequence quality scores (%) of each base and detects adapter contamination</p></li></ul></li><li><p>Trimming of adaptor sequencing </p><ul><li><p>Tool: Trim Galore</p></li><li><p>purpose: removes adapter sequences (e.g. the first 13 bp of illumina adapters) and low-quality regions</p></li></ul></li><li><p>Alignment </p><ul><li><p>Tool: Star aligner</p></li><li><p>purpose: maps reads to a reference genome, providing mapping quality and splice junction information</p></li></ul></li><li><p>Alignment quality assessment</p><ul><li><p>tool: RESeQC</p></li><li><p>purpose: generates detailed statistics on the alignment (e.g. total records, duplicate reads, unmapped reads)</p></li></ul></li><li><p>feature counting:</p><ul><li><p>tool: featurecounts</p></li><li><p>purpose: determines the number of reads mapping to specific genes or features</p></li></ul></li><li><p>differential expression analysis</p><ul><li><p>tool: DESeq2</p></li><li><p>purpose: compares expression levels between different sample groups (e.g. cancerous vs. healthy tissue) to identify altered genes </p><p></p></li></ul></li></ol><p></p>
9
New cards

Why is quality control essential in sequencing,

  • Ensures the raw sequencing data is accurate and reliable.

  • Identifies issues such as adapter contamination, low base quality scores, and biases which can adversely affect downstream analysis

10
New cards

what specific metrics does FastQC provide?

  • Quality Score Distribution: Assesses the overall quality of bases across reads.

  • Base Composition: Evaluates the percentage of each nucleotide at every position.

  • Adapter Content: Detects the presence of non-biological adapter sequences in the data.

  • Read Length Distribution: Verifies consistency in the length of sequencing reads.

11
New cards

Explain the purpose and importance of alignment in the transcriptomics pipeline using STAR.

Purpose of Alignment:

  • To map each sequencing read to its corresponding location in a reference genome.

  • Provides the basis for quantifying gene expression and identifying splice junctions.

mportance:

  • Accurate alignment is critical for ensuring that the subsequent counts (via FeatureCounts) are reliable.

  • It enables the detection of anomalies such as non-unique mappings (multiple mapping) and helps to assess overall data quality using alignment quality tools like RESeQC.

12
New cards

What is differential gene expression analysis, and how does DESeq2 facilitate this process?

  • Differential Gene Expression Analysis:

    • A statistical method used to determine differences in gene expression levels between sample groups (e.g., diseased vs. healthy).

  • DESeq2:

    • Performs normalization of count data to account for sequencing depth and technical variation.

    • Estimates variance and uses a robust statistical model to determine which genes are significantly upregulated or downregulated.

    • Provides visualizations such as volcano plots

13
New cards

How can the integration of genomics and transcriptomics data advance pharmacogenomics and personalized medicine?

Integration Strategy:

  • Combining genomic data (e.g., mutations, copy number variations) with transcriptomic data (e.g., gene expression profiles) provides a comprehensive view of an individual's molecular landscape.

Pharmacogenomics Impact:

  • Enables prediction of drug efficacy based on a patient’s specific omic profile.

  • Helps in identifying biomarkers for drug response or resistance.

14
New cards

Discuss a case study or literature example that leverages next-generation sequencing for pharmacogenomics or personalized medicine

creation of a living biobank of breast cancer organoids.

  • Breast cancer organoids are derived from patient tumors and maintained ex vivo.

  • Next-generation sequencing is used to profile both genomic mutations and transcriptomic expression patterns.

  • The data helps capture disease heterogeneity, providing insights into which pharmacological agents may be most effective.