1/80
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
|---|
No study sessions yet.
What is genome sequencing?
The determination of the complete DNA sequence of an organism's genome.
What insights does genome sequencing provide?
It provides insights on genetic basis of evolutionary relationships, diseases, and genes, including coding and non-coding regions.
What are the three generations of sequencing?
First-generation, second-generation, and third-generation sequencing.
What was the significance of Rosalind Franklin's work in 1952?
She photographed X-ray diffraction of DNA, providing crystallographic data crucial for understanding DNA structure.
Who solved the three-dimensional structure of DNA?
James Watson and Francis Crick in 1953.
What is first-generation sequencing?
Sanger Sequencing, which uses a chain termination method and is considered the 'gold standard' for accuracy.
Who developed Sanger Sequencing?
Frederick Sanger in 1977.
What are dNTPs?
Deoxyribonucleoside triphosphates, the building blocks for DNA replication.
What are ddNTPs?
Dideoxyribonucleoside triphosphates, which act as chain-terminating inhibitors in Sanger Sequencing.
What is the role of DNA polymerase in Sanger Sequencing?
It synthesizes new DNA strands by adding nucleotides to a growing chain.
What is the main challenge associated with genome assembly?
Dealing with repetitive sequences and accurately aligning reads to reconstruct the genome.
Why is annotation important in bioinformatics?
It helps identify the locations of genes and other features in the genome, providing functional insights.
What does the term 'coding regions' refer to?
Parts of the genome that are translated into proteins.
What does the term 'non-coding regions' refer to?
Parts of the genome that do not code for proteins but may have regulatory or other functions.
What is the historical significance of the year 1953 in genetics?
It marks the year Watson and Crick published their model of the DNA double helix.
What is the primary method used in Sanger Sequencing?
Chain termination method using dideoxynucleotides.
What is the purpose of primers in Sanger Sequencing?
Primers are short sequences that initiate DNA synthesis during the sequencing process.
What is the difference between dNTPs and ddNTPs?
dNTPs have one less oxygen than ribose, while ddNTPs have two less, preventing further elongation of the DNA strand.
What is the relevance of sequencing in understanding diseases?
Sequencing can reveal genetic mutations associated with diseases, aiding in diagnosis and treatment.
What does 'assembly' refer to in genome sequencing?
The process of piecing together short DNA sequences to form a complete genome.
What are the main challenges in second-generation sequencing?
Higher error rates and difficulties in assembling short reads into longer contiguous sequences.
What advancements characterize third-generation sequencing?
The ability to sequence longer DNA fragments in real-time, improving assembly and accuracy.
What is the main mechanism of Automated Sanger Sequencing?
It uses four ddNTPs labeled with different fluorescent tags.
What is the difference between manual and automated Sanger sequencing?
Manual uses radioisotopes and polyacrylamide gel slabs, while automated uses capillary electrophoresis and dye-labeled ddNTPs.
What is the Human Genome Project?
An ambitious research effort to decipher the entire human genetic code, published in 2001 and finalized in 2003.
What are the basic steps of second-generation sequencing?
What are the common features of second-generation sequencing?
Highly parallel, microscale reactions, fast results, and low-cost genome sequencing.
What is 454 GS20?
The first NGS technology developed by Roche, allowing massive parallel sequencing.
What are the advantages of Illumina Sequencing?
Allows high-throughput sequencing at reduced costs and produces shorter reads.
What is the significance of Illumina Sequencing in NGS data generation?
It accounts for about 80% of all NGS data generated.
What is the process of bridge amplification in Illumina Sequencing?
Template DNA makes U-shaped loops attached to the surface, generating dense clusters of DNA.
What is PacBio SMRT sequencing?
A third-generation sequencing method that does not require amplification of template DNA.
What is Nanopore Sequencing?
A sequencing technology that uses ionic current signals to read DNA, producing longer reads.
What is a major challenge after sequencing?
Ensuring high quality of the assembled and annotated genomic sequence.
What is the primary goal of the Human Genome Project?
To identify genes associated with rare and common diseases and examine ethical implications.
What are the characteristics of second-generation sequencing?
It is fast, low-cost, and allows for high-throughput sequencing.
What are the disadvantages of 454 GS20 technology?
It is prone to errors, especially in indels and homopolymer regions.
What is the purpose of adapter ligation in library preparation?
To prepare fragmented DNA for sequencing.
What is the difference between single-end and paired-end sequencing?
Single-end sequences from one end, while paired-end sequences from both ends of the DNA fragment.
What is the role of DNA polymerase in Illumina Sequencing?
To synthesize new DNA strands during the sequencing process.
What does 'base calling' refer to in sequencing?
The process of determining the identity of the first base in a sequencing reaction.
What is the significance of massively increased throughput in sequencing?
It allows for parallelization of many reactions, enhancing efficiency.
What is the purpose of sequencing primer in second-generation sequencing?
To initiate the addition of bases to the template DNA.
What are the key differences between first, second, and third-generation sequencing?
First generation is manual, second generation is high-throughput and cost-effective, and third generation allows real-time sequencing without amplification.
What are the implications of genetic technologies examined by the Human Genome Project?
Ethical, legal, and social implications related to genetics.
What is the output of Illumina Sequencing?
Shorter reads that can be sequenced from one or both ends.
What is the primary aim of genome assembly?
To create a genome assembly with the longest possible sequences (least fragmented) and the smallest number of mis-assemblies.
What does the phrase 'garbage in, garbage out' imply in bioinformatics?
The quality of output is determined by the quality of the input data.
Why is quality control important in genome assembly?
To avoid erroneous downstream applications and conclusions.
What is the FASTQ file format?
A format that contains 4 lines per read: Read Name, Sequence, Plus sign, and Quality Scores.
What does the Phred Quality Score (Q Score) measure?
The probability of a correct base call in sequencing data.
What is FastQC?
A commonly used tool for read quality assessment that can be run from both web-based and command line interfaces.
What is adapter trimming in genome assembly?
The removal of adapter sequences from the ends of DNA fragments to ensure only the actual target DNA is analyzed.
What is low-quality end trimming?
The removal of poor-quality base calls at the ends of reads to ensure only high-quality bases are present.
What is genome assembly?
A computational process of deciphering the genetic material within the cell of an organism using numerous short sequences called reads.
What are the two main types of genome assembly?
Reference assembly and de novo assembly.
What is a reference genome?
A representative example of a set of chromosomes for a species, ideally produced from the DNA of one member of that species.
What challenges exist in genome assembly?
Repetitive regions can cause gaps, rearrangements, and inaccurate repetitions in the assembly.
What are short tandem repeats?
Repetitive sequences in the genome that consist of short sequences repeated in tandem, such as ATATATATA.
What are long interspersed nuclear elements (LINES)?
Repetitive sequences in the genome that are approximately 7000 base pairs long.
What is scaffolding in genome assembly?
The process of stitching assembled contigs together based on information from paired short reads.
What does N50 measure in genome assembly?
The length of the smallest contig such that the sum of contig lengths covers 50% of the total size of contigs.
What is the purpose of gap filling in genome assembly?
To fill in gaps using actual sequences to improve the continuity of the assembly.
What tools are commonly used for adapter trimming?
PrinSEQ and Trimmomatic.
What is the significance of misassemblies in genome assembly?
Misassemblies need to be corrected before scaffolding to ensure accurate genome representation.
What is the role of Quast in genome assembly?
A tool used to compare metrics between different genome assemblies.
What is the difference between short read and long read assembly?
Short read assembly uses shorter sequences for assembly, while long read assembly uses longer sequences, which can provide more context.
What is genome annotation?
The process of deriving structural and functional information of a protein or gene from raw data using various analysis techniques.
What are the two main components of genome annotation?
(a) Identifying elements on the genome (gene prediction) and (b) attaching biological information to these elements.
What are the three categories of genome annotation?
What does structural annotation involve?
Attaching biological meaning to genome sequences by analyzing their sequence structure and composition.
What is the output of structural annotation?
Gene maps and location of elements.
What does functional annotation assign?
Biologically relevant information to predicted polypeptides and the features they derive from, such as genes and mRNA.
What are the outputs of functional annotation?
Biological processes, cellular components, and molecular functions.
What factors should be considered in genome assembly?
What is the General Feature Format (GFF)?
Often the output of a genome annotation, used to submit data to databases for improved availability and findability.
What is the relevance of genome annotation in bioinformatics?
It translates raw genetic data into understandable biological information from physical and genetic maps.
What are the three sequencing technologies?
What are the challenges associated with genome assembly?
Presence of repeating sequences, short reads, sequencing errors, and computational requirements.
Why is quality control important prior to bioinformatic analyses?
To ensure the accuracy and reliability of the data being analyzed.
What are the two types of genome annotation?
Structural annotation and functional annotation.