Lecture 18: Genomics and Sequencing

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/47

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

48 Terms

1
New cards

genome

The _______ is all the genetic information that defines an organism.

2
New cards

Various processes

________ are encoded by the genome. Ex. Energy and nutrient sources (catabolism), growth products (biosynthesis), factors that promote fitness (ecology), molecular adaptions (evolution).

3
New cards

Genomics

science of mapping, sequencing, analyzing and comparing genomes between species.

4
New cards

Haemophilus influenzae

The first bacterial genome that was sequenced was ________________ in 1955.

5
New cards

Open reading frame (ORF)

A region of DNA that begins with the start codon and ends with a stop codon.

6
New cards

more complex, greater

Generally the _________ an organism the ______ the genome size.

7
New cards

Sequence, Assemble, Annotate

Genome of an organism can be derived using the following steps

8
New cards

1. Sequence

Determining the order of the bases within DNA

9
New cards

2. Assemble

Piece together different parts of the genome in the correct order.

10
New cards

3. Annotate

Convert this raw data into a list of genes whose function may or may not be known.

11
New cards

Bioinformatics

is the process of using a computer to store and analyze the sequences of structures of DNA and proteins

12
New cards

functions

Once sequenced a genome can provide insight into how an organism ________.

13
New cards

DNA synthesis

sanger sequencing designed by Fred Sanger this technique uses __________ to determine the sequence of a region of DNA>

14
New cards

replicating, amplifying

Sanger sequencing involves ________ and ________ a strand of DNA using DNA polymerase. This is done via a similar method to PCR.

15
New cards

terminated, fluorescently labelled, ddNTPs (dideoxyribonucleotides)

At random points DNA replication is ___________, leaving a _____________ residue at the end. This is done using fluorescently labeled _____________ of all four bases in the reaction.

16
New cards

specific, sequence

Each base has a __________ fluorescent label of a different color. This allows us to determine the ________ of the DNA.

17
New cards

Dideoxyribose-NTPs (ddNTPs)

can be integrated into the DNA and differ from dNTPs in two ways

18
New cards

They lack a 3' OH, terminal nucleotides

This means there is no way to extend the chain past the ddNTP, as further monomers cannot be added. They are referred to as _______________.

19
New cards

random, very single

This integration is _______, with millions of DNA strands made during sequencing it is inevitable that a ddNTP will be present in ___________ residue

20
New cards

base

Each ddNTP has a different fluorescent which enables us to determine which terminal ________ is present at the each of each strand of DNA created.

21
New cards

DNA replication mixture (Step 1 of Sanger sequencing)

Fluorescently labeled ddNTPS and regular dNTPs are added to a ____________, with the target strand of interest and a known primer.

22
New cards

replication stops, ddNTP, every position (Step 2 of Sanger sequencing)

DNA synthesis continues until a ddNTP is integrated. At this point ___________. There will be a ______ integrated at _________ of the DNA.

23
New cards

capillary electrophoresis (Step 3 of Sanger sequencing)

The fragments are separated by size using ____________, the shorter the fragment the faster it will migrate down the column.

24
New cards

assembled (Step 4 of Sanger Sequencing)

Automated sequencer reads outputs, inputting the corresponding base, DNA sequence can be __________.

25
New cards

smallest, further

Remember, the _________ fragment will always migrate ________.

26
New cards

molecular sieve

Smaller fragments migrate further because the column acts as a ____________ therefore it takes larger samples longer to get through the pores.

27
New cards

smaller, earlier, reverse complementary

The ____ the fragment the ______ it comes in sequence. The sequence __________ to the sequence we determine as synthesis is 5' to 3'.

28
New cards

primer, known sequence

Sanger sequencing requires the use of a _______. This must bind to a __________.

29
New cards

800, known, large-scale

There are several limitations to Sanger sequencing. Limited to around _______ nucleotide per reaction. Can only be sequence from _________ sequence. Cannot be used in _________ genome sequencing.

30
New cards

fragment the DNA, numerous reactions

To sequence an entire genome or a larger region of DNA you would have to first _________ into multiple, smaller pieces. ______________ would be required to completely sequence the genome.

31
New cards

assembled

Once a genome has been sequenced it must be ________ before it can be analyzed.

32
New cards

correct order, overlaps

Assembly entails putting the fragments in the _______ to reconstruct the chromosome and eliminating any _________.

33
New cards

looks for overlaps

To assemble a genome the computer analyzes the sequences produced by next-generation sequencing and _____________.

34
New cards

jigsaw puzzle, consensus genome

Like a _________ the overlaps indicate that pieces should fit next to one another. This produces a ___________.

35
New cards

annotated

Once the genome has been assembled it can be _________.

36
New cards

Gene annotation

the identification of genes and other functional regions within the DNA

37
New cards

open-reading frames, start codon.

The annotation relies on finding __________, which begin at a _____________.

38
New cards

protein sequence, stop codon

Once a start codon is found in annotation the software will look for a potential ________ ending at a _________.

39
New cards

ribosome binding site

The software will then determine whether there is a canonical ______________. (for annotation)

40
New cards

valid or not

The software will predict whether the found ORF is _________ based on a database (annotation).

41
New cards

translated

If an ORF is determined to be valid the DNA sequence is ____________. (annotation)

42
New cards

protein functional domains

The protein sequence is compared to that of known __________________. (annotation)

43
New cards

share a functional domain, common, similar

Proteins which ______________ often have a _________ evolutionary ancestor that may share a ___________ role. (annotation)

44
New cards

BLAST (basic local alignment search tool)

Many bioinformatic tools are available to help annotate a protein correctly, such as GenBank's ________.

45
New cards

protein sequence

When attempting to determine the function of a ORF the _________ is used instead of the DNA sequence.

46
New cards

redundancy, identical

We use the protein sequence instead of the DNA sequence when determining the ORF because codons have _________, several codons encode the same amino acid. Therefore some genes may not have a similar sequence, but protein sequence may be __________.

47
New cards

different

Different organisms have ________ codon usage.

48
New cards

horizontally transferred

The difference in codon frequency usage can indicate which genes have been _____________. (ex. An E. coli gene which encodes purely AGG arginine codons may have come from another bacteria.)