COMPSCI 260 Unit 1 Midterm Practice Flashcards

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/49

flashcard set

Earn XP

Description and Tags

Vocabulary-style flashcards covering biological domains, genomics, sequence analysis, algorithmic complexity, and dynamic programming based on the COMPSCI 260 Unit 1 Midterm.

Last updated 6:32 AM on 4/29/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

50 Terms

1
New cards

Three Domains of Life

The three primary classifications of biological organisms which, in alphabetical order, are Archaea, Bacteria, and Eukarya.

2
New cards

Human Genome Size

Approximately 3imes1093 imes 10^9 base pairs in scientific notation.

3
New cards

SARS-CoV-2 Genome Size

Approximately 3imes1043 imes 10^4 nucleotides in scientific notation.

4
New cards

Transcription

The biological process where DNA is transformed into mRNA, primarily involving the agent RNA polymerase.

5
New cards

Translation

The biological process where mRNA is transformed into proteins, primarily involving the agent known as the ribosome.

6
New cards

Replication

The biological process where DNA is used to create more DNA, primarily involving the agent DNA polymerase.

7
New cards

Stop Codon

A specific nucleotide triplet that signals the molecular machinery to cease the growth of a polypeptide chain.

8
New cards

ORF-finder

A computational procedure used to exhaustively scan the genome of a simple organism for subsequences that might correspond to protein-coding genes.

9
New cards

False Positive (ORF-finder)

An output where the code identifies a subsequence as a gene when it is not actually a protein-coding gene.

10
New cards

False Negative (ORF-finder)

A real gene that is excluded by the ORF-finder code due to specific filtering parameters.

11
New cards

Reads

The resulting sequences produced when fragments of DNA from an organism are processed by a sequencing instrument.

12
New cards

Read Mapping

The process of aligning reads to an existing complete reference genome by finding exact substring matches.

13
New cards

Genome Assembly

The algorithmic task of stitching DNA reads together into longer sequences to determine an organism's genome sequence.

14
New cards

Contigs

Longer sequences of consecutive nucleotides output by an assembly algorithm after identifying and stitching overlapping reads.

15
New cards

Variable X (Sequencing)

The total length of all the reads that are sequenced by a sequencing instrument.

16
New cards

Variable Y (Assembly)

The total length of all the contigs that are output by a genome assembly algorithm.

17
New cards

Y vs X Relationship

In genome assembly, the value of Y is less than the value of X because the algorithm stitches overlapping reads together.

18
New cards

Genbank

A gene database that contains genetic sequences from a wide variety of organisms.

19
New cards

DNA Base Pairing

The principle where specific nucleotides hybridize to form double-stranded DNA; for example, G complements C and A complements T.

20
New cards

5’ Overhang

A structural feature of a DNA fragment where one strand extends beyond the other at the 5’ end, often referred to as a sticky end.

21
New cards

Recursive Function

A function that solves a problem by calling itself with smaller instances of the same problem.

22
New cards

Asymptotic Running Time

The measurement of how an algorithm's execution time grows as the input size increases toward infinity.

23
New cards

Bogosort

A sorting algorithm which is generally the asymptotically slowest among common sorting methods.

24
New cards

Merge Sort

A sorting algorithm known for being asymptotically faster than Bubble, Insertion, or Selection sort.

25
New cards

Recognition Site

A specific sequence of DNA that a restriction enzyme recognizes and binds to for the purpose of cutting the DNA.

26
New cards

Restriction Enzyme Symbol: r

In a recognition site, this lowercase letter represents that the enzyme accepts either A or G at that position.

27
New cards

Restriction Enzyme Symbol: y

In a recognition site, this lowercase letter represents that the enzyme accepts either C or T at that position.

28
New cards

Restriction Enzyme Symbol: w

In a recognition site, this lowercase letter represents that the enzyme accepts either A or T at that position.

29
New cards

Restriction Enzyme Symbol: s

In a recognition site, this lowercase letter represents that the enzyme accepts either C or G at that position.

30
New cards

\Theta(n \log n)$^

The asymptotic running time for efficient sorting algorithms like Merge Sort.

31
New cards

Θ(n2)\Theta(n^2)

The asymptotic running time for algorithms like Bubble Sort, Insertion Sort, and Selection Sort in their average or worst cases.

32
New cards

Θ(n3)\Theta(n^3)

A cubic asymptotic running time; if the problem size increases by a factor of 10, the running time increases by a factor of 1000.

33
New cards

Θ(2n)\Theta(2^n)

An exponential asymptotic running time where the time required doubles with each incremental increase in problem size.

34
New cards

Elementary Multiplication Operations

The basic numeric operations required to compute the product of matrices, where the number of operations depends on the dimensions of the matrices.

35
New cards

Fibonacci Numbers

An infinite series of integers defined by the recurrence relation fi=fi1+fi2f_i = f_{i-1} + f_{i-2}, starting with f1=1f_1 = 1 and f2=1f_2 = 1.

36
New cards

Mathematical Induction

A proof method used to prove a claim for all positive integers, consisting of a base case and an inductive step.

37
New cards

Base Case (Induction)

The first component of an induction proof that demonstrates the truth of the claim for the smallest possible value (e.g., n=1n=1).

38
New cards

Inductive Step

The second component of an induction proof that demonstrates if the claim holds for nn, it must also hold for n+1n+1.

39
New cards

Overlapping Subproblems

A scenario in recursive algorithms where the same sub-calculations are performed multiple times, leading to slow performance.

40
New cards

Memoization

A strategy used to speed up recursive functions by storing the results of subproblems in a lookup table to avoid redundant calculations.

41
New cards

Dynamic Programming

An algorithmic technique that solves a problem by building a table of solutions for smaller subproblems before solving larger ones.

42
New cards

DP Table

A table, often designated as V, used in dynamic programming to store the solutions to subproblems parameterized by indices.

43
New cards

Recursive Update Equation

The formula at the heart of a dynamic programming algorithm used to calculate a cell's value from previously computed cells.

44
New cards

Base Cases (Dynamic Programming)

The initial cells in a DP table that must be filled manually before the recursive update equation can be applied to the rest of the table.

45
New cards

TRANSMOGRIFY Function

A recursive function described in the text that splits an input list into thirds (left, middle, right) and processes them.

46
New cards

ZAP Function

A function used within TRANSMOGRIFY that processes three lists and returns a final transmogrified result.

47
New cards

Constant Time (\Theta(1))

An execution time that remains the same regardless of the size of the input data.

48
New cards

Linear Time (\Theta(n))

An execution time that grows in direct proportion to the size of the input data.

49
New cards

Central Dogma

The conceptual framework representing the flow of genetic information from DNA to RNA to Protein.

50
New cards

Pseudo-code

A simplified, high-level description of a computer program's logic that resembles code but is intended for human reading rather than machine execution.