L4: Genome assembly and annotation

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/20

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

21 Terms

1
New cards

How to get to complete chromosomes

long reads shotgun sequencing

assembly

scaffolding

2
New cards

scaffold

a larger DNA sequence formed by linking together multiple contigs

3
New cards

scaffolding requires -

scaffolding does not add - to your assembly, it just - the pieces you have

longer range information

missing sequences, orders

4
New cards

main scaffolding approaches

  • reference-based

  • genetic maps

  • optical maps

  • chromatin conformation capture

5
New cards

chromatin organization in nucleus

fractal globule: DNA is highly organized in chromosomal regions and topologically associated domains (TADs)

6
New cards

Hi-C: chromatin conformation capture

interacting regions are joined and sequenced (with short reads)

more interactions = physical proximity

7
New cards
<p>Long-range Hi-C interaction can - contigs</p>

Long-range Hi-C interaction can - contigs

connect/order

pixel intensity in the matrix indicates how often a pair of genomic positions interact

8
New cards

scaffolding builds - into -

contigs, chromosomes

9
New cards

how to QC contiguity in scaffolds?

scaffold N50: the length of the shortest scaffold for which longer and equal length scaffolds cover at least 50% of the assembly

10
New cards
<p>Hi-C is the high throughput version of -</p><p>to study the link between - and -</p><p>A compartment?</p><p>B compartment?</p><p>TADs</p>

Hi-C is the high throughput version of -

to study the link between - and -

A compartment?

B compartment?

TADs

chromatin conformation capture

chromatin conformation, genome function

A compartment = transcriptionally active

B compartment = transcriptionally inactive

TADs are genes and enhancers

11
New cards

alleles and genetic variants on the same copy of a chromosome will be - together

inherited

12
New cards

phasing allows for -

phasing: haplotype-aware genome assembler can distinguish which reads belong to the - in - individuals

two chromosome complements are assembled - (haplotypes)

separation of homologous chromosomes

same copy of a chromosome, heterozygous

independently

13
New cards

trio binning

short read data from parental individuals are used to separate long reads from different haplotypes

<p>short read data from parental individuals are used to separate long reads from different haplotypes</p>
14
New cards

inbred diploid individuals have - copies of each chromosome, so theres no need for - when assembling genomes

identical, phasing

15
New cards

three complementary approaches to gene annotation

  • ab initio

  • homology-based

  • experimental

16
New cards

ab initio gene annotation

coding genes have distinctive features

bioinformatic software can scan the genome for possible genes

method of predicting genes in a genome by analyzing its intrinsic properties, without using external evidence like protein or RNA sequences

17
New cards

homology-based gene annotation

takes advantage of sequence conservation across species to determine if a gene is “real”

more powerful between closely related species

18
New cards

experimental support for gene annotation

uses available data to identify possible genes such as

  • gene expression data

  • protein sequence data

  • methylation pattern

19
New cards

tandem or interspersed repeats

sequences of various length occurring in multiple copies throughout the genome

20
New cards

transposons

mobile genetic elements, can move or copy themselves across the genomea

21
New cards

automated gene annotation

bioinformatic pipelines combine different lines of evidence

can require extensive manual curation

Explore top flashcards