Bioinformatics lecture 6 (kinda important)

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/38

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No study sessions yet.

39 Terms

1
New cards

What is the exome? ๐Ÿงฌ

The exome is all the exons in the genome, meaning the protein-coding parts of genes.

2
New cards

Why is the exome important? โญ

Although it is only about 1โ€“2% of the genome, around 85% of known disease-causing mutations occur in exons.

3
New cards

Why sequence the exome instead of the whole genome? ๐Ÿ’ฐ

Exome sequencing is cheaper, faster, easier to analyse, and focuses on the most biologically informative regions.

4
New cards

How many genes and exons are in the human exome? ๐Ÿ”ข

About 20,000 genes and roughly 150,000 exons.

5
New cards

What types of problems is exome sequencing especially good for? ๐Ÿง 

Rare Mendelian disease discovery, cancer mutation detection, and clinical diagnostics.

6
New cards

What is a FASTQ file? ๐Ÿ“„

A sequencing file format that stores DNA sequences together with their quality scores.

7
New cards

What are the four lines in a FASTQ entry? 4๏ธโƒฃ

Identifier, DNA sequence, separator (+), and quality scores.

8
New cards

What do quality scores (Phred scores) represent? ๐ŸŽฏ

They show how confident we are that each base is correct.

9
New cards

What does a Phred score of 20 mean? โœ…

It means 99% accuracy, or about 1 error in 100 bases.

10
New cards

What is paired-end sequencing? ๐Ÿ”

A method where both ends of a DNA fragment are sequenced.

11
New cards

Why is paired-end sequencing useful? ๐Ÿงฉ

It improves read mapping, alignment accuracy, and variant detection.

12
New cards

What is the first step in the exome sequencing analysis pipeline? ๐Ÿงญ

Alignment โ€“ mapping sequencing reads to the reference genome.

13
New cards

What is the goal of variant calling? ๐Ÿ”

To identify positions where the sample DNA differs from the reference genome.

14
New cards

What is variant annotation? ๐Ÿท๏ธ

Adding biological meaning to variants, such as which gene they affect and how.

15
New cards

About how many variants are found in a typical exome? ๐Ÿ˜ฎ

Around 23,000 variants per individual.

16
New cards

Why is exome analysis mainly a filtering problem? ๐Ÿงน

Because only one or a few variants cause disease, but thousands are detected.

17
New cards

What information does variant annotation add? ๐Ÿ“š

Gene name, exon location, mutation type, known databases, and predicted functional effects.

18
New cards

What is a SNP? ๐Ÿ”ค

A single-nucleotide polymorphism, meaning a one-base change in DNA.

19
New cards

Do most SNPs cause disease? โŒ

No, most SNPs are harmless and have no effect on health.

20
New cards

Why does exome sequencing focus on coding variants? ๐ŸŽฏ

Because changes in coding regions are easier to interpret and more likely to affect proteins.

21
New cards

What is the main challenge in finding a disease-causing variant? ๐Ÿง 

Finding the one harmful variant among thousands of harmless ones.

22
New cards

What is the key idea behind variant filtering? ๐Ÿ”Ž

Systematically removing unlikely variants to narrow down candidates.

23
New cards

What is the first filtering step for rare disease analysis? ๐Ÿšซ

Remove common variants found in public databases.

24
New cards

Why are common variants usually removed? ๐Ÿ“‰

Rare diseases are unlikely to be caused by variants common in the population.

25
New cards

What types of variants are prioritised during filtering? โญ

Protein-altering variants in conserved regions.

26
New cards

Why are conserved regions important? ๐Ÿงฌ

They are preserved by evolution, so changes there are more likely harmful.

27
New cards

Why compare variants across affected individuals? ๐Ÿ‘จโ€๐Ÿ‘ฉโ€๐Ÿ‘งโ€๐Ÿ‘ฆ

True disease-causing variants should be shared among affected people.

28
New cards

What does the โ€œRAREโ€ filtering mnemonic stand for? ๐Ÿง 

Remove common variants, Affect protein, Retained in affected, Evolutionarily conserved.

29
New cards

What is OMIM? ๐Ÿ“˜

Online Mendelian Inheritance in Man, a curated database of Mendelian diseases and their genes.

30
New cards

Why is OMIM useful in exome analysis? ๐Ÿ”—

It links genes to known disease phenotypes, helping prioritise candidates.

31
New cards

What was the key lesson from the Miller syndrome exome study? ๐Ÿงช

Exome sequencing can identify new disease genes by filtering a small number of patients.

32
New cards

What are major limitations of exome sequencing? โš ๏ธ

It misses intronic and regulatory variants and may not explain complex diseases.

33
New cards

Why can exome sequencing fail for some diseases? ๐Ÿงฉ

Some diseases are multifactorial or caused by non-coding variants.

34
New cards

What are incidental findings in genomic medicine? โš ๏ธ

Unexpected variants that may indicate unrelated health risks.

35
New cards

Why do ethical issues arise in exome sequencing? ๐Ÿ›ก๏ธ

Because of privacy concerns, uncertain results, and informed consent.

36
New cards

What is the future direction of genomic medicine? ๐Ÿš€

A move toward whole-genome sequencing and personalised medicine.

37
New cards

What does personalised medicine aim to do? ๐ŸŽฏ

Use genetic information to predict risk, guide treatment, and choose the best drugs.

38
New cards

Why is bioinformatics essential in genomic medicine? ๐Ÿ’ป

It turns raw sequencing data into meaningful diagnoses and clinical decisions.

39
New cards

Explore top flashcards