lectures 3 and 4 - the human genome project

5.0(1)
studied byStudied by 6 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/56

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

57 Terms

1
New cards

Range of sizes of human chromosomes

55×10^6bp to 250×10^6bp

2
New cards

smallest chromosome

Y

3
New cards

what other chromosome is X chromosome approx the same size as?

chromosome 8

4
New cards

what were the 4 main goals of the human genome project?

  • determine the sequence of the 3 billion base pairs

  • identify all human genes and their position

  • attempt to predict all genes’ function

  • use this information to understand disease, develop better medicine and see how humans compare to other species etc

5
New cards

What grant was provided to Francis Collins and team to carry out the HGP?

$200 million/year

6
New cards

Craig Venter

founder of Celera, the private competitor of the publicly funded HGP

7
New cards

how was work divided within the human genome project?

individual labs concentrated on a single chromosome

8
New cards

Why were robotic production lines required for the HGP?

to carry out the large number of PCR tests and sequencing reactions

9
New cards

sharing of information - HGP

The project was publicly funded so information was shared worldwide as it was found out on the internet for anyone to access - including Celera

10
New cards

HGP phase I

produce high res chromosomal maps by positioning genetic markers and creating libraries of BAC clones for sequencing

11
New cards

HGP phase II

sequence each BAC DNA

12
New cards

HGP phase III

assemble all sequences to produce final draft and annotate to identify genes

13
New cards

what is shotgun cloning?

breaking up DNA sequences into lots of small pieces then sequencing each and reassembling the sequence by using computers to look for regions of overlap

14
New cards

integrated maps of human chromosomes

genetic and physical maps were combined to put all BACs in the right order - logged as library, plate, row and column

15
New cards

What had to happen for the final phases of the HGP to occur?

Sequencing technology needed to improve. Other species’ genomes were sequenced in the wait for technology to become advanced enough

16
New cards

Old Sanger sequencing

radioactive 35S labelled DNA fragments were ran on a gel with 4 separate dideoxy reactions taking place (one for each base) then exposed to xray film

17
New cards

New Sanger sequencing

similar to PCR but fluorescently labelled bases act as terminators

18
New cards

HGP phase II and the Sanger method

advances in Sanger sequencing technology were taken advantage of, speeding up the process and reducing cost

19
New cards

how were DNA sequences assembled in phase II?

Computationally into ‘contig’ (set of overlapping DNA segments brought together)

20
New cards

what’s needed for one-fold coverage (phase II)?

approx 3 million separate sequencing reactions producing 1,000 bases each

21
New cards

draft sequence coverage/depth

4 fold coverage

22
New cards

complete/finished sequence coverage/depth

9 fold coverage

23
New cards

disadvantages of the clone-by clone approach (used by publicly funded international project)

very slow and expensive

24
New cards

advantages of the clone-by clone approach (used by publicly funded international project)

very effective at getting over regions of highly repetitive DNA sequences and it’s possible to retrieve clones later

25
New cards

Disadvantages of Celera’s approach

had to rely on public sequence databases and mapping information to assemble the sequence generated and then fill in gaps left by the repeats

26
New cards

Venter/Collins simultaneous publication

Both sides were only just able to produce a readable genome at the big announcement as Celera didn’t have enough sequence of their own and the public sequence was poor quality

27
New cards

White House announcement June 2000

the majority (80%) of the human genome had been sequenced in a working draft. Completion of 99.99% of the genome occurred in July 2003

28
New cards

genes of unknown function

20 years ago a figure was published saying 4061 (23.6%) of our genes’ functions were unclassified. This figure has barely changed since due to changes in public interest and direction of funding

29
New cards

How many people were sequenced during the human genome project?

4 in the public project and 1 at Celera. As the sequences were aligned, variants emerged

30
New cards

people studied in the international HapMap project

4 populations first studied (270 people total from Nigeria, Japan, China and Utah). More recently up to 1301 from across the world

31
New cards

purpose of the international HapMap project

to find the most common SNP variants in the world’s population. Reported locations of 1 million+ human SNPs published in Nature 2005

32
New cards

Moore’s law

describes the electronics field where chips get logarithmically faster and cheaper as time goes on - sequencing technologies are advancing at an even greater rate

33
New cards

Why are big sequencing studies becoming more routine?

Human genome sequencing is becoming very cheap and much faster. What once took 10 years can now take 24 hours

34
New cards

According to the 2022 draft ‘T2T-CHM13’, how many genes do humans have in their DNA, and how many of these are protein coding?

63,494 genes. 19,969 are predicted to code for proteins

35
New cards

What do non-coding genes do?

code for tRNAs, rRNAs, miRNAs and long non-coding RNAs (lncRNAs) involved in transcription

36
New cards

alternative mRNA splicing

humans have a relatively small number of genes. This process allows more protein isoforms to be produced per gene

37
New cards

What pathways are highly conserved across species?

pathways important to life eg ribosome proteins, tRNAs and ABC transporters (pump ions/hormones across membrane)

38
New cards

TBC1D3 function

regulates growth factors and has role in RAS-mediated cancers. Found only in humans

39
New cards

genes specific to humans

850 genes found with human specific features. 50+ entirely human specific genes found

40
New cards

What genes show evidence for fast/recent human evolution?

pathogen response, DNA & protein metabolism, reproduction, neuronal activity, skin pigmentation

41
New cards

How do most HIV strains enter cells?

using CCR5 as the main co-receptor (with CD4). Some people are naturally resistant as they’re homozygous for delta32 mutation in CCR5 gene

42
New cards

HIV has only been around since the 70s/80s - the prevalence of the CCR5 D32 variant would have required millenia - how?

the mutation may have been selected for in ancestral populations against earlier HIV-like epidemics or possibly smallpox

43
New cards

FOXP2

gene involved in human speech and language disorders

44
New cards

FOXP2 in humans and their ancestors

May have been key in human-specific language development - difference of 2 amino acids between chimps and humans/Neanderthals/Denisovans

45
New cards

human specific loss of olfactory (smell) genes

humans have far fewer functional olfactory genes than other species, suggesting that we rely more heavily on other senses eg sight and hearing for survival

46
New cards

linkage disequilibrium blocks

Collection of linked marker alleles on one chromosome over a short distance and tend to remain intact - not disrupted by meiotic recombination

47
New cards

what is the genetic evidence for humans travelling out of Africa to populate the world?

African populations today are far more genetically diverse than the rest of the world as those that left were only a small population - example of a genetic bottleneck

48
New cards

humans as ancestors of Neanderthals and Denisovans

All populations have some form of genetic contribution from archaic ancestors. Approx 2 - 2.5% of a non-African genome derives from the genomes of Neanderthals and Denisovans

<p>All populations have some form of genetic contribution from archaic ancestors. Approx 2 - 2.5% of a non-African genome derives from the genomes of Neanderthals and Denisovans</p>
49
New cards

genetic differences affecting results of drugs (pharmacogenomics/genetics)

  • (in)activation by oxidative pathways

  • conjugation for excretion through kidney

  • target sensitivity

  • toxicity

  • disease mutation type

50
New cards

How could genome sequencing be used for personalised medicine?

to predict those who will respond well or badly (death/severe adverse reaction) and prescribe accordingly

51
New cards

screening pharmacogenomics and warfarin

screening for CYP2C9 and VKORC1 gene variants identifies those who should start on a low warfarin dose to reduce risk of internal bleeding

52
New cards

CYP2C19 variants and antidepressants

Antidepressant escitalopram affects brain serotonin levels and different gene variants affect the serum concentration and treatment success

53
New cards

(PegIFN-a)-2b/2a combined with RBV for Hep C treatment

Treatment really successful for individuals with CC SNP but not as much for TT SNP patients

54
New cards

most common mutation causing cystic fibrosis in Europeans

DF508 - no protein produced

55
New cards

Ivacaftor - CF drug

treats the 5% with the mutation that just damages protein function by helping to refold the protein therefore screening CF patients would find treatable cases. However would cost 200k/year/patient

56
New cards

Drug-Induced Liver Injury (DILI) and Flucloxacillin

SNP marker in complete LD with HLA-B*5701 immune cell surface protein vary significantly associated with DILI risk from antibiotic which affects 8.5/100,000. Screening would prevent prescription to those with the marker

57
New cards