How was generating a reference genome sequence done in the HGP:
break genome into large fragments and insert into clones
order the clones
break individual clones into small pieces
generate thousands of sequence reads and assemble sequence of clones
assemble sequences of overlapping clones to create reference sequence
How is generating a person's genome done
break genome into smaller pieces
generate millions of sequence reads
align sequence reads to existing reference sequence
deduce starting sequence and identify differences from reference sequence
What was the main accomplishment of the HGP: It was the first high quality sequence of >90% of the human genome
What was the biggest problem with sequencing a whole genome in the HGP: cost and quality of the data analysis
exon sequencing: targeting 1.5% of the whole genome which is 10x cheaper than whole genome sequencing
What is next gen sequencing: uses nanopores to create fewer reads of a longer length for sequencing both DNA and RNA
what is the challenge of next-gen sequencing: it needs a lot of storage, tens of terabytes are needed