BIOINFORMATICS

0.0(0)
studied byStudied by 1 person
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/134

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

135 Terms

1
New cards

BIOINFORMATICS

is a combination of information, technology, and molecular biology

2
New cards

BIOINFORMATICS

It is being used largely in the field of human genome research

3
New cards

BIOINFORMATICS

is also used to store and organize the different discovery in the sequence which will be stored in the software

4
New cards

BIOINFORMATICS

It can also used in understanding diseases, new molecular targets, drug discovery, etc

5
New cards

BIOINFORMATICS

The study of how information is represented and transmitted in biological systems, starting at the molecular level

6
New cards

BIOINFORMATICS

is the merger of biology with information technology

7
New cards

COMPUTATIONAL BIOLOGY

Bioinformatics dedicated specifically to handling sequence information is a form of ____?

8
New cards

BIOINFORMATICS

also used to store and organize large amount of data into databases such as those used in clinical sequence analysis

9
New cards

BIOINFORMATICS

used due to vast amount of data arising from the sequence discovery.

10
New cards

BIOINFORMATICS

the science of computer technology and developing computer databases to facilitate biological research.

11
New cards

Standard expression of sequence data

is important for the clear communication and organized storage of sequence data

12
New cards
  • Interpretation of sequence variants

  • Used in epidemiology to speciate organisms or to find homologies within or between species

  • Identification of new sequences

  • Useful for test and primer design

Uses of Sequence Information:

13
New cards

Pneumocystis jirovecii or Pneumocystis carinii

was first thought to be a protozoan that is present in the sputum, but it doesn’t align with the protozoan sequence; it matches with the sequence of a fungi

14
New cards

NCBI

Commonly used database

15
New cards

SEQUENCE INFORMATION

includes the principles, practical aspects, and structural analysis

16
New cards

Polymorphic or heterozygous sequences

are written as consensus sequences with proportional representation of the polymorphic bases

17
New cards

International Union of Pure and Applied Chemistry and the International Union of Biochemistry and Molecular Biology (IUB)

have assigned a universal nomenclature for mixed, degenerate, or wobble bases

18
New cards

Consensus Sequences

if there is a mutation in the heterogeneous sequences, there may be more than 1 base or mix bases at the same position in the sequence.

19
New cards

A, G

symbol: R

bases: ??

Mnemonic: PURINE

20
New cards

C, T

symbol: Y

bases: ??

Mnemonic: PYRAMIDINE

21
New cards

G, T

symbol: K

bases: ??

Mnemonic: KETO

22
New cards

A, C

symbol: M

bases: ??

Mnemonic: AMINO

23
New cards

C, G

symbol: S

bases: ??

Mnemonic: 3 H BONDS

24
New cards

A, T

symbol: W

bases: ??

Mnemonic: 2 H BONDS

25
New cards

A, C, T

symbol: H

bases: ??

Mnemonic: NOT G

26
New cards

C, G, T

symbol: B

bases: ??

Mnemonic: NOT A

27
New cards

A, C, G

symbol: V

bases: ??

Mnemonic: NOT T

28
New cards

A, G, T

symbol: D

bases: ??

Mnemonic: NOT C

29
New cards

A, C, G, T

symbol: N

bases: ??

Mnemonic: ANY

30
New cards

UNKNOWN

symbol: X, ?

bases: ??

Mnemonic: A or C or G or T

31
New cards

DELETION

symbol: O, -

bases: ??

32
New cards

Basic Local Alignment Search Tool

BLAST

33
New cards

GENE SEQUENCE

FASTA format =

34
New cards

ARRANGED

GenBank =

35
New cards

Basic Local Alignment Search Tool

System used for homology searches

36
New cards

Basic Local Alignment Search Tool

searches GenBank in National Center for Biotechnology Information (NCBI)

37
New cards

Basic Local Alignment Search Tool

Useful in epidemiology too. You can also confirm bacteria with the same genus through their DNA sequence.

38
New cards

Basic Local Alignment Search Tool

uses GenBank which is also a database for all DNA sequences that were discovered.

39
New cards

Basic Local Alignment Search Tool

is a tool used to aligned 2 sequences

40
New cards

Basic Local Alignment Search Tool

Comparing gene and protein sequences against others in public databases

41
New cards

Basic Local Alignment Search Tool

is a set of sequence comparison algorithms used to search databases for optimal local alignments to a query

42
New cards

Basic Local Alignment Search Tool

It breaks the query and databases sequences into fragments and seeks matches between them

43
New cards

Basic Local Alignment Search Tool

is a computer algorithm that is available for use online at the National Center for Biotechnology Information (NCBI) website and many other sites

44
New cards

Local Alignment

finding similarities on a specific region of a DNA.

45
New cards

Global Alignment

finding similarities from one end to another end, whether they are matching or mismatching.

46
New cards

Basic Local Alignment Search Tool

is the most widely used program in the Bioinformatics

47
New cards

FASTA, GENBANK FORMAT

Input sequences in either of these 2 formats

48
New cards

HTML, plain text, and XML formatting

BLAST output can be delivered in a variety of formats. These formats include ___?

49
New cards

Expect value (E)

is a parameter that describes the number of hits one can "expect" to see by chance when searching a database of a particular size

50
New cards

OUTPUT

shows all the records matching the query

  • Most of the time, it is in HTML format

51
New cards

mismatching

the higher the background noise, the higher the _____ sequence

52
New cards

matching

The lower the E, the lower the background noise, the higher the ___ sequence?

53
New cards

match

E value = 10-12 = ?

54
New cards

Nucleotide BLAST

sequences of the DNA

55
New cards

Protein BLAST

sequences of the amino acids (sequences of the amino acids were also made from the information of the DNA)

56
New cards

High Scoring Segment Pair (HSP)

local alignment used for aligning 2 DNA without a graph

57
New cards

High Scoring Segment Pair (HSP)

We have match, mismatch, and a gap – all of these have a score.

58
New cards

match

= +2

59
New cards

mismatch

= -2

60
New cards

gap

= 0

61
New cards

HSP

The higher the ___, the higher the amount of match.

62
New cards
  • EMBL

  • GenBank

  • DDBJ (DNA Data Bank of Japan)

PRIMARY BIOLOGICAL DATABASE OF NUCLEIC ACID?

63
New cards
  • PIR

  • MIPS

  • SWISS-PROT

  • TrEMBL

  • NRL-3D

PRIMARY BIOLOGICAL DATABASE OF PROTEIN?

64
New cards

PRIMARY BIOLOGICAL DATABASE

Also known as Archival Database

65
New cards

GenBank

best for nucleic acid, you can also find protein sequences here.

66
New cards

FASTA

stands for fast-all” or “FastA”

67
New cards

FASTA

It was developed by W.R. Pearson and Lipman and this algorithm can be accessed from EBI site

68
New cards

FASTA

It was the first database similarity search tool developed, preceding the development of BLAST

69
New cards

FASTA

The alignment in diagonals is then refined

70
New cards

FASTA

Finds regions of similarity by first breaking the sequence into short subsequences, then searching for diagonals with highest density of words that match

71
New cards

FASTA

Its fast but is not guaranteed to find the best alignment, it may miss matches

72
New cards

FASTA

Its fast but is not guaranteed to find the best alignment, it may miss matches

73
New cards

FASTA

gives better results for nucleotide sequences than protein

74
New cards

FastP

is for protein sequences

75
New cards

FASTX and FASTY

compares DNA query to a protein database.

76
New cards

TFASTA

compares a protein query to a DNA database.

77
New cards

FASTA format

is a text-based format that represents either the nucleotide sequence or the protein sequence in which that bases or base pairs are represented using a single letter code.

78
New cards

FASTA

can be used for both Local and Global Alignment

79
New cards

FASTA, BLASTA

  • to infer relationship between sequences,

  • to identify members of the gene families

  • as a searching tool for the matching sequences

80
New cards

FASTA GRAPH

simple technique. You just have to find similarities, mismatching, gap, by scoring and tracing back to find the local similarities (or even global similarities).

81
New cards

LOCAL ALIGNMENT

write only the parts of the DNA sequence that are similar or matching.

82
New cards

GLOBAL ALIGNMENT

write both matching and mismatching from end to end of the DNA sequence.

83
New cards

GENBANK FILE FORMAT

Genetic sequence database sponsored by NIH in USA

84
New cards

PubMed

searching tool for journals

85
New cards

SWISS-PROT FILE FORMAT

Protein database sponsored by Medical Research Group of UK (Europe)

86
New cards
  • Basic Local Alignment Search Tool (BLAST)

  • Gene Recognition and Assembly Internet Link (GRAIL)

  • FAST-All derived from FAST-P (protein)

  • FAST-N (nucleotide) search algorithms (FASTA)

  • Phred

  • Polyphred

  • Phragment Assembly Program (Phrap)

  • The Institute for Genomic Research (TIGR Assembler)

  • Factura (Factura)

  • SeqScape (SeqScape)

  • Assign

  • Matchmaker

SOFTWARE PROGRAMS USED TO ANALYZE AND APPLY SEQUENCE DATA

87
New cards

Basic Local Alignment Search Tool

Compares an input sequence with all sequences in a selected database

88
New cards

Gene Recognition and Assembly Internet Link (GRAIL):

Finds gene-coding regions in DNA sequences

89
New cards

FAST-All derived from FAST-P (protein) and FAST-N (nucleotide) search algorithms (FASTA)

Rapid alignment of pairs of sequences by sequence patterns rather than individual nucleotides

90
New cards

Phred

Reads bases from original trace data and recalls the bases, assigning quality values to each base

91
New cards

Polyphred

Identifies single nucleotide polymorphisms (SNPs) among the traces and assigns a rank indicating how well the trace at a site matches the expected pattern for an SNP

92
New cards

Phragment Assembly Program (Phrap)

Uses user supplied and internally computed data quality information to improve accuracy of assembly in the presence of repeats

93
New cards

The Institute for Genomic Research (TIGR Assembler)

Assembly tool developed by TIGR to build a consensus sequence from smaller-sequence fragments

94
New cards

Factura

Identifies sequence features such as flanking vector sequences, restriction sites, and ambiguities.

95
New cards

SeqScape

Mutation and SNP detection and analysis, pathogen subtyping, allele identification, and sequence confirmation

96
New cards

Assign

Allele identification software for haplotyping

97
New cards

New primer or probe

sequence query the primer or probe sequence to confirm that it belongs to the correct species and is not duplicated in multiple places in a genome

98
New cards

misprimes and off-target products

Primers and probes with multiple potential binding sites will produce ??

99
New cards

query

We have to ___ first the primer to confirm if there is similar DNA or if we can anneal it to a similar DNA.

100
New cards

PRIMER DESIGN

Also used to check the size of the amplicons