BIOINFORMATICS

0.0(0)

Studied by 1 person

Call with Kai

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/134

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

135 Terms

New cards

BIOINFORMATICS

is a combination of information, technology, and molecular biology

New cards

BIOINFORMATICS

It is being used largely in the field of human genome research

New cards

BIOINFORMATICS

is also used to store and organize the different discovery in the sequence which will be stored in the software

New cards

BIOINFORMATICS

It can also used in understanding diseases, new molecular targets, drug discovery, etc

New cards

BIOINFORMATICS

The study of how information is represented and transmitted in biological systems, starting at the molecular level

New cards

BIOINFORMATICS

is the merger of biology with information technology

New cards

COMPUTATIONAL BIOLOGY

Bioinformatics dedicated specifically to handling sequence information is a form of ____?

New cards

BIOINFORMATICS

also used to store and organize large amount of data into databases such as those used in clinical sequence analysis

New cards

BIOINFORMATICS

used due to vast amount of data arising from the sequence discovery.

New cards

BIOINFORMATICS

the science of computer technology and developing computer databases to facilitate biological research.

New cards

Standard expression of sequence data

is important for the clear communication and organized storage of sequence data

New cards

Interpretation of sequence variants
Used in epidemiology to speciate organisms or to find homologies within or between species
Identification of new sequences
Useful for test and primer design

Uses of Sequence Information:

New cards

Pneumocystis jirovecii or Pneumocystis carinii

was first thought to be a protozoan that is present in the sputum, but it doesn’t align with the protozoan sequence; it matches with the sequence of a fungi

New cards

NCBI

Commonly used database

New cards

SEQUENCE INFORMATION

includes the principles, practical aspects, and structural analysis

New cards

Polymorphic or heterozygous sequences

are written as consensus sequences with proportional representation of the polymorphic bases

New cards

International Union of Pure and Applied Chemistry and the International Union of Biochemistry and Molecular Biology (IUB)

have assigned a universal nomenclature for mixed, degenerate, or wobble bases

New cards

Consensus Sequences

if there is a mutation in the heterogeneous sequences, there may be more than 1 base or mix bases at the same position in the sequence.

New cards

A, G

symbol: R

bases: ??

Mnemonic: PURINE

New cards

C, T

symbol: Y

bases: ??

Mnemonic: PYRAMIDINE

New cards

G, T

symbol: K

bases: ??

Mnemonic: KETO

New cards

A, C

symbol: M

bases: ??

Mnemonic: AMINO

New cards

C, G

symbol: S

bases: ??

Mnemonic: 3 H BONDS

New cards

A, T

symbol: W

bases: ??

Mnemonic: 2 H BONDS

New cards

A, C, T

symbol: H

bases: ??

Mnemonic: NOT G

New cards

C, G, T

symbol: B

bases: ??

Mnemonic: NOT A

New cards

A, C, G

symbol: V

bases: ??

Mnemonic: NOT T

New cards

A, G, T

symbol: D

bases: ??

Mnemonic: NOT C

New cards

A, C, G, T

symbol: N

bases: ??

Mnemonic: ANY

New cards

UNKNOWN

symbol: X, ?

bases: ??

Mnemonic: A or C or G or T

New cards

DELETION

symbol: O, -

bases: ??

New cards

Basic Local Alignment Search Tool

BLAST

New cards

GENE SEQUENCE

FASTA format =

New cards

ARRANGED

GenBank =

New cards

Basic Local Alignment Search Tool

System used for homology searches

New cards

Basic Local Alignment Search Tool

searches GenBank in National Center for Biotechnology Information (NCBI)

New cards

Basic Local Alignment Search Tool

Useful in epidemiology too. You can also confirm bacteria with the same genus through their DNA sequence.

New cards

Basic Local Alignment Search Tool

uses GenBank which is also a database for all DNA sequences that were discovered.

New cards

Basic Local Alignment Search Tool

is a tool used to aligned 2 sequences

New cards

Basic Local Alignment Search Tool

Comparing gene and protein sequences against others in public databases

New cards

Basic Local Alignment Search Tool

is a set of sequence comparison algorithms used to search databases for optimal local alignments to a query

New cards

Basic Local Alignment Search Tool

It breaks the query and databases sequences into fragments and seeks matches between them

New cards

Basic Local Alignment Search Tool

is a computer algorithm that is available for use online at the National Center for Biotechnology Information (NCBI) website and many other sites

New cards

Local Alignment

finding similarities on a specific region of a DNA.

New cards

Global Alignment

finding similarities from one end to another end, whether they are matching or mismatching.

New cards

Basic Local Alignment Search Tool

is the most widely used program in the Bioinformatics

New cards

FASTA, GENBANK FORMAT

Input sequences in either of these 2 formats

New cards

HTML, plain text, and XML formatting

BLAST output can be delivered in a variety of formats. These formats include ___?

New cards

Expect value (E)

is a parameter that describes the number of hits one can "expect" to see by chance when searching a database of a particular size

New cards

OUTPUT

shows all the records matching the query

Most of the time, it is in HTML format

New cards

mismatching

the higher the background noise, the higher the _____ sequence

New cards

matching

The lower the E, the lower the background noise, the higher the ___ sequence?

New cards

match

E value = 10-12 = ?

New cards

Nucleotide BLAST

sequences of the DNA

New cards

Protein BLAST

sequences of the amino acids (sequences of the amino acids were also made from the information of the DNA)

New cards

High Scoring Segment Pair (HSP)

local alignment used for aligning 2 DNA without a graph

New cards

High Scoring Segment Pair (HSP)

We have match, mismatch, and a gap – all of these have a score.

New cards

match

= +2

New cards

mismatch

= -2

New cards

gap

= 0

New cards

HSP

The higher the ___, the higher the amount of match.

New cards

EMBL

GenBank
DDBJ (DNA Data Bank of Japan)

PRIMARY BIOLOGICAL DATABASE OF NUCLEIC ACID?

New cards

PIR
MIPS
SWISS-PROT
TrEMBL
NRL-3D

PRIMARY BIOLOGICAL DATABASE OF PROTEIN?

New cards

PRIMARY BIOLOGICAL DATABASE

Also known as Archival Database

New cards

GenBank

best for nucleic acid, you can also find protein sequences here.

New cards

FASTA

stands for fast-all” or “FastA”

New cards

FASTA

It was developed by W.R. Pearson and Lipman and this algorithm can be accessed from EBI site

New cards

FASTA

It was the first database similarity search tool developed, preceding the development of BLAST

New cards

FASTA

The alignment in diagonals is then refined

New cards

FASTA

Finds regions of similarity by first breaking the sequence into short subsequences, then searching for diagonals with highest density of words that match

New cards

FASTA

Its fast but is not guaranteed to find the best alignment, it may miss matches

New cards

FASTA

Its fast but is not guaranteed to find the best alignment, it may miss matches

New cards

FASTA

gives better results for nucleotide sequences than protein

New cards

FastP

is for protein sequences

New cards

FASTX and FASTY

compares DNA query to a protein database.

New cards

TFASTA

compares a protein query to a DNA database.

New cards

FASTA format

is a text-based format that represents either the nucleotide sequence or the protein sequence in which that bases or base pairs are represented using a single letter code.

New cards

FASTA

can be used for both Local and Global Alignment

New cards

FASTA, BLASTA

to infer relationship between sequences,

to identify members of the gene families
as a searching tool for the matching sequences

New cards

FASTA GRAPH

simple technique. You just have to find similarities, mismatching, gap, by scoring and tracing back to find the local similarities (or even global similarities).

New cards

LOCAL ALIGNMENT

write only the parts of the DNA sequence that are similar or matching.

New cards

GLOBAL ALIGNMENT

write both matching and mismatching from end to end of the DNA sequence.

New cards

GENBANK FILE FORMAT

Genetic sequence database sponsored by NIH in USA

New cards

PubMed

searching tool for journals

New cards

SWISS-PROT FILE FORMAT

Protein database sponsored by Medical Research Group of UK (Europe)

New cards

Basic Local Alignment Search Tool (BLAST)
Gene Recognition and Assembly Internet Link (GRAIL)
FAST-All derived from FAST-P (protein)
FAST-N (nucleotide) search algorithms (FASTA)
Phred
Polyphred
Phragment Assembly Program (Phrap)
The Institute for Genomic Research (TIGR Assembler)
Factura (Factura)
SeqScape (SeqScape)
Assign
Matchmaker

SOFTWARE PROGRAMS USED TO ANALYZE AND APPLY SEQUENCE DATA

New cards

Basic Local Alignment Search Tool

Compares an input sequence with all sequences in a selected database

New cards

Gene Recognition and Assembly Internet Link (GRAIL):

Finds gene-coding regions in DNA sequences

New cards

FAST-All derived from FAST-P (protein) and FAST-N (nucleotide) search algorithms (FASTA)

Rapid alignment of pairs of sequences by sequence patterns rather than individual nucleotides

New cards

Phred

Reads bases from original trace data and recalls the bases, assigning quality values to each base

New cards

Polyphred

Identifies single nucleotide polymorphisms (SNPs) among the traces and assigns a rank indicating how well the trace at a site matches the expected pattern for an SNP

New cards

Phragment Assembly Program (Phrap)

Uses user supplied and internally computed data quality information to improve accuracy of assembly in the presence of repeats

New cards

The Institute for Genomic Research (TIGR Assembler)

Assembly tool developed by TIGR to build a consensus sequence from smaller-sequence fragments

New cards

Factura

Identifies sequence features such as flanking vector sequences, restriction sites, and ambiguities.

New cards

SeqScape

Mutation and SNP detection and analysis, pathogen subtyping, allele identification, and sequence confirmation