Biological Databases & Websites Overview

0.0(0)

Studied by 0 people

0.0(0)

Call with Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/40

Earn XP

Description and Tags

A collection of flashcards focusing on key terms and definitions related to biological databases and bioinformatics.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No study sessions yet.

41 Terms

New cards

FASTA format

A text-based format for representing nucleotide or peptide sequences, where sequences are preceded by a header line.

New cards

FASTQ format

A file format for storing both a biological sequence and its corresponding quality scores, typically used in sequencing data.

New cards

NCBI (National Center for Biotechnology Information)

A US government agency that provides access to biomedical and genomic information.

New cards

GenBank

A primary database of nucleotide sequences maintained by NCBI.

New cards

XML (Extensible Markup Language)

A markup language that defines rules for encoding documents in a format that is both human-readable and machine-readable.

New cards

Entrez

NCBI’s search and retrieval system that provides a unified interface to multiple databases.

New cards

RefSeq

A curated and non-redundant database of reference sequences for genomes, transcripts, and proteins.

New cards

Genome browser

A web-based tool that visualizes genomic data along with annotations and features.

New cards

Metadata

Data that provides information about other data, often used to describe the characteristics of biological sequences.

New cards

ASCII files

Text files that contain data represented in a format readable by humans and includes sequence data.

New cards

SNP (Single Nucleotide Polymorphism)

A variation at a single position in a DNA sequence among individuals.

New cards

Public repositories

Databases that hold biological data accessible to the public, including NCBI, EBI, and DDBJ.

New cards

Biological Process (Gene Ontology)

A larger process accomplished by multiple molecular activities.

New cards

Cellular Component (Gene Ontology)

Locations relative to cellular structures where functions are performed.

New cards

Molecular Function (Gene Ontology)

Activities that occur at the molecular level, such as enzyme activity or binding.

New cards

Gene Ontology (GO)

A collaborative project that aims to standardize the representation of gene and gene product attributes.

New cards

Biological Sequence Databases

Databases that store and provide access to biological sequences.

New cards

Unix Server

A powerful computer system utilized for hosting and processing large-scale bioinformatics data.

New cards

Programming utilities

Software tools provided by database hosts to facilitate bulk data access.

New cards

Quality values in FASTQ

Encoded error probability values that represent the reliability of sequence data.

New cards

BioProject

An NCBI database that organizes data associated with a biological research project.

New cards

BLAST (Basic Local Alignment Search Tool)

A tool that finds similarities between biological sequences.

New cards

Genome Workbench

An integrated software suite for studying and analyzing genetic data.

New cards

MarkerDB

A database consolidating information on clinical and pre-clinical biomarkers.

New cards

DRUGBANK

A comprehensive resource for drug discovery that integrates various biomedical data.

New cards

Genome Data Viewer

A tool provided by NCBI to visualize and explore genomic data.

New cards

Proteomics

The large-scale study of proteins, particularly their functions and structures.

New cards

Taxonomy

The science of classifying and naming organisms.

New cards

Gene ID

A unique identifier assigned to a gene for reference in various databases.

New cards

Entrez Help Manual

A guide that provides instructions on using NCBI's Entrez system.

New cards

Clinical significance

The importance of a specific genetic variant in terms of its influence on health.

New cards

SRA (Sequence Read Archive)

A database that stores raw sequencing data and related information.

New cards

EBI (European Bioinformatics Institute)

An organization that provides freely available data and services in bioinformatics.

New cards

DDBJ (DNA Data Bank of Japan)

A database that collects and disseminates nucleotide sequence data.

New cards

PubMed

A free search engine accessing primarily the MEDLINE database of references and abstracts on life sciences and biomedical topics.

New cards

Clinical trials

Research studies performed on patients to evaluate a medical, surgical, or behavioral intervention.

New cards

Unique identifier (SeqID)

A distinct code used to label biological sequences in databases.

New cards

Common languages in bioinformatics

Typically include programming languages such as Python and R for data analysis.

New cards

Open reading frame (ORF)

A sequence of DNA that can be translated to give a protein.

New cards

Accession number

A unique identifier assigned to a biological sequence record in a database.

New cards

Submission Portal (NCBI)

An online entry point for submitting data to various NCBI databases.