Bioinformatics: Data Management

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/21

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

22 Terms

1
New cards

What is a database?

Is a repository of information that has a specific structure that enables enterimg and extracting data.

2
New cards

How is a database structured?

In tables or files, each containing numerous records with data distributed in fields

3
New cards

What are the different database types?

  • Flat-file → GenBank

    • The simplest form

    • Large single text files or collection of different text files

  • Relational → PFAM

  • Object-Oriented

  • Data warehouses

  • Distributes

4
New cards

Which type of database is specifically used for distributing raw data and can be read by many different programs?

Flat-file databases

5
New cards

What is a key disadvantage of complex databse structures (like relational databases) compared to flat-files?

They often depend on specific, and often expensive, software to be read and analyzed.

6
New cards

What does indexing mean?

In bioinformatics, indexing is the process of creating a separate, optimized data structure (an index) that allows for extremely fast searching and retrieval of specific information from a large, primary dataset.

7
New cards

What is the primary purpose of creating and index for large biological database like GenBank?

To enable fast searching and retrieval of specific records without scanning the entire database.

8
New cards

What is eXtensivle Markup Language (XML)?

Is a general tool for the storage of data and information, it uses plain file format. Data is stored in identifiers, called tabs.

9
New cards

HTML is a type of - and what does it allow to do?

It is a type of XML, and it allows to create webs.

10
New cards

What is a significant advantage of XML databases that makes them a serious alternative to flat-file and relational databases?

Their great flexibility in designing specialized ways to classify data.

Data extraction is similar to relational databases (Xquery language)

Disadvantage?: lower complexity

11
New cards

In a relational database, what is the term for a unique field used to relate one table to another?

A key (or primary key). Data for a key must be unique in each record

12
New cards

How can data from different tables be combined?

Through queries, this allows the extraction of useful data.

13
New cards

What is the main advantage of a relational database over a flat-file database?

The ability to easily extract and combine related data from multiple tables + the consistency of the data.

14
New cards

Relational Databases are built on general or specific programming languages?

Specific

15
New cards

Which language is specifically designed for managing and querying data in a relational database?

SQL (Structured Query Language)

16
New cards

What are the different operators in relational databases?

  • Mathematical (+,-,*,/)

  • Data handling (queries)

  • SQL

17
New cards

What is PFAM?

A relational database that includes a collection of protein domain families.

18
New cards

The process of gathering, cleaning and modeling data to reveal meaningful insights is known as:

Data Analysis

19
New cards

What is the scientific field that uses insights from data analysis to support decision-making processes?

Data Analytics

20
New cards

In the context of biological databases, why is database maintenance critically important?

To ensure data quality and have confidence in the accuracy of the extracted data.

21
New cards

True or false: Flat-file databases are the most efficient structure for complex queries across multiple, related data types.

False: Relational databases are better suited for combinig data from multiple tables with complex queries.

22
New cards

Which database format uses a plain text file with a structured but simple layout, as seen in a GenBank record?

A Flat-file database.