D426 Data Management Foundations 1: Introduction to Databases

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/35

flashcard set

Earn XP

Description and Tags

Terms and concepts from WGU course D426: Data Management and Foundations. This set is for chapter 1 of the textbook.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

36 Terms

1
New cards

data

numeric, textual, visual, or audio information that describes real-world systems

2
New cards

scope

the amount of data produced and collected

3
New cards

format

how data is produced. i.e. numbers, text, image, audio, or video

4
New cards

access

whether a data source is private, publicly available, or a combination

5
New cards

analog

data encoded as continuous variations on various physical media.

Ex: audio recorded as vibrations impressed on vinyl disks

6
New cards

digital

data encoded as zeros and ones on electronic and magnetic media

7
New cards

Examples of public data sets

  1. Data.gov provides more than 250,000 U.S. government data sets to support research and application development

  2. Kaggle.com, owned by Google, supports an online community that allows users to find and publish data sets

  3. data.nasa.gov provides numerous data sets in categories such as aerospace, earth science, and space science

  4. cancer.gov/research collects and reports data and information relative to all forms of cancer

  5. opendata.cityofnewyork.us contains data collected by the New York City government to support continuous monitoring and improvements to NYC and residents’ health

8
New cards

database

a collection of data in a structured format, ensuring that similar data is stored in a standardized manner

9
New cards

database management system

DBMS; software that reads and writes data in a database, ensuring that data is secure, internally consistent, and available at all times

10
New cards

query

a request to retrieve or change data in a database

11
New cards

query language

a specialized programming language, designed specifically for database systems that can read and write data efficiently and differs significantly from general-purpose languages such as Python, Java, and C++

12
New cards

database application

software that helps business users interact with database systems

13
New cards

information management system

a software application that manages corporate data for a specific business function, which usually includes a database system as well as other components, such as a user interface, business logic, and interfaces to other systems

14
New cards

database administrator

the person who enforces procedures for user access and database system availability

15
New cards

database designer

determines the format of each data element and the overall database structure. Responsibilities include:

  • storage

  • response time

  • support for rules that govern data

16
New cards

database programmer

develops computer programs that utilize a database, combining query languages and general purpose languages

17
New cards

database user

a consumer of data in a database. Requests, updates, or uses stored data to generate reports or information. Usually accesses the database via applications but can also submit queries directly to the database.

18
New cards

database performance

how quickly a database responds to queries, affected by how the data is structured on storage media and how queries are processed

19
New cards

Requirements for large, complex databases with many users

  • Performance

  • authorization

  • security

  • rules

  • recovery

20
New cards

database authorization

many database users should have limited access to specific tables, columns, or rows of a database, so systems control which individual users can access specific data

21
New cards

database security

how systems ensure users only access data they are authorized to,a s well as protecting against hackers by encrypting data and restricting access

22
New cards

database rules

how systems ensure that data is consistent with structural and business rules

ex. when multiple copies of data are stored in different locations, copies must be synchronized as data is updated

ex. when a course # appears in a student registration record, the course must exist in the course catalog

23
New cards

database recovery

how systems recoup from system failures and restore the database to a consistent state without loss of data

24
New cards

transaction

a group of queries that must be either completed or rejected as a whole as execution of some, but not all, of the queries results in inconsistent or incorrect data

25
New cards

When processing transactions, database systems must:

  1. ensure transactions are processed completely or not at all

  2. prevent conflicts between concurrent transactions

  3. ensure transaction results are never lost

26
New cards

database system architecture

the internal components f a database system and the relationships between them

27
New cards

query processor

interprets queries, creates a plan to modify the database or retrieve data, and returns query results to the application

28
New cards

storage manager

translates the query processor instructions into low-level file system commands that modify or retrieve data using indexes to quickly locate data

29
New cards

transaction manager

ensures transactions are properly executed

30
New cards

log

a file containing a complete record of all inserts, updates, and deletes that the database has processed, which the transaction manager writes records to before applying changes and uses to restore the database in case of transaction or system failure

31
New cards

catalog/data dictionary

a directory of tables, columns, indexes, and other database objects that other components use to process and execute queries

32
New cards

metadata

data about the database, such as column names and the number of rows in each table, which is stored in the catalog

33
New cards

relational database

stores data in tables, columns, and rows, similar to a spreadsheet, with all the data in a column having the same format and all data in a row representing a single object, such as a person, place, product, or activity

34
New cards

SQL

“Structured Query Language;” a query language supported by all relational database systems that includes statements to read and write data, create and delete tables, and administer the database system

35
New cards

NoSQL

non-relational database systems optimized for big data, which became common since 2000 when the growth of the internet generated massive volumes of online data, often with poorly structured or missing information

36
New cards

Leading database products

  1. Oracle Database

  2. MySQL

  3. SQL Server

  4. PosgreSQL

  5. MongoDB (NoSQL)