Week 6: Databases and Information Management

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 63

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

64 Terms

1

Database

collection of data organized within a computing system to efficiently serve multiple applications by centralizing information and reducing redundancy

New cards
2

Structured Data

pre-defined data model, easy to search, and most mature applications exist

New cards
3

Unstructured Data

no pre-defined model, harder to search, applications are relatively recent and still developing

New cards
4

Semi-Structured

tag-driven structure that identifies data elements and their hierarchy, but does not follow the formal structure of a data model

New cards
5

Entity

person, place, thing, or event on which we store and maintain information

New cards
6

Attribute

each characteristic or quality describing a particular entity

New cards
7

Bit

represents the smallest unit of data a computer can handle

New cards
8

Byte

represents a single character, which can be a letter, number, or another symbol

New cards
9

Field

a grouping of characters into a group of words, or a complete number, such as person’s name or age

New cards
10

Record

a group of related fields

New cards
11

File

a group of records of same type

New cards
12

Database Management System (DBMS)

software that permits an organization to centralize data, manage them efficiently, and provide access to the stored data by application programs

New cards
13

Relationship

association between entities

New cards
14

Data Redundancy

the presence of duplicate data in multiple data files

New cards
15

Data Inconsistency

the presence of different values for same attribute when the same data are stored in multiple locations

New cards
16

Program-Data Dependence

the close relationship between data stored in files and the software programs that update and maintain those files. Any change in data organization or format requires a change in all the programs associated with those files

New cards
17

Requirement Analysis

based on business processes, what do users need? what should database do?

New cards
18

Conceptual (Logical) Design

high-level description (often E/R models)

New cards
19

Schema Refinement

consistency, normalization

New cards
20

Relational Database (SQL)

built around a single concept for modelling data: relations or tables of related data, represent data as two-dimensional tables

New cards
21

Records (tuples)

collections of different entities

New cards
22

Primary Key

used to identify rows (tuples) in the relation, to establish connections to other relations, and for storage purposes (most important key in a table)

New cards
23

Foreign Key

essentially a lookup field to look up data in another table using primary key

New cards
24

Select

creates a subset consisting of all records in the file that meet stated criteria

New cards
25

Join

combines relational tables to provide the user with more information than is available in individual tables

New cards
26

Project

creates a subset consisting of columns in a table, permitting the user to create new tables that contain only the information required

New cards
27

Data Definition (DBMS)

capability to specify the structure and the content of the DB.

New cards
28

Data Dictionary

stores definition of data elements and their characteristics

New cards
29

Data Manipulation

used to add, change, delete, retrieve, and querying and reporting the data in the DB

New cards
30

Data Manipulation Language

a language associated with a database management system that end users and programmers use to manipulate data in the database

New cards
31

Query

request for data from a database

New cards
32

Normalization

process of creating small, stable, yet flexible and adaptive data structures & substructures from complex groups of data

New cards
33

Referential Integrity

rules to ensure that relationships between coupled database tables remain consistent

New cards
34

Entity-Relationship Diagram

methodology for documenting databases illustrating the relationship between various entities in the database

New cards
35

Non-Relational Database Management System (NoSQL)

DMS for working with large quantities of structured and unstructured data that would be difficult to analyze with a relational model

New cards
36

Composite

both entity and relationship

New cards
37

One-To-One

single valued in both directions

New cards
38

One-To-Many

one entity has multivalued relationship with another (but not the reverse)

New cards
39

Many-To-Many

multivalued in both directions

New cards
40

Blockchain

distributed database technology that enables firms and organizations to create and verify transactions on a network nearly instantaneously without a central authority

New cards
41

Cloud Database

type of database service built and accessed through a cloud computing platform

New cards
42

Big Data

data sets with volumes so huge that they are beyond the ability of typical relational DBMS to capture, store, and analyze. The data are often unstructured or semistructured

New cards
43

Data Warehouse

a database, with reporting and query tools, that stores current and historical data extracted from various operational systems and consolidated for management reporting and analysis

New cards
44

Data Mart

a small data warehouse containing only a portion of the organization’s data for a specified function or population of users

New cards
45

Hadoop

open source software framework managed by the Apache Software Foundation that enables distributed parallel processing of huge amounts of data across inexpensive computers

New cards
46

In-Memory Computing

technology for very rapid analysis and processing of large quantities of data by storing the data in the computer’s main memory rather than in secondary storage

New cards
47

Analytic Platforms

preconfigured hardware-software system that is specifically designed for high-speed analysis of large datasets

New cards
48

Data Lake

repository for raw unstructured data or structured data that for the most part has not yet been analyzed, and the data can be accessed in many ways

New cards
49

Online Analytical Processing (OLAP)

Capability for manipulating and analyzing large volumes of data from multiple perspectives

New cards
50

Data Mining

analysis of large pools of data to find patterns and rules that can be used to guide decision making and predict future behaviour

New cards
51

Association

occurrences linked to a single event

New cards
52

Sequences

events are linked over time

New cards
53

Classification

recognizes patterns that describe the group to which an item belongs by examining existing items that have been classified and by inferring a set of rules

New cards
54

Clustering

no groups have yet been defined, a data mining tool can discover different groupings within data, such as finding affinity groups for bank cards or partitioning a database into groups of customers based on demographics and types of personal investments

New cards
55

Forecasting

series of existing values to _______ what other values will be

New cards
56

Text Mining

discovery of patterns and relationships from large sets of unstructured data

New cards
57

Sentiment Analysis

mining text comments in an email message, blog, social media conversation, or survey form to detect favourable and unfavourable opinions about specific subjects

New cards
58

Web Mining

discovery and analysis of useful patterns and information from the World Wide Web

New cards
59

Database Server

a computer in a client/server environment that is responsible for running a DBMS to process SQL statements and perform database management tasks

New cards
60

Data Quality Audit

structured survey of the accuracy and level of completeness of the data in an information system

New cards
61

Data Cleansing (data scrubbing)

consists of activities for detecting and correcting data in a database that are incorrect, incomplete, improperly formatted, or redundant

New cards
62

Data Governance

encompasses policies and procedures through which data can be managed as an organizational resource

New cards
63

Data Administration

responsible for the specific policies and procedures through which data can be managed as an organizational resource

New cards
64

Database Administration

responsible for defining and organizing the structure and content of the database, and maintaining the DB

New cards
robot