Data Management Foundations- C175

4.0(1)
studied byStudied by 6 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/122

flashcard set

Earn XP

Description and Tags

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

123 Terms

1
New cards
Flat Files
a file having no internal hierarchy
2
New cards
Hashed Files
A file that has been encrypted for security purposes.
3
New cards
Heap File
An unsorted set of records.
4
New cards
Information
The transformation of raw data into useful facts.
5
New cards
Punch Card
A card that is perforated and can hold commands or data.
6
New cards
Structured Data
Information with a high degree of organization.
7
New cards
Unstructured Data
Information that does not have structure (such as text)
8
New cards
Binary Relationship
A relationship between two entity types.
9
New cards
Unary Relationship
An associate occurrence of an entity type with other occurrences of the same entity type.
10
New cards
Cardinality
The maximum number of entities that can be involved in a particular relationship.
11
New cards
E-R Model

*E-R \= Entity - Relationship
Diagram of entities together with their attributes and the relationship among them.
12
New cards
Intersection Data
It is data that describes a many-to-many relationship.
13
New cards
Modality
It is a minimum number of entity occurrences that can be involved in a relationship.
14
New cards
One-to-one Binary Relationship
It means that a single occurrence of one entity type can be associated with a single occurrence of the other entity type and vice versa.
15
New cards
Ternary Relationship
Involves three different entity types.
16
New cards
Unique identifier
It is used to uniquely identify each record in a database table.
17
New cards
Attribute
A property, characteristic, or fact that we know about an entity.
18
New cards
"A salesperson works in one office."

What is the name of this relationship?
One-to-one binary relationship
19
New cards
"A salesperson sells to many customers."

What is the name of this relationship?
One-to-many binary relationship
20
New cards
"A salesperson is authorized to sell many products, and a product can be sold by many salespersons."

What is the name of this relationship?
Many-to-many binary relationship
21
New cards
What is the positioning and meaning for Cardinality and Modality on an E-R model?
Cardinality is the outer symbol; represents the maximum.

Modality is the inner symbol; represents the minimum.
22
New cards
"A salesperson works in a minimum of one and a maximum of one office, and an office may be occupied by or assigned to a minimum of zero and a maximum of one salesperson."
23
New cards
"A salesperson may have no customers or many customers."
24
New cards
Describe the ER model for "Each salesperson is authorized to sell to at least one or many products, and each product can be sold by at least one or many salespeople."
25
New cards
"One salesperson backs-up another salesperson."

What is the name of this model?
One-to-one unary relationship
26
New cards
"A salesperson manages zero to many other salespersons, and a salesperson is managed by exactly one other salesperson."

What is the name of this model?
One-to-many unary relationship
27
New cards
"A product can either be part of no other products or be part of several other products, and a product can either be composed of no other products or be composed of several other products."

What is the name of this model?
Many-to-many unary relationship
28
New cards
What does 'refer' in Referential Integrity imply?
This revolves around the circumstance of trying to refer to data in one relation in the database, based on values in another relation.
29
New cards
Define the delete rule RESTRICT.
If the delete rule between two relations is RESTRICT and an attempt is made to delete a record on the "one side" of the one-to-many relationship, the system will forbid the delete to take place if there are any matching foreign key values in the relation on the "many side".
30
New cards
Define the delete rule CASCADE.
If the delete rule between two relations is CASCADE and an attempt is made to delete a record on the "one side" of the relationship, not only will the record be deleted but all of the records on the "many side" of the relationship that have a matching foreign key value will also be deleted.

In other words, the delete will "cascade" from one relation to the other.
31
New cards
Define the delete rule SET-TO-NULL.
If the delete rule between the two relations is SET-TO-NULL and an attempt is made to delete a record on the "one side" of the one-to-many relationship, that record will be deleted and the matching foreign key values in the records on the "many side" of the relationship will be set to null.

Essentially, it's exactly like the CASCADE delete option, but instead of completely deleting all possible values, the values are set to NULL instead.
32
New cards
Which entity is uniquely identified by concatenating the primary keys of the two entities it connects?
Associative Entity
33
New cards
Which type of entity is also called a dependent entity?
Weak Entity
34
New cards
Candidate Key
This is when a relation has more than one attribute or minimum group of attributes that represents a way of uniquely identifying the entity.
35
New cards
Concurrency Problem
When two or more users are trying to update the same record simultaneously.
36
New cards
Equijoin
Combines two or more tables based on a column that is common to the tables.

Example: Joining Client and Salesman tables that both contain the SalesmanID column which have the exact same values.
37
New cards
Foreign Key
When an attribute or group of attributes serves as the primary key of one relation and also appears in another relation.
38
New cards
Natural Join
Matches each row in a table against each row in another table based on common values found in columns sharing a common name and data type.
39
New cards
Tuple
Rows/records are referred to as tuples when talking about relations. They serve the exact same function, it just has a different name in the context of relations.
40
New cards
What are the five basic principles of The Database Concept?
1. The creation of a datacentric environment that is a significant company resource, which can be shared inside and outside the company.
2. The ability to achieve data integration while storing data in a non-redundant fashion.
3. The ability to store data representing entities involved in multiple relationships w/o introducing data redundancy.
4. Managing data control issues such as data security, backup and recovery, and concurrency control.
5. High degree of data independence.
41
New cards
What are the four major DBMS approaches?
- Hierarchical
- Network
- Relational
- Object-oriented
42
New cards
What are four key differences between a RELATION and a FILE?
- The columns of a relation can be arranged in any order w/o affecting the meaning of the data. That is not true of a file.
- Similarly, the rows of a relation can be arranged in any order, which is not true of a file.
- Every row/column position, sometimes referred to as a "cell", can have only a single value, which is not necessarily true in a file.
- No two rows of a relation are identical, which is not necessarily true in a file.
43
New cards
* in the SELECT clause
- It indicates that all attributes of the selected row are to be retrieved
44
New cards
AND operator
- It displays a record if more than one condition is true
45
New cards
AVG() function
- It returns the average value of a numeric column.
46
New cards
BETWEEN operator
- It allows you to specify a range of numeric values in a search.
47
New cards
DISTINCT operator
- It is used to eliminate duplicate rows in a query result.
48
New cards
IN operator
- It allows you to specify a list of character strings to be included in a search
49
New cards
JOIN clause
- It is used to combine rows from more than one table, based on a common field between them. Sometimes it is done by using the '\=' symbol.
50
New cards
LIKE operator
- It allows you to specify partial character strings in a "wildcard" sense.
51
New cards
OR operator
- It displays a record it either the first condition OR the second condition is true.
52
New cards
ORDER BY clause
- It simply takes the result of a SQL query and orders them by one or more specified attributes.
53
New cards
SELECT command
- Data retrieval in SQL is accomplished with the SELECT command.
54
New cards
Subquery
- When on SELECT statement is "nested" within another in a format, it is known as subquery. This is shown when there is a second SELECT phrase within a set of parenthesis.
55
New cards
Common DDL commands:
- DROP
- ALTER
- RENAME
- CREATE
- TRUNCATE
56
New cards
Common DML commands:
- UPDATE
- DELETE
- INSERT
- MERGE
- SELECT
57
New cards
Write the basic SQL query command:
SELECT
58
New cards
Write the SQL query to "Find the commission percentage and year of hire of salesperson 186":
SELECT COMMPERCT, YEARHIRE
FROM SALESPERSON
WHERE SPNUM\=186;
59
New cards
Write the SQL query to "Retrieve the entire record for salesperson 186":
SELECT *
FROM SALESPERSON
WHERE SPNUM\=186;
60
New cards
Write the SQL query to "List the salesperson numbers and salesperson names of those salespersons who have a commission percentage of 10.":
SELECT SPNUM, SPNAME
FROM SALESPERSON
WHERE COMMPERCT\=10;
61
New cards
Write the SQL query to "List the salesperson numbers, salesperson names, and commission percentages of the salespersons whose commission percentage is less than 12.":
SELECT SPNUM, SPNAME, COMMPERCT
FROM SALESPERSON
WHERE COMMPERCT
62
New cards
Write the SQL query to "List the customer numbers and headquarters cities of all customers that have a customer number of at least 1700":
SELECT CUSTNUM, HQCITY
FROM CUSTOMER
WHERE CUSTNUM\>\=1700;
63
New cards
Write the SQL query to "List the customer numbers, customer names, and headquarters cities of the customers that are headquartered in New York and that have a customer number higher than 1500":
SELECT CUSTNUM, CUSTNAME, HQCITY
FROM CUSTOMER
WHERE HQCITY\='New York'
AND CUSTNUM\>1500;
64
New cards
Write the SQL query to "List the customer numbers, customer names, and headquarters cities of the customers that are headquartered in New York OR that have customer numbers higher than 1500":
SELECT CUSTNUM, CUSTNAME, HQCITY
FROM CUSTOMER
WHERE HQCITY\='New York'
OR CUSTNUM\>1500;
65
New cards
Write the SQL query to "List the customers, customer names, and headquarters cities of the customers that are headquartered in New York or that satisfy the two conditions of having a customer number higher than 1500 and being headquartered in Atlanta":
SELECT CUSTNUM, CUSTNAME, HQCITY
FROM CUSTOMER
WHERE HQCITY\='New York'
OR (CUSTNUM\>1500
AND HQCITY\='Atlanta');
66
New cards
Write the SQL query to "List the customer records for those customers whose names begin with the letter 'A' ":
SELECT *
FROM CUSTOMER
WHERE CUSTNAME LIKE 'A%';
67
New cards
Write the SQL query to "Find the customer numbers, customer names, and headquarters cities of those customers with the customer numbers greater than 1000. List the results in alphabetic order by headquarters cities (and have the customer names within the same city alphabetized)":
SELECT CUSTNUM, CUSTNAME, HQCITY
FROM CUSTOMER
WHERE CUSTNUM\>1000
ORDER BY HQCITY, CUSTNAME;
68
New cards
Write the SQL query to "Find the average quantity of units of the different products that Salesperson 137 has sold":
SELECT AVG(QUANTITY)
FROM SALES
WHERE SPNUM\=137;
69
New cards
Write the SQL query to "Find the total quantity of units of all products that Salesperson 137 has sold":
SELECT SUM(QUANTITY)
FROM SALES
WHERE SPNUM\=137;
70
New cards
Write the SQL query to "Find the name of the salesperson responsible for Customer Number 1525":
SELECT SPNAME
FROM SALESPERSON, CUSTOMER
WHERE SALESPERSON.SPNUM\=CUSTOMER.SPNUM
AND CUSTNUM\=1525;
71
New cards
Write the SQL query to "List the NAMES of the products of which salesperson Adams has sold more than 2000 units":
SELECT PRODNAME
FROM SALESPERSON, PRODUCT, SALES
WHERE SALESPERSON.SPNUM\=SALES.SPNUM
AND SALES.PRODNUM\=PRODUCT.PRODNUM
AND SPNAME\='Adams'
AND QUANTITY\>2000;
72
New cards
CREATE TABLE command
The command that creates base tables and tells the system what attributes will be in them.
73
New cards
CREATE VIEW command
Specifies the base tables on which the view is to be based and the attributes and rows of the table that are to be included in the view.
74
New cards
DELETE command
Specify which row(s) of a table are to be deleted based on data values within those rows.
75
New cards
DROP TABLE command
Discards an entire table from a database.
76
New cards
DROP VIEW command
Discards views.
77
New cards
Normalization
The process of organizing the fields and tables of a relational database to minimize redundancy (duplication) and dependency.
78
New cards
Second Normal Form
All non-key attributes must be functionally dependent on the entire key of that table.
79
New cards
Third Normal Form
Non-key attributes are not allowed to define other non-key attributes.
80
New cards
What are three important points about Third Normal Form?
1. It is completely free of redundancy
2. All foreign keys appear where needed to logically tie together related tables.
3. It is the same structure that would have been derived from a properly drawn entity-relationship diagram of the same business environment.
81
New cards
Write the SQL query to "Add a new salesperson into the SALESPERSON table whose salesperson number is 489, name is Quinlan, commission percentage is 15, year of hire is 2011, and department number is 59.":
INSERT INTO SALESPERSON
VALUES
('489','Quinlan',15,'2011','59');

*Hint, this is DML, so remember that INSERT is one of the keywords for DML.
82
New cards
Write the SQL query to "Delete the row for salesperson 186 from the SALESPERSON table.":
DELETE FROM SALESPERSON
WHERE SPNUM \= '186';
83
New cards
What is the correct syntax of the INSERT command?
INSERT INTO table_name VALUES (value1,value2,value3,...):
84
New cards
What is the correct syntax of the CREATE VIEW command?
CREATE VIEW view_name AS
SELECT column_name(s)
FROM table_name
WHERE condition
85
New cards
What is called a decomposition process?
Data normalization
86
New cards
In which of the normal forms should every non-key attribute be fully functionally dependent on the entire key of a table?
Second form
87
New cards
What is the correct syntax of the CREATE TABLE command?
CREATE TABLE table_name (
column_name data_type(size),
);
88
New cards
What is the correct syntax of the UPDATE command?
UPDATE table_name
SET column1\=value1,column2\=value2,...
WHERE some_column\=some_value;
89
New cards
Association Rules
Association rules specify a relation between attributes that appears more frequently than expected if the attributes were independent.
90
New cards
Business Intelligence
The processes, technologies, and tools needed to turn data into information, information into knowledge, and knowledge into plans that drive profitable business action.
91
New cards
Classification
Classification involves examining the attributes of a particular object and assigning it to a defined class.
92
New cards
Clustering
Clustering is the task of taking a large collection of objects and dividing them into smaller groups of objects that exhibit some similarity.
93
New cards
Affinity Grouping
Affinity Grouping is a process of evaluating relationships or associations between data elements that demonstrate some kind of affinity between objects.
94
New cards
What are the values of Business Intelligence?
- Financial value associated w/ increased profitability.
- Productivity value associated with increased throughput.
- Trust value (customer, employee, supplier satisfaction) as well as increased confidence in forecasting.
- Risk value - decreased risk associated with decision making
95
New cards
What are the reasons for using the Dimensional Model for Business Intelligence?
- Simplicity.
- Lack of bias.
- Extensibility.
96
New cards
What are the fundamental aspects of a Data Warehouse?
- Centralized repository of information.
- Organized around relevant subject areas.
- Provides platform for queries.
- Used for analysis and not transactional
processing.
- Data is nonvolatile.
- Target location for integrating data from multiple sources.
97
New cards
What is the general theme of the ETL process?
1. Get the data
2. Map the data to staging area
3. Validate and clean the data
4. Apply necessary transformations
5. Map data to loading model
6. Move data to repository
7. Load data to warehouse
98
New cards
What is the key factor based on the need for linear scalability?
Performance
99
New cards
What is used for populating summaries or any cube dimensions that can be performed at the staging area (ETL)?
Aggregation
100
New cards
What data mining activity is a process of assigning some continuously valued numeric value to an object?
Estimation