Chapters 1, 2, 3, 4, 5, 6, 16 database systems

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/239

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

240 Terms

1
New cards

What is the key difference between data and information?

Data are raw facts; information is processed data given context and meaning, suitable for decision making.

2
New cards

What is a database?

A shared, integrated computer structure that stores end-user data and metadata to manage, retrieve, and secure information efficiently.

3
New cards

What is metadata?

Data about data that describes data characteristics, structure, and relationships used to integrate and manage end-user data.

4
New cards

What is a DBMS?

A collection of programs that manages the database structure, controls access, enforces rules, and serves as intermediary between users and data.

5
New cards

Name two main roles of a DBMS.

Provides an integrated view of data and enables secure, efficient sharing, improving access, security, and decision-making.

6
New cards

What does "ubiquitous" mean about data today?

That data are abundant, global, and available everywhere.

7
New cards

What does "pervasive" mean about data today?

That data are prevalent, persistent, and hard to avoid.

8
New cards

List the primary components of a database system.

Hardware; software; people; procedures; data.

9
New cards

What is the importance of database design?

Good design facilitates management and accurate reporting; poor design leads to hidden errors and poor decisions.

10
New cards

What is a single-user database?

Supports one user at a time; often implemented as a desktop database on a personal computer.

11
New cards

What is a multiuser database?

Supports multiple concurrent users, from small workgroups to enterprise-wide deployments.

12
New cards

Define centralized, distributed, and cloud databases.

Centralized: data at a single site; Distributed: data across multiple sites; Cloud: provided and managed by cloud services with defined SLAs.

13
New cards

What is an operational database?

Designed to support daily business operations and transaction processing.

14
New cards

What is an analytical database?

Stores historical data and metrics optimized for tactical or strategic decision making.

15
New cards

What is a data warehouse?

A storage system formatted and optimized for decision support and large-scale analytics.

16
New cards

What is OLAP?

Online Analytical Processing: tools and methods for retrieving, processing, and modeling data from data warehouses.

17
New cards

How are databases classified by structure of data?

Structured: formatted for processing; Semistructured: partially processed; Unstructured: raw original state.

18
New cards

What is XML's role in semistructured data?

XML represents data elements in a textual, tagged format to describe structure and content.

19
New cards

Give the basic file terminology: Data, Field, Record, File.

Data: raw facts; Field: a character/group with meaning; Record: connected fields describing an entity; File: collection of related records.

20
New cards

List common problems with file system data processing.

Lengthy development; slow answers; complex admin; poor security; limited sharing; extensive programming.

21
New cards

What is structural dependence vs. independence?

Structural dependence: programs rely on file structure; independence: file structure changes without breaking application access.

22
New cards

What is data dependence vs. data independence?

Data dependence: access changes when storage changes; independence: storage changes do not affect program access.

23
New cards

What is data redundancy and why is it bad?

Unnecessary duplicate storage across locations causing inconsistency, security issues, entry errors, and integrity problems.

24
New cards

Name the three data anomalies caused by redundancy.

Update anomalies; insertion anomalies; deletion anomalies.

25
New cards

How does a DBMS address file system weaknesses?

By centralizing data structures, relationships, access paths, reducing redundancy, anomalies, and dependence while managing access and security.

26
New cards

List key DBMS functions for reliability and integrity.

Multiuser access control, backup/recovery management, data integrity management, performance tuning, and security management.

27
New cards

What is the role of SQL in DBMS?

SQL is the de facto query and data access language allowing users to specify what to do without programming how to do it.

28
New cards

Give three disadvantages of database systems.

Higher costs; management complexity; vendor dependence and frequent upgrades.

29
New cards

What are common database career roles?

Examples: Database Developer, Designer, Administrator, Analyst, Architect, Consultant, Security Officer, Cloud Architect, Data Scientist.

30
New cards

Summary: core takeaways from Chapter 1.

Data are raw facts stored in databases; DBMSs centralize management and reduce redundancy; types of databases vary by users, location, usage, and structure; good design matters.

31
New cards

What is data modeling?

Creating a specific data model for a defined problem domain to represent real-world data structures simply and usefully.

32
New cards

What is a data model?

A simplified representation of complex real-world data structures used to support a specific problem domain.

33
New cards

Why are data models important?

They facilitate communication, provide multiple views of the database, organize data for users, and support good database design.

34
New cards

Define entity in data modeling.

An object (person, place, thing, or event) about which data will be collected and stored.

35
New cards

What is an attribute?

A characteristic or property of an entity.

36
New cards

What is a relationship in a data model?

An association among entities describing how instances of entities relate to each other.

37
New cards

Give the three common relationship types.

One-to-many (1:M), many-to-many (M:N), one-to-one (1:1).

38
New cards

What is a constraint?

A restriction placed on data to ensure integrity and correctness.

39
New cards

What are business rules?

Brief, precise, unambiguous policies or principles that establish entities, relationships, and constraints in a data model.

40
New cards

Name common sources for discovering business rules.

Company managers; policy makers; department managers; documentation; interviews with end users.

41
New cards

How do nouns and verbs in requirements translate into models?

Nouns → entities; verbs → relationships between entities.

42
New cards

What question determines relationship cardinality?

Ask how many instances of B relate to one instance of A and vice versa.

43
New cards

Why are naming conventions important?

Descriptive, familiar names for entities and attributes facilitate communication and self-documentation.

44
New cards

What characterizes the hierarchical model?

Upside-down tree of segments representing 1:M relationships; used for complex manufacturing and large datasets.

45
New cards

What is a key limitation of hierarchical models?

Navigational access; structural changes require changes in all application programs.

46
New cards

What characterizes the network model?

Allows records to have multiple parents; models complex relationships; introduced schema/subschema, DDL, and DML.

47
New cards

What is a relational model?

A model based on relations (tables) composed of tuples (rows) and attributes (columns) for precise data manipulation.

48
New cards

What is an RDBMS?

Relational DBMS that implements relational model functions, hides relational complexity, and manages tables and queries.

49
New cards

How do tables relate in a relational database?

Each table is independent; rows in different tables relate through common attribute values.

50
New cards

What role does the SQL engine play?

Executes queries and processes data requests in relational systems.

51
New cards

What is an ERD?

Entity Relationship Diagram; a graphical representation of entities and relationships in a database.

52
New cards

What is an entity instance?

A single occurrence (row) of an entity in a relational table.

53
New cards

What is the object-oriented data model?

A model where data and relationships are contained in objects that include operations and attributes.

54
New cards

Define class and inheritance in OODM.

Class: collection of similar objects with shared structure/behavior; Inheritance: subclasses inherit attributes and methods from parent classes.

55
New cards

What is UML used for?

Unified Modeling Language: diagrams and symbols to graphically model systems and object relationships.

56
New cards

What is the extended relational data model (ERDM)?

Relational model extended with OO features, extensible data types, and inheritance support.

57
New cards

What problems drive Big Data / NoSQL adoption?

Volume, velocity, and variety of web/sensor data; conventional structures and OLAP struggle with unstructured large-scale data.

58
New cards

What technologies are associated with Big Data?

Hadoop, HDFS, MapReduce, and NoSQL databases.

59
New cards

What are NoSQL databases optimized for?

Distributed architectures, high scalability, availability, fault tolerance, and handling large sparse/unstructured data.

60
New cards

Name trade-offs of many NoSQL systems.

Optimized for performance over strict transaction consistency; often eventual consistency.

61
New cards

List the four levels of data abstraction.

External (end-user views); Conceptual (global view); Internal (specific DB model); Physical (storage/access methods).

62
New cards

What is the external model?

Representation of end users' views; external schema shows specific representations of those views.

63
New cards

What is the conceptual model?

A global, software- and hardware-independent view used for logical design and identifying main data objects.

64
New cards

What is the internal model?

Mapping of the conceptual model to the DBMS constructs; provides logical independence from physical details.

65
New cards

What is the physical model?

Lowest level describing how data are stored on media and access methods; hardware- and software-dependent.

66
New cards

How do data-modeling requirements vary?

They depend on different data views (global vs local) and the chosen level of data abstraction.

67
New cards

Summary: core takeaways from Chapter 2.

Data models abstract real-world structures; entities, attributes, relationships, and business rules drive design; multiple model types exist to meet different needs; abstraction levels separate user views from storage.

68
New cards

What is the logical view of data in the relational model?

Represents data and relationships using relations (tables) to yield logical simplicity and effective database design.

69
New cards

List three advantages of using tables in the relational model.

Tables provide a two-dimensional row/column view; attribute-named columns; independent rows that represent entity occurrences.

70
New cards

Name four necessary characteristics of a relational table.

Rows are tuples; columns are attributes with domains; intersections are single values; table must have a unique identifier (key).

71
New cards

What is a key in a relational table?

One or more attributes that uniquely identify a row and establish relationships among tables to ensure integrity.

72
New cards

Define functional dependency.

When the value of one attribute (or a set) determines the value of another attribute.

73
New cards

What is a determinant and dependent attribute?

Determinant: attribute whose value determines another; Dependent: attribute whose value is determined by the determinant.

74
New cards

What is full functional dependence?

The entire set of attributes in the determinant is required to determine the dependent attribute.

75
New cards

Define composite key and candidate key.

Composite key: key composed of multiple attributes; Candidate key: a minimal superkey with no redundant attributes.

76
New cards

What is a superkey and a primary key?

Superkey: attribute(s) that uniquely identify rows; Primary key: chosen candidate key that must be unique and non-null.

77
New cards

What is a foreign key and referential integrity?

Foreign key: primary key from one table placed in another; Referential integrity requires non-null foreign keys to reference existing primary keys.

78
New cards

What is a secondary key?

A key used only for data retrieval (non-unique indexing purpose).

79
New cards

What is entity integrity?

Requirement that primary key values are unique and none of its attributes are null to guarantee each row's identity.

80
New cards

Give two ways to handle nulls in a table.

Use flags or special codes to indicate missing data; enforce NOT NULL or UNIQUE constraints where appropriate.

81
New cards

What is relational algebra?

Theoretical set of operators for manipulating relations, producing new relations and obeying closure.

82
New cards

What does the SELECT (restrict) operator do?

Produces a horizontal subset of rows satisfying a condition.

83
New cards

What does the PROJECT operator do?

Produces a vertical subset by selecting specific columns (attributes).

84
New cards

What is UNION and when are tables union-compatible?

UNION combines rows from two tables removing duplicates; union-compatible means same column count and compatible domains.

85
New cards

What does INTERSECT return?

Rows that appear in both union-compatible tables.

86
New cards

What does DIFFERENCE (minus) return?

Rows present in the first table but not in the second (requires union-compatibility).

87
New cards

What is the CARTESIAN PRODUCT operator?

Combines every row of one table with every row of another, yielding all possible row pairs.

88
New cards

Explain the DIVIDE operator in relational algebra.

Divides a double-column table by a single-column table, returning values associated with every row in the divisor.

89
New cards

What is a natural join and how does it differ from an equijoin?

Natural join links tables on common attribute names and values; equijoin links using explicit equality comparisons on specified columns.

90
New cards

Define inner join and outer join (left/right).

Inner join returns only matched rows; outer join retains unmatched rows from one side with nulls for missing matches; left/right indicate which table's unmatched rows are kept.

91
New cards

What is a theta join?

A join based on an inequality comparison operator (any condition other than equality).

92
New cards

What does an index provide in a relational database?

An orderly reference structure that speeds row lookup; index keys point to data locations; can be unique or non-unique.

93
New cards

What is a unique index?

An index whose key maps to exactly one data pointer, enforcing uniqueness of the indexed values.

94
New cards

What is the data dictionary and system catalog?

Data dictionary: user/designer descriptions of tables; System catalog: DBMS-managed metadata describing all database objects.

95
New cards

Why avoid homonyms and synonyms in a database design?

Homonyms (same name different attributes) and synonyms (different names same attribute) create ambiguity and confusion.

96
New cards

How is an M:N relationship implemented in a relational model?

By creating a composite (associative) entity that holds primary keys from the related tables, converting M:N into two 1:M relationships.

97
New cards

When might data redundancy be intentionally preserved?

To support crucial reporting needs or to preserve historical accuracy despite general goals to minimize redundancy.

98
New cards

List three common relationship types in relational databases.

One-to-many (1:M), one-to-one (1:1), many-to-many (M:N via associative entity).

99
New cards

Name three integrity constraints enforced by RDBMS.

Entity integrity, referential integrity, and column constraints (NOT NULL, UNIQUE).

100
New cards

What are two practical purposes of Codd's relational rules?

Ensure data is represented in tables and guarantee systematic access, treatment of nulls, metadata management, and integrity stored in catalogs.