Introduction to Database Management

Importance of Data

  • Decision Making:

    • In Managerial Contexts:

    • Finance

    • Personnel

    • Operations

    • Logistics

    • Marketing & Sales

    • In Personal Contexts:

    • Education

    • Career

    • Finances

    • Daily Life

  • Key Question: Is data important?

    • If data is essential, how do we store and manage it?

Databases Overview

  • Definition: Databases are software utilized to store and manage data.

  • Comparison between Database Management Systems (DBMS) and spreadsheets:

    • DBMS Examples: MySQL, Oracle, SQL Server

    • Spreadsheets Examples: Excel

  • Example: In a retail sales scenario with multiple tables, databases can manage thousands or hundreds of thousands of rows including:

    • Customers

    • Suppliers

    • Products

    • Orders

    • Order Details

    • Employees

Rationale for Studying Database Management

  • Prevalence of Databases:

    • Foundational to successful Information Systems (IS)

    • Examples of data storage systems include:

      • Amazon, Wal-Mart

      • Warehouse inventory systems

      • Social media messages (Facebook & Twitter)

      • iPhone App Store

      • Google search index

      • UPS/FedEx tracking systems

      • Dell/HP order processing

      • Airline reservations

      • eBay auctions

Advantages of Databases over Conventional File Processing Systems

  • Issues with File Processing Systems:

    • Spreadsheets do not scale well with data size.

    • Complex linking between multiple tables.

    • Slow query execution.

    • Errors due to denormalized data.

  • Database Management Advantages:

    • Data Independence

    • ANSI/SPARC architecture is essential for data handling.

Data Redundancy and Consistency

  • Data Redundancy: Refers to duplication of data.

  • Challenges: Can lead to issues with sharing data effectively.

Problems with File System Approach

  • Complexity in Administration

  • Data Integrity Issues:

    • Inconsistency can arise from flawed data quality for decision-making.

  • Program-Data Dependence: Programs cannot function without tightly coupled data structures.

  • Insufficient Backup and Recovery Mechanisms.

  • Concurrent Access Conflicts.

  • Data Security Vulnerabilities.

Database Management System (DBMS) Framework

  • Central Repository: Data is managed by DBMS and is stored in a standardized, accessible manner.

Advantages of Database Approach:
  1. Improved data consistency.

  2. Enhanced data sharing and productivity of application development.

  3. Enforcement of data standards.

  4. Superior data quality and accessibility.

  5. Reduced program maintenance needs.

  6. Better decision-making support mechanisms.

ANSI/SPARC 3-Layer Architecture

  • Three Layers:

    1. External Layer: View of data from users and application programs.

    2. Logical/Conceptual Layer: Represents how data is organized logically.

    3. Internal/Physical Layer: Details on how data is actually stored on disk.

  • Data Independence:

    • Physical Data Independence: Changes at the physical level do not require changes in the logical structure.

    • Logical Data Independence: Changes in the logical structure do not necessitate changes to external views.

Cost Considerations for Using DBMS

Main Costs:
  • High initial investment needed to establish a DBMS infrastructure.

  • Potential requirement for additional hardware resources.

  • Maintenance overhead costs.

  • Need for user training to effectively navigate the DBMS.

Situations where DBMS may be unnecessary:
  • When the application is simple, well-defined, and not expected to change.

  • In cases where stringent real-time requirements are needed.

  • When write access by multiple users is not required.

Fundamental Database Terminology

  • Data: These include facts, texts, graphics, sound, or video segments that carry meaning within the user's environment.

  • Information: Processed data that becomes useful in decision-making contexts.

  • Metadata: Detailed information that describes the properties or characteristics of other data; for example, headers of tables.

Importance of Core Data Operations

  • Core operations on data include:

    • Creation (adding new data)

    • Retrieval of existing data

    • Update or modification of existing data

    • Deletion of data

DBMS Platforms

  • Prominent DBMS platforms include:

    • Db2 (IBM)

    • MySQL (now owned by Oracle; open-source, free)

    • MariaDB (an open-source fork of MySQL, free)

    • Oracle Database

    • PostgreSQL (open-source, free)

    • Snowflake (cloud-based)

    • Microsoft SQL Server

    • SQLite (open-source, free)

    • Others include Amazon RDS, DynamoDB, MongoDB, etc.

Summary of Key Learnings

  • Understanding databases and their necessity within organizational frameworks.

  • Recognizing advantages of DBMS over traditional file processing systems.

  • Familiarization with the ANSI/SPARC 3-layer architecture promoting data independence.

  • Distinction between core concepts such as data, information, metadata, databases, and DBMS.

  • Discussion of DBMS components and their functionalities.

Review Questions

  • List and explain the advantages of DB systems compared to file processing systems.

  • What challenges does the ANSI/SPARC architecture address?

  • Why is data independence crucial in database management?

  • How do the two types of data independence differ?