Introduction to Database Management
Importance of Data
Decision Making:
In Managerial Contexts:
Finance
Personnel
Operations
Logistics
Marketing & Sales
In Personal Contexts:
Education
Career
Finances
Daily Life
Key Question: Is data important?
If data is essential, how do we store and manage it?
Databases Overview
Definition: Databases are software utilized to store and manage data.
Comparison between Database Management Systems (DBMS) and spreadsheets:
DBMS Examples: MySQL, Oracle, SQL Server
Spreadsheets Examples: Excel
Example: In a retail sales scenario with multiple tables, databases can manage thousands or hundreds of thousands of rows including:
Customers
Suppliers
Products
Orders
Order Details
Employees
Rationale for Studying Database Management
Prevalence of Databases:
Foundational to successful Information Systems (IS)
Examples of data storage systems include:
Amazon, Wal-Mart
Warehouse inventory systems
Social media messages (Facebook & Twitter)
iPhone App Store
Google search index
UPS/FedEx tracking systems
Dell/HP order processing
Airline reservations
eBay auctions
Advantages of Databases over Conventional File Processing Systems
Issues with File Processing Systems:
Spreadsheets do not scale well with data size.
Complex linking between multiple tables.
Slow query execution.
Errors due to denormalized data.
Database Management Advantages:
Data Independence
ANSI/SPARC architecture is essential for data handling.
Data Redundancy and Consistency
Data Redundancy: Refers to duplication of data.
Challenges: Can lead to issues with sharing data effectively.
Problems with File System Approach
Complexity in Administration
Data Integrity Issues:
Inconsistency can arise from flawed data quality for decision-making.
Program-Data Dependence: Programs cannot function without tightly coupled data structures.
Insufficient Backup and Recovery Mechanisms.
Concurrent Access Conflicts.
Data Security Vulnerabilities.
Database Management System (DBMS) Framework
Central Repository: Data is managed by DBMS and is stored in a standardized, accessible manner.
Advantages of Database Approach:
Improved data consistency.
Enhanced data sharing and productivity of application development.
Enforcement of data standards.
Superior data quality and accessibility.
Reduced program maintenance needs.
Better decision-making support mechanisms.
ANSI/SPARC 3-Layer Architecture
Three Layers:
External Layer: View of data from users and application programs.
Logical/Conceptual Layer: Represents how data is organized logically.
Internal/Physical Layer: Details on how data is actually stored on disk.
Data Independence:
Physical Data Independence: Changes at the physical level do not require changes in the logical structure.
Logical Data Independence: Changes in the logical structure do not necessitate changes to external views.
Cost Considerations for Using DBMS
Main Costs:
High initial investment needed to establish a DBMS infrastructure.
Potential requirement for additional hardware resources.
Maintenance overhead costs.
Need for user training to effectively navigate the DBMS.
Situations where DBMS may be unnecessary:
When the application is simple, well-defined, and not expected to change.
In cases where stringent real-time requirements are needed.
When write access by multiple users is not required.
Fundamental Database Terminology
Data: These include facts, texts, graphics, sound, or video segments that carry meaning within the user's environment.
Information: Processed data that becomes useful in decision-making contexts.
Metadata: Detailed information that describes the properties or characteristics of other data; for example, headers of tables.
Importance of Core Data Operations
Core operations on data include:
Creation (adding new data)
Retrieval of existing data
Update or modification of existing data
Deletion of data
DBMS Platforms
Prominent DBMS platforms include:
Db2 (IBM)
MySQL (now owned by Oracle; open-source, free)
MariaDB (an open-source fork of MySQL, free)
Oracle Database
PostgreSQL (open-source, free)
Snowflake (cloud-based)
Microsoft SQL Server
SQLite (open-source, free)
Others include Amazon RDS, DynamoDB, MongoDB, etc.
Summary of Key Learnings
Understanding databases and their necessity within organizational frameworks.
Recognizing advantages of DBMS over traditional file processing systems.
Familiarization with the ANSI/SPARC 3-layer architecture promoting data independence.
Distinction between core concepts such as data, information, metadata, databases, and DBMS.
Discussion of DBMS components and their functionalities.
Review Questions
List and explain the advantages of DB systems compared to file processing systems.
What challenges does the ANSI/SPARC architecture address?
Why is data independence crucial in database management?
How do the two types of data independence differ?