Database Systems Overview
Recap
Understanding Data vs Information:
- Data: Raw, unprocessed facts.
- Information: Processed data that provides context and meaning for decision-making.
What is a Database?
- A shared and integrated computer system that stores collections of related data.
- Types of data stored:
- End-user Data: Data that users interact with.
- Metadata: Data that describes other data.
Database Management Systems (DBMS):
- A collection of programs that manage database structure and controls data access.
- Functions as an intermediary between users and the databases.
Advantages of Using a DBMS:
- Improved data sharing and security.
- Reduces data inconsistency.
- Facilitates efficient data access leading to better decision-making.
Types of Databases:
- Single User vs. Multi-User:
- Single user databases are meant for one user to access at a time.
- Multi-user databases allow multiple users to access data concurrently.
- Centralized vs. Distributed:
- Centralized databases store data in one location.
- Distributed databases store data in multiple locations.
- Cloud Databases:
- Accessed remotely via cloud services.
- Operational vs. Analytical Databases:
- Operational databases handle day-to-day transactions.
- Analytical databases are used for business intelligence and analytics.
Evolution of File Systems
- Manual File Systems:
- Used traditional methods such as paper forms and filing cabinets.
- Computerized File Systems:
- Developed by data processing specialists, allowing tracking of data digitally.
Problems with File Systems
Data Redundancy:
- Overlap of data across multiple files, causing inconsistency.
Complex System Administration:
- Difficulty in managing and controlling data consistently.
Development and Retrieval Challenges:
- Lengthy development times and lack of quick retrieval methods.
Security Issues:
- Limited data sharing and vulnerability to unauthorized access.
Extensive Programming Needs:
- Complex system requires significant programming efforts to manage.
Structural Dependence vs. Independence
Structural Dependence:
- The ability to access a file is tied to its structure.
Structural Independence:
- Allows changes to be made to data structure without impacting data accessibility.
Data Dependence vs. Independence
Data Dependence:
- Changes in data storage characteristics affect how data is accessed.
Data Independence:
- Enables changes in data storage without affecting access methods.
Data Redundancy Implications
Poor Data Security:
- Multiple copies create risks for data breaches.
Data Inconsistency:
- Conflicting versions of data in various locations.
Types of Data Anomalies
Update Anomaly:
- Difficulty ensuring consistency when updating records in multiple files.
Insertion Anomaly:
- Issues arise when adding new entries (e.g., agents requiring dummy customer data).
Deletion Anomaly:
- Risk of losing valuable data when deleting certain entries.
Components of a Database System
Hardware: Physical devices used in a database system.
Software: Includes DBMS software, application programs, and utilities.
People: Users of the database system, including both technical and end users.
Procedures: Rules that guide the design and usage of the database system.
Data: The collection of facts stored in the database.
Functions of a DBMS
Data Management:
- Data dictionary management and data storage management for efficiency.
Security Management:
- Protects against unauthorized data access and enforces privacy.
Multiuser Access Control:
- Allows multiple users to access data without data corruption.
Backup and Recovery Management:
- Ensures data can be recovered after failures.
Data Integrity Management:
- Aims to minimize redundancy and maintain data consistency.
Disadvantages of Database Systems
Increased Costs:
- Higher operational costs associated with DBMS.
Management Complexity:
- Need for ongoing updates and maintenance.
Vendor Dependence:
- Risk of being tied to specific DBMS vendors.
Frequent Upgrades:
- Regular updates can be demanding.