TOPIC 1

Module 1: Introduction to Database Systems

Course Information

  • Lecture Schedule:

    • Tuesday, 4:00 – 6:00 PM, DKP2 (BPA)

  • Tutorial Slots:

    • Slot 1: Thursday, 2:00 – 4:00 PM, MAKMAL HAINA (FKI)

    • Slot 2: Thursday, 4:00 – 6:00 PM, MAKMAL HAINA (FKI)

Teaching Plan

  • Detailed outlines or weekly content coverage are likely provided in a separate teaching plan document, which is not included in this transcript.

Assessment Breakdown

  • Quiz: 10%

    • Quiz 1: 5%

    • Quiz 2: 5%

  • Lab Work: 10%

  • Group Assignments: 20%

    • Assignment 1: 10%

    • Assignment 2: 10%

  • Mid-Term Examination: 20%

  • Final Examination: 40%

  • CARRY MARKS TOTAL: 60%

Chapter Objectives

By the end of this chapter, you should be able to:

  1. Define the difference between data and information.

  2. Describe what a database is, the various types of databases, and why they are valuable assets for decision making.

  3. Explain the importance of database design.

  4. See how modern databases evolved from file systems.

  5. Understand flaws in file system data management.

  6. Outline the main components of the database system.

  7. Describe the main functions of a database management system (DBMS).

Why Databases?

  • Pervasive Nature of Databases:

    • Not discussed in detail; figure 1.1 referenced but contents are not available in the transcript.

Data versus Information

Raw Data and Processed Information
  • Data:

    • Consists of raw facts that have not yet been processed to reveal meaning to the end user.

  • Information:

    • The result of processing raw data to reveal the meaning; requires context.

  • Knowledge:

    • Implies familiarity, awareness, and understanding of information.

    • Accurate, relevant, and timely information is crucial for good decision-making.

  • Data Management:

    • A discipline focusing on proper generation, storage, and retrieval of data.

Transforming Raw Data into Information
  • Figure 1.2 is referenced but not detailed; likely illustrates the data processing pathway.

Introducing the Database

  • Database Definition:

    • A shared, integrated computer structure that stores a collection of:

    • End-user data: Raw facts of interest to the end user.

    • Metadata: Data about data which allows for the integration and management of end-user data.

      • Describes data characteristics and relationships linking the data.

  • DBMS Definition:

    • A collection of programs that manage the database structure and controls access to the data.

Role and Advantages of DBMS

User Interaction with Data
  • Single Integrated View:

    • DBMS presents end users with an integrated view of data.

Advantages of DBMS
  • Improved data sharing.

  • Improved data security.

  • Better data integration.

  • Minimized data inconsistency.

  • Improved data access and decision-making.

  • Increased end-user productivity.

Types of Databases

User and Accessibility Type
  1. Single-User Database:

    • Supports one user at a time (e.g., desktop databases).

  2. Multiuser Database:

    • Supports multiple users simultaneously.

    • Workgroup Database: Supports limited users in specific departments.

    • Enterprise Database: Supports many users across various departments.

Location-Based Classification
  • Centralized Database: Data located at a single site.

  • Distributed Database: Data spread across multiple sites.

  • Cloud Database: Created and maintained using cloud data services.

Data Type Classification
  1. General-Purpose Databases:

    • Contain a wide variety of data across multiple disciplines.

  2. Discipline-Specific Databases:

    • Focus on specific subject areas.

  3. Operational Database:

    • Supports a company’s daily operations.

  4. Analytical Database:

    • Stores data used for decision-making (historical data metrics).

    • Components:

      • Data Warehouse: Optimized for decision support.

      • Online Analytical Processing (OLAP): Tools for processing and modeling data.

Business Intelligence and Data Structuring
  • Business Intelligence:

    • Captures and processes business data to support decisions.

  • Data Structuring:

    • Unstructured Data: Original raw state; difficult to manage.

    • Structured Data: Formatted for easier storage and access.

    • Semi-Structured Data: Partially processed, already formatted.

XML and NoSQL Databases

  • XML Databases:

    • Support storage/management of unstructured XML data.

  • NoSQL Databases:

    • Not based on traditional relational models, designed for handling large volumes and varieties of data efficiently.

Knowledge Check Activity 1-1

  • Question: What is a DBMS?

  • Answer: A DBMS is best described as a collection of programs managing the database structure and controlling shared access to data.

Importance of Database Design

Key Points About Design
  • Definition:

    • Activities focused on designing the database structure for storing and managing end-user data.

  • Proper decomposition of integrated information is crucial.

  • A well-designed database aids in data management and ensures accuracy of information.

  • Poorly designed databases lead to difficult errors and poor decision-making.

Evolution of File System Data Processing

Traditional Manual Systems Versus Computerized Systems
  • Traditional file systems use manual methods (folders, cabinets).

  • Computerized systems developed to automate data tracking and reporting.

  • Modern users often misuse spreadsheet software as databases leading to inefficiencies.

Basic File Terminology

Term

Definition

Data

Raw facts with little meaning unless organized.

Field

A group of characters defining and storing data.

Record

A set of fields describing an entity (e.g., a customer).

File

A collection of related records (e.g., student data).

Problems with File System Data Processing

  • Issues Include:

    • Lengthy development times.

    • Difficulty obtaining quick answers.

    • Complexity in system administration.

    • Lack of security and limited data sharing.

    • Extensive programming requirements.

Structural and Data Dependence

  • Structural Dependence:

    • Access depends on the file structure; changes require program modifications.

  • Structural Independence:

    • Changing file structure without affecting data access.

  • Data Dependence:

    • Programs requiring changes based on data storage characteristics.

  • Data Independence:

    • Changes in data storage don’t require program changes.

Data Redundancy

  • Definition:

    • Unnecessary data duplication across systems, leading to various issues.

  • Characteristics Include:

    • Poor data security, inconsistency, errors, and integrity problems.

Data Anomalies

  • Definition:

    • Occur when not all required changes in redundant data are made.

  • Types of Data Anomalies:

    • Update anomalies

    • Insertion anomalies

    • Deletion anomalies

Knowledge Check Activity 1-2

  • Question: What is data redundancy, and which characteristics of the file system can lead to it?

  • Answer: Data redundancy arises when duplicated data exists in multiple locations without proper management, symptomatic of a file system's inability to maintain data relations.

Database Systems

Overview
  • Database System Definition:

    • Consists of logically related data in a unified repository, with physical distribution across storage facilities.

  • Current DBMS resolve file system issues by ensuring data consistency, managing structures, and defining access paths.

Database System Environment

Components of Database Systems
  1. Hardware.

  2. Software.

  3. People.

  4. Procedures.

  5. Data.

Effectiveness of Database Solutions
  • Must be cost-effective, tactically, and strategically effective.

DBMS Functions

Important Functions of a DBMS
  1. Data Dictionary Management:

    • Stores definitions and relationships in a data dictionary.

  2. Data Storage Management:

    • Manages structures for efficient data storage.

  3. Data Transformation and Presentation:

    • Transforms data to fit user expectations.

  4. Security Management:

    • Enforces security and privacy.

Continued Functions of a DBMS
  1. Multiuser Access Control:

    • Manages simultaneous user access.

  2. Backup and Recovery Management:

    • Ensures data safety and integrity, enabling recovery after failures.

  3. Data Integrity Management:

    • Enforces rules for minimal redundancy and maximum consistency.

Further DBMS Functionalities
  1. Database Access Languages:

    • Provides query languages for data access (e.g., SQL).

  2. Database Communication Interfaces:

    • Accepts end-user requests through various communication means.

Managing Database Systems

Considered Challenges
  • Disadvantages:

    • Increased costs

    • Management complexity

    • Maintaining currency in technology

    • Vendor dependence

    • Frequent upgrade and replacement cycles.

Preparing for a Career in Database Management

Job Titles and Skills Required

Job Title

Description

Skills Required

Database Developer

Create and maintain database applications

Programming, SQL, database fundamentals

Database Designer

Design and maintain databases

Systems design, SQL, database design

Database Administrator

Manage DBMS and databases

SQL, database fundamentals, vendor courses

Database Analyst

Develop databases for reporting

SQL, query optimization, data warehouses

Database Architect

Design database environments

Data modeling, SQL, hardware knowledge, DBMS fundamentals

Database Consultant

Advise on database technology usage

SQL, data modeling, DBMS knowledge

Database Security Officer

Implement data security policies

SQL, database administration, data security technologies

Cloud Data Architect

Design cloud database infrastructure

Internet technologies, data security, performance tuning

Data Scientist

Analyze data for insights and predictions

Data analysis, SQL, statistics, programming, machine learning

Knowledge Check Activity 1-3

  • Question: What are the main components of a database system?

  • Answer: The main components are hardware, software, people, procedures, and data.