Relational Database Design:

Upon completion, students will be able to:
- Identify problems tied to bad database design
- Explain Functional Dependency (FD)
- List various types of dependencies
- Explain Armstrong Inference Axioms
- Determine closures of attributes/attribute sets
- Determine closures of Functional Dependencies (FDs)
- Determine equivalences of FDs
- Identify redundant FDs
- Determine Canonical or Minimal Cover

Goal: Create relational schemas for storing information without unnecessary redundancy while maintaining ease of data retrieval.
Steps for Good Design:
1. Characterize data requirements through textual descriptions.
2. Choose an Entity-Relationship (ER) model for designing a conceptual schema.
3. Map the conceptual schema onto a relational data model (Logical Design).
4. Use the specific database schema during the Physical Design phase.

Bottom-up Approach (Design by Synthesis):
- Focuses on relationships among individual attributes to construct schemas.
- Challenges include identifying numerous binary relationships from attributes.
Top-down Approach (Design by Analysis):
- Starts with aggregating attributes into relations that naturally exist (e.g., an Invoice).
- Analyzes relations leading to decomposition until all desirable properties are achieved.

Anomalies in Poorly Planned Databases:
- Problems arise from storing all data in a single table, affecting data insertion, deletion, and updates.
- Example: Student Database which leads to inefficiencies.

Insertion Anomaly:
- Occurs when an attribute cannot be inserted due to the absence of another attribute.
- Example: Unable to record a new professor without course assignment.
Deletion Anomaly:
- Deleting an entry leads to loss of valuable information related to another entity.
- Example: Deleting a course inadvertently deletes associated professor information.
Updation Anomaly:
- Requires modulating the same attribute across several records, risking inconsistency.
- Example: Updating a student's phone number incorrectly affects the database state.

Functional Dependency is the building block of normalization

principles.

Definition: Basic principle of normalization.
- If attribute(s) A in relation schema R functionally determines attribute(s) B, then for a given value of A, there is a single, corresponding value of B.
- Symbolically: A → B (A is determinant, B is dependent).

Conditions to Ensure FD (A → B):
1. Unique values of A imply FD holds.
2. If all B values for given A values are the same, FD holds.
3. If same A values yield differing B values, the FD does not hold.

Trivial vs Non-Trivial FD:
- Trivial: If Y is a subset or equal to X (e.g., {Name, Course} → Course).
- Non-Trivial: Y is not a subset of X; e.g., Prof → Grade.
Multivalued Dependency:
- Occurs when multiple independent multivalued attributes exist downwards;
- Consider a bike manufacture company, havind attributes as (bike_model, mfg_year, color), here mfg_year and color are independent to eachother.
  e.g.:
```
bike_model ->> mfg_year; 
bike_model ->> color
```
Transitive Dependency:
- Indirect dependency formed by two functional dependencies (X → Z if X → Y and Y → Z).

Purpose: Infer FDs satisfied by a relation.
- Includes several fundamental rules:
- Reflexivity: If Y is a subset of X, then X → Y.
- Augmentation: If X → Y, then {X, Z} → {Y, Z}.
- Transitivity: If X → Y and Y → Z, then X → Z.

Definition: Set of attributes functionally defined from set 'A', denoted as A+.

Closure F+: the set of FDs logically implied by F, generated by applying axioms through combinations of attributes.

FDs F and G are equivalent if F+ = G+; can be verified through testing attribute closures.

Definition: An FD is redundant if it can be derived from other FDs in the set.
- Method to find redundancy involves testing each FD's derivability within the remaining set.

Definition: A canonical cover is a minimal set of FDs where:
1. F and Fc are equivalent.
2. Every FD is simple with one attribute on the RHS.
3. No FD is redundant.
4. The LHS attributes are irreducible.
Method: Achieve a cover by eliminating non-essential dependencies and reducing FDs.

Write FDs with one attribute on the RHS.
Check each FD's essentiality using attribute closures.
If FDs have more than one attribute on the LHS, reduce them using closure checks.