Database Basics

Basic Concepts and Definitions

Database Defined

  • A database is an organized collection of logically related data.

  • Databases can vary significantly in size and complexity.

    • Small database: A salesperson's customer contact list (megabytes).
    • Large database: A corporation's database (terabytes).
    • Very large data warehouses: Contain petabytes of data.

Database Size

  • Megabyte: Small Database for salesperson's customer contacts.
  • Terabyte: A trillion bytes; example: Corporation's mainframe computer for decision support.
  • Petabyte: A quadrillion bytes; example: Very large data warehouses.

Data and Its Types

  • Assumption: All databases are computer-based.

  • Historically, data referred to facts concerning objects and events recorded on computer media.

    • Example: Customer name, address, and telephone number in a salesperson's database.

Structured Data

  • Numeric, character, and dates are the most important structured data types.
  • Stored in tabular form: tables, relations, arrays, spreadsheets.
  • Common in traditional databases and data warehouses.

Unstructured Data

  • Includes documents, emails, tweets, Facebook posts, GPS information, maps, photographic images, sound, and video segments.
  • Example: A photo image of a customer contact or a video clip about the most recent product.

Multimedia Data

  • Structured and unstructured data combined, creating a true multimedia experience.

    • Example: An automobile repair shop combining customer and automobile data with images of damaged autos and scanned insurance claim forms.

Big Data Implications

  • Big data technologies can handle highly heterogeneous data.

Expanded Definition of Data

  • A stored representation of objects and events that have meaning and importance in the user's environment.