Informatics Lecture 9

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/60

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 7:58 PM on 8/23/24
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

61 Terms

1
New cards

Database

A structured collection of data that can be easily accessed, managed, and updated.

2
New cards

Database Management System (DBMS)

A group of programs that manipulate the database and provide an interface between the database and its users.

3
New cards

Database Administrator (DBA)

A skilled IS professional who directs all activities related to an organization’s database.

4
New cards

Byte

A unit of data made up of eight bits.

5
New cards

Character

The basic building block of information.

6
New cards

Field

A name, number, or combination of characters that describes a business object or activity.

7
New cards

Record

A collection of related data fields.

8
New cards

File

A collection of related records.

9
New cards

Entity

A person, place, or thing for which data is collected, stored, and maintained.

10
New cards

Attribute

A characteristic of an entity.

11
New cards

Data Item

The specific value of an attribute.

12
New cards

Primary Key

A field or set of fields that uniquely identifies a record.

13
New cards

Data Model

A diagram of data entities and their relationships.

14
New cards

Enterprise Data Modelling

Investigates the general data and information needs of the organization at the strategic level.

15
New cards

Entity-Relationship (ER) Diagrams

Data models that use graphical symbols to show organization entities and relationships.

16
New cards

Relational Model

Organizes data into collections of two-dimensional tables called relations.

17
New cards

Domain

The range of allowable values for a data attribute.

18
New cards

Data Cleansing/Cleanup

The process of detecting and correcting or deleting incomplete or inaccurate records in a database.

19
New cards

Data Center

A climate-controlled building that houses database servers and systems delivering mission-critical information.

20
New cards

Traditional Data Centers

Warehouses filled with server racks and cooling systems.

21
New cards

Flat File Database

A simple database program with records that have no relationships to one another.

22
New cards

Single User Database

A database that allows only one person to use it at a time (e.g., Access).

23
New cards

Multiple User Database

A database that allows many users to access it simultaneously (e.g., SQL Server, Oracle).

24
New cards

Schema

A description of the entire database, which can be part of the database or a separate file.

25
New cards

DBMS in User View

References a schema to access requested data in relation to other data.

26
New cards

Data Definition Language (DDL)

Instructions used to define and describe data and relationships in a database.

27
New cards

Data Dictionary

A detailed description of all data used in the database, including data flows and organization.

28
New cards

Concurrency Control

Manages situations where multiple users need to access the same record simultaneously.

29
New cards

Data Manipulation Language (DML)

A language provided with a DBMS that allows users to access and modify data.

30
New cards

Structured Query Language (SQL)

The standard query language for relational databases, adopted by ANSI.

31
New cards

Big Data

Enormous datasets generated by web and mobile applications, often measured in terabytes and petabytes.

32
New cards

Data Management

A set of functions defining how data is obtained, stored, secured, and processed to meet user needs.

33
New cards

Data Governance

Defines roles and processes for ensuring data trustworthiness and usability across an organization.

34
New cards

Data Lifecycle Management (DLM)

A policy-based approach for managing the flow of enterprise data.

35
New cards

Data Warehouse

A large database that collects business information from various sources to support decision-making.

36
New cards

ETL

Stands for Extract, Transform, Load; a process used in data warehousing.

37
New cards

Data Mart

A subset of a data warehouse used by smaller businesses or departments for decision-making.

38
New cards

Non-Relational Databases (NoSQL)

Databases that store and retrieve data using methods other than two-dimensional tables.

39
New cards

Key-Value NoSQL

Databases with two columns ("key" and "value") for storing data.

40
New cards

Document NoSQL

Databases for managing document-oriented information, such as social media posts.

41
New cards

Graph NoSQL

Databases for analyzing relationships among various data points, suitable for social media data.

42
New cards

Column NoSQL

Databases that store data in columns for fast response times with large data volumes.

43
New cards

Hadoop

An open-source framework for storing and processing large datasets.

44
New cards

Function of data management

  • without data and the ability to process the data, an organization could not successfully complete most business activities.

  • To transform data into useful information

45
New cards

Database Approach to Data Management

(1) Information systems share a pool of related data. (2) Offers the ability to share data and information resources. (3) A database management system (DBMS) is required.

46
New cards

Characteristics of Traditional Approach

Each distinct operational system used data files dedicated to that system.

47
New cards

Considerations for Building a Database

Content, access, logical structure, physical organization, and security.

48
New cards

Relational Model Structure

Rows represent data entities (records), and columns represent attributes (fields).

49
New cards

Data Manipulation Methods

Selecting, projecting, joining, and linking data from tables.

50
New cards

Types of databases

flat file, single user, and multiple users

51
New cards

flat file

Simple database program whose records have no relationship to one another

52
New cards

single user

Only one person can use a database at a time.

53
New cards

multiple users

Allow dozens or hundreds of people to access the same database system at the same time.

54
New cards

how is data sorted and retrieved?

When an application program needs data, it requests the data through the DBMS.

55
New cards

characteristics of Daas

  1. stored on service provider’s servers

  2. accessed by the client over the network

  3. data administration is handled by service provider

56
New cards

characteristics of big data

Velocity, volume, variety, veracity, and value

57
New cards

characteristics of big data

Volume: It indicates to the size of data. Analyzing data with very large volume to extract valuable information is one of important challenges of big data.

Velocity: The term velocity is referring to the speed of data. Flooding of data is very high speed, and it has to be dealt with in appropriate time.

Variety: The data is very diverse and has many types as it comes from different sources with different structures such as: social data, audio, video unstructured data, email and etc.

Value: Another challenging issue is to convert the data into values to understand and discover hidden values.

Veracity: Data veracity, in general, is how accurate or truthful a data set may be. More specifically, when it comes to the accuracy of big data, it’s not just the quality of the data itself but how trustworthy the data source, type, and processing of it is.

58
New cards

challenges of big data

  1. data capture, storage, visualization, analysis, and updating data securely.

  2. Analyzing data with very large volume to extract valuable information

59
New cards

elements of data warehouse

  1. A relational database to store and manage data.

  2. An extraction, loading, and transformation (ELT) solution for preparing the data for analysis.

  3. Analysis tools, reporting, and data mining capabilities.

  4. Client analysis tools for visualizing and presenting data to business users.

60
New cards

advantages of NoSQL databases

  • Ability to spread data over multiple servers so that each server contains only a subset of the total data.

  • Do not require a predefined schema.

61
New cards

primary components of Hadoop

  • A data processing component (MapReduce).

  • A distributed file system (Hadoop Distributed File System, HDFS).