Accounting Information Systems Chapter 5: Data Storage & Big Data

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/51

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

52 Terms

1
New cards

Consists of facts and statistics about a person or object that are collected for reference or analysis.

Data

2
New cards

What can data include?

Numbers, words, measurements, observations, or even just the description of the object it references.

3
New cards

What are the six data elements that together comprise an IT system

1. Bit

2. Byte

3. Field

4. Record

5. File

6. Database

4
New cards

The smallest element in a computer system

Bits

5
New cards

Are composed of Bits

Bytes

6
New cards

Are composed of bytes

Fields

7
New cards

Are composed of fields

Records

8
New cards

Are composed of records

Files

9
New cards

Are composed of files

Databases

10
New cards

What are the two types of ways data is categorized as based on how that data is stored on a computer

1. Structured

2. Unstructured

11
New cards

Data that is organized and fits nicely into tables

Structured

12
New cards

Data that doesn't fit into a traditional table

Unstructured

13
New cards

What are some characteristics of structured data?

1. Defined data types like numerical, text, or date

2. Can easily be displayed as a table

3. Require less storage space and are easier to manage

14
New cards

What are some characteristics of unstructured data?

1. Not stored in a fixed field of a record or file

2. Images, audio, video, and more

3. Require more storage space and are harder to manage

4. Cannot easily be displayed as a table and are harder to manage

15
New cards

Refers to user-generated content and user participation on the internet with interactive applications. Read and write environment

Web 2.0

16
New cards

True or False: The internet was once a "read-only" environment where users mainly consumed published content

True

17
New cards

True or False: Over time, unstructured data has grown much more quickly than structured data

True

18
New cards

Data that doesn't change once it's created

Static Data

19
New cards

Data that may change after it is recorded and must be updated

Dynamic Data

20
New cards

A set of logically related files (tables) that contains an organized collection of data that is accessible for fast searching and retrieval.

Database

21
New cards

The most common type of database you'll encounter, organize structured data in interrelated tables which are connected by similarities between tables.

Relational Database

22
New cards

The systems that manage a database and retrieve data for a user. For example, Oracle, MySQL, SQL Server, and Microsoft Access

Database Management System (DBMS)

23
New cards

Requests for data that is stored within a database

Queries

24
New cards

Queries are written in a ______ that is unique to the database management system

Querying Languages

25
New cards

A logical diagram of a database's structure and organization. It depicts relationships between tables and acts as both a blueprint for database administrators when setting up a new database and a guide for users who access an established database

Schema

26
New cards

The capability of a database to manage increasing demands

Database Scalability

27
New cards

What are the 2 types of scaling?

1. Vertical Scaling

2. Horizontal Scaling

28
New cards

This type of scaling increases the size of the machine

Vertical Scaling

29
New cards

This type of scaling increases the number of machines

Horizontal Scaling

30
New cards

A database that is connected to an information system that is used for transactions in a business process. This database is "live"— the data is created and stored in real time as a result of a transaction being processed

Production Database

31
New cards

What are the 2 types of data storage?

1. Data Lake

2. Data Warehouse

32
New cards

Is designed specifically for reporting and data analysis and contains relevant data that has already been transformed for reporting use.

Data Warehouse

33
New cards

A vast pool of data as it is designed to contain all a company's data and acts as a central repository for data. Stores data that has not yet been cleaned, aggregated, or filtered.

Data Lake

34
New cards

A subset of a data warehouse that is designed for a specific business function. A department uses this for its individual reporting needs.

Data Mart

35
New cards

What is the type of data do data lakes use?

Unstructured and structured data from across the company (raw data)

36
New cards

What is the type of data do data warehouses use?

Historical data in a structured format designed for a relational database (processed data)

37
New cards

What is the purpose of data lakes?

Cost-effective storage of big data

38
New cards

What is the purpose of data warehouses?

Aggregated big data for analytics and business decisions

39
New cards

Who are the users of data lakes?

Data Scientists

40
New cards

Who are the users of data warehouses?

Data Analysts

41
New cards

What are the activities of a data lake?

1. Storing big data

2. Big data analytics (data science)

42
New cards

What are the activities of a data warehouse?

1. Supporting business analysis

2. Read-only queries for aggregating or extracting data

43
New cards

What is the scope of data of a data lake?

All data in a company

44
New cards

What is the scope of data of a data warehouse?

Only data relevant to analysis

45
New cards

True or False: Data flows through an enterprise data lake to an enterprise data warehouse

True

46
New cards

Refers to extremely large and complex data sets that can be analyzed using recent technological innovations to reveal patterns and associations. So large, generated so fast, and so unstructured that it surpasses the limitations of traditional systems and databases.

Big Data

47
New cards

What are the 5 characteristics of big data?

1. Volume

2. Velocity

3. Variety

4. Veracity

5. Value

48
New cards

The speed at which data is generated.

Velocity

49
New cards

The accuracy and truthfulness of the data.

Veracity

50
New cards

The diversity of data created or collected.

Variety

51
New cards

The quantity and scale of data generated every second.

Volume

52
New cards

Arguably the most important of the 5Vs because data isn't useful to a business unless it can be converted into valuable information. Extracting useful data.

Value