Information Systems Final - first chunk

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/24

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

25 Terms

1
New cards

data

observations, symbols, or representations that are recorded

2
New cards

information

data placed into a meaningful context

3
New cards

knowledge

the application of information to achieve a goal

4
New cards

databases

a collection of files that are organized as tables

5
New cards

tables

made up of records (rows)

6
New cards

rows

Made up of fields (columns)

7
New cards

fields

made up of characters

8
New cards

flat file database

a single table

9
New cards

relational database

two or more tables with relationships between the tables, tables are related to each other based on a common key field, joined tables look like a flat file database

10
New cards

pros of relational databases

reduces errors and necessary input

11
New cards

steps to using a database

  1. understand the business needs

  2. define entities

  3. define properties/fields

  4. define relationships

12
New cards

data quality definition

the degree to which the data is fit for its intended use (does the data make sense?)

13
New cards

data consistency

what if the data is in different formats?, inconsistent data makes it difficult to combine data into a single file for analysis, How do we decide which standard to use?

14
New cards

common data issues

redundant data across the organization, some data elements stored in different formats, different naming conventions, different unique identifiers

15
New cards

data agency problem

the data creator is usually not the data consumer, the entire organization/project incurs the bad cost of data, and it must be cleaned up at some point

16
New cards

productivity tax

it takes 10 times as much to complete a unit of work when the data is flawed in any way as it does when the data is good, the cost of “non-value-added’ work

17
New cards

dealing with problematic data (outliers) options

  • remove it

  • use average of other data points

  • guessing at the correct values

18
New cards

dealing with problematic data (inconsistent values) possible options:

fix for consistency so analysis can be done (state names, dates, currencies)

19
New cards

connecting diverse data sets: joining

combines tables for analysis, same thing as merging

20
New cards

connecting diverse data sets: appending

emphasizes enriching data by adding value-added information into existing records from external sources

21
New cards

metadata:

data about data, such as title, description, or a data type (not the data itself)

22
New cards

importance of metadata

helped us resolve inconsistencies because it can be an answer to why data is organized in a certain way (fields/descriptions)

23
New cards

data lake

where you put stuff that is not organized yet

24
New cards

type of file pivot tables use

flat file with all related data in the same columns, which allows you to aggregate data by column name

25
New cards

pivot table operations

  • select measures

  • select the dimensions (columns)

  • select granularity

  • select aggregation operation

  • additional (select filter conditions and create calculated fields)