AIS Chapter 5 and 7

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/79

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 4:20 PM on 6/25/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

80 Terms

1
New cards

How is big data changing business and accounting?

Big data allows organizations to analyze massive amounts of information to improve decision-making, forecasting, auditing, tax planning, risk assessment, customer service, and operational efficiency.

Examples:

  • Auditors test entire populations instead of samples.

  • Tax professionals estimate tax impacts in real time.

  • Airlines predict arrival times more accurately.

2
New cards

What are the benefits of big data?

  • Better decision-making

  • Improved forecasting

  • Increased efficiency

  • Better customer insights

  • Competitive advantage

  • Reduced costs

3
New cards

What are challenges of big data?

  • Data quality issues

  • Privacy concerns

  • Data security risks

  • Storage costs

  • Complex analysis requirements

  • Data becoming dark data or a data swamp

4
New cards

What are the 4 V's of big data?

  1. Volume

  2. Velocity

  3. Variety

  4. Veracity

5
New cards

What is Volume?

The amount of data collected and stored.

Example: Amazon storing billions of customer transactions.

6
New cards

What is Velocity?

The speed at which data are generated and processed.

Example: Credit card transactions occurring every second.

7
New cards

What is Variety?

Different forms of data.

Examples:

  • Text

  • Images

  • Videos

  • Emails

  • Social media posts

8
New cards

What is Veracity?

The quality, accuracy, and trustworthiness of data.

Example: Removing duplicate customer records.

9
New cards

How can each V impact the data processing cycle?

V

Impact

Volume

More storage and processing needed

Velocity

Faster data processing required

Variety

More complex ETL process

Veracity

More validation and cleaning required

10
New cards

What are the four steps of the analytics mindset?

  1. Ask the right question

  2. Extract, Transform, Load (ETL)

  3. Apply analytics

  4. Interpret and share results

11
New cards

What makes a good data analytics question?

SMART

  • Specific

  • Measurable

  • Achievable

  • Relevant

  • Timely

12
New cards

Example of a SMART analytics question?

"What factors increased sales revenue during Q1 2026?"

Specific, measurable, relevant, and time-bound.

13
New cards

What does ETL stand for?

Extract
Transform
Load

14
New cards

What are the three steps in data extraction?

Understand data needs

Extract data

Verify and document extraction

15
New cards

What is metadata?

Data about data.

Examples:

  • Data type

  • Field length

  • Date format

16
New cards

What is a data dictionary?

Documentation describing the structure and meaning of data fields.

17
New cards

What is a primary key?

A field that uniquely identifies each record.

Example: Employee ID

18
New cards

What occurs during transformation?

  • Standardizing

  • Cleaning

  • Structuring

  • Formatting data

19
New cards

Why is data transformation often the most time-consuming step?

Data from different systems frequently have different formats and structures.

20
New cards

What happens during loading?

Transformed data are imported into a database, software application, or analysis tool.

21
New cards

What is structured data?

Highly organized data stored in fixed fields.

Examples:

  • Relational databases

  • Spreadsheets

22
New cards

What is unstructured data?

Data with no predefined format.

Examples:

  • Images

  • Videos

  • Emails

  • Social media posts

23
New cards

What is semi-structured data?

Data that has some organization but not enough for a relational database.

Examples:

  • XML

  • JSON

  • CSV files

24
New cards

What is a data warehouse?

A repository of structured data from many sources.

25
New cards

What is a data mart?

A smaller subset of a data warehouse.

Example: Sales department data mart.

26
New cards

What is a data lake?

Storage for structured, semi-structured, and unstructured data.

27
New cards

What is dark data?

Data collected but never analyzed.

28
New cards

What is a data swamp?

Poorly documented data that cannot be effectively identified or analyzed.

29
New cards

How can companies avoid dark data and data swamps?

  • Maintain accurate data dictionaries

  • Document metadata

  • Regularly analyze data

  • Invest in data governance

30
New cards

What question does descriptive analytics answer?

"What happened?"

31
New cards

Examples of descriptive analytics?

  • Profit margin

  • Inventory turnover

  • Current ratio

  • Budget vs actual

32
New cards

What question does diagnostic analytics answer?

"Why did it happen?"

33
New cards

Example of diagnostic analytics?

Investigating why gross profit decreased.

34
New cards

What is the 5 Whys technique?

Continuously asking "Why?" to find the root cause.

35
New cards

What question does predictive analytics answer?

"What will likely happen?"

36
New cards

Examples of predictive analytics?

  • Forecasting sales

  • Predicting customer churn

  • Fraud prediction

37
New cards

What question does prescriptive analytics answer?

"What should we do?"

38
New cards

Examples of prescriptive analytics?

  • Loan approval systems

  • Inventory reorder recommendations

  • Route optimization

39
New cards

What is the mean?

Arithmetic average.

40
New cards

What is the median?

Middle value.

41
New cards

Why compare mean and median?

Large differences may indicate outliers.

42
New cards

What is an outlier?

An observation far from most other observations.

43
New cards

What measures spread?

  • Range

  • Standard deviation

  • Quartiles

44
New cards

What is a correlation coefficient?

Measure of relationship strength between variables.

Range:
-1 to +1

45
New cards

Does correlation imply causation?

No.

46
New cards

Example of correlation without causation?

Snow gear sales and advertising both increase before winter. Winter may be causing both.

47
New cards

What is a null hypothesis (H₀)?

No relationship exists.

48
New cards

What is an alternative hypothesis (H₁)?

A relationship exists.

49
New cards

What is a Type I Error?

Rejecting a true null hypothesis.

False positive.

50
New cards

What is a Type II Error?

Failing to reject a false null hypothesis.

False negative.

51
New cards

Typical significance level?

0.05

52
New cards

Why visualize data?

  • Easier to understand

  • Faster processing

  • Better communication

  • Supports decision-making

53
New cards

What is data storytelling?

Translating analytics into understandable information for stakeholders.

54
New cards

Best chart for comparisons?

Bar chart

55
New cards

Best chart for showing progress toward goals?

Bullet chart

56
New cards

Best chart for correlation?

Scatterplot

57
New cards

Best chart for correlation using color intensity?

Heatmap

58
New cards

Best chart for distributions?

  • Histogram

  • Boxplot

59
New cards

Best chart for trends over time?

  • Line chart

  • Area chart

60
New cards

Best chart for part-to-whole relationships?

  • Pie chart

  • Treemap

61
New cards

What are the three major visualization design principles?

   Simplification

   Emphasis

Ethical presentation

62
New cards

Four simplification techniques?

Quantity

Distance

Orientation

Color

63
New cards

What is quantity?

Using only the necessary amount of information.

64
New cards

What is distance?

Keeping related information close together.

65
New cards

What is orientation?

Presenting information horizontally when possible and sorting logically.

66
New cards

Four emphasis techniques?

  1. Highlighting

  2. Weighting

  3. Ordering

  4. Color

67
New cards

Examples of highlighting?

  • Arrows

  • Labels

  • Contrast

  • Color

68
New cards

What increases visual weight?

  • Larger size

  • Darker color

  • Greater contrast

  • More density

69
New cards

Why is ordering important?

It helps emphasize important values and improves interpretation.

70
New cards

What is confirmation bias?

Interpreting information in a way that supports existing beliefs.

71
New cards

How can visualizations misrepresent data?

  • Truncated axes

  • Misleading scales

  • Distorted proportions

  • Excessive emphasis

72
New cards

How do you avoid data misrepresentation?

  • Use honest scales

  • Label clearly

  • Present complete information

  • Avoid deceptive formatting

  • Maintain objectivity

73
New cards

An auditor calculates inventory turnover and profit margin. What type of analytics?

Descriptive

74
New cards

A company investigates why profits decreased. What type?

Diagnostic

75
New cards

A bank predicts whether a customer will default on a loan. What type?

Predictive

76
New cards

A system automatically approves or rejects a loan application. What type?

Prescriptive

77
New cards

A company stores emails, videos, and transaction data together. What storage structure?

Data Lake

78
New cards

Which visualization would best show sales trends over five years?

Line Chart

79
New cards

Which visualization would best show relationship between training hours and performance?

Scatterplot

80
New cards

Which visualization would best show percentage of expenses by category?

Pie Chart or Treemap