ACCT 3130 - Chapter 3: Performing the Test Plan and Analyzing the Results

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/40

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

41 Terms

1
New cards

What are the four main categories of data analytics?

  1. Descriptive Analytics

  2. Diagnostic Analytics

  3. Predictive Analytics

  4. Prescriptive Analytics

2
New cards

Descriptive Analytics

Are procedures that summarize existing data to determine what has happened in the past

3
New cards

Diagnostic Analytics

Are procedures that explore the current data to determine why something has happened the way it has, typically comparing the data to a benchmark

4
New cards

Predictive Analytics

Are procedures used to generate a model that can be used to determine that is likely to happen in the future

5
New cards

Prescriptive Analytics

Are procedures that model data to enable recommendations for what should be done in the future

6
New cards

From lowest of highest of value and difficulty?

  • Descriptive Analytics

  • Diagnostic analytics

  • Predictive Analytics

  • Prescriptive Analytics

7
New cards

Descriptive analytics examples:

  • Summary statistics

  • Data reduction or filtering

8
New cards

Summary Statistics

Describe a set of data in terms of their location (mean, median), range (standard deviation, min, max), shape (quartile), and size (count)

9
New cards

Data Reduction or Filterting

Is used to reduce the amount of observations to focus on relevant items (that is, highest cost, highest risk, largest impact, etc.)

10
New cards

How is data reduction/filtering done?

It does this by taking a large set of data (perhaps the population) and reducing it to a smaller set that has the vast majority of the critical information of the larger set

11
New cards

Diagnostic Analytics examples:

  • Profiling

  • Clustering

  • Similarity Matching

  • Co-occurence grouping

12
New cards

Profiling

Identifies the “typical” behavior of an individual, group, or population by compiling summary statistics about the data (including mean, standard deviations, etc.) and comparing individuals to the population

  • used to discover patterns of behavior

13
New cards

Clustering

Helps identify groups (or clusters) of individuals (such as customers) that share common underlying characteristics - identifying groups of similar data elements and the underlying drivers of those groups

14
New cards

Similarity Matching

Is a grouping technique used to identify similar individuals based on data known about them

15
New cards

Co-occurence grouping

Discovers associations between individuals based on common events, such as transactions they are involved in

16
New cards

Predictive analytics example:

  • Regression

  • Classification

  • Link Prediction

17
New cards

Regression

Estimates or predicts the numerical value of a dependent variable based on the slope and interest of a line and the value of an independent variable

18
New cards

Classification

Predicts a class or category for a new observation based on the manual identification of classes from previous observations

19
New cards

Link Prediction

Predicts a relationship between two data items, such as members of a social media platform

20
New cards

Prescriptive analytics examples:

  • Decision support systems

  • Machine Learning and Artificial Intelligence

21
New cards

Decision Support Systems

Are rule-based systems that gather data and recommend actions based on the input

22
New cards

Machine Learning and Artificial Intelligence

Are learning models or intelligent agents that adapt to new external data to recommend a course of action

23
New cards

Summary Statistics

Describe the location, spread, shape, and dependence of a set of observations

24
New cards

Data reduction involves the following steps:

  • Identify the attribute you would like to reduce on or focus on

  • Filter the results

  • Interpret the results

  • Follow up on results

25
New cards

Fuzzy Matching

Locates approximate matches

  • Useful for identifying relationships in imperfect data

26
New cards

What can diagnostic analytics provide?

Insight into why things happened or how individual data values relate to the general population

27
New cards

What type of data is profiling primarily using?

Structured data - data that are stored in a database or spreadsheet and are readily searchable

28
New cards

Profiling relies on gathering summary statistics and identifying outliers:

  • Identify the objects or activity you want to profile

  • Determine the types of profiling you want to perform

  • Set boundaries or threshold for the activity

  • Interpret the results and monitor the activity and/or generate a list of exceptions

  • Follow up on exceptions

29
New cards

What can help show spread and outliers?

Z-scores and box plots

30
New cards

What is an example of data profiling?

Variance Analysis

31
New cards

Benford’s law

Is a diagnostic analytics that compares actual to expected values

32
New cards

What does regression help with?

helps predict expected outcomes

33
New cards

Classification: Training data

Are existing data that have been manually evaluated and assigned a class

34
New cards

Classification: Test Data

Are existing data used to evaluate the modelClassification:

35
New cards

Classification: Decision Trees

Are used to divide data into similar groups

36
New cards

Classification: Decision Boundaries

Mark the split between one class and another

37
New cards

Classification: Pruning

Removes branches from a decision tree to avoid overfitting the model

38
New cards

Classification: Linear Classifiers

Are useful for ranking items rather than simply predicting class probability

  • Useful for determining the important values

39
New cards

Classification: Support Vector Machine

Is a discriminating classifier that is define by a separating hyperplane that work first to find the widest margin and then works to find the middle line

40
New cards

How do we evaluate classifiers?

  • Try to avoid overfitting, or model that are too accurate. They are bad at predicting a future observation

  • Look for the sweet spot where we maximize the accuracy of the testing data

41
New cards

What do we do once other diagnostics and predictive analyses have been performed?

The decision process can be aided by rules-based decision support systems, machine learning models, or added to an existing artificial intelligence model to improve future predictions