Ch 5- Types of Data Analysis

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/50

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

51 Terms

1
New cards

descriptive analytics

characterizes, summarizes, and organizes features and properties of the data to facilitate understanding of the results and the underlying data

what happened/what is happening

2
New cards

sources for descriptive analysis

financial statements

internal information systems

external sources

3
New cards

statistical and summarization tools for descriptive analysis

counts

totals, sums, averages, subtotals

min, max, medians, standard deviation

graphs, bar charts, histograms

horizontal and vertical analysis

ratio analysis

4
New cards

diagnostic analysis

performed to investigate the underlying cause that cannot be answered by simply looing at the descriptive data

why did it happen

what are the reasons for the past result

can we explain why it happened

5
New cards

identify anomalies/outliers

unusual/unexpected results or transactions

Benford’s law

6
New cards

Benford’s law

in any large, randomly produced set of natural numbers, there is an expected distribution of the first digit

if a distribution departs from the expectation it is considered an anomaly and should be investigated

7
New cards

finding previously unknown linkages, patterns, or relationships

drill-down analysis

determine relations/patterns/linkages between varaibles

8
New cards

drill-down analysis

used to uncover details in the data by looking at difference levels of the data to understand why something happened

look for patterns to identify potential correlations

9
New cards

determine relations/patterns/linkages between varaibles

correlation

regression

hypothesis testing

10
New cards

predictive analysis

performed to foresight by identifying patterns in historical data and assessing the likelihood or probability

will it happen in the future

what is the probability something will happen

is it forecastable

11
New cards

3 broad categories of predictive analysis

classification

regression

forecasting using time series analysis

12
New cards

prescriptive analysis

performed to identify the best possible options given constraints or changing conditions

what should we do based on what we expect to happen

how do we optimize our performance based on potential contraints

13
New cards

5 broad categories of prescriptive analysis

sensitivity

capital

marginal

goal-seeking

what-if scenario

14
New cards

sensitivity analysis

evaluate outcomes based on uncertainty regarding the inputs

15
New cards

capital analysis

evaluating future cash flows for potential investments

16
New cards

marginal analysis

used to determine the change in profit association with the cost or benefit of the next unit produced

17
New cards

goal-seeking analysis

what-if analysis that tells what needs to be done to reach a desired outcome

18
New cards

what-if scenario analysis

analysis of potential future events by considering potential outcomes

19
New cards

population

group of phenomenon having something in common

20
New cards

sample

a subset of members of a population selected to represent that population

21
New cards

parameter

characteristics of a population

22
New cards

statistic

characteristic of a sample

23
New cards

mean

sum of all data points divided by the number of data points

24
New cards

median

midpoint of the data in a sorted array

25
New cards

mode

observation that occurs most frequently

26
New cards

range

differences between the maximum and minimum values

27
New cards

variance

average of squared differences from the mean

28
New cards

standard deviation

square root of the variance

29
New cards

probability distributions

statistical property that describes the possible values of random variables and the likelihood that a random variable will be within a given range

30
New cards

3 primary probability distributions

normal

uniform

poisson

31
New cards

normal distribution

bell-shaped probability distribution that is symmetric about the mean with data points closer to the mean frequent than those farther from the mean

32
New cards

how much data is included when data is within 1 sd

68%

33
New cards

how much data is included when data is within 2 sd

95%

34
New cards

how much data is included when data is within 3 sd

99.7%

35
New cards

z-score

tells how many standard deviations a data point is from the mean

36
New cards

uniform distribution

probability distribution where all outcomes are equally likely

37
New cards

poisson distribution

distribution characterized as the mean number of events per interval of space or time

38
New cards

hypothesis

assumption of theory based on an understanding of the data

39
New cards

null hypothesis

assumes that the hypothesized relationship does not exist; there is no significant difference between two samples or populations

40
New cards

H0

grades for students who study are less than or equal to the grades for students who do not study

41
New cards

alternate hypothesis

the case that is believed to be true; opposite of the null hypothesis or a result that is expected

42
New cards

HA

grades for students who study are greater than grades for students who do not study

43
New cards

p-value

result of a test that either rejects or fails to reject the null hypothesis

determine statistical significance by comparing the p-value to a threshold value

44
New cards

p-value > threshold

fail to reject the null hypothesis; not significant result

45
New cards

p-value less than or equal to threshold

reject the null hypothesis; significant result

46
New cards

confidence interval

measures the probability that a population parameter will fall between two set values

47
New cards

sample t-test

used to compare the means of two sets of data observations

48
New cards

paired t-test

compares the same population but at a different time

49
New cards

statistical output from a regression

used to measure the relationship between one output variable and various inputs

50
New cards

output variables

dependent variable

51
New cards

input variable

independent variable