MSIS 3223

0.0(0)
studied byStudied by 1 person
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/78

flashcard set

Earn XP

Description and Tags

Oklahoma State University Business Data Analytics Exam 1

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

79 Terms

1
New cards

Relative address

just row & column label in cell reference ex) A4

2
New cards

absolute address

uses $ before either row or column or both to keep the reference fixed ex) $F$4 OR $F4 OR F$4

3
New cards

MIN(range)

excel function finding smallest value in range of cells

4
New cards

MAX(range)

finds largest value in range of cells

5
New cards

SUM(range)

finds sum of values in range of cells

6
New cards

AVERAGE(range)

finds average of values in range of cells, also known as mean

7
New cards

COUNT(range)

find # of cells in a range that contains numbers

8
New cards

COUNTIF(range, criteria)

finds # of cells within a range that meet a specified criterion ex) COUNTIF(A1:A20, >20) OR COUNTIF(A1:A20, “Jim”)

9
New cards

descriptive analytics

most commonly used, summarizes data into meaningful charts/reports

10
New cards

predictive analytics

seeks to predict future by examining historical data and detecting patterns/relationships then extrapolating relationships forward in time

11
New cards

prescriptive analytics

uses optimization to identify best alternatives to minimize or maximize an objective, to prescribe the best solution/ course of action in order to accomplish a goal

12
New cards

data reliability

data is accurate and consistent

13
New cards

data validity

data measures what it’s supposed to measure

14
New cards

Decision Models

3 Types of Input

  • Data (assumed constant for model purposes)

  • uncontrollable inputs

  • decision options

3 types

  • descriptive models

  • predictive models

  • prescriptive models

15
New cards

uncontrollable inputs

quantities that can change but can’t be directly controlled by decision maker ex) FED controls interest rates

16
New cards

decision options (decision variables)

controllable & can be selected at discretion of decision maker ex) how many employees needed to maximize profit

17
New cards

Descriptive decision model

explain behavior, allow users to evaluate potential decisions by asking “what-if?”

18
New cards

predictive decision model

focus on what’ll happen in the future

19
New cards

prescriptive decision model

help decision makers identify best solution to decision problems by using optimization

20
New cards

optimization

process of finding set of values for decision options that minimize/maximize some quantity of interest

21
New cards

phases in problem solving

1) recognizing the problem

2) defining the problem

3) structuring the problem

4) analyzing the problem

5) interpreting results & making decision

6) implementing solution

22
New cards

Recognizing the problem

problem exists when gap btwn what’s happening & what we think should be happening

23
New cards

defining the problem

important to involve all ppl who make decisions or who may be affected by them

24
New cards

structuring the problem

formal model often developed in this phase

25
New cards

analyzing the problem

experimentation/ solution process, evaluating diff scenarios or analyzing risks associated w various decision alternatives

26
New cards

interpreting results & making decision

models can’t capture every detail of the real problem & managers must understand the limitations

27
New cards

implementing solution

making solution work in the org or translating the results back to the real world

28
New cards

logical functions

depend on whether 1 or more conditions are true or false

  • =IF

  • =AND

  • =OR

29
New cards

condition

statement about the value of a cell (numeric or text)

30
New cards

IF(condition, value if true, value if false)

logical function that returns 1 value if condition true and another if value is false ex)=IF(A2>B2, “Over Budget”, “Within budget”)

31
New cards

AND(condition1, condition2…)

logical function that returns TRUE if all conditions are true and FALSE if not ex)=AND(B2>750, C2>750 would return true if both cells actually exceed 750

32
New cards

OR(condition 1, condition 2)

logical function that returns TRUE if any of the conditions is true (even if 1 is not true) and FALSE if not ex)=OR(1>2,1>0) returns TRUE but OR(1>2, 1>3) returns FALSE

33
New cards

Lookup functions

  • VLOOKUP

  • HLOOKUP

  • INDEX

  • MATCH

  • CHOOSE

34
New cards

VLOOKUP

looks up value in leftmost column of a table and returns value in same row from a column you specify

35
New cards

HLOOKUP

looks up value in top row of a table & returns a value in same column from a row you specify

36
New cards

INDEX

lookup function which returns value or reference of the cell at the intersection of a particular row & column in a given range

37
New cards

MATCH

lookup function which returns relative position of an item in an array that matches a specified value in a specified order

38
New cards

CHOOSE

lookup function which returns value from a list based on position in the list, specified by index_num

39
New cards

PivotTables

allows you to create custom summaries/charts of key info and drill down into data

dragging a field into filters area in a PivotTable field list allows you to add 3rd dimension

can express data in various % views ex) % of grand total, % of column total, etc.

40
New cards

excel charts

to create: highlight range of data, click insert tab, click chart type, click chart subtype you wanna use

distinguishes btwn vertical & horizontal charts (column & bar charts)

  • clustered column

  • stacked column

  • line charts

  • area chart

  • scatter chart

  • bubble chart

  • combo chart

  • radar chart

  • stock chart

41
New cards
<p>clustered column chart</p>

clustered column chart

compares values across categories using vertical rectangles

42
New cards
<p>stacked column chart</p>

stacked column chart

displays contribution of each value to the total by stacking the rectangles

43
New cards
<p>line charts</p>

line charts

provide useful means for displaying data over time

44
New cards
<p>pie chart</p>

pie chart

displays relative proportion of each data source by partitioning circle into pie-shaped areas

45
New cards
<p>area chart</p>

area chart

combines features of pie chart & line chart, useful for displaying proportions over time

46
New cards
<p>scatter chart</p>

scatter chart

shows relationship btwn 2 variables and consists of observations of pairs of variable data

  • orbit chart: scatter chart where points are connected in sequence over time

47
New cards
<p>bubble chart</p>

bubble chart

type of scatter chart in which size of data corresponds to value of a 3rd variable, a way to plot 3 variables in 2-D

48
New cards
<p>combo chart</p>

combo chart

displays multiple data series on the same chart using diff chart types such as a column chart & a line chart

49
New cards
<p>radar chart</p>

radar chart

shows multiple metrics on a spider web, allowing plotting of multiple dimensions of several data series

50
New cards
<p>stock chart</p>

stock chart

allows you to plot stock prices ex) daily high, low, close values

51
New cards

dashboards

visual representation of set of key business measures, important summaries of key business info to help manage business process/function

52
New cards

categorical (nominal data)

sorted into categories according to specified characteristics

53
New cards

ordinal data

can be ordered or ranked according to some relationship to eachother

54
New cards

interval data

are ordinal but have constant differences btwn observations and have arbitrary zero points, allow meaningful comparison of ranges/averages & other stats

55
New cards

ratio data

continuous & have natural zero point, most business/ economic data ($ or time) fall into this category

56
New cards

frequency distribution

  • table showing # of observations in each of several non-overlapping groups

  • graphical depiction in form of a column chart is called a histogram

  • categorical data naturally define the groups

  • to make: use COUNTIF to count # of observations in each category

  • can be expressed as fraction or proportion of total (relative frequency)

57
New cards

relative frequency

frequencies expressed as fraction or proportion of total

to make: # of observations in each category divided by total # of observations in all categories

58
New cards

cross tabulation

tabular method displaying # of observations in a data set in diff subcategories of 2 categorical variables

often called a contingency table

subcategories of variables must be mutually exclusive & exhaustive (each observation only fit in 1 subcategory & all observations will be in a category)

59
New cards

population 

all items of interest for a particular decision/investigation

60
New cards

sample/sampling

subset of population, is necessary as most populations are too large to deal with- this allows easier analyzing to obtain sufficient info to draw a valid inference about a population

61
New cards

measures of location

provide estimates of a single value that represents the “centering” of a set of data

  • average

  • median

  • mode

  • midrange

62
New cards

average/arithmetic mean

most common measure of location, sum of observations divided by # of observations

is affected by outliers

63
New cards

median

measure of location that specifies middle value when data arranged least-greatest

not affected by outliers

64
New cards

mode

measure of location, observation that occurs most frequently, useful for data sets which relatively small # of unique values

65
New cards

midrange

measure of location used occasionally, the average of the largest & smallest values

highly affected by outliers

66
New cards

dispersion

degree of variation in the data/ numerical spread (or compactness) of the data

  • range

  • interquartile range

  • variance

  • standard deviation

67
New cards

range

easiest to compute, difference btwn max value and min value of a data set

68
New cards

interquartile range (mid spread)

diff btwn first & third quartiles

69
New cards

variance

more commonly used, the larger the variance the more the data is spread out from the mean, average of the squared deviations of the observations from the mean

70
New cards

standard deviation

square root of the variance, popular measure of risk especially in financial analysis

71
New cards

histogram

graphical form of frequency distribution, can take many diff shapes

  • skewness

  • coefficient of skewness

  • kurtosis

72
New cards

Skewness

lack of symmetry of data

  • Positively skewed (right-skewed)- mass of data on left with tail to right (picture)

  • negatively skewed (left-skewed)- mass of data on right and tail to left

  • used on histogram

<p>lack of symmetry of data</p><ul><li><p>Positively skewed (right-skewed)- mass of data on left with tail to right (picture)</p></li><li><p>negatively skewed (left-skewed)- mass of data on right and tail to left</p></li><li><p>used on histogram</p></li></ul><p></p>
73
New cards

coefficient of skewness

measures degree of asymmetry of observations around the mean of a histogram

74
New cards

kurtosis

peakedness or flatness of a histogram

75
New cards

proportion

formal statistical measure of a fraction of data that has certain characteristics

76
New cards

strong statistical relationship

when 2 variables appear to move together

77
New cards

covariance

measure of the linear association btwn 2 variables

78
New cards

correlation

measure of the linear relationship btwn 2 variables, not dependent on units of measurement, measured by correlation coefficient

79
New cards

correlation coefficient

  • scaled btwn -1 & 1, used to measure correlation

  • positive correlation coefficient (close to 1) indicates positive linear relationship btwn the variables

  • negative correlation coefficient (close to -1) indicates negative linear relationship exists btwn the variables

  • correlation coefficient close to 0 indicates no linear relationship btwn variables