Stats Exam 2

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/89

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

90 Terms

1
New cards

trimmed mean

deleting percentage of smallest and largest values from data set and computing mean of the rest

2
New cards

bimodel

data set with 2 modes

3
New cards

multimodal

data set with more than 2 modes

4
New cards

weighted mean

(weight of obs x value of obs) / weight of obs

5
New cards

geometric mean definition

used in analyzing growth rates in financial data; applied anytime want to determine mean rate of change over successive periods

6
New cards

geometric mean formula

xg = nth root of [(x1)(x2)…(xn)]

7
New cards

percentile definition

at least p% of items take on this value or less, at least (100-p)% of items take on this value or more

8
New cards

location of pth percentile

Lp = (p/100)(n-1)

9
New cards

1st quartile

25th percentile

10
New cards

2nd quartile

50th percentile

11
New cards

3rd quartile

75th percentile

12
New cards

measures of variability

range, interquartile range, variance, standard deviation

13
New cards

interquartile range formula

IQR = Q3 - Q1

14
New cards

variance definition

difference between xi and the mean

15
New cards

sample variance formula

s² = (sum of [xi-x]²) / (n-1)

16
New cards

population variance formula

S² = (sum of [xi-u]²) / (N)

17
New cards

standard deviation definition

measures the dispersion of a dataset relative to its mean

18
New cards

standard deviation formula

square root of the variance (s for sample, omega for population)

19
New cards

coefficient of variation definition

indicates how large the standard deviation is in relation to the mean

20
New cards

coefficient of variation formula

CV = (population standard deviation) / (population mean)

21
New cards

left skewness

skewness is negative, mean < median, tail to the left

22
New cards

right skewness

skewness is positive, mean > median, tail is to the right

23
New cards

highly skewed right

skewness is often above 1.0

24
New cards

z-scores definition

number of standard deviations a data value is from the mean

25
New cards

z-score formula

zi = (xi-x) / (s)

26
New cards

negative z-score

data value < sample mean

27
New cards

positive z-score

data value > sample mean

28
New cards

Chebyshev’s Theorem

at least 1 - (1/z²) of items will be within z standard deviations of the mean

29
New cards

z=2

at least 75% data points

30
New cards

z=3

at least 89% data points

31
New cards

z=4

at least 94% data points

32
New cards

empirical rule definition

when data approximates a bell-shaped distribution, can determine % of values within specified number of standard deviations

33
New cards

empirical rule ± 1 standard deviation

68%

34
New cards

empirical rule ± 2 standard deviations

95%

35
New cards

empirical rule ± 3 standard deviations

99.7%

36
New cards

outlier characteristics

± 3 standard deviations away, incorrectly recorded, included, or just unusual

37
New cards

boxplot requirements

smallest value, first quartile, median, third quartile, largest value

38
New cards

lower limit formula

Q1 = 1.5(IQR) below Q1

39
New cards

upper limit

Q3 = 1.5(IQR) above Q3

40
New cards

covariance definition

linear association between two variables

41
New cards

sample covariance formula

sxy = (sum of [(xi-x)(yi-y)]) / (n-1)

42
New cards

population covariance formula

sigmaxy = (sum of [(xi-Mx)(yi-Yy)]) / (N)

43
New cards

correlation coefficient definition

linear association, not necessarily causation, can take on values between -1 and 1

44
New cards

sample correlation coefficient formula

rxy = (sxy) / [(sx)(sy)]

45
New cards

population correlation coefficient formula

Pxy = (sigmaxy) / [(sigmax)(sigmay)]

46
New cards

strong negative correlation

-1

47
New cards

strong positive correlation

1

48
New cards

data dashboard characteristics

graphical and numerical

49
New cards

drilling down (data dashboards)

functionality in interactive dashboards that allow user to access information at a detailed level

50
New cards

probability definition

likelihood an event will occur, scale from 0-1

51
New cards

stats experiment

generates a well-defined outcome

52
New cards

sample space

set of all experimental outcomes

53
New cards

sample point

experimental outcome

54
New cards

coin toss experiment

the actual toss

55
New cards

coin toss sample space

heads, tails

56
New cards

counting rule for multiple step experiment definition

sequence of k steps in which there are n1 possible results for first step, n2 for second step…

57
New cards

total number of experimental outcomes formula

(n1)(n2)k(nk)

58
New cards

combinations formula

CNn = (N!) / (n!)(N-n)!

59
New cards

permutations formula

PNn = (n!) [(N!) / (N-n)!]

60
New cards

event

collection of sample points

61
New cards

probability

sum of probabilities of sample points in the event

62
New cards

complement of event definition

event consisting of all sample points NOT in A; P(A) = P(Ac)

63
New cards

Union of two events definition

event containing all sample points IN A OR B OR BOTH; A U B

64
New cards

intersection of two events definition

set of all sample points in BOTH A AND B

65
New cards

mutually exclusive events definition

events have no sample points in common, when one event occurs the other can’t occur

66
New cards

conditional probability definition

probability of event given that another event has occurred

67
New cards

(independent) P(A and B)=

P(A)P(B)

68
New cards

(dependent) P(A and B)=

P(A)P(B|A)

69
New cards

independent events definition

if probability of A is not changed by the existence of B

70
New cards

if one mutually exclusive event is known to occur,

the other event’s probability is 0

71
New cards

2 events that are not mutually exclusive,

might or might not be independent

72
New cards

Bayes’ Theorem

prior probability, likelihood of the evidence, and posterior probability

73
New cards

Bayes’ Theorem formula

P(A|B) = [P(B|A)P(A)] / P(B)

74
New cards

probability distribution definition

describes how probabilities are distributed over values of random variables

75
New cards

discrete uniform probability distribution

f(x) = 1/n

76
New cards

discrete uniform expected value

E(x) = xf(x)

77
New cards

discrete uniform variance

(x-u)²f(x)

78
New cards

bivariate distribution definition

2 random variables being looked at, interested in relationship between them

79
New cards

binomial probability distribution characteristics

experiment with n identical trials, 2 outcomes success and failure on each, prob of success is p, does not change from trial to trial, trials are independent

80
New cards

binomial prob dist formula

f(x) = (n!) / [x!(n-x)!] p^x (1-p)^(n-x)

81
New cards

binomial prob dist variance formula

var(x) = np(1-p)

82
New cards

poisson probability distribution characteristics

prob of occurrence is same for any 2 intervals of equal length, occurrence or non in interval is independent of occurrence in any other interval, number of occurrences per interval

83
New cards

poisson formula

f(x) = [(M^x)(e^-M)] / x!

84
New cards

hypergeometric prob dist

trials not independent, prob of success does change from trial to trial

85
New cards

uniform prob dist expected value

E(x) = (a+b)/n

86
New cards

uniform prob dist variance

(b-a) / n

87
New cards

normal prob dist characteristics

skewness is 0, mean median mode same

88
New cards

z-score formula

z = (x-M) / standard dev

89
New cards

exponential prob dist characteristics

time takes to complete a task, mean and standard dev are same, length of interval between occurrences

90
New cards

exponential prob formula

p(x-x0) = 1+e^(-x0/M)