MIS 301 Quiz 3 Shaul SDSU

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/87

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

88 Terms

1
New cards

ANOVA

Analysis of variance. A statistical procedure examining variations between two or more sets of interval or ratio data.

2
New cards

for an ANOVA test what is the numerator of the variance

the basis of the test

3
New cards

with each ANOVA test

our probability of making a type 1 error increases

4
New cards

type one error

rejecting null hypothesis when it is actually true - a false positive

5
New cards

type 2 error

failing to reject a false null hypothesis

6
New cards

TS > CV

reject null

7
New cards

TS < CV

fail to reject null

8
New cards

assumptions of ANOVA Test

the null is true

at least interval level data

the Central Limit Theorem is Satisfied

random independent samples

the variances are equal

9
New cards

ANOVA factor/classification/treatment

overarching title on how our data is organized

10
New cards

ANOVA categories

nominal level data that identifies a group

11
New cards

criterion variable

the actual numerical values

12
New cards

ANOVA data variation

within and betweem

13
New cards

ANOVA Data Variation : Within

due to chance, randomness and error

14
New cards

ANOVA Data Variation: Between

due to factor/classification/treatment

15
New cards

ANOVA: Distance from x to x double bar is

sum of squares total or total variation

16
New cards

ANOVA: distance from x to x bar is

sum of squares between or variation due to factor

17
New cards

ANOVA: distance from x bar to x double bar

sum of the squares within or variation due to chance

18
New cards

Underlying theory of ANOVA Test

total variation can be portioned into 2 parts- within and between, and those two components can be compared to determine which is affecting the data at a greater degree

19
New cards

ANOVA formula for test stat

Between Term/ Within Term

20
New cards

ANOVA: large test stat means

more likely to reject the null because the factor is affecting the data

21
New cards

ANOVA: small test stat means

more likely to fail to reject because the variation is due more to chance than the factor

22
New cards

bivariate data

Data with two variables

23
New cards

multivariate data

More than two variables are measured on a single experimental unit.

24
New cards

time series

a time-ordered sequence of observations taken at regular intervals

25
New cards

cross section

variation across units of observation during one point of time

26
New cards

panel data

information collected from a group of consumers, organized into panels, over time

27
New cards

r

sample correlation coefficient

28
New cards

The value of r is always between

-1 and 1

29
New cards

the size of the correlation r indicates...

the strength of the linear relationship of x1 and x2

30
New cards

values of r close to 1 or -1 indicate

strong correlation

31
New cards

if r is 0

there is no correlation

32
New cards

if r is 1

there is a perfect positive correlation- when x1 increase, x2 increase

33
New cards

if r is -1

there is perfect negative correlation- when x1 increase x2 decrease

34
New cards

for correlation, data is compared to ______ then to ______

compared to their means then to each other

35
New cards

sample size needed for correlation

10 data points for x and 10 for y

36
New cards

relationships change

over time, outside the data, and across space

37
New cards

correlation leads to

causation then liability, opportunity, or beneficiary

38
New cards

a high r does

not always mean we need to reject the null

39
New cards

sample size effect

as sample size increase, r significance goes down

40
New cards

if r is significant and strong correlation

it is useful

41
New cards

if r is not significant, and weak correlation

it is not useful

42
New cards

if r is not significant and the correlation is weak

it is not useful

43
New cards

if r is significant but weak correlation

it is not useful

44
New cards

if r is significant and moderate correlated

it could be useful

45
New cards

simple linear regression

regression analysis involving one independent variable and one dependent variable in which the relationship between the variables is approximated by a straight line

46
New cards

error for SLR is

actual value- predicted value

47
New cards

total variation in y can be broken into two variables

residual term and regression term

48
New cards

residual term

y's relationship with the x variable

49
New cards

regression term

random factors not in the model- error

50
New cards

four concepts of slr

1. the coefficient of determination

2. isolating the slope: effect of marginal inputs on the outputs

3. over and underperforming the model (y above the line, positive e, over performing; y below the line, negative e, under performing)

4. Restricted model: some predictor variables

51
New cards

alpha > p value

reject the null

52
New cards

alpha < p value

fail to reject null

53
New cards

line of best fit

the regression line that best fits the observed data and minimizes the error in prediction

54
New cards

properties of residual error

if r = +/-1, e= 0, ordinarily, least squares regression pulls down total error

55
New cards

df

degrees of freedom

56
New cards

SS

sum of squares

57
New cards

MS

mean square

58
New cards

F

test stat

59
New cards

F=

MS regression/ MS residual

60
New cards

as the predictor variable increase

the adjusted r squared drops, so you can't just keep adding more predictor variables to drive up multiple r because r square will be penalized

61
New cards

if the general rule regarding sample size is not met...

adjusted r squared is a better measurement of regression strength

62
New cards

as shared variation between the x variable and the y variable increases

r approaches it's upper and lower limit respectfully

63
New cards

significance f =

p-value

64
New cards

dummy variable

A variable for which all cases falling into a specific category assume the value of 1, and all cases not falling into that category assume a value of 0.

65
New cards

collinearity check

if r is greater than or equal to .6, be concerned

if r is greater than or equal to ,8, there is a collinearity error

66
New cards

for a multivariate regression

R is always positive, does not suggest the direction of the relationship, R is greater than or equal to any single X-Y relationship, R is a single value representing the strength of a simultaneous relationship between the x variables and the y variables

67
New cards

Multiple Regression population notation

β0

68
New cards

Multiple Regression partial correlation coefficient sample notation

β1

69
New cards

Collinearity

when 2 or more x variables are highly correlated with each other

70
New cards

Y hat

predicted value

71
New cards

y bar

chance model

72
New cards

DFregression

k-1

73
New cards

DFresidual

nt-k

74
New cards

DFtotal

nt-1

75
New cards

high MS regression/ low MS residual

F is high, probs gonna reject null

76
New cards

low ms regression/ high ms residual

low f, prob gonna fail to reject

77
New cards

ANOVA table tells us

whether or not the model is a good one- reject= a good model, FTR = a bad model

78
New cards

regression stats tell us

multiple r which tells us the level of correlation between the x variable and y

79
New cards

coefficient table

tell us the significance of each component- FTR = the x variable is not a good predictor, reject= the x value is a good predictor

80
New cards

the ANOVA table is a ____ question

stat question

81
New cards

the coefficient table is a _____ question

stat question

82
New cards

as collinearity decreases, there is an increase in

each predictor variable's unique position of the variability within the y variable

83
New cards

y below the line

negative e, under performing

84
New cards

y above the line

positive e, over performing

85
New cards

when p is low

reject Ho

86
New cards

regression stat table is a _____ question

judgement

87
New cards

when n goes up

ms residual goes down

88
New cards

when ms residual goes down

the test stat goes up