PLCY 460 Midtern

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/79

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

80 Terms

1
New cards

*

comment line

2
New cards

use command

load the stata data file

3
New cards

summarize command

summarize X1 X2 and Y

4
New cards

drop command

delete a variable

5
New cards

regress command

makes a regression

6
New cards

independent variable

variable that possibly influences the value of the dependent variable, X

7
New cards

dependent variable

the outcome of interest dependent on the independent variable, Y

8
New cards

slope coefficient

coefficient that reflects how much the dependent variable increases when the independent variable increases by one, B1

9
New cards

constant/intercept

the point at which a regression line crosses the Y-axis, B0

10
New cards

error term

term associated with unmeasured factors in a regression model, E (epsilon)

11
New cards

i

the observation number

12
New cards

population regression equation

Yi = B0 + B1Xi + Ei

13
New cards

endogenous

when changes in the independent variable are related to factors in the error term (outside factors)

14
New cards

exogenous

when changes in the independent variable are NOT related to factors in the error term

15
New cards

why exogenous > endogenous?

you want to focus on exogenous variables instead of endogenous variables to make reasonable inferences

16
New cards

correlation

measures the extent to which two variables are linearly related to each other

17
New cards

positive correlation

high values of one variable are associated with high values of the other, normally upward trends

18
New cards

negative correlation

high values of one variable are associated with low values of the other, normally downward trends

19
New cards

two challenges of econometrics

randomness and endogeneity

20
New cards

randomization

the process where the value of the independent variable is determined by a rando process, like chance, making the IV uncorrelated with everything

21
New cards

randomized controlled trial

experiment where the treatment of interest is randomized; good because they create exogeneity

22
New cards

treatment group

group that receives treatment of interest

23
New cards

control group

group that does not receive treatment of interest

24
New cards

internal validity

when research findings are not biased

25
New cards

external validity

when research findings can be applied outside of the context in which the experiment was conducted

26
New cards

standard deviation

measured how widely dispersed the values of the observation are

27
New cards

replication

research that can be duplicated based on the information provided at the time of publication using replication files

28
New cards

how to open a Do file

click on window, do file editor, new do-file editor, safe as “assignment.do”

29
New cards

how to load Stata data files

file, open, select file, save command in syntax file "C

30
New cards

tabulate command

produces a frequency table when prompted, must include variable name after

31
New cards

list command

lists out observations, must include variable name after

32
New cards

equal to command

==

33
New cards

not equal to command

!=

34
New cards

if command

limits the data used in analyses

35
New cards

scatter x y command

plotting scatterplot of two variables

36
New cards

fitted value

value of Y predicted by the estimated equation, Yi hat = B0 hat + B1 hat Xi

37
New cards

what do hats mean in equations

hats are estimates

38
New cards

regression line

fitted line from regression

39
New cards

residual

distance between fitted value and actual observed value, E hat = Yi – B0 hat – B1 hat Xi

40
New cards

OLS estimation strategy

minimizing the sum of the squared distances of each data point from the regression line, using calculus to find the solution

41
New cards

OLS formula

(i=1)^N▒〖ϵhat= 〗 ∑(i=1)^N▒〖(Yi-B0 hat=B1hat Xi)^2〗

42
New cards

Sampling randomness

variations in estimates seen in a subset of an entire population

43
New cards

Modeled randomness

variation that exits even when observing an entire population

44
New cards

central limit theorem

the average or any random variable follows a normal distribution; histogram should look like a normal distribution

45
New cards

variance

a measure of how much a random variable varied

46
New cards

standard error

the square root of variance

47
New cards

variance of the regression

the variance of the regression measures how well the model explains variation in the dependent variable

48
New cards

homoscedastic

when a random variable has the same variance for all observations

49
New cards

heteroscedastic

when some observations are on average closer to the predicted value than others

50
New cards

goodness of fit

how well a model fits the data, r^2

51
New cards

standard error of regression/goodness of fit

a measure of goodness of fit as the square root of the variance of the regression; σ hat

52
New cards

outliers

observations that are extremely different from the rest of the sample

53
New cards

sample size and outliers

when sample sizes are small, a single outlier can exert considerable influence on OLS coefficient estimates

54
New cards

hypothesis testing

process assessing whether the observed data is or is not consistent with a claim of interest

55
New cards

null hypothesis

a hypothesis of no effect; H0 = B1 = 0

56
New cards

alternative hypothesis

the outcome that is accepted if null hypothesis is rejected

57
New cards

reject null hypothesis

there is sufficient evidence that says that the null hypothesis is false, and the alternative hypothesis is true

58
New cards

fail to reject null hypothesis

there is not enough evidence to prove the null hypothesis is false

59
New cards

type 1 error

rejecting a null hypothesis that is really true

60
New cards

type 2 error

failing to reject a null hypothesis that is actually false

61
New cards

significance level

probability of committing a type 1 error; common is α=0.05

62
New cards

trade off between type 1 and 2 errors

lowering the significance level decreases the probability of making a type 1 error while increasing the probability of making a type 2 error

63
New cards

steps to do a hypothesis test

choose one-sided or two-sided alternative hypothesis; set a significance level α; find a critical value based on the t distribution; use OLS to estimate parameters

64
New cards

confidence interval

range of true values that are most consistent with the observed coefficient estimate

65
New cards

stata commands for hypothesis test

  1. display invttail (number of observations – number of parameters, significance level(/2 if two-tailed)) 2. display invnormal (number of observations – number of parameters, significance level(/2 if two-tailed)) 3. Display2*ttail(degrees of freedom, observed t statistics) 4. set obs (possible values) 5. graph twoway

66
New cards

multivariate OLS

OLS with multiple independent variables

67
New cards

multivariate OLS and endogeneity

fights endogeneity by pulling variables from the error term into the estimated equation

68
New cards

ceteris paribus

all else being equal

69
New cards

auxiliary regression

a regression that is not directly the one of interest but yields information helpful in analyzing the equation we really care about

70
New cards

omitted variable bias

bias that results from leaving out a variable that affects the dependent variable and is correlated with the independent variable

71
New cards

measurement error

when a variable is measured inaccurately; has greater effect on the independent variable

72
New cards

attenuation bias

consequence of the omission of the measurement error from the estimated model; grows larger with larger amount of measurement error

73
New cards

multicollinearity

when there are strong linear relationships between independent variables

74
New cards

factors that influence the variance of multivariate estimates

model fit, sample size, variation, multicollinearity

75
New cards

standardized coefficients

the coefficient of an independent variable that has been standardized

76
New cards

standardizing variable formula

〖Variable〗^Standardized=(Variable- (Variable) ̅)/(sd(Variable))

77
New cards

difference of means test

comparing the mean of Y for one group against the mean of Y for another group

78
New cards

dummy variable

either 0 or 1

79
New cards

categorical variables

have two or more categories with no intrinsic ordering

80
New cards

ordinal variables

express rank but not necessarily relative size