UCSB Political Science 15 Final

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/81

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

82 Terms

1
New cards

selection effect/bias

selection of sample that is not randomized, so is it biased

2
New cards

treatment groups

those who get some treatment of interest in an experiment

3
New cards

control group

those who do not get the treatment of interest

4
New cards

observational study

research where you don't get to randomize who gets the treatment. just observing some relationship in the world

5
New cards

experimental study/randomized control

research designs in which you can randomize who gets the treatment

6
New cards

quasi-experimental research

research in which you have observational data, but you find ways to ensure that the treatment was effectively randomly distributed

7
New cards

internal validity

is the experiment well designed? free from confounders or bias?

8
New cards

external validity

is the finding applicable to other populations, situations, or cases? does it apply outside the context of the research?

9
New cards

Yi

dependent variable, the thing we want to predict

10
New cards

Xi

independent variable, the thing that predicts the DV

11
New cards

Ei

error term, the part the DV or IV doesn't explain, everything NOT in our model. not directly observable

12
New cards

endogeneity

the IV is correlated with the error term

13
New cards

confounder

another unmeasured variable that affects the IV and then also affects the DV

14
New cards

randomness

noise in the data, could go away with larger sample sizes

15
New cards

randomization

randomizing to create treatment and control groups, creates exogeneity

16
New cards

distribution of outcome

the idea that you could observe different outcomes with different probabilities, even when you only observe one outcome

17
New cards

population

overall collection of individuals, beyond just the sample

18
New cards

sample

collection of individuals on which statistical analyses are performed, and from which general trends for the population are inferred

19
New cards

individual

object or unit, single data point contributing to the sample

20
New cards

random variable

the thing being measured in any random experiment

21
New cards

expectation

the bets guess about what number will be drawn from the distribution, loosely thought of as "average" of distribution

22
New cards

variance

how far the numbers you draw tend to be from that best guess

23
New cards

central limit theorem (CLT)

: the distribution of the mean

tends toward a normal distribution.

24
New cards

regression model

Yi = Bo + B1Xi + Ei

25
New cards

B1

slope coefficient, relationship between X and Y

26
New cards

B0

constant, value of Y when X is zero (intercept)

27
New cards

covariance

measures how much two random variables vary together

28
New cards

positive correlation

when X is higher, expect Y is higher

29
New cards

negative correlation

when X is higher, we expect Y is lower

30
New cards

not associated

when X is higher, doesn't tell us anything about Y

31
New cards

correlation

measures the extent to which two variables are linearly related to each other,

32
New cards

bivariate regression

technique to estimate a model with two variables (DV and IV), allows us to quantify the degree to which X and Y move together

33
New cards

omitted variable bias

specific form of endogeneity, often why estimates change if the model changes, X is correlated with something that influences Y (error term is correlated with Y)

34
New cards

unbiased estimate

on average, our estimate is equal to the true parameter

35
New cards

biased estimate

our coefficient is systematically wrong, either too high or too low than the true parameter

36
New cards

robust

whether the model changes or not when the specification changes

37
New cards

homoskedasticity

when random variable X has same variance for all observations of X. not a problem

38
New cards

heteroskedasticity

when the random variable X DOES NOT have the same variance for all observations of X. fixable problem. this means non-constant variance in our errors

39
New cards

outlier

observation that is extremely different from the rest of the observations in the sample. this drags the estimate of the mean/slope towards it.

40
New cards

null distribution

describing how weird our result is if there really is no difference

41
New cards

p-value

probability of observing a difference in mean or a coefficient as big as what we observed, if the null hypothesis were true

42
New cards

critical value

a point on a distribution that defines the boundary between rejecting and not rejecting the null hypothesis in a hypothesis test

43
New cards

significantly different

based on critical value

44
New cards

null hypothesis

no effect, no difference in means, no relationship between X and Y, B1 = 0

45
New cards

alternative hypothesis

likely an effect, there is likely a difference in means and relationship between X and Y, B1 DOES NOT = 0

46
New cards

type 1 error

false positive, when we reject a null hypothesis that is actually true. saying there is a relationship when there isn't.

47
New cards

type 2 error

false negative, when we fail to accept the alternative hypothesis, saying there isn't a relationship when there is. (small sample size, study has low power)

48
New cards

substantive significance

the relationship needs to be large enough to matter

49
New cards

power limitations

larger sample = larger power and vice versa, higher variance = harder to detect relationships. big variance needs larger sample size

50
New cards

statistically significant

reject the null and accept the alternative, based on critical value cutoff, likely a difference in means and relationship between X and Y, B1 = 0

51
New cards

irrelevant variable

adding a variable to regression that doesn't actually explain Y, will not cause bias but will eat up degrees of freedom

52
New cards

model specification

choosing what variables to include in the model

53
New cards

binary/dummy variables

useful, used in experiments to identify the treated (1) and control (0) units, make difference in means across two groups easy to calculate

54
New cards

discrete data

comes in 'bins' or groups. ex. on a scale of 1 to 5, how much do you like this class?

55
New cards

continuous data

can take any value in a sequence. ex. annual income, votes for each candidate

56
New cards

categorical data

descriptive, describes how the world is, comes from qualitiative research, can be ordinal (ordered: low, medium, high) or nominal (cannot be ordered: majors)

57
New cards

cross-sectional data

sample of a population in a given period of time

58
New cards

repeated cross-sectional data

taking different samples of a population over time

59
New cards

panel (time-series) data

seeing the same population repeatedly over time

60
New cards

fixed effects model

control for unit specific effects, time period effects. benefits gives more leverage in identifying causal relationships, look at a single unit over time instead of comparing

61
New cards

case study

intensive study of a single spatial and temproal phenomenon

62
New cards

cross-case study

study of several cases to compare a phenomenon across space and time

63
New cards

process tracing

attempts to identify the intervening causal process between an IV or variables and the outcome of the DV, qualitative method, how X becomes Y

64
New cards

elite interviews

asking people who were involved in the political event or issue about what happened when and why, usually semi-structured and open-ended questions

65
New cards

focus groups

asking a group of people what they think about a given issue

66
New cards

A: attrition

1 problem in experiments: units drop out of experiment, never observe their outcome variable

67
New cards

B: balance

2 problem in experiments: do the covariates (control variables) have the same mean across the two groups

68
New cards

C: compliance

3 problem in experiments: whether units actually receive the treatment they were assigned to

69
New cards

natural experiments

when a researcher identifies a situation in which values of the independent variable have been determined by a random process

70
New cards

goodness of fit

how much of Y does X explain, related to r-squared

71
New cards

residual

part of Y that X doesn't explain, tells us what isn't there

72
New cards

qualitative research

what mechanisms/processes lead to this outcome vs. another, outliers

73
New cards

quantitative research

effects, population-level relationship, what is the relationship on average between two variables? (ex. does poverty predict)

74
New cards

difference of means test

comparing the mean of Y for one group against the mean of Y for another

75
New cards

categorical variable

has two or more categories but no ordering

76
New cards

ordinal variable

expresses rank but not relative size

77
New cards

reference category

coefficients on all the included dummy variables indicate how much higher or lower the DV is relative to this

78
New cards

blocking

picking treatment and control groups in advance so that they are equal in covariates

79
New cards

intention to treat analysis

addresses potential endogeneity that arises from non-compliance

80
New cards

balance table

a method to compare the characteristics of a treatment group and a control group, typically used in experimental or matching studies

81
New cards

P-value

If the p-value is less than the significance level (typically 0.05), the evidence against the null hypothesis is considered strong, and the null hypothesis is often rejected in favor of the alternative hypothesis.

82
New cards

T-Test

analyzes the significant differences between the means of two sample groups (the mean of a reference with a known reference mean)