UCSB Political Science 15 Final

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/81

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

82 Terms

New cards

selection effect/bias

selection of sample that is not randomized, so is it biased

New cards

treatment groups

those who get some treatment of interest in an experiment

New cards

control group

those who do not get the treatment of interest

New cards

observational study

research where you don't get to randomize who gets the treatment. just observing some relationship in the world

New cards

experimental study/randomized control

research designs in which you can randomize who gets the treatment

New cards

quasi-experimental research

research in which you have observational data, but you find ways to ensure that the treatment was effectively randomly distributed

New cards

internal validity

is the experiment well designed? free from confounders or bias?

New cards

external validity

is the finding applicable to other populations, situations, or cases? does it apply outside the context of the research?

New cards

dependent variable, the thing we want to predict

New cards

independent variable, the thing that predicts the DV

New cards

error term, the part the DV or IV doesn't explain, everything NOT in our model. not directly observable

New cards

endogeneity

the IV is correlated with the error term

New cards

confounder

another unmeasured variable that affects the IV and then also affects the DV

New cards

randomness

noise in the data, could go away with larger sample sizes

New cards

randomization

randomizing to create treatment and control groups, creates exogeneity

New cards

distribution of outcome

the idea that you could observe different outcomes with different probabilities, even when you only observe one outcome

New cards

population

overall collection of individuals, beyond just the sample

New cards

sample

collection of individuals on which statistical analyses are performed, and from which general trends for the population are inferred

New cards

individual

object or unit, single data point contributing to the sample

New cards

random variable

the thing being measured in any random experiment

New cards

expectation

the bets guess about what number will be drawn from the distribution, loosely thought of as "average" of distribution

New cards

variance

how far the numbers you draw tend to be from that best guess

New cards

central limit theorem (CLT)

: the distribution of the mean

tends toward a normal distribution.

New cards

regression model

Yi = Bo + B1Xi + Ei

New cards

slope coefficient, relationship between X and Y

New cards

constant, value of Y when X is zero (intercept)

New cards

covariance

measures how much two random variables vary together

New cards

positive correlation

when X is higher, expect Y is higher

New cards

negative correlation

when X is higher, we expect Y is lower

New cards

not associated

when X is higher, doesn't tell us anything about Y

New cards

correlation

measures the extent to which two variables are linearly related to each other,

New cards

bivariate regression

technique to estimate a model with two variables (DV and IV), allows us to quantify the degree to which X and Y move together

New cards

omitted variable bias

specific form of endogeneity, often why estimates change if the model changes, X is correlated with something that influences Y (error term is correlated with Y)

New cards

unbiased estimate

on average, our estimate is equal to the true parameter

New cards

biased estimate

our coefficient is systematically wrong, either too high or too low than the true parameter

New cards

robust

whether the model changes or not when the specification changes

New cards

homoskedasticity

when random variable X has same variance for all observations of X. not a problem

New cards

heteroskedasticity

when the random variable X DOES NOT have the same variance for all observations of X. fixable problem. this means non-constant variance in our errors

New cards

outlier

observation that is extremely different from the rest of the observations in the sample. this drags the estimate of the mean/slope towards it.

New cards

null distribution

describing how weird our result is if there really is no difference

New cards

p-value

probability of observing a difference in mean or a coefficient as big as what we observed, if the null hypothesis were true

New cards

critical value

a point on a distribution that defines the boundary between rejecting and not rejecting the null hypothesis in a hypothesis test

New cards

significantly different

based on critical value

New cards

null hypothesis

no effect, no difference in means, no relationship between X and Y, B1 = 0

New cards

alternative hypothesis

likely an effect, there is likely a difference in means and relationship between X and Y, B1 DOES NOT = 0

New cards

type 1 error

false positive, when we reject a null hypothesis that is actually true. saying there is a relationship when there isn't.

New cards

type 2 error

false negative, when we fail to accept the alternative hypothesis, saying there isn't a relationship when there is. (small sample size, study has low power)

New cards

substantive significance

the relationship needs to be large enough to matter

New cards

power limitations

larger sample = larger power and vice versa, higher variance = harder to detect relationships. big variance needs larger sample size

New cards

statistically significant

reject the null and accept the alternative, based on critical value cutoff, likely a difference in means and relationship between X and Y, B1 = 0

New cards

irrelevant variable

adding a variable to regression that doesn't actually explain Y, will not cause bias but will eat up degrees of freedom

New cards

model specification

choosing what variables to include in the model

New cards

binary/dummy variables

useful, used in experiments to identify the treated (1) and control (0) units, make difference in means across two groups easy to calculate

New cards

discrete data

comes in 'bins' or groups. ex. on a scale of 1 to 5, how much do you like this class?

New cards

continuous data

can take any value in a sequence. ex. annual income, votes for each candidate

New cards

categorical data

descriptive, describes how the world is, comes from qualitiative research, can be ordinal (ordered: low, medium, high) or nominal (cannot be ordered: majors)

New cards

cross-sectional data

sample of a population in a given period of time

New cards

repeated cross-sectional data

taking different samples of a population over time

New cards

panel (time-series) data

seeing the same population repeatedly over time

New cards

fixed effects model

control for unit specific effects, time period effects. benefits gives more leverage in identifying causal relationships, look at a single unit over time instead of comparing

New cards

case study

intensive study of a single spatial and temproal phenomenon

New cards

cross-case study

study of several cases to compare a phenomenon across space and time

New cards

process tracing

attempts to identify the intervening causal process between an IV or variables and the outcome of the DV, qualitative method, how X becomes Y

New cards

elite interviews

asking people who were involved in the political event or issue about what happened when and why, usually semi-structured and open-ended questions

New cards

focus groups

asking a group of people what they think about a given issue

New cards

A: attrition

1 problem in experiments: units drop out of experiment, never observe their outcome variable

New cards

B: balance

2 problem in experiments: do the covariates (control variables) have the same mean across the two groups

New cards

C: compliance

3 problem in experiments: whether units actually receive the treatment they were assigned to

New cards

natural experiments

when a researcher identifies a situation in which values of the independent variable have been determined by a random process

New cards

goodness of fit

how much of Y does X explain, related to r-squared

New cards

residual

part of Y that X doesn't explain, tells us what isn't there

New cards

qualitative research

what mechanisms/processes lead to this outcome vs. another, outliers

New cards

quantitative research

effects, population-level relationship, what is the relationship on average between two variables? (ex. does poverty predict)

New cards

difference of means test

comparing the mean of Y for one group against the mean of Y for another

New cards

categorical variable

has two or more categories but no ordering

New cards

ordinal variable

expresses rank but not relative size

New cards

reference category

coefficients on all the included dummy variables indicate how much higher or lower the DV is relative to this

New cards

blocking

picking treatment and control groups in advance so that they are equal in covariates

New cards

intention to treat analysis

addresses potential endogeneity that arises from non-compliance

New cards

balance table

a method to compare the characteristics of a treatment group and a control group, typically used in experimental or matching studies

New cards

P-value

If the p-value is less than the significance level (typically 0.05), the evidence against the null hypothesis is considered strong, and the null hypothesis is often rejected in favor of the alternative hypothesis.

New cards

T-Test

analyzes the significant differences between the means of two sample groups (the mean of a reference with a known reference mean)