stats test: Exam 2: Scatterplots, Correlation, Regression, Experimental Design

0.0(0)

Studied by 10 people

0.0(0)

Call with Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/56

Earn XP

Description and Tags

Mon Nov. 10

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No study sessions yet.

57 Terms

New cards

scatterplot

relationship between two quantative variables, one on x and other on y axis

New cards

correlation coeff represents what?

New cards

what is r?

r is the strength of a linear relationship between two variables

ranges from -1 to 1

the abs value of the r is how strong it is

the negative/positive sign is which direction the linear relationship is going in

0 = weakest

1 = strongest

New cards

what reps coef of determination

r²

New cards

what is r²

(r² x 100) % of the variation in the response variable can be explained by its linear relationship with the explanatory variable

New cards

what is association

association is when a change in one variable changes the other variable

association can be positive negative linear nonlinear strong weak

association does not mean causation

New cards

does association prove causation and why?

NO because association does not prove direct change in another variable - for that you need a good randomized experiment with a control group - cannot be conducted from correlation alone

New cards

association can be…

pos, neg, nonlinear (montonic - ½ of a semi circle looking graph, quadratic - semi circle), linear, no correlation, weak, strong

New cards

does an experiment prove causation?

only a well-designed experiment can prove causation

New cards

correlation

measures the strength of the linear relationship

how close data points follow a linear pattern

does not imply one change in a variable directly causes a change in the other

New cards

what is a residual in words?

the difference between the predicted value by the LSRL and the actual data point on the scatterplot

real value (y) - predicted value (y with v on top)

New cards

if a residual is positive then?

lsrl underpredicted the value

New cards

if a residual is negative then?

lsrl overpredicted value

New cards

least squares regression line (LSRL) - whats formula?

y with a v on top = b0 + b1(x)

New cards

what is b1

slope

New cards

what is b0

when x = 0, which is the y intercept

New cards

what is the least squares regression line (LSRL)? how to find on graph (manually)?

so as we said its y with v on top = b0 + b1x

its the line “best fit” for predicting the response variable from the explanatory

so to reiterate we use the explanatory variable to predict the response variable

the LSRL tries to minimize the sum of the least squared residuals

New cards

how to find slope

r multiplied by the standard deviation of y values over the standard deviation of x values

sy = stan dev of y values

sx = stan dev of x values

r(sy)/sx

r (standard deviation of y values)

/ standard deviation of x values

New cards

as said previously what does the LSRL try to do?

the LSRL tries to minimize the sum of the least squared residuals

New cards

what is the sum of squared residuals

the vertical distance between observed y value and predicted y value

New cards

what key info does b0 tell us

what the response variable will be when our explanatory variable = 0

New cards

how to find b0

b0 = ybar - slopexbar

ybar = mean of all y values

slope = b1 = r x (sy/sx)

x bar = mean of all x values

mean of all y values - mean of all x values (r (SDy/SDx)

New cards

what are scatterplots vs residual plots?

scatter = LSRL + o.g. values

residual = x axis is the explanatory variable in o.g. values, and the y axis is the residuals (so that value the y value - y with a v on top value, which is the predicted value by the LSRL)

New cards

what is an influential point?

any point if removed would change EITHER the y intercept or the correlation

New cards

what happens if you remove/add an influential point?

affects r and/or affects y intercept

New cards

what is homeoscadastic

ms. weatherspoon def: when the varaibility (spread) of residulas is approx constant across all values of explanatory variable

simplified: so as you move along the x axis (as explanatory varaible increases) there isn’t a pattern for the residuals increasing or decreasing it remains random - like the residuals dont increase or decrease as the explanatory increases

New cards

what’s a good residual plot?

when the points are spreed = homeoscadastic

New cards

observational study def

record what happens naturally w/o intervention

New cards

experiment def

actively impose treatments on experimental units and observe the results

New cards

confounding variable def

a factor related to the explanatory varaiable that will influence the response variable

New cards

so like if you want to have a good experiment what do you have to make sure about your explanatory variable?

that confounding varaibles that are related to the explanatory varaibles which may influence the response variable are accounted for through some type of system (ex rbd, blocking, control)

New cards

experimental units def

people receiving treatment

New cards

treatment

what is done/not done to the exp. units

New cards

what are the steps for designing a good experiment?

comparison
RA (a) label, (b) randomize, © assign
replication
control

New cards

what is replication?

Replication means having many subjects in each treatment so your conclusion is more trustworthy.

me explaining: replication means like making sure you have a lot of test units for your experiment - bc the more people tested on the more legitimate your conclusion - so amount of test subjects/ exp. units

New cards

what is control?

control means controlling for other variables: so having a control group and/or blocking possible confounding variables

New cards

what is comparison?

being able to compare to sets of data (ie control group vs received treatment group)

New cards

RA meaning:

randomly assigning exp. units to treatment groups

New cards

placebo effect def

when a fake treatment “works” - conciousness infiltrated in

New cards

how do we prevent placebo

double/single blind study

New cards

single blind study def

exp. units dont know which treatment they are receiving

New cards

double blind study def

exp. units and researchers/ppl adminstering the treatments dont know which treatment they are receiving

New cards

what is RBD? (randomized block design)

grouping subjects into blocks based on a shared characteristic (which is a confounding variable) that may affect the response. RA exp. units treatments within each block

New cards

so if ms. weatherspoon asks how to draw diagram of completely randomized - what do you do?

name population - RA - exp. units r put into either control group or treatment group - compare the results

New cards

so if ms. weatherspoon asks how to draw diagram of RBD - what do you do?

name population (exp units) - split into confounding variables or their characteristic groups - RA in each block - the RA leads into control or treatment - compare the control and treament within each block - compare the two blocks data

ex is attached