Review-Ch 11 statistics

studied byStudied by 3 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 22

flashcard set

Earn XP

Description and Tags

Inference for regression

23 Terms

1

T statistic (for regression line)

  • tests whether the slope of the true regression line is 0

    • if rejecting that the slope of the true regression line is 0 ā†’ regression line will be useful in PREDICTING y given x

    • if not rejecting ā†’ plausible that the positive/negative trend seen is solely due to CHANCE variation that ALWAYS results when you have ONLY 1 sample

New cards
2

Confidence Interval(regression line)

  • helps to DECIDE whether the linear relationship is statistically significant & practical significance

    • ___the average increase/decrease in___population

  • If claim is not in the interval ā†’not supported claim

If Confidence interval hasā€¦

  • All positive values ā†’ evidence for positive association

  • All negative values ā†’ evidence for negative association

  • BOTH positive and negative values ā†’ no evidence for an association

New cards
3

B1

the slope of the true regression line

New cards
4

Bo

the y intercept

New cards
5

BIO

Hypothesised slope

New cards
6

b1

slope(estimate from a sample)

New cards
7

df(regression line)

n-2, because a sampling distribution is a t-distribution

New cards
8

S: Standard error(regression)

  • spread around the regression line

  • the difference BETWEEN predicted(estimates) and the actual scores are measured with this residual standard deviation

New cards
9

SSE

Sum of squared errors

New cards
10

Slope(Interpretation)

  • for every 1 increase/decrease in x, there is a predicted increase/decrease in y

  • The slope(* correlation sign is same as slope sign) of a regression line is in the middle of the Confidence Interval

New cards
11

r

Correlation coefficient, can be from -1 to 1

New cards
12

rĀ²

coefficient of determination

  • ā€œ__% of the variation in y can be attributed/accounted for by the variation in x

New cards
13

Significance Test(Regression line)

CONDITIONS

  1. approximately normal distribution of y for a fixed value of xā€¦

  • The means lie on a line

  • Standard deviation is CONSTANT across ALL x values

  1. ONE of below

    1. SRS from a bivariate(2) population

      OR

    2. independent random sample with ( x,y ) values given

  2. roughly linear scatterplot

  3. Residual plot has no pattern/curvature

  4. Residual distribution looks approximately normal or uniform(on x axis ALONE)

STEPS

name test: t-test for the slope of a population regression line

1) Conditions

2)Hypothesis

let B represent the slope of ___between x and y

Ho: B=0

HA: Bā‰ ,<,> 0.

3) Test stat, p-value

t=(b1-BIO)/sb1

4)COnclusion

  • smaller p valueā†’STRONGER evidence against the null hypothesis bc farther from Ī± ā†’ ā€œsufficient evidenceā€

New cards
14

Confidence Interval(Regression Line)

CONDITIONS

  1. approximately normal distribution of y for a fixed value of xā€¦

  • The means lie on a line

  • Standard deviation is CONSTANT across ALL x values

  1. ONE of below

    1. SRS from a bivariate(2) population

      OR

    2. independent random sample with ( x,y ) values given

  2. roughly linear scatterplot

  3. Residual plot has no pattern/curvature

  4. Residual distribution looks approximately normal or uniform(on x axis ALONE)

STEPS

1) Conditions

2) Computations

CI=b1+- t* sb1

df= n-2

t*=invt(% thingy, df)

3) Interpret in Context

ā€œWe are __% sure that the true slope of the line of regression between x var and y var lies BETWEEN the interval ( , )ā€

ā€œOut of 100 such Confidence Interval, when constructed from random samples. The expected true value B1 to be #(as a number) of themā€

New cards
15

line of mean/averages

uy=bo+ b1x

New cards
16

Variability

size of on depends onā€¦

  1. Sample Size(n)

  2. Variability in y

<p> size of on depends onā€¦</p><ol><li><p>Sample Size(n)</p></li><li><p>Variability in <mark data-color="purple">y</mark></p></li></ol>
New cards
17

Standard error

The slope varies less whenā€¦

  1. Sample size larger

  2. values of y tend closer to the regression line

  3. values of x more spread out

<p><strong>The slope</strong> varies <span style="color: red">less </span>whenā€¦</p><ol><li><p>Sample size<span style="color: green"> larger</span></p></li><li><p>values of <mark data-color="purple">y </mark>tend closer to <strong>the regression line</strong> </p></li><li><p>values of <mark data-color="blue">x </mark>more <mark data-color="green">spread out </mark></p></li></ol>
New cards
18

Power transformation

y=axb

the base is what changes

(log x, log y)

New cards
19

Exponential model

y=aby

the exponent is what changes

( x, log y)

New cards
20

Ln transformations

(ln x, ln y)

  • the LSRL is ln(y var)=a+b(ln(y var)

    which also EQUALS y var=ea+xb

New cards
21

ā€œCubic or moreā€ transformation

  • see if Confidence Interval capturers 3(n) or not

  • CI could potentially be too big/small

New cards
22

General conclusions

  • if the slope was actually BIO only a (p-value number) chance of getting a slope as far or FARTHER than b1 is from BIO for an SRS of units

  • if a transformation is madeā†’include it in the LSLR equation & conclusion

New cards
23

Graphing Analyzation

Scatterplot

  • If there are gaps/empty space in the middle suggest 2 clustersā†’ If analyzed separately could result in other answers

Graphing calculator

  • if HA is 1 sidedā†’p value on calculator graph /2

  • S=standard deviation

  • constant(intercept) coef= bo

  • x variable coef =b1$

<p>Scatterplot</p><ul><li><p>If there are gaps/empty space in <sub>the </sub>middle suggest 2 clustersā†’ If analyzed separately could result in other answers</p></li></ul><p>Graphing calculator </p><ul><li><p>if H<sub>A </sub>is 1 sidedā†’p value on calculator graph /2 </p></li><li><p>S=standard deviation </p></li><li><p>constant(intercept) coef= bo </p></li><li><p>x variable coef =b1$</p></li></ul>
New cards

Explore top notes

note Note
studied byStudied by 310 people
359 days ago
5.0(3)
note Note
studied byStudied by 6 people
476 days ago
5.0(1)
note Note
studied byStudied by 11 people
83 days ago
5.0(1)
note Note
studied byStudied by 64 people
38 days ago
5.0(1)
note Note
studied byStudied by 89 people
993 days ago
5.0(1)
note Note
studied byStudied by 88 people
620 days ago
5.0(1)
note Note
studied byStudied by 16 people
376 days ago
5.0(1)
note Note
studied byStudied by 100 people
769 days ago
4.0(1)

Explore top flashcards

flashcards Flashcard (34)
studied byStudied by 5 people
298 days ago
5.0(1)
flashcards Flashcard (30)
studied byStudied by 4 people
656 days ago
4.5(2)
flashcards Flashcard (220)
studied byStudied by 2 people
103 days ago
5.0(1)
flashcards Flashcard (93)
studied byStudied by 8 people
39 days ago
5.0(1)
flashcards Flashcard (56)
studied byStudied by 6 people
754 days ago
5.0(1)
flashcards Flashcard (137)
studied byStudied by 14 people
170 days ago
5.0(1)
flashcards Flashcard (254)
studied byStudied by 51 people
168 days ago
5.0(1)
flashcards Flashcard (26)
studied byStudied by 15 people
747 days ago
5.0(1)
robot