Review-Ch 11 statistics

studied byStudied by 3 people
0.0(0)
Get a hint
Hint

T statistic (for regression line)

1 / 22

flashcard set

Earn XP

Description and Tags

Inference for regression

23 Terms

1

T statistic (for regression line)

  • tests whether the slope of the true regression line is 0

    • if rejecting that the slope of the true regression line is 0 ā†’ regression line will be useful in PREDICTING y given x

    • if not rejecting ā†’ plausible that the positive/negative trend seen is solely due to CHANCE variation that ALWAYS results when you have ONLY 1 sample

New cards
2

Confidence Interval(regression line)

  • helps to DECIDE whether the linear relationship is statistically significant & practical significance

    • ___the average increase/decrease in___population

  • If claim is not in the interval ā†’not supported claim

If Confidence interval hasā€¦

  • All positive values ā†’ evidence for positive association

  • All negative values ā†’ evidence for negative association

  • BOTH positive and negative values ā†’ no evidence for an association

New cards
3

B1

the slope of the true regression line

New cards
4

Bo

the y intercept

New cards
5

BIO

Hypothesised slope

New cards
6

b1

slope(estimate from a sample)

New cards
7

df(regression line)

n-2, because a sampling distribution is a t-distribution

New cards
8

S: Standard error(regression)

  • spread around the regression line

  • the difference BETWEEN predicted(estimates) and the actual scores are measured with this residual standard deviation

New cards
9

SSE

Sum of squared errors

New cards
10

Slope(Interpretation)

  • for every 1 increase/decrease in x, there is a predicted increase/decrease in y

  • The slope(* correlation sign is same as slope sign) of a regression line is in the middle of the Confidence Interval

New cards
11

r

Correlation coefficient, can be from -1 to 1

New cards
12

rĀ²

coefficient of determination

  • ā€œ__% of the variation in y can be attributed/accounted for by the variation in x

New cards
13

Significance Test(Regression line)

CONDITIONS

  1. approximately normal distribution of y for a fixed value of xā€¦

  • The means lie on a line

  • Standard deviation is CONSTANT across ALL x values

  1. ONE of below

    1. SRS from a bivariate(2) population

      OR

    2. independent random sample with ( x,y ) values given

  2. roughly linear scatterplot

  3. Residual plot has no pattern/curvature

  4. Residual distribution looks approximately normal or uniform(on x axis ALONE)

STEPS

name test: t-test for the slope of a population regression line

1) Conditions

2)Hypothesis

let B represent the slope of ___between x and y

Ho: B=0

HA: Bā‰ ,<,> 0.

3) Test stat, p-value

t=(b1-BIO)/sb1

4)COnclusion

  • smaller p valueā†’STRONGER evidence against the null hypothesis bc farther from Ī± ā†’ ā€œsufficient evidenceā€

New cards
14

Confidence Interval(Regression Line)

CONDITIONS

  1. approximately normal distribution of y for a fixed value of xā€¦

  • The means lie on a line

  • Standard deviation is CONSTANT across ALL x values

  1. ONE of below

    1. SRS from a bivariate(2) population

      OR

    2. independent random sample with ( x,y ) values given

  2. roughly linear scatterplot

  3. Residual plot has no pattern/curvature

  4. Residual distribution looks approximately normal or uniform(on x axis ALONE)

STEPS

1) Conditions

2) Computations

CI=b1+- t* sb1

df= n-2

t*=invt(% thingy, df)

3) Interpret in Context

ā€œWe are __% sure that the true slope of the line of regression between x var and y var lies BETWEEN the interval ( , )ā€

ā€œOut of 100 such Confidence Interval, when constructed from random samples. The expected true value B1 to be #(as a number) of themā€

New cards
15

line of mean/averages

uy=bo+ b1x

New cards
16

Variability

size of on depends onā€¦

  1. Sample Size(n)

  2. Variability in y

<p> size of on depends onā€¦</p><ol><li><p>Sample Size(n)</p></li><li><p>Variability in <mark data-color="purple">y</mark></p></li></ol>
New cards
17

Standard error

The slope varies less whenā€¦

  1. Sample size larger

  2. values of y tend closer to the regression line

  3. values of x more spread out

<p><strong>The slope</strong> varies <span style="color: red">less </span>whenā€¦</p><ol><li><p>Sample size<span style="color: green"> larger</span></p></li><li><p>values of <mark data-color="purple">y </mark>tend closer to <strong>the regression line</strong> </p></li><li><p>values of <mark data-color="blue">x </mark>more <mark data-color="green">spread out </mark></p></li></ol>
New cards
18

Power transformation

y=axb

the base is what changes

(log x, log y)

New cards
19

Exponential model

y=aby

the exponent is what changes

( x, log y)

New cards
20

Ln transformations

(ln x, ln y)

  • the LSRL is ln(y var)=a+b(ln(y var)

    which also EQUALS y var=ea+xb

New cards
21

ā€œCubic or moreā€ transformation

  • see if Confidence Interval capturers 3(n) or not

  • CI could potentially be too big/small

New cards
22

General conclusions

  • if the slope was actually BIO only a (p-value number) chance of getting a slope as far or FARTHER than b1 is from BIO for an SRS of units

  • if a transformation is madeā†’include it in the LSLR equation & conclusion

New cards
23

Graphing Analyzation

Scatterplot

  • If there are gaps/empty space in the middle suggest 2 clustersā†’ If analyzed separately could result in other answers

Graphing calculator

  • if HA is 1 sidedā†’p value on calculator graph /2

  • S=standard deviation

  • constant(intercept) coef= bo

  • x variable coef =b1$

<p>Scatterplot</p><ul><li><p>If there are gaps/empty space in <sub>the </sub>middle suggest 2 clustersā†’ If analyzed separately could result in other answers</p></li></ul><p>Graphing calculator </p><ul><li><p>if H<sub>A </sub>is 1 sidedā†’p value on calculator graph /2 </p></li><li><p>S=standard deviation </p></li><li><p>constant(intercept) coef= bo </p></li><li><p>x variable coef =b1$</p></li></ul>
New cards

Explore top notes

note Note
studied byStudied by 18 people
... ago
5.0(1)
note Note
studied byStudied by 1712 people
... ago
4.7(13)
note Note
studied byStudied by 3 people
... ago
5.0(1)
note Note
studied byStudied by 26 people
... ago
5.0(1)
note Note
studied byStudied by 24 people
... ago
5.0(1)
note Note
studied byStudied by 13 people
... ago
5.0(1)
note Note
studied byStudied by 12 people
... ago
5.0(1)
note Note
studied byStudied by 10 people
... ago
5.0(1)

Explore top flashcards

flashcards Flashcard (22)
studied byStudied by 12 people
... ago
5.0(1)
flashcards Flashcard (72)
studied byStudied by 12 people
... ago
5.0(1)
flashcards Flashcard (94)
studied byStudied by 13 people
... ago
4.0(1)
flashcards Flashcard (62)
studied byStudied by 1 person
... ago
5.0(1)
flashcards Flashcard (105)
studied byStudied by 28 people
... ago
5.0(1)
flashcards Flashcard (101)
studied byStudied by 3 people
... ago
5.0(1)
flashcards Flashcard (21)
studied byStudied by 26 people
... ago
5.0(1)
flashcards Flashcard (32)
studied byStudied by 21 people
... ago
5.0(1)
robot