Chapter 10: Two Quantitative Variables (Correlation and Regression)

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/27

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

28 Terms

1
New cards

What is a scatterplot and what does it represent?

A scatterplot shows the relationship between two quantitative variables measured on the same individuals, using dots to represent individual pairs. X-axis: explanatory; Y-axis: response.

2
New cards

What are the three main aspects of a scatterplot?

1. Form (linear, curved)

2. Direction (positive, negative, or none)

3. Strength (tightness of the points to a form)

3
New cards

What does a positive, negative, or no linear relationship mean?

Positive: As x increases, y increases

Negative: As x increases, y decreases

No linear relationship: x and y don’t follow a linear trend

4
New cards

 What’s the difference between “no linear” and “no relationship”?

 No linear relationship doesn’t mean no relationship at all—it might be nonlinear.

5
New cards

What is the correlation coefficient (r)?

A numerical summary of the strength and direction of a linear relationship between two quantitative variables.

Range: -1 ≤ r ≤ 1

6
New cards

 What are key properties of the correlation coefficient r?

• Same sign as slope

• Unitless

• Affected by outliers

• Only for quantitative variables

• r =1: perfect linear relationship

7
New cards

 Rule of Thumb for interpreting r?

knowt flashcard image
8
New cards

What are the hypotheses for testing correlation \rho?

𝐻0: There is no linear relationship between 𝑥 and 𝑦

𝐻𝑎: There is a linear relationship between 𝑥 and 𝑦

<p>𝐻0: There is no linear relationship between 𝑥 and 𝑦</p><p>𝐻𝑎: There is a linear relationship between 𝑥 and 𝑦</p>
9
New cards

How is the simulation approach used to test correlation?

 Shuffle the y-values to simulate the null hypothesis and generate a distribution of r-values under H_0.

10
New cards

How is a confidence interval for p estimated via simulation?

knowt flashcard image
11
New cards
<p>What is a deterministic model?</p>

What is a deterministic model?

<p></p>
12
New cards

Why is a deterministic model unrealistic in most real-world data?

Because in practice, random variation (due to uncontrolled factors or measurement error) always exists, so we move to probabilistic models.

13
New cards

What is the least squares regression line?

A line that best fits the data by minimizing the sum of squared residuals

<p>A line that best fits the data by minimizing the <strong>sum of squared residuals</strong></p>
14
New cards

What do b0 and b1 represent?

• b0: y-intercept (when x = 0)

• b1: slope (change in y per unit x)

15
New cards

Straight-Line Linear Models

knowt flashcard image
16
New cards

Methods of least square

knowt flashcard image
17
New cards

What is a residual?

e = y - y(hat)

It’s the vertical distance between an actual data point and the predicted point.

18
New cards

What is R^2 and how is it interpreted?

 Coefficient of determination. It represents the proportion of variance in y explained by x.

<p>&nbsp;<strong>Coefficient of determination</strong>. It represents the <strong>proportion of variance in y explained by x</strong>.</p><p class="p2"></p>
19
New cards

Interpreting R²

knowt flashcard image
20
New cards

Rule of thumb for R²?

knowt flashcard image
21
New cards

What are the hypotheses for testing slope \beta_1?

knowt flashcard image
22
New cards

What is the link between slope and correlation in regression?

Testing B1 = 0 is equivalent to testing P = 0 for linear models.

<p>Testing B1 = 0 is equivalent to testing P = 0 for linear models.</p>
23
New cards

What are the validity conditions for theory-based slope inference?

1. Linearity

2. Symmetric residuals (mean zero)

3. Equal variance along x

<p>1. <strong>Linearity</strong></p><p class="p1">2. <strong>Symmetric residuals</strong> (mean zero)</p><p class="p1">3. <strong>Equal variance</strong> along x</p>
24
New cards

What is the test statistic for slope in regression?

knowt flashcard image
25
New cards

 How do we interpret the confidence interval for B1?

If 0 is not in the CI → significant slope → strong evidence for a linear relationship.

26
New cards

What should we be cautious of in regression?

Extrapolation beyond data range

Outliers that affect line

Influential points far in x-direction that pull the regression line

27
New cards

What is a probabilistic model in regression?

knowt flashcard image
28
New cards

Why must the mean of error \epsilon be zero in regression?

knowt flashcard image