2nd- Chapter 4 The sample Correlation Coefficient.

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/5

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

6 Terms

New cards

Rxy

The sample correlation coefficient.

Book Def: measures the strength of the linear relationship between X and Y. It gives the average change in standard deviations of Y for every 1 standard deviation increase in X.

Ex: rxy= +0.72 means that the Y-variable rose an average of 0.72 std deviations for every 1 std deviation increase in X.

Gpt:

A number that measures how strong and what direction the relationship between X and Y is.
It always falls between –1 and +1.
- +1 = Perfect positive relationship (X goes up, Y goes up exactly).
- –1 = Perfect negative relationship (X goes up, Y goes down exactly).
- 0 = No linear relationship.
Interpretation:
rₓᵧ = +0.72 → On average, for every 1 standard deviation increase in X, Y increases by 0.72 standard deviations.
Example:
- X = Hours studied, Y = Exam scores
- rₓᵧ = 0.72 → Strong positive relationship (more hours, higher scores).

New cards

(The sample correlation coefficient) rxy equation:

Book Def:

Think of (yi-y ̅) as Y’s deviation of the mean and (xi-x ̅ ) as X’s deviation from the mean.

(yi-y ̅)(xi-x ̅) is the product of Y’s deviation from its mean and X’s deviation from its mean.

∑(yi-y ̅)^2 and ∑(xi-x ̅)^2 are the total variation of X and Y.

Gpt:

The formula measures how strongly two things (X and Y) move together.

If both go up and down together, the correlation is positive (close to +1).
If one goes up when the other goes down, the correlation is negative (close to –1).
If they move randomly, the correlation is near 0.

Part	Meaning	In simple words
xi	Each value of X	One data point for X
yi	Each value of Y	One data point for Y
xˉ	Mean (average) of all X’s	The center of X values
yˉ	Mean (average) of all Y’s	The center of Y values
)(xi−xˉ)	X’s deviation from its mean	How far each X is from average
(yi−yˉ)	Y’s deviation from its mean	How far each Y is from average
Numerator ∑(xi−xˉ)(yi−yˉ)\	Sum of products of deviations	Shows how X and Y move together
Denominator Squared ∑(xi−xˉ)² ∑(yi−yˉ)²	Scale adjustment	Makes result between –1 and +1

See how far each X and Y is from their average.
Multiply those differences together for each pair.
- If both are above or below average → positive product.
- If one is above and the other below → negative product.
Add up all those products.
Divide by the total variation (the denominator).
The result tells you how related X and Y are.

Example intuition

r= +1: Perfect upward trend — when X increases, Y always increases.
r= −1: Perfect downward trend — when X increases, Y always decreases.
r= 0r: No consistent relationship between X and Y.

$ Book Def:Think of (yi-y ̅) as Y’s deviation of the mean and (xi-x ̅ ) as X’s deviation from the mean.(yi-y ̅)(xi-x ̅) is the product of Y’s deviation from its mean and X’s deviation from its mean.∑(yi-y ̅)^2 and ∑(xi-x ̅)^2 are the total variation of X and Y.Gpt: The formula measures how strongly two things (X and Y) move together.<ul><li>If both go up and down together, the correlation is positive (close to +1).</li><li>If one goes up when the other goes down, the correlation is negative (close to –1).</li><li>If they move randomly, the correlation is near 0.</li></ul><table style="min-width: 75px;"><colgroup><col style="min-width: 25px;"><col style="min-width: 25px;"><col style="min-width: 25px;"></colgroup><tbody><tr><th colspan="1" rowspan="1">Part</th><th colspan="1" rowspan="1">Meaning</th><th colspan="1" rowspan="1">In simple words</th></tr><tr><td colspan="1" rowspan="1">xi</td><td colspan="1" rowspan="1">Each value of X</td><td colspan="1" rowspan="1">One data point for X</td></tr><tr><td colspan="1" rowspan="1">yi</td><td colspan="1" rowspan="1">Each value of Y</td><td colspan="1" rowspan="1">One data point for Y</td></tr><tr><td colspan="1" rowspan="1">xˉ</td><td colspan="1" rowspan="1">Mean (average) of all X’s</td><td colspan="1" rowspan="1">The center of X values</td></tr><tr><td colspan="1" rowspan="1">yˉ</td><td colspan="1" rowspan="1">Mean (average) of all Y’s</td><td colspan="1" rowspan="1">The center of Y values</td></tr><tr><td colspan="1" rowspan="1">)(xi−xˉ)</td><td colspan="1" rowspan="1">X’s deviation from its mean</td><td colspan="1" rowspan="1">How far each X is from average</td></tr><tr><td colspan="1" rowspan="1">(yi−yˉ)</td><td colspan="1" rowspan="1">Y’s deviation from its mean</td><td colspan="1" rowspan="1">How far each Y is from average</td></tr><tr><td colspan="1" rowspan="1">Numerator ∑(xi−xˉ)(yi−yˉ)\</td><td colspan="1" rowspan="1">Sum of products of deviations</td><td colspan="1" rowspan="1">Shows how X and Y move together</td></tr><tr><td colspan="1" rowspan="1">Denominator Squared ∑(xi−xˉ)² ∑(yi−yˉ)²</td><td colspan="1" rowspan="1">Scale adjustment</td><td colspan="1" rowspan="1">Makes result between –1 and +1</td></tr></tbody></table><ol><li>See how far each X and Y is from their average.</li><li>Multiply those differences together for each pair.<ul><li>If both are above or below average → positive product.</li><li>If one is above and the other below → negative product.</li></ul></li><li>Add up all those products.</li><li>Divide by the total variation (the denominator).</li><li>The result tells you how related X and Y are.</li></ol>Example intuition<ul><li>r=  +1: Perfect upward trend — when X increases, Y always increases.</li><li>r= −1: Perfect downward trend — when X increases, Y always decreases.</li><li>r= 0r: No consistent relationship between X and Y.</li></ul>$

New cards

Bood Def:

Dependent Variable/ output

-What you’re trying to explain, predict, or measure.

-Think of it as the “outcome or result”

Ex:

Y= House price

Y= Infant mortality rate

Y= Exam score

Y is the Dependent Variable because it should somewhat depend on the model.

Gpt:

Y (Dependent Variable): the thing that changes as a result.
→ It “depends” on X.

In short

X → given or controlled → causes changes in Y.
Y → responds → depends on X.
X is “independent” not because it’s totally separate from Y in real life,
but because we treat it as something that isn’t affected by Y in our model.

New cards

Independent Variable/ input/ determined independently.

-The factor you think influences or explain Y.

Ex:

X= Study time (affects exam score)

X= Income per person (Affects infant mortality rate)

X=Square per footage of a house (affects price)

X is the Independent Variable, NOT because it’s independent of Y, but because it should determined outside of the model, called exogeneity.

Gpt:

X (Independent Variable): the thing we control or choose to see how it affects Y.
→ It’s “independent” because it’s decided outside the system we’re studying.

In short

X → given or controlled → causes changes in Y.
Y → responds → depends on X.
X is “independent” not because it’s totally separate from Y in real life,
but because we treat it as something that isn’t affected by Y in our model.

New cards

X & Y Examples:

Let’s say we’re studying:

X = hours studied
Y = exam score

Your exam score (Y) depends on how much you study (X).
You choose how many hours to study — that’s outside the exam system.
That’s why X is independent.
The exam score responds to how much you studied — that’s why Y is dependent.

Example 1

Y = Exam Scores

X = Mystatlab Averages

Example 2

Y = Infant Mortality Rates

X = Income per Capita

Example 3

Y = Sale Price of a Residential Home

X = Square Feet of Interior Space

Example 4

Y = Exam Scores

X = Student ID number (qualitative data, lol)

New cards

What we’re trying to figure out:

How strong is the relationship between X and Y? In what direction? How much evidence is there that X and Y are actually related?