2nd- Chapter 4 The sample Correlation Coefficient.

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/5

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

6 Terms

1
New cards

Rxy

The sample correlation coefficient.

Book Def: measures the strength of the linear relationship between X and Y. It gives the average change in standard deviations of Y for every 1 standard deviation increase in X.

Ex: rxy= +0.72 means that the Y-variable rose an average of 0.72 std deviations for every 1 std deviation increase in X.

Gpt:

  • A number that measures how strong and what direction the relationship between X and Y is.

  • It always falls between –1 and +1.

    • +1 = Perfect positive relationship (X goes up, Y goes up exactly).

    • –1 = Perfect negative relationship (X goes up, Y goes down exactly).

    • 0 = No linear relationship.

  • Interpretation:
    rₓᵧ = +0.72 → On average, for every 1 standard deviation increase in X, Y increases by 0.72 standard deviations.

  • Example:

    • X = Hours studied, Y = Exam scores

    • rₓᵧ = 0.72 → Strong positive relationship (more hours, higher scores).

2
New cards

(The sample correlation coefficient) rxy  equation:

Book Def:

Think of (yi-y ̅) as Y’s deviation of the mean and (xi-x ̅ ) as X’s deviation from the mean.

(yi-y ̅)(xi-x ̅) is the product of Y’s deviation from its mean and X’s deviation from its mean.

∑(yi-y ̅)^2 and ∑(xi-x ̅)^2 are the total variation of X and Y.

Gpt: 

The formula measures how strongly two things (X and Y) move together.

  • If both go up and down together, the correlation is positive (close to +1).

  • If one goes up when the other goes down, the correlation is negative (close to –1).

  • If they move randomly, the correlation is near 0.

Part

Meaning

In simple words

xi​

Each value of X

One data point for X

yi

Each value of Y

One data point for Y

Mean (average) of all X’s

The center of X values

Mean (average) of all Y’s

The center of Y values

)(xi​−xˉ)

X’s deviation from its mean

How far each X is from average

(yi​−yˉ​)

Y’s deviation from its mean

How far each Y is from average

Numerator ∑(xi−xˉ)(yi−yˉ)\

Sum of products of deviations

Shows how X and Y move together

Denominator

Squared ∑(xi−xˉ)² ∑(yi−yˉ)²

Scale adjustment

Makes result between –1 and +1

  1. See how far each X and Y is from their average.

  2. Multiply those differences together for each pair.

    • If both are above or below average → positive product.

    • If one is above and the other below → negative product.

  3. Add up all those products.

  4. Divide by the total variation (the denominator).

  5. The result tells you how related X and Y are.

Example intuition

  • r=  +1: Perfect upward trend — when X increases, Y always increases.

  • r= −1: Perfect downward trend — when X increases, Y always decreases.

  • r= 0r: No consistent relationship between X and Y.

<p> </p><p><em>Book Def:</em></p><p><em>Think of </em><span style="font-family: &quot;Cambria Math&quot;;">(yi-y&nbsp;̅)</span><em> as Y’s deviation of the mean and </em><span style="font-family: &quot;Cambria Math&quot;;">(xi-x&nbsp;̅</span> )<em> as X’s deviation from the mean.</em></p><p></p><p><span style="font-family: &quot;Cambria Math&quot;;">(yi-y&nbsp;̅</span>)(<span style="font-family: &quot;Cambria Math&quot;;">xi-x&nbsp;̅)</span><em> is the product of Y’s deviation from its mean and X’s deviation from its mean.</em></p><p></p><p><span style="font-family: &quot;Cambria Math&quot;;">∑(yi-y&nbsp;̅)^2 </span><em>and </em><span style="font-family: &quot;Cambria Math&quot;;">∑(xi-x&nbsp;̅)</span>^<span style="font-family: &quot;Cambria Math&quot;;">2</span><em> are the total variation of X and Y.</em></p><p></p><p><em>Gpt:&nbsp;</em></p><p>The formula measures <strong>how strongly two things (X and Y)</strong> move <strong>together</strong>.</p><ul><li><p>If both go <strong>up and down together</strong>, the correlation is <strong>positive</strong> (close to +1).</p></li><li><p>If one goes <strong>up when the other goes down</strong>, the correlation is <strong>negative</strong> (close to –1).</p></li><li><p>If they move <strong>randomly</strong>, the correlation is <strong>near 0</strong>.</p></li></ul><p></p><table style="min-width: 75px;"><colgroup><col style="min-width: 25px;"><col style="min-width: 25px;"><col style="min-width: 25px;"></colgroup><tbody><tr><th colspan="1" rowspan="1"><p>Part</p></th><th colspan="1" rowspan="1"><p>Meaning</p></th><th colspan="1" rowspan="1"><p>In simple words</p></th></tr><tr><td colspan="1" rowspan="1"><p>xi​</p></td><td colspan="1" rowspan="1"><p>Each value of X</p></td><td colspan="1" rowspan="1"><p>One data point for X</p></td></tr><tr><td colspan="1" rowspan="1"><p>yi</p></td><td colspan="1" rowspan="1"><p>Each value of Y</p></td><td colspan="1" rowspan="1"><p>One data point for Y</p></td></tr><tr><td colspan="1" rowspan="1"><p>xˉ</p></td><td colspan="1" rowspan="1"><p>Mean (average) of all X’s</p></td><td colspan="1" rowspan="1"><p>The center of X values</p></td></tr><tr><td colspan="1" rowspan="1"><p>yˉ</p></td><td colspan="1" rowspan="1"><p>Mean (average) of all Y’s</p></td><td colspan="1" rowspan="1"><p>The center of Y values</p></td></tr><tr><td colspan="1" rowspan="1"><p>)(xi​−xˉ)</p></td><td colspan="1" rowspan="1"><p>X’s deviation from its mean</p></td><td colspan="1" rowspan="1"><p>How far each X is from average</p></td></tr><tr><td colspan="1" rowspan="1"><p>(yi​−yˉ​)</p></td><td colspan="1" rowspan="1"><p>Y’s deviation from its mean</p></td><td colspan="1" rowspan="1"><p>How far each Y is from average</p></td></tr><tr><td colspan="1" rowspan="1"><p>Numerator ∑(xi−xˉ)(yi−yˉ)\</p></td><td colspan="1" rowspan="1"><p>Sum of products of deviations</p></td><td colspan="1" rowspan="1"><p>Shows how X and Y move together</p></td></tr><tr><td colspan="1" rowspan="1"><p>Denominator </p><p><span style="font-family: &quot;Cambria Math&quot;;">Squared </span>∑(xi−xˉ)² ∑(yi−yˉ)²</p></td><td colspan="1" rowspan="1"><p>Scale adjustment</p></td><td colspan="1" rowspan="1"><p>Makes result between –1 and +1</p></td></tr></tbody></table><ol><li><p>See how far each X and Y is from their average.</p></li><li><p>Multiply those differences together for each pair.</p><ul><li><p>If both are above or below average → positive product.</p></li><li><p>If one is above and the other below → negative product.</p></li></ul></li><li><p>Add up all those products.</p></li><li><p>Divide by the total variation (the denominator).</p></li><li><p>The result tells you <strong>how related X and Y are</strong>.</p></li></ol><p></p><p><strong>Example intuition</strong></p><ul><li><p>r=&nbsp; +1: Perfect upward trend — when X increases, Y always increases.</p></li><li><p>r= −1: Perfect downward trend — when X increases, Y always decreases.</p></li><li><p>r= 0r: No consistent relationship between X and Y.</p></li></ul><p></p>
3
New cards

Y=

Bood Def:

Dependent Variable/ output

-What you’re trying to explain, predict, or measure. 

-Think of it as the “outcome or result” 

Ex: 

Y= House price 

Y= Infant mortality rate

Y= Exam score 

Y is the Dependent Variable because it should somewhat depend on the model.

Gpt: 

Y (Dependent Variable): the thing that changes as a result.
→ It “depends” on X.

In short

  • X → given or controlled → causes changes in Y.

  • Y → responds → depends on X.

  • X is “independent” not because it’s totally separate from Y in real life,
    but because we treat it as something that isn’t affected by Y in our model.

4
New cards

X=

Independent Variable/ input/ determined independently.

-The factor you think influences or explain Y. 

Ex:

X= Study time (affects exam score)

X= Income per person (Affects infant mortality rate)

X=Square per footage of a house (affects price)

X is the Independent Variable, NOT because it’s independent of Y, but because it should determined outside of the model, called exogeneity.

Gpt: 

X (Independent Variable): the thing we control or choose to see how it affects Y.
→ It’s “independent” because it’s decided outside the system we’re studying.

In short

  • X → given or controlled → causes changes in Y.

  • Y → responds → depends on X.

  • X is “independent” not because it’s totally separate from Y in real life,
    but because we treat it as something that isn’t affected by Y in our model.

5
New cards

X & Y Examples:

Let’s say we’re studying:

X = hours studied
Y = exam score

  • Your exam score (Y) depends on how much you study (X).

  • You choose how many hours to study — that’s outside the exam system.
    That’s why X is independent.

  • The exam score responds to how much you studied — that’s why Y is dependent.

Example 1

Y = Exam Scores

X = Mystatlab Averages

Example 2

Y = Infant Mortality Rates

X = Income per Capita

Example 3

Y = Sale Price of a Residential Home

X = Square Feet of Interior Space

Example 4

Y = Exam Scores

X = Student ID number (qualitative data, lol)

6
New cards

What we’re trying to figure out: 

How strong is the relationship between X and Y? In what direction? How much evidence is there that X and Y are actually related?