PSYC3290 FINAL!!!!!

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 196

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

197 Terms

1

**What is Standard Error of the Mean (SEOM):

it is the standard deviation of the sampling distribution of the sample mean (MEAN HEAP) that tells us how precise our estimate of the population mean is (measure of sampling variability)

New cards
2

How do you calculate SEOM with a true population SD?

SEM = pop SD/sqrt N

<p><strong><span class="bgY">SEM = pop SD/sqrt N</span></strong></p>
New cards
3

When the sample size increases, what happens to the standard error of the mean?

a) It decreases

b) It increases

c) It remains constant

d) It fluctuates randomly

a) it decreases

New cards
4

How do you calculate SEOM without the true population SD?

take the sample standard deviation (s) instead of the population standard deviation and divide by the square root of the sample size

New cards
5

**What is the central Limit Theorem:

states that the sum, or the mean, of a number of independent variables has, approximately, a normal distribution, almost whatever the distributions of those variables larger N, closer the distribution of T and curve is to normal. (distributions produce a roughly normal sampling distribution of sample means)

New cards
6

Which of the following is a correct implication of the Central Limit Theorem for hypothesis testing?
a) It allows us to use the normal distribution when the sample size is large enough, even if the population distribution is not normal
b) It requires that the population distribution is always normal
c) It only applies to studies with small sample sizes
d) It invalidates the use of t-tests for large samples

a) It allows us to use the normal distribution when the sample size is large enough, even if the population distribution is not normal

New cards
7

According to the Central Limit Theorem, what happens as the sample size increases?
a) The sample means become less variable
b) The sample means become more variable
c) The sample means tend to form a uniform distribution
d) The sample means become identical to the population mean

a) The sample means become less variable (and approach a normal distribution! - large sample size means more ability to infer population characteristics from sample)

New cards
8

**what is the t-distribution:

very similar to normal distribution but with HIGHER TAILS, we cant use 1.96 anymore and here we have 2.145 as our new limits for 95% CI, this is bc not only does our estimate of µ vary from sample to sample, but our estimate of σ does too!(The larger the df (sample) the closer to the normal distribution your t distribution will be)

<p>very similar to normal distribution but with <strong>HIGHER TAILS, we cant use 1.96 anymore and here we have 2.145 as our new limits for 95% CI, this is bc not only does our estimate of µ vary from sample to sample, but our estimate of </strong>σ<strong> does too!<span class="bgY">(The larger the df (sample) the closer to the normal distribution your t distribution will be)</span></strong></p>
New cards
9

cats eye picture tells us:

how to interpret CI, says not all values in the range are likely to be population, if they are in the middle they are more likely than on outside! as you move away from it, it becomes less likely to true mean

(a CI has C% likelihood of capturing µ, graphically illustrated in a cats eye)

<p>how to interpret CI,  says not all values in the range are likely to be population, <strong>if they are in the middle they are more likely than on outside!</strong> <strong>as you move away from it, it becomes less likely to true mean</strong> </p><p>(a CI has C% likelihood of capturing µ, graphically illustrated in a cats eye)</p>
New cards
10

what does the t distribution represent:

the probability of different outcomes for a sample mean when the true population mean is unknown! (especially for SMALL sample sizes)

New cards
11

how do you calculate a t-score?

T = M-µ / s /√N (sample mean-population mean divided by sample standard deviation / square root of N)

<p>T = M-µ <strong>/</strong> s /√N (sample mean-population mean divided by sample standard deviation / square root of N)</p>
New cards
12

**What is power:

the chance of CORRECTLY REJECTING the null hypothesis if it is in fact FALSE (corresponds to beta 1 - β - the probability that we can detect said effect and reject the null)

New cards
13

what is power influenced by?

1) alpha (larger alpha = greater power needed to detect an effect, increases type I error)
2) Effect size (larger effect = greater power to detect it)
3) sample size (N)

New cards
14

What does a power of .08 represent?

the likelihood that a study will correctly reject the null if the alternative hypothesis is true. 80% chance of avoiding Type II error!

New cards
15

** What is sampling variability:

the variability in results caused by using different samples (if you repeatedly take samples of the sample size from a population and calculate the mean of each sample, the means will differ!)

New cards
16

post hoc analysis:

(after the fact) is not specified in advance. It risks merely telling us about sampling variability, but may provide valuable hints for further investigation

New cards
17

**What is the dance of the means:

sampling variability creates the visual effect of the means dancing around the population mean (Distribution created by the means of many samples the little dots below the X value are the sample Green dots = sample means)

<p>sampling variability creates the visual effect of the means dancing around the population mean <strong>(Distribution created by the means of many samples</strong> the little dots below the X value are the sample <strong>Green dots = sample means)</strong></p>
New cards
18

**What is Type I Error:

Rejecting the null hypothesis when its true (false positive)

New cards
19

**What is Type II Error:

Failing to reject the null hypothesis when its false (false negative)

New cards
20

Independent group design has the ability to:

estimate the p-values based on OVERLAP of CIs (utilizes a p value, graph overlap - inter-individual variability)

New cards
21

Paired design has the ability to:

have a no overlap rule (intra-individual variability)

New cards
22

crtitical t vs obtained t value is:

what you get using a table that tells us which t values contain 95% of the scores middle VS. a t value you get from a calculated sample

New cards
23

**Gap between CIs (Independent groups):

the p-value is less than .01 (significant)

New cards
24

just touching CIs (independent groups):

the p value is -.01 (significant)

New cards
25

moderate overlapping CIs

the p value is -.05 (insiginificant)

New cards
26

large overlapping CIs

the p value is greater than .05 (insignificant)

New cards
27

we use CI to evaluate the null by:

comparing the null hypothesis value to the CI (If the hypothesized value falls within the interval, we fail to reject the null hypothesis. If the hypothesized value falls outside the interval, we reject the null hypothesis)

New cards
28

**What is cohen's D:

a standardized effect size measure which is expressure in standard deviation units (assesses the standardized difference between two means in terms of standard deviation)

New cards
29

how to calculate cohen's d:

d = effect size in original units / appropriate standard deviation (ES/SD) for independent groups : d = (M1 - M2) / sp

New cards
30

how to interpret cohen's d:

small = .20
medium = .50
large = .80

New cards
31

**CI on the difference between 2 means is:

the difference between the 2 group means (m2-m1) (either laptop or pen group, here the difference is 0 which means it is not a very likely mean difference, so we reject the null hypothesis)

<p>the difference between the 2 group means (m2-m1) (either laptop or pen group, <strong>here the difference is 0</strong> which means it is not a very likely mean difference, so <strong>we reject the null hypothesis</strong>)</p>
New cards
32

MOE for one sample design calculation:

MOE = t.95 (df) X s (1/sqrtN)

<p>MOE = t.95 (df) X s (1/sqrtN)</p>
New cards
33

MOE for CI difference calculation (for more than one sample design):

MOEdiff = t.95 (N1 + N2 - 2) X ...

<p>MOEdiff = t.95 (N1 + N2 - 2) X ...</p>
New cards
34

Calculating the 95% CI on the difference requires a few components which are:

1) the t- and df component
2) the variability component
3) the sample size component(use critical t to calculate MoE)

<p>1) the t- and df component<br>2) the variability component<br>3) the sample size component(use critical t to calculate MoE)</p>
New cards
35

**What is true about sample size:

1) generally, the larger the better (4x the sample size cuts the CI by HALF)
2) when sample size is larger, mean heap is narrow
3) larger the sample, lower the odds of Type II error
4) larger the sample, more likely to detect an effect if it stil exists

New cards
36

small sample size means:

wider distribution of sample means(less precise, longer CI, and bigger MOE)

New cards
37

larger sample size means:

narrower distribution of sample means (more precise, shorter CI, and smaller MOE)

New cards
38

if you have a narrow confidence interval around population mean estimate you will have _______ precision in your estimate

high

New cards
39

what is a paired design?

IV is a within groups variable/repeated measure (all participants experience same conditions) pretest and postest)(compares the means of two measurements taken from the same individual, object, or related units)

- recommended when sample sizes are SMALL

- participants are given a more precise CI on difference than either CI for the means

- no overlap rule is possible

EX: depressed people going to therapy before vs after

New cards
40

What is an independent groups design?

between groups variable, utilizes different participants are used in each condition of the IV

- recommended for LARGE samples

EX: pen vs laptop group

<p>between groups variable, utilizes different participants are used in each condition of the IV</p><p>- recommended for <strong>LARGE</strong> samples</p><p>EX: pen vs laptop group</p>
New cards
41

**scenario question 1 EX- figure out the type of design being used: A researcher is running a study looking at the effects of epidural dosages and time on patients’ comfortability while in labour. Patients received only one of three epidural dosages (dose A, dose B, dose C) and are measured at two time-points (10-minutes and 20-minutes).

Split plot ANOVA

New cards
42

**scenario question 2 EX- figure out the type of design being used:

Interaction between testing and gender

<p>Interaction between testing and gender</p>
New cards
43

**scenario question 3 EX- figure out the type of design being used: A researcher is trying to decide one which design best fits her research study. She’s looking at the effects of exercise on anxiety levels. Her exercise conditions include running, walking, jumping rope, and weights, such that every participant completes every condition

Repeated measures One-Way ANOVA

New cards
44

**scenario question 4 EX- figure out the type of design being used: A researcher is running a study looking at the effects of epidural dosages and time on patients’ comfortability while in labour. Patients received only one of three epidural dosages (dose A, dose B, dose C) and are measured at two time-points (10-minutes and 20-minutes).

a. How many hypotheses will the researcher be testing with this design?

Three

New cards
45

**Scenario question 5 EX- figure out the type of design being used: A researcher is studying the effects of two different teaching methods (traditional vs. interactive) on students' test scores. The researcher also wants to examine whether the effect of teaching method depends on students' prior knowledge level (high vs. low).Each student is assigned to one of the teaching methods based on their prior knowledge, and their test scores are measured at the end of the study.

Factorial ANOVA (between subjects) (explanation: there are 2 IV's both IVs are between-subjects factors, DV is test scores

New cards
46

**Scenario question 5 EX- figure out the type of design being used: A researcher is studying the effects of teaching method (lecture-based, discussion-based) and assessment type (written exam, oral presentation) on student performance. Each student is randomly assigned to one teaching method and completes both assessment types.

Split Plot ANOVA (explanation: there is one between group IV which is teaching method and students are each assigned to a group, and one within-subjects IV which is assessment type, where each student experienced BOTH levels)

New cards
47

**Scenario question 5 EX- figure out the type of design being used: A researcher is studying the effects of teaching method (lecture-based, discussion-based) and assessment type (written exam, oral presentation) on student performance. Each student is randomly assigned to one teaching method and completes both assessment types.

a) How many independent variables are in the design?

two (teaching method (between) and assessment type (within)

New cards
48

**Scenario question 5 EX- figure out the type of design being used: A researcher is studying the effects of teaching method (lecture-based, discussion-based) and assessment type (written exam, oral presentation) on student performance. Each student is randomly assigned to one teaching method and completes both assessment types.

b) if a significant interaction effect is found between teaching method and assessment type, what does this mean?

the effect of teaching method depends on the type of assessment

New cards
49

**Scenario question 5 EX- figure out the type of design being used: A researcher is studying the effects of teaching method (lecture-based, discussion-based) and assessment type (written exam, oral presentation) on student performance. Each student is randomly assigned to one teaching method and completes both assessment types.

c) if the main effect of the teaching method is significant, what conclusion can the researcher draw?

one teaching method outperforms the other across all assessment types

New cards
50

**Scenario question 5 EX- figure out the type of design being used: A researcher is studying the effects of teaching method (lecture-based, discussion-based) and assessment type (written exam, oral presentation) on student performance. Each student is randomly assigned to one teaching method and completes both assessment types.

d) If the researcher finds significant differences in student performance and wants to determine which teaching method or assessment type is driving the results, what type of post hoc test should be conducted?

Bonferroni correction

New cards
51

what is a bonferroni correction

used in within-subject comparisons to adjust for multiple comparisons

New cards
52

What is a tukey HSD:

suitable for post hoc test of between-subjects factors when theres multiple groups

New cards
53

How to determine if its a factorial or One-way ANOVA

factorial ANOVAs: has one or more independent variables one-way ANOVA: always has only one dependent variable

New cards
54

**What is Sum of Squares:

a way to measure how spread out a set of data points are from the mean, the sum of the squared differences between each data point and the mean (In a One-Way ANOVA each variability term is calculated using the sum of squares (SS)

New cards
55

**In a factorial II ANOVA you can calculate for cohen's f using N2p and the two IV's SS's:

f = √n2p/ 1-n2p

<p>f = √n2p/ 1-n2p</p>
New cards
56

**in the case of a one-way between subjects ANOVA, you need the f distribution which is calculated as:

f = between group variability + error variability + variability among population means / error variability

<p>f = between group variability + error variability + variability among population means / error variability</p>
New cards
57

when you run a factorial anova and your testing 3 null hypotheses at the same time what are looking for:

1) main effect of IVa
2) main effect of IVb
3) interaction between A and B
(null is that there is no effect of Iv1 and no effect of IV2, and interaction)

New cards
58

what do the 2 df values in one-way ANOVAs represent?

df 1 : k-1
df 2 : (k-1) (n-1) (repeated/within measures design)

k = number of groups

n = total number of participants

<p>df 1 : k-1<br>df 2 : (k-1) (n-1) (repeated/within measures design)<br><br>k = number of groups<br><br>n = total number of participants</p>
New cards
59

Obtaining SStotal means:

includes the variability associated with SSIV 2 and the interaction between IV1 and IV2 SSIV 1∗IV 2 (when more than one IV is present, η 2 p will always be larger than η 2 )

<p>includes the variability associated with SSIV 2 and the interaction between IV1 and IV2 SSIV 1∗IV 2 (when more than one IV is present, η 2 p will always be larger than η 2 )</p>
New cards
60

**What is the appropriate effect size measure for a one-way ANOVA repeated measures within or between?

eta-squared, or partial eta squared n2 (n2 = SSeffect/SStotal) shown as a percentage

<p>eta-squared, or partial eta squared n2 (n2 = SSeffect/SStotal) shown as a percentage</p>
New cards
61

**what is the appropriate effect size measure for factorial and Split plot ANOVAs (factorial within or factorial between)?

partial eta-squared, η 2 p (n2p = SSIV/ SSIV + SSerror)

<p>partial eta-squared, η 2 p (n2p = SSIV/ SSIV + SSerror)</p>
New cards
62

Effect size for T tests:

cohen's D

New cards
63

effect size for correlations:

r2 (percentage)

New cards
64

what do sums of squares generally represent:

it measure the total variability in the data, into between-group and within-group variability

New cards
65

How are sum of squares used in calculation of f statistic?

this specific statistic compares the variability explained by the model SS between tp the unexplained variability using the formula: F=MS(mean square) between/MSwithin

New cards
66

**What is partial eta-squared (𝜼 2p):

(more than one independent variable) An effect size estimate which can be computed for each main effect and interaction. Each of these effect size measures expresses SSIV as a proportion of SSIV + SSError

Some benchmarks for 𝜼 2p:

.01 (small)

.06 (medium)

>=.14 (large)

<p> (more than one independent variable) An effect size estimate which can be computed for each main effect and interaction. Each of these effect size measures expresses SSIV as a proportion of SSIV + SSError</p><p></p><p></p><p><strong>Some benchmarks for 𝜼 2p:</strong> </p><p>.01 (small)</p><p>.06 (medium)</p><p>&gt;=.14 (large)</p>
New cards
67

**what is eta-squared (n2)?

(one independent variable) name of the measure of a one-way ANOVA effect size (measured by expressing the between-groups variability (SSBetween) as a proportion of the total variability (SSTotal))

Some benchmarks for 𝜼 2:

.01 (small)

.06 (medium)

>=.14 (large)

<p> (one independent variable) name of the measure of a one-way ANOVA effect size (measured by expressing the between-groups variability (SSBetween) as a proportion of the total variability (SSTotal))</p><p></p><p><strong>Some benchmarks for 𝜼 2:</strong></p><p>.01 (small)</p><p>.06 (medium)</p><p>&gt;=.14 (large)</p>
New cards
68

Sphericity Assumption is:

The variability of the difference scores between each pair of conditions must be equal

New cards
69

**basic relation between a 95% CI and the p value is stated as:

- if the null hypothesis value lies outside of the 95% CI, the p value is less than .05, so p is less than the significance level and we reject the null hypothesis
- if the null hypothesis lies inside the 95% CI, the p value is greater than .05, so we dont reject

<p>- if the null hypothesis value lies outside of the 95% CI, the p value is less than .05, so p is less than the significance level and we reject the null hypothesis<br>- if the null hypothesis lies inside the 95% CI, the p value is greater than .05, so we dont reject</p>
New cards
70

z score formula (FOR KNOWLEDGE):

Z = (M - µ) / alpha/ sqrt N)

<p>Z = (M - µ) / alpha/ sqrt N)</p>
New cards
71

**Correlation r is:

measure of the strength of association between 2 variables, such as ratings from -1 through 0 to +1

New cards
72

factorial anova guide if there is an interaction:

analyze the simple main effects

<p>analyze the simple main effects</p>
New cards
73

factorial anova guide if there is NOT an interaction:

if there are no simple main effects your finished... BUT if there is between 2+ levels complete a POSTHOC TEST

<p>if there are no simple main effects your finished... BUT if there is <strong>between 2+ levels complete a POSTHOC TEST</strong></p>
New cards
74

**designs with more than one IV design (factorial 2 X 2) looks like:

chart has 2 IVS: violence and difficulty with 2 levels : easy and hard

<p>chart has 2 IVS: violence and difficulty with 2 levels : easy and hard</p>
New cards
75

**What is between variability:

seeing the difference between each group mean to overall mean (reflects the variability caused by IV) (AKA control, white, music)

New cards
76

**What is within variability:

seeing the difference within each group's data points in comparison each group mean, reflects the variability due to random factors NOT IV (participants)

New cards
77

**What is simple main effect?

when there is an interaction, determines where the difference lies in an interaction by comparing the effect of IV1 at each level of IV2, or conversely, the effect of IV2 at each level of IV1

EXAMPLE: difference between red triangle and another red triangle

<p>when there is an interaction, determines where the difference lies in an interaction by comparing the effect of IV1 at each level of IV2, or conversely, the effect of IV2 at each level of IV1</p><p><strong>EXAMPLE: difference between red triangle and another red triangle</strong></p>
New cards
78

what does a significant interaction mean?

that the effect of one IV depends on the level of the other IV (existence of NON parallel lines)

EXAMPLE: comparing 3 circles or 3 triangles, (if one isnt and other are just test for significant one!)

<p>that the effect of one IV depends on the level of the other IV (existence of NON parallel lines)</p><p></p><p>EXAMPLE: comparing 3 circles or 3 triangles, (<strong>if one isnt and other are just test for significant one!)</strong></p>
New cards
79

**when is it appropriate to analyze main effects?

when theres no interaction, focusing one one IV and you found one IV is significant

New cards
80

what is the difference between r = .1 and .9 on a scatterplot??

the strength and pattern of the relationship between the variables X and Y:

Correlation r=.1

Strength: Very weak correlation.

Scatterplot Appearance: The data points will be widely scattered with no clear trend. There’s only a slight tendency for Y to increase (or decrease) as X increases.

Interpretation: A small positive correlation indicates a weak linear relationship. Knowing X gives very little predictive power about Y

Correlation r=0.9

Strength: Very strong positive correlation.

Scatterplot Appearance: The data points will be tightly clustered along a straight line sloping upwards. There’s a clear pattern showing that as X increases, Y also increases in a predictable way.

Interpretation: A large positive correlation means a strong linear relationship. Knowing X provides high predictive power about Y

New cards
81

**how to identify if a scatterplot is showing a negative or positive correlation?

more matched = positive correlation

EX: more sleep hrs are associated with better cognitive test scores

(both variables move in same direction (as one increases so does other)

more mismatched = negative correlation

EX: fewer sleep hrs correlate with lower cognitive scores

(variables move in opposite directions, as one increases other decreases)

(NO RELATION: sleep hrs and cognitive performance are unrelated)

<p>more matched = <strong>positive</strong> correlation</p><p>EX: more sleep hrs are associated with better cognitive test scores</p><p>(both variables move in same direction (as one increases so does other)</p><p></p><p>more mismatched = <strong>negative</strong> correlation</p><p>EX: fewer sleep hrs correlate with lower cognitive scores</p><p>(variables move in opposite directions, as one increases other decreases)</p><p></p><p><strong>(NO RELATION: sleep hrs and cognitive performance are unrelated)</strong></p>
New cards
82

**Dance of r values is measured by:

correlation coefficient

r = +1 perfect positive relationship

r = -1 perfect negative relationship

r = 0 no relationship

New cards
83

**Confidence Intervals around Pearson's r:

r can be used to estimate ρ. r values dance for smaller Ns!!

(if CI overlaps 0 = possible fail to reject null as 0 is possible if its in range)

(the lower the value the more confident you are that true pop is .5 null = 0)

New cards
84

how does this scatterplot showing the 95% CI for r show asymmetry of CIs on r:

r = 0; long and symmetric CI—as r gets further from 0, the CI gets shorter andmore asymmetric, with the side further from 0 as the smaller side.

<p>r = 0; long and symmetric CI—as r gets further from 0, the CI gets shorter andmore asymmetric, with the side further from 0 as the smaller side.</p>
New cards
85

how does this scatterplot showing the 95% CI for r show an effect of N on CIs on r:

Predictably, larger N results in a shorter CI

<p>Predictably, larger N results in a shorter CI</p>
New cards
86

the CI on the difference of r allows us to:

tell the difference if one variable has a stronger correlation than another variable

<p>tell the difference if one variable has a stronger correlation than another variable</p>
New cards
87

**Line of best fit is calculated by:

finding the line that minimizes the squared differences between actual data points + predicted values on line. Ensures the best representation of relationship between variables

New cards
88

**Calculation for linear regression:

is through r2

New cards
89

***what does the formula for linear regression represent:

based on a dataset of (X, Y) pairs, (Y – Ŷ ) is an estimation error/residual, where Y is an observed value for some X, and Ŷ is the estimated value given by the regression line for that X The SD of the residuals is written as sY.X ,and the summation is over all N data points

<p>based on a dataset of (X, Y) pairs, <strong>(Y – Ŷ )</strong> is an <strong>estimation error/residual,</strong> where <strong>Y is an observed value</strong> for <strong>some X</strong>, and <strong>Ŷ is the estimated value given </strong>by the <strong>regression line for that X The SD</strong> of the <strong>residuals</strong> is written as <strong>sY.X</strong> ,and the <strong>summation is over all N</strong> <strong>data points</strong></p>
New cards
90

Well-being and Body Satisfaction linear regression example:

The “regression of Y on X” means that we intend to use X values to predict Y.• X (Body Satisfaction) is the predictor variable and Y (Well-being) is the predicted/criterion variable.

Regression line goes through the means of X and Y, marked by crossed vertical and horizontal lines. In our figure, if a participant scores X = 3.0, what is our best point estimate of their wellbeing score?

our best point estimate of their wellbeing score is Ŷ = 4.44

<p><strong>our best point estimate of their wellbeing score is Ŷ = 4.44</strong></p>
New cards
91

**Calculation for multiple regression:

is through partial R2

<p>is through partial R2</p>
New cards
92

**Complications with multiple regression:

It can be confusing as there are 2 predictor variables being assessed for effects on the main outcome variable, they are both correlated but independent

<p>It can be confusing as there are 2 predictor variables being assessed for effects on the main outcome variable, they are both correlated but independent</p>
New cards
93

***What does the formula for multiple regression represent:

using partial r2: the b's in the formula represent each predictor variable and how they are added to get overall outcome variable in percentage (how much change in the outcome variable results form one-unit of change in the predictor variable adter controlling for the other predictor variable)

FREE THROW EX: after accounting for Age, bMPG = 0.33, which means that FTP increases by 0.33% for every extra minute of game time, regardless of Age. (In other words, bmpg tells us how much FTP increases with each minute of game actionwhen we hold the age of the player constant (i.e., considering only 28-year olds)

<p>using partial r2: the <strong>b's</strong> in the <strong>formula represent each predictor variable</strong> and <strong>how they are added</strong> to get <strong>overall outcome variable in percentage</strong> (how much change in the outcome variable results form one-unit of change in the predictor variable adter controlling for the other predictor variable) </p><p></p><p><strong><span class="bgY">FREE THROW EX:</span></strong>  after accounting for Age, bMPG = 0.33, which means that FTP increases by 0.33% for every extra minute of game time, regardless of Age. (In other words, bmpg tells us how much FTP increases with each minute of game actionwhen we hold the age of the player constant (i.e., considering only 28-year olds)</p>
New cards
94

what influence does an outlier have on r?

The distance of a point from the means of X and Y tells the tale of itsinfluence on r:
• it has a much stronger contribution to r ( in fact, this single data point turned our correlation from .89 to -.15!)

<p>The distance of a point from the means of X and Y tells the tale of itsinfluence on r:<br>• it has a much stronger contribution to r ( in fact, this single data point turned our correlation from .89 to -.15!)</p>
New cards
95

what is a regression based on:

a dataset of (X, Y) pairs.• The two are closely linked: large correlations mean X predicts Y more strongly; small correlations mean X only gives part of the story.

<p>a dataset of (X, Y) pairs.• The two are closely linked: <strong><span class="bgY">large correlations mean</span></strong> X predicts Y more strongly; <strong><span class="bgY">small correlations mean</span></strong> X only gives part of the story.</p>
New cards
96

what is an estimation error/residual

is (Y – Ŷ ) where Y is anobserved value for some X, and Ŷ is the estimated value givenby the regression line for that X (regression line minimizes this error!!)

New cards
97

how to find the slope of regression line:

b = r X (Sy/Sx) indicates how strongly the x and y variables are related

<p>b = r X  (Sy/Sx) indicates <strong>how strongly the x and y variables are related</strong></p>
New cards
98

How to find the intercept of regression line:

b0 = My - b X Mx

<p>b0 = My - b X Mx</p>
New cards
99

final equation of r correlation:

Ŷ = b0 +𝑏𝑋

<p>Ŷ = b0 +𝑏𝑋</p>
New cards
100

what is the Nifty test?

a good predictor test demonstrating the relationship between 2 variables EX:The correlation coefficient (r) between predictor X ("Nifty test score") and outcome Y (GPA)

Two extreme cases of correlation:

r=0: No correlation; the regression line is flat (mean line), and knowing X provides no information about Y.

r=1: Perfect positive correlation; all data points lie on the regression line, and knowing X gives exact knowledge of Y

New cards
robot