1/9
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
Describe the uses of correlation analysisÂ
Correlations = Used to evaluate exploratory data, to evaluate the relationship between two measured variablesÂ
Measures of correlation ask “what is the relationship between A and B?” or “Does A increase with variable B?”Â
Can be applied to paired observations on two different variables OR applied to one variable measured on two occasions Â
Ex: heart rate and level of exertion OR intelligence of a parent and childÂ
Pairs of observations (X (independent) and Y (dependent)) are examined to see if they go “together”Â
Taller people tend to weight more than shorter peopleÂ
Strong correlation = can infer something about the second value by knowing the first Â
Know how dependent variable is going to behave based on the independent Â
Does NOT equal causationÂ
Could be a 3rd variable affecting resultsÂ
Ex: number of churches in a city and crime & random correlation websiteÂ
Perfect positive = X increases in the same amount as Y increasesÂ
Perfect negative = X increases in the same amount as Y decreasesÂ
*Can estimate correlation by looking at whether graph is going up/down (positive/negative) or are close or spread apart (1.00 - 0)Â
Define the correlation coefficientÂ
Correlation coefficient (r) = provides an index that is a quantitative measure of the relationship between 2 variables (also known as Pearson’s Product Moment Correlation Coefficient)Â
Ranges from –1.00 to +1.00Â
Magnitude of the correlation coefficient indicates the STRENGTHÂ
Closer to 1 = STRONGER relationshipÂ
Closer to 0 = WEAK relationship (no correlation)Â
Closer to 0.5 = low correlationÂ
Gives the magnitude of a relationshipÂ
The sign (+/-) indicates the direction of the relationshipÂ
Tells you that two sets of data are related in a proportionalÂ
DOES NOT PROVIDE INFORMATION RELATIVE TO THE DIFFERENCE BETWEEN SETS OF DATA (just that it’s stat. Diff. From 0)Â
Use t-test to infer similarities/differences Â
Only works with linear relationshipsÂ
Integrate the value of the correlation coefficient with the shape and spread of a scatter plotÂ
Scatter plots = each point represents the intersection of a pair of related observationsÂ
Have an X and Y axisÂ
The closer the points are to the “straight line” = the higher the correlation (STRONGER RELATIONSHIP)Â
Ranges from –1.00 to +1.00Â
Magnitude of the correlation coefficient indicates the STRENGTHÂ
Closer to 1 = STRONGER relationshipÂ
Closer to 0 = WEAK relationship (no correlation)Â
Closer to 0.5 = low correlationÂ
Gives the magnitude of a relationshipÂ
The sign (+/-) indicates the direction of the relationshipÂ
Can estimate correlation by looking at whether graph is going up/down (positive/negative) or are close or spread apart (1.00 - 0)Â
Â
There can be curvilinear relationships Â
Ex: anxiety level and performanceÂ
Linear correlation will be nearly 0 but the scatterplot will show a clear relationship (curved)Â
Differentiate between significance of a correlation and strength of a correlationÂ
Significance of correlation (p-value) = Indicates that observed correlation is unlikely/likely due to chanceÂ
Is the correlation coefficient significantly different from 0?Â
Is the observed correlation a random effect or is it a good estimate of the population correlation?Â
DOES NOT mean there is a strong relationshipÂ
Can only indicate that it is NOT due to chance at the mostÂ
Sensitive to sample sizeÂ
Large sample sizes = r of 0.20 will be considered significantÂ
Small sample sizes = r of 0.45 will be considered significantÂ
Less useful in a correlation analysis than the actual magnitude of the correlationÂ
Look more at r value (than p) because all p-value says that it is statistically significant from 0Â
Does not provide information
relative to the difference between sets of data
Strength of correlation (r ) = tell you that two sets of data are related in a proportional way
Significance DOESN’T tell you anything about the difference between the sets of data
Describe the uses of linear regression analysisÂ
Linear regression = an inferential procedure that draws conclusions about a population based on samples taken from that populationÂ
Uses relationship between variables as a basis for predictionÂ
Draws the line of best fit (regression line)Â
Variable X (independent) = predictor variableÂ
Variable Y (dependent) = criterion variableÂ
Used for correlations less than 1.00Â
Interpret the parts of the straight line equation (define outliers and residuals)
Outliers = points that do not seem to fit with the rest of the scores Â
Lies outside the obvious cluster of scoresÂ
1 outlier alone can significantly alter the statistical description of the dataÂ
Don't remove from data UNLESS there’s a good reasonÂ
Measurement errorÂ
Linear regression equation: Y = mx + bÂ
Y = predicted value of Y (dependent variable)Â
m = slope (rate of change)Â
x = value of independent variable XÂ
b = Y-intercept (value of Y when X = 0)Â
Residuals = degree of error in regression line (residual value = difference between actual Y and predicted Y)Â
Points above/below the regression line (how far away are they from the line of best fit)Â
Define the coefficient of determination and compare it to the correlation coefficientÂ
Coefficient of determination (r^2) = gives the percentage of the total change in Y scores that can be explained by the X scoresÂ
Measure of proportionÂ
Ranges from 0.00 – 1.00 (NO NEGATIVE)Â
0 = no correlationÂ
1 = high correlationÂ
The amount of times you can account for Y based on XÂ
Ex: 76% of change in blood pressure can be accounted by knowing the ageÂ
Therefore, 24% of the time, the change in blood pressure is due to an unknown/unidentified variableÂ
 correlation coefficient (r ) = measures the strength and direction of the linear relationship between two variables
CAN BE NEGATIVE
Define “extrapolation”Â
Extrapolation = using linear regression to predict Y beyond the range of your data setÂ
SHOULD NOT DO THIS ON DATAÂ
Because don’t know where the data will continue to goÂ
Using linear regression to predict Y when you know X (“interpolation”) if it’s within the range of your data set (sometimes ok)Â
Explain the use of linear regression for a comparison of methods study (what is the comparison method)
Comparison of methods = uses linear regression to evaluate new procedures or equipment in clinical setting Â
Obtain data from both procedures, determine correlation, linear regression lineÂ
Old method = X-axisÂ
New method = Y-axisÂ
A “perfect” method agreement is:Â
Y= XÂ
m = 1Â
b =0Â
r >0.99Â
Define the perfect method agreement for the straight line equationÂ
Implies that the observed data points perfectly align with the calculated regression line.
A “perfect” method agreement is:Â
Y= XÂ
m = 1Â
b =0Â
r >0.99Â