Study Notes on R Scores and Statistical Analysis
R Score and Hypothesis Testing
Introduction to Statistical Concepts
Focus on calculating an R score and understanding its significance.
Introduction to T-tests and ANOVA for testing significance, recognizing prior exposure in course 2106.
Key Statistical Terms
R Score (Pearson's R): Used to describe the relationship strength and direction between two variables.
Null Hypothesis: Assumes no relationship exists between the variables (Rho = 0).
Alternative Hypothesis: Assumes some relationship exists between the variables (Rho ≠ 0).
Sample Data Example: Maternal Behavior in Rats
Data collected on the maternal behavior of female rats categorized as high, medium, and low performers.
Discussion of correlation based on a sample of 5 rats, emphasizing the small sample size as a limitation.
Understanding R Score
Correlation values range from -1 to 1:
Values close to 1 indicate a strong positive correlation.
Values close to -1 indicate a strong negative correlation.
Values around 0 indicate no correlation.
Highlighting that with just two points, a perfect correlation could appear due to bias in small sample sizes.
Adjusted R Score
Adjusted R formula: where n is sample size.
Significance of checking adjusted R when sample size is small to ensure reliable results.
Importance of ensuring adjusted R and R are closely aligned when analyzing data.
Importance of Sample Size
Discussion on how small sample sizes can skew results and correlations.
Need for awareness of sampling errors and their impact on the data representation.
Significance Testing
Significance tests determine if an observed R value could appear purely through random chance.
Use of hypothesis testing:
Null hypothesis (H0): Rho = 0 (no correlation).
Alternative hypothesis (H1): Rho ≠ 0 (there is a correlation).
Degrees of freedom calculated as n - 2 for correlation tests.
Conducting Statistical Tests
Reference to the Howell textbook for statistical tables based on R values and degrees of freedom.
While both T-tests and correlation are used, T-tests focus on the relationship between variables with linear predictability.
SPSS and Output Analysis
SPSS generates correlation matrices with redundancy and significant indicators marked clearly.
Discussion on sample size definition and the significance of findings related to correlation analysis.
Understanding that p-values provide context for correlation significance and true relationship likelihood.
Factors Affecting Correlation Coefficients
Factors influencing the reliability of the R score:
Linearity: Whether data follow a linear relationship.
Restricted Range: The impact of analyzing a limited range of data points.
Extreme Observations: Influence of outliers that may distort results.
Heterogeneous Subgroups: Combining disparate groups can obscure true relationships.
Types of Correlation Coefficients
Importance of recognizing variations of Pearson's R in statistical analysis:
Spearman's R for ranked data.
Point-Biserial for dichotomous and interval data analysis.
Regression Analysis Introduction
Transition to regression focusing on predictive relationships rather than simply correlation.
Importance of regression as a method for predicting one variable based on another through a linear relationship.
Summary of Key Differences: Correlation vs. Regression
Correlation: Focus on degree of relationship.
Regression: Focus on predicting one variable from another based on established relationships.