Key Concepts in Statistical Analysis for PSYC 1010

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/96

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

97 Terms

New cards

Data

Numerical info (scores, measurements).

New cards

Variables

Characteristics that vary (e.g., gender, age).

New cards

Case

Entity from which data is collected (e.g., people, cities).

New cards

Descriptive Stats

Describe and summarize data (univariate or multivariate).

New cards

Inferential Stats

Generalize from sample to population; includes hypothesis testing and cause-effect relationships.

New cards

Discrete

Whole numbers, no fractions (e.g., yes/no, categories).

New cards

Continuous

Infinite fractional values, limited by instrument precision (e.g., age).

New cards

Independent Variable

The cause or predictor (e.g., number of drinks).

New cards

Dependent Variable

The effect or outcome (e.g., blood alcohol level).

New cards

Correlation

Relationship between variables without cause/effect.

New cards

Regression

Predict outcome (Y) from predictor(s) (X); can be single or multiple.

New cards

Population

Entire group (e.g., all Canadians 65+).

New cards

Sample

Subset of the population (e.g., 50 Guelph students).

New cards

Parameters

Describe population.

New cards

Statistics

Describe sample.

New cards

Random Sampling

Ideally unbiased but can be difficult and ethically complex.

New cards

Scientific Method

Steps: Observation → Question → Hypothesis → Experiment → Analyze → Conclusion → Replicate.

New cards

Deductive Reasoning

General → Specific.

New cards

Inductive Reasoning

Specific → General.

New cards

Quantitative Research

Numeric, statistical.

New cards

Qualitative Research

Descriptive, open-ended.

New cards

Nominal Scale

Categories (e.g., yes/no).

New cards

Ordinal Scale

Ranked order (e.g., income levels).

New cards

Interval Scale

Equal intervals, no true zero (e.g., temperature).

New cards

Ratio Scale

True zero (e.g., age, salary).

New cards

Mean

Average, affected by outliers.

New cards

Median

Middle value, unaffected by outliers.

New cards

Mode

Most frequent score.

New cards

Normal Distribution

Symmetrical bell curve; mean = median = mode.

New cards

Positive Skew

Few high scores.

New cards

Negative Skew

Few low scores.

New cards

Kurtosis

Leptokurtic: Tall and thin; Platykurtic: Flat; Mesokurtic: Normal.

New cards

Range

Difference between highest and lowest scores.

New cards

Variance

The average of squared deviations from the mean.

New cards

Standard Deviation

Square root of variance; measures spread of scores.

New cards

Average Deviation

Mean of all individual score deviations from the mean.

New cards

Coefficient of Variation

Standard deviation ÷ mean; useful for comparing variability across different units.

New cards

Range

Difference between the highest and lowest scores.

New cards

Normal Distribution

Symmetrical, bell-shaped curve.

New cards

68% Rule

68% of scores fall within ±1 SD of the mean.

New cards

Positive Skew

Distribution with few high scores.

New cards

Negative Skew

Distribution with few low scores.

New cards

Z-Scores

Standardized score showing how far a value is from the mean in SD units; useful for comparing across different distributions.

New cards

Sample Space

All possible outcomes.

New cards

Event

A specific outcome.

New cards

Probability Function

Ensures all event probabilities sum to 1.

New cards

Mutually Exclusive

P(A or B) = P(A) + P(B).

New cards

Not Mutually Exclusive

Events can overlap (e.g., ace of diamonds).

New cards

Bernoulli Distribution

Probability of success/failure with sampling with replacement.

New cards

Null Hypothesis (H₀)

No difference or effect.

New cards

Alternative Hypothesis (H₁)

Predicts a difference or effect.

New cards

Type I Error (α)

False positive - rejecting H₀ when it's true.

New cards

Type II Error (β)

False negative - failing to reject H₀ when it's false.

New cards

Power

Probability of correctly rejecting H₀ (1 - β).

New cards

Z-Test

Used when population standard deviation is known.

New cards

T-Test

Used when population parameters are unknown.

New cards

Independent T-Test

Used for two separate groups.

New cards

Paired T-Test

Used when one group is measured twice.

New cards

Degrees of Freedom (df)

One sample: N - 1; Independent t-test: N - 2.

New cards

Critical T-Value

Used to determine rejection region; If calculated t > critical t → reject H₀.

New cards

Confidence Intervals

Range around a sample mean where the population mean likely falls.

New cards

Point Estimate

Exact value from sample data.

New cards

Interval Estimate

Range around the point estimate with a confidence level (e.g., 95%).

New cards

ANOVA (Analysis of Variance)

A parametric test used when comparing more than 2 means.

New cards

Between-group variance

Variation due to differences between group means.

New cards

Within-group variance

Variation within each group.

New cards

One-factor ANOVA

Different participants in each group (e.g., comparing schools).

New cards

Repeated-measures ANOVA

Same participants measured multiple times (e.g., before/after study).

New cards

Two-factor / Three-factor ANOVA

Tests for interactions between two or more independent variables (e.g., school type × region × income level).

New cards

Bonferroni-Dunn correction

Divide α by the number of comparisons to keep overall error rate at 0.05.

New cards

Assumptions of ANOVA

Normal distribution, homogeneity of variance, independence of observations, interval or ratio data.

New cards

F-Ratio

F = MSbetween / MSwithin; F > 1 indicates more variability between groups than within, suggesting potential statistical significance.

New cards

Degrees of Freedom

df between = k - 1 (where k = number of groups); df within = N - k (where N = total sample size); df total = N - 1.

New cards

ANOVA Steps

1. Calculate sum of squares (SS): total, between, and within. 2. Calculate degrees of freedom (df). 3. Calculate mean square (MS = SS / df). 4. Calculate the F-ratio (MSbetween / MSwithin). 5. Compare Fobtained to Fcritical (use F-table). 6. If Fobtained > Fcritical, reject H₀.

New cards

Post-Hoc Tests

Used only if ANOVA is significant to determine which groups are significantly different; Tukey's HSD is a common post-hoc test.

New cards

Correlational Studies

Predictor variable (X-axis) is the independent variable; Criterion variable (Y-axis) is the dependent variable.

New cards

Correlation Analysis

Measures the strength and direction of the relationship between two variables (X and Y).

New cards

Scatterplots

Show how closely data points fit the regression line (line of best fit).

New cards

Outliers

Vertical outliers affect the relationship, while horizontal outliers are called leverage points.

New cards

PPMC (Pearson's r)

The correlation coefficient (r) ranges from -1 to 1, showing strength and direction (positive/negative) of the relationship.

New cards

Strength of r

0.00-0.25: Little to no correlation; 0.25-0.50: Fair correlation; 0.50-0.75: Moderate to good correlation; 0.75: Good to excellent correlation.

New cards

R² (coefficient of determination)

Indicates the percentage of variability in one variable explained by the other variable; e.g., R² = 0.49 means 49% of variability in Y is explained by X.

New cards

Limitations of PPMC

A high r-value doesn't prove causation; range and extreme data points can affect the results; assumes a linear relationship between variables.

New cards

Regression Analysis

Models the relationship between a dependent variable (Y) and an independent variable (X) using a straight line.

New cards

Linear Regression Equation

Y = bX + a (where b is the slope and a is the Y-intercept).

New cards

Slope

Indicates the direction of the relationship (positive or negative).

New cards

Y-intercept

Value of Y when X = 0.

New cards

Prediction in Regression

The line of best fit helps predict Y from X, and error is the difference between the predicted and actual values.

New cards

Coefficient of Determination (R²)

Measures the proportion of variability in Y explained by the variability in X; e.g., if R² = 0.495, 49.5% of the variance in Y is accounted for by X.

New cards

RCT (Randomized Controlled Trials)

Establish cause-and-effect relationships, the gold standard for drug trials.

New cards

Historical Data

Can show correlation but doesn't establish causality.

New cards

Statistical Testing

Null Hypothesis (H₀): No relationship or effect between variables.

New cards

Non-Directional Hypothesis

Open to the possibility of either a positive or negative relationship.

New cards

T-tests & ANOVA

Used to compare means between two or more groups; T-test compares two groups, ANOVA compares more than two groups.

New cards

Example: Covid-19 Case Study

Analyzed the relationship between vaccination rates and new case counts, showing statistical significance (p < 0.05).

New cards

Conclusion of Correlation Studies

In correlation studies, r shows the relationship, R² quantifies the explanation, and regression predicts outcomes.

New cards

Statistical Significance

Statistical significance (e.g., p-value < 0.05) confirms whether a relationship exists without concluding causality.