Pre-lab notes: mood data (MOOD) and split reliability
JASP pre-lab notes: mood dataset (MOOD) and reliability (SPLIT)
Purpose and mindset
- JASP is a statistical program designed for psychology; aims to show where buttons are and explain what they mean, helping you learn by doing.
- Statistics are a tool to investigate what’s going on in people's heads and test ideas beyond intuition, which can be misleading.
- If you have a preferred learning style, consider using the recommended textbook for clearer explanations.
Datasets used in the session
- MOOD: used for descriptive statistics and correlation analysis.
- SPLIT: used to learn about reliability of a measure via Cronbach Alpha.
- Two data types appear in MOOD: a categorical variable (gender, labeled as sex) and several continuous variables (happy, pos, n).
Mood file: variables and interpretation
- Gender variable: sex (binary male/female); categorical; means are not meaningful. Use frequencies in reports (e.g., gender counts).
- Continuous variables:
- happy: life happiness (general happiness)
- pos: positive mood (mood in the moment)
- n: neuroticism (personality trait, not clinical; tendency to negative emotions and emotional volatility)
- Neuroticism notes:
- Part of BFI; often 12 items (6 positive, 6 negative).
- Negative items should be reverse-scored and aggregated into a single neuroticism score.
- Data structure:
- Each row = a participant; each column = a variable (e.g., participant 1 has a neuroticism score of 37; happiness and mood scores follow similarly).
- Visual intuition:
- Mood tends to vary; may show greater variance than trait-like happiness.
Quick pre-lab exercise: hypothesis generation
- Think of possible relationships between mood, happiness, and neuroticism; relationships may be positive or negative.
- Discuss the idea that relationships could be linear and that there is a spectrum of individuals.
Descriptive statistics in MOOD (descriptives)
- In JASP, open MOOD, select variables, and add them to the Descriptives analysis box; you can drag individually or select all at once.
- Output on the right shows descriptors; means and standard deviations (SD) for continuous variables are highlighted in green.
- Gender is treated as categorical; report frequencies instead of means.
- Interpreting the descriptive stats:
- Mean = average score across participants.
- SD = how much scores vary around the mean.
- Practical note: raw average scores don’t imply high/low meaning without population norms; avoid claiming that “neuroticism levels are very high/low” without norms.
- Reporting tip: present means and SDs for happy, pos, and n; report gender frequencies.
Understanding and interpreting scatter plots
- Scatter plots visualize relationships, not tests; eyeballing is useful but unreliable.
- Example concepts explained in the video:
- Temperature vs drinks sold: positive association (as temperature rises, drink sales rise).
- Temperature vs coffee sales: negative association (as temperature rises, coffee sales fall).
- Mood data visuals:
- happy vs pos: positive association (high life happiness tends to coincide with higher positive mood).
- n (neuroticism) vs happy: negative association (higher life satisfaction tends to relate to lower neuroticism).
- n vs pos: downward tendency (more neuroticism tends to be associated with lower positive mood, though with more variance).
- Important caveat: scatter plots suggest patterns but do not establish significance or causation.
Pearson correlation: purpose and interpretation
- Purpose: statistically test for a linear relationship between two continuous variables.
- Notation and range:
- Correlation coefficient r ∈ [-1, 1].
- Positive r indicates a positive relationship; negative r indicates a negative relationship.
- Larger |r| indicates a stronger relationship.
- Mathematical expression (conceptual):
r = \frac{\mathrm{cov}(X,Y)}{\sigmaX \sigmaY} - Significance: report the p-value to indicate whether the observed correlation is statistically significant (often p < 0.05 is used as a threshold).
- Two-tailed test: the test assesses the possibility of both positive and negative associations.
- Reading the JASP output: a two-by-two matrix with the reported r value, the sample size n, and the p-value for the pair of variables.
- Example from MOOD data:
- Life satisfaction (happy) and neuroticism (n): r = -0.413 \quad p < 0.05
- Interpretation: a significant negative relationship; higher life satisfaction tends to be associated with lower neuroticism; n indicates the sample size used for this correlation.
- Effect size interpretation (Cohen’s guidelines, context-dependent):
- Roughly, small ~ |r| ≈ 0.10, medium ~ |r| ≈ 0.30, large ~ |r| ≈ 0.50.
- The example r = -0.413 is described as a medium effect size in the session.
- Practical takeaway: correlations give you an effect size (r) and significance (p); do not rely on p-values alone; consider the magnitude of the effect and theoretical justification.
Spurious correlations: why significance isn’t enough
- It’s possible to obtain statistically significant correlations that are meaningless if there is no theoretical basis.
- Example described:
- Per capita mozzarella cheese consumption and number of lawyers in Hawaii showed a very high correlation (r ≈ 0.97) with no theoretical link.
- Takeaway: avoid “fishing” for significant correlations; rely on theory or prior data to justify the expected relationship.
- In psychology: test hypotheses grounded in theory or prior literature (e.g., personality and self-determination theory) to avoid spurious findings.
Reliability and Cronbach Alpha (SPLIT dataset)
- Reliability in this context means internal consistency: do items that are supposed to measure the same construct yield similarly shaped responses?
- Internal consistency is about the measurement instrument, not about the participants.
- Why reliability matters:
- Many scales are multi-item; we want those items to tap into the same underlying construct (e.g., extraversion) so that aggregation makes sense.
- If some items diverge, it may indicate multiple constructs are being measured or items don’t fit the construct.
- SPLIT data structure:
- Data include items related to different splitting domains: self (s), family (f), others (o).
- Example items show how items map to subscales (e.g., family-splitting items 2 and 4). Sorting helps identify which items belong to which subscale.
- Cronbach Alpha (concept): an index of internal consistency across items in a subscale.
- How to run Cronbach Alpha in JASP (SPLIT file):
- Open the SPLIT file; switch to Reliability (blue top bar).
- Choose classical reliability (Cronbach Alpha).
- Alphabetize variables (A to Z) to find subscale items easily.
- Select items belonging to the subscale (e.g., all family-splitting items starting with 'f').
- Ensure Cronbach Alpha is selected (not McDonald’s omega or other options).
- Example result:
- Family-splitting subscale Cronbach Alpha ≈ 0.83.
- Interpreting Cronbach Alpha (George & Mallory guidelines, used in the session):
- > 0.9: excellent
- 0.8 – 0.9: good
- < 0.5: unacceptable
- (Note: other categories exist in literature; these guidelines are simplified for teaching purposes.)
- Interpretation for the SPLIT example:
- α ≈ 0.83 for family-splitting items → good reliability; the subscale is likely tapping into the same underlying construct of family-splitting.
- Additional reporting notes:
- Reliability is typically reported in the method section and pertains to the measures, not the participants.
How to report findings and integrate theory
- When reporting, combine descriptive stats, correlations, and reliability with a theoretical frame (e.g., literature on personality and mood).
- Typical reporting structure:
- Descriptives: means and SDs for continuous variables; gender frequencies.
- Correlations: report r, p-values, and n; interpret direction and magnitude; discuss significance in context.
- Reliability: report Cronbach Alpha for subscales; interpret using guidelines.
- Emphasize that correlations are evidence for relationships but do not imply causation; use theory to justify expected directions.
- Learn to distinguish between aggregated mood scores (MOOD) and item-level data (SPLIT) when performing analyses.
Practical tips for the lab session
- Practice by pausing the video after performing each step in JASP to match the instructor’s actions.
- Write down specific problems and questions to bring to tutors in the next lab.
- Keep in mind the difference between descriptive statistics, visualization, correlation, and reliability analyses; each serves a different purpose.
Quick recap of key concepts and terms
- Descriptives: mean, standard deviation; reporting for continuous variables; frequencies for categorical variables.
- Scatter plots: visualization of relationships; not a substitute for statistical tests.
- Pearson correlation (r): strength and direction of a linear relationship; range [-1, 1].
- p-value: probability of observing the data (or more extreme) under the null hypothesis; two-tailed test used in this session.
- Cronbach Alpha (α): measure of internal consistency reliability for a set of items; higher values indicate more coherent measurement of a single construct.
- Internal consistency vs construct validity: reliability is about measurement consistency, while validity concerns whether the measure captures the intended construct.
Final notes
- The pre-lab videos are designed to prepare you for the lab by outlining how to use JASP and how to interpret outputs.
- If you had trouble, identify concrete questions and discuss them with tutors or in the next lab.
References to the workflow mentioned in the video (for quick recall)
- Load MOOD data and run Descriptives; add scatter plots; interpret means and SDs.
- Run Pearson correlation for selected pairs (e.g., happy-n; happy-n, etc.); report r, p, and n; classify effect size.
- Load SPLIT data; sort items by subscale; run Cronbach Alpha for a chosen subscale; interpret α against guidelines.
Notation and formulas used
- Pearson correlation coefficient:
r = \frac{\mathrm{cov}(X,Y)}{\sigmaX \sigmaY} - Cronbach Alpha (conceptual formula):
\alpha = \frac{N \bar{c}}{\bar{v} + (N-1) \bar{c}} - Relationship interpretation: from the MOOD example, the reported correlation was
r = -0.413, \quad p < 0.05,
indicating a significant negative relationship with a medium effect size per Cohen’s guidelines.
- Pearson correlation coefficient: