Module 1 – Introduction to Statistics (Psychological Statistics)
Module Context
- Module 1: Introduction to Statistics (Psychological Statistics / PSYSTAT)
- Delivery mode: Synchronous; 5 hrs-week, scheduled for one week.
- Instructor: Mrs. Gina T. Montalla – College of Arts & Sciences, San Mateo Municipal College.
- Timeline: Initiated July 14 2025, completed July 19 2025.
Learning Objectives
- Enumerate the four philosophical/scientific methods of studying truth.
- Define & contrast key pairs:
• Observational vs Experimental research
• Descriptive vs Inferential statistics - Define foundational terms: population, sample, variable, statistic, parameter.
- Explain the importance of random sampling.
- Differentiate the four measurement scales: nominal, ordinal, interval, ratio.
- Differentiate & exemplify continuous vs discrete variables.
Statistics, Science & Observation
- Psychological statistics = application of mathematical formulas, theorems, numbers & laws to psychological data.
- Provides tools for modelling the science of mind & behavior; aids in discovering what is typical or normal for a group.
- Visual displays (graphs, pie charts, frequency distributions, scatterplots) help spot patterns otherwise missed.
- General purposes:
• Organize & summarize information so researchers “see what happened.”
• Provide a basis for justified conclusions.
Core Definitions
- Statistics (discipline): body of methods for collecting, organising, presenting, analysing & interpreting data.
- Statistics (data): facts/figures such as average income, crime rate, birth rate, etc.
- Four functional stages:
• Collection – gathering information.
• Organisation/Presentation – textual, tabular, graphical summarising.
• Analysis – application of statistical techniques.
• Interpretation – drawing conclusions.
Population & Sample
- Population (N): totality of elements/individuals of interest.
- Sample (n): subset chosen to represent the population.
- Relationship: sample is studied ➔ inference about population.
Parameter vs Statistic
- Parameter: numerical/nominal characteristic of a population (usually unknown, fixed). Example: population mean \mu.
- Statistic: numerical characteristic computed from a sample; estimator of the parameter. Example: sample mean \bar{x}.
Variables & Data
- Variable: attribute with varying values across individuals (age, gender, temperature).
- Data/Datum: measurements or observations on variables.
Variable Classifications
- Discrete: separate, indivisible categories; no values between. Ex: number of students, gender.
- Continuous: infinite possible values between any two points; divisible into fractional parts. Ex: height, weight, time.
Data Classifications
- Metric (Measurement) Data – obtained via measuring; yields numbers on a scale (height, scores).
- Enumeration (Count) Data – obtained by counting; yields whole numbers (number of households).
- Categorical (Qualitative) Data – grouped by categories/qualities (attitudes, SES).
Scales of Measurement
- Nominal
• Labels only; mutually exclusive categories; no quantitative value.
• Examples: religion, civil status, nationality. - Ordinal
• Rank order; intervals undefined.
• Examples: class standing, clothing sizes (S/M/L), Likert-type positions if treated strictly. - Interval
• Ordered, equal intervals, arbitrary zero.
• Can speak of how much more but not meaningful ratios.
• Examples: IQ scores, °C/°F temperature, standardised test scores. - Ratio
• Ordered, equal intervals, absolute (true) zero; allows ratios (“twice as heavy”).
• Examples: age, height, weight, reaction time.
Choosing a scale:
- Fit must match variable nature & desired precision.
- When several scales fit, prefer the highest level – affords richer info & more powerful statistical tests.
Descriptive vs Inferential Statistics
- Descriptive: techniques that summarise/organise raw scores (tables, graphs, mean, SD).
- Inferential: techniques that use sample data to make generalisations/decisions about populations (hypothesis tests, confidence intervals).
Error Concepts
- Sampling Error: natural, random discrepancy between a sample statistic and its corresponding population parameter; size unknown, bounded by margin of error in random sampling.
- Non-Sampling Error: errors from data-collection process other than sampling itself (selection bias, questionnaire wording, recording mistakes, non-response). Potentially introduces systematic bias.
Sampling & Sample-Selection Techniques
- Sampling = process of selecting a sample.
- Sampling Technique = specific procedure for choosing sample members.
Probability Sampling (each element has known, non-zero chance)
- Simple Random Sampling (SRS)
- Every element has equal chance; requires complete population list.
- Tools: lotteries, random-digit tables, RNG.
- Systematic Sampling
- Select every k^{th} element where k = \frac{N}{n}; start point random.
- Stratified Sampling
- Divide population into homogeneous strata; perform SRS within each.
- Allocation options: Equal vs Proportional.
- Cluster Sampling
- Population split into heterogeneous clusters (often geographically); randomly select clusters & study all elements within chosen clusters.
- Multi-Stage Sampling
- Combine stages (e.g., pick clusters, then households within clusters) until final units obtained.
Uses/Advantages of Probability Sampling
- Minimises selection bias; supports valid generalisation & error estimation.
- Ensures diversity & representativeness, especially in large/heterogeneous populations.
Non-Probability Sampling (chance of selection not equal/known)
- Convenience Sampling – sample units easiest to access (mall intercepts).
- Purposive/Judgemental Sampling – researcher selects units possessing specific characteristics (e.g., master’s-degree aspirants).
- Snowball Sampling – existing subjects recruit future subjects; useful for hidden or hard-to-reach populations (e.g., undocumented migrants, HIV patients).
- Quota Sampling – researcher sets target quotas for categories & samples until quotas filled (mimics stratification but without random selection).
Uses/Advantages of Non-Probability Sampling
- Exploratory or pilot work where speed, cost, or limited frames matter.
- Hypothesis generation when no prior info exists.
- Acceptable under tight budget/time or where random access impossible.
Limitations
- Potentially large bias; weak generalisability; absence of calculable sampling error.
Slovin’s Formula & Sample-Size Determination
- Formula: n = \frac{N}{1 + N\alpha^{2}} where
• n = required sample size
• N = population size
• \alpha = permissible error (level of significance) - Illustrative problem (given): N = 5000, \; \alpha = 0.05\; \Rightarrow\; n = 370.
- Same method applied for varying populations (2 000; 10 000; 8 000) and different error tolerances (10 %, 7.5 %, 5 %, 2.5 %, 1 %).
- Smallest possible sample is that which still yields acceptable error.
Learning Activities (Google Form)
- Classify data as discrete/continuous.
- Classify data as metric/enumeration/categorical.
- Identify scales of measurement (nominal, ordinal, interval, ratio).
- Decide if situations require descriptive or inferential procedures.
- Compute sample sizes using Slovin’s formula for various scenarios.
Assignment Preview – Presentation of Data
- Define & differentiate frequency distribution tables:
• Simple Frequency
• Complete Frequency
• Relative Frequency
• Cumulative Frequency
• Cumulative Percentage - Illustrate graphs:
• Bar Chart
• Histogram
• Frequency Polygon
• Pie Chart
• Ogive (cumulative frequency curve)
Ethical, Practical & Philosophical Notes
- Random selection underpins unbiased inference; ethical obligation to represent population faithfully.
- Non-sampling errors (question wording, low response) can invalidate findings despite perfect randomisation.
- Choice of measurement scale affects both ethical clarity (do labels mislead?) and analytic power.
Connections & Real-World Relevance
- Psychological researchers rely on stats to interpret test scores, treatment effects, or prevalence of behaviors.
- Sampling methods replicate in market research, public-health studies, political polling.
- Data-visualisation skills transfer to reporting insights for stakeholders.
References & Further Reading
- Ferguson & Takane (1989) Statistical Analysis in Psychology and Education – foundational text on educational stats.
- Gravetter & Wallnau (2012) Statistics for the Behavioral Sciences – widely used psychology stats guide.
- Montero–Galliguez et al. (2016) Fundamentals of Statistical Analysis – local Philippine perspective.
- Online: video resources on Prim’s algorithm, cryptocurrency, banking loans, stock-vs-bond differences.