Lecture 1 - Introduction to Statistics (WJC)

Lecture 1: Introduction to Statistics for the Behavioural Sciences

Overview of the Lecture

Presented by David Butler
Key areas of focus include definitions, types, and applications of statistics in behavioural sciences.

Objectives

Definition of Statistics: Understanding the discipline that handles data collection, organization, analysis, and interpretation.
Data & Variables: Learn the differences between various types of data and the variables that represent them.
Types of Variables: Explore qualitative vs. quantitative variables.
Measurement Levels: Different levels at which variables can be measured and their implications.
Population vs. Sample: Understand the concepts of population and sample in research.
Types of Statistics: Differentiate between descriptive and inferential statistics.
Sampling Techniques: Introduction to different methods used to collect samples for analysis.

Statistics: Definition and Purpose

Statistics: The study of collecting, organizing, analyzing, and interpreting numerical data.
Emphasizes how to systematically handle data to draw meaningful conclusions.

Individuals and Variables

Individuals

Definition: Individuals refer to the entities (people or objects) included in a study.
Also known as data units, these are the subjects from which data is collected.

Variables

Definition: Characteristics of individuals being measured or observed.
Referred to as data items, they can vary among subjects and are crucial for analysis.

Sources of Data

Primary Sources

Collected firsthand by the researcher.
Methods include:
- Questionnaires
- Surveys
- Tests & Exams
- Interviews
- Observation

Secondary Sources

Utilization of existing data obtained from:
- Newspapers
- Previous research studies
- Census data
- Academic journals

Types of Variables

Quantitative Variables

Definition: Numerical; meaningful mathematical functions can be performed.
Transformations: All data points can undergo the same unit change.

Qualitative Variables

Definition: Categorical; mathematical operations do not yield meaningful results.
Transformations: Must preserve uniqueness (nominal) or rank (ordinal).

Levels of Measurement (LOM)

Categorical Variables

Nominal: Non-rankable categories (e.g., names, religion, gender).
Ordinal: Rankable categories without numerical differences (e.g., grades, drink sizes).

Numerical Variables

Interval: Rankable, but with no true zero (e.g., temperature scales).
Ratio: Rankable with a true zero point (e.g., weight, height).

Types of Data

Discrete Data

Countable values (e.g., number of cars).

Continuous Data

Uncountable values; can include fractions (e.g., weight, time).

Target Groups

Population

All individuals within the study’s target group.

Sample

A subset of the population selected for the research.

Types of Statistics

Descriptive Statistics

Techniques to organize and summarize data (e.g., averages, tables).
Describe characteristics of the sample or population.

Inferential Statistics

Techniques that allow inferences about a population based on a sample.
Involve making generalized conclusions.

Units of Analysis

Individual Units

Most common in social research (e.g., surveying individual student opinions).

Group Units

Focuses on group behavior rather than individual entities (e.g., gangs in different environments).

Organizational Units

Evaluation of behavior across organizations (e.g., school performance).

Sampling Concepts

Population vs. Sample

Population: Entire group of interest.
Sample: Subset selected for practical research constraints.

Accessible Population

Elements of the population available for selection as a sample.

Sampling Frame

A comprehensive list of all elements to choose from in a population.

Sampling Strategies

Representative Samples

Samples should ideally represent broader populations.
Law of Large Numbers: Larger sample sizes yield results closer to actual population behaviors.

Sampling Methods

Sampling with Replacement

Samples drawn are returned; probabilities remain the same.

Sampling without Replacement

Drawn samples are not returned; affects remaining selection probabilities.

Probability Sampling

Characteristics

Each element has a defined chance of selection.
Involves randomness, reducing biases.

Methods of Probability Sampling

Random Sampling

Every population element has an equal chance of selection.
Best method for minimizing bias.

Systematic Random Sampling

Involves a defined starting point and a set interval for selection.

Stratified Random Sampling

Divides population into subgroups; samples are selected from each group.

Cluster Sampling

Involves selecting entire clusters or groups; useful for large populations.

Non-Probability Sampling

General Characteristics

No defined probability for selection; more practical with no sampling frame available.

Methods of Non-Probability Sampling

Quota Sampling

Participants chosen based on representation rates within the population.

Snowball Sampling

Current participants help identify future subjects, useful for hidden populations.

Convenience Sampling

Selecting based on ease of access at a specific time and place.

Purposive Sampling

Researcher uses judgment to select representative units based on variation within the population.