Lecture 1 - Introduction to Statistics (WJC)
Lecture 1: Introduction to Statistics for the Behavioural Sciences
Overview of the Lecture
Presented by David Butler
Key areas of focus include definitions, types, and applications of statistics in behavioural sciences.
Objectives
Definition of Statistics: Understanding the discipline that handles data collection, organization, analysis, and interpretation.
Data & Variables: Learn the differences between various types of data and the variables that represent them.
Types of Variables: Explore qualitative vs. quantitative variables.
Measurement Levels: Different levels at which variables can be measured and their implications.
Population vs. Sample: Understand the concepts of population and sample in research.
Types of Statistics: Differentiate between descriptive and inferential statistics.
Sampling Techniques: Introduction to different methods used to collect samples for analysis.
Statistics: Definition and Purpose
Statistics: The study of collecting, organizing, analyzing, and interpreting numerical data.
Emphasizes how to systematically handle data to draw meaningful conclusions.
Individuals and Variables
Individuals
Definition: Individuals refer to the entities (people or objects) included in a study.
Also known as data units, these are the subjects from which data is collected.
Variables
Definition: Characteristics of individuals being measured or observed.
Referred to as data items, they can vary among subjects and are crucial for analysis.
Sources of Data
Primary Sources
Collected firsthand by the researcher.
Methods include:
Questionnaires
Surveys
Tests & Exams
Interviews
Observation
Secondary Sources
Utilization of existing data obtained from:
Newspapers
Previous research studies
Census data
Academic journals
Types of Variables
Quantitative Variables
Definition: Numerical; meaningful mathematical functions can be performed.
Transformations: All data points can undergo the same unit change.
Qualitative Variables
Definition: Categorical; mathematical operations do not yield meaningful results.
Transformations: Must preserve uniqueness (nominal) or rank (ordinal).
Levels of Measurement (LOM)
Categorical Variables
Nominal: Non-rankable categories (e.g., names, religion, gender).
Ordinal: Rankable categories without numerical differences (e.g., grades, drink sizes).
Numerical Variables
Interval: Rankable, but with no true zero (e.g., temperature scales).
Ratio: Rankable with a true zero point (e.g., weight, height).
Types of Data
Discrete Data
Countable values (e.g., number of cars).
Continuous Data
Uncountable values; can include fractions (e.g., weight, time).
Target Groups
Population
All individuals within the study’s target group.
Sample
A subset of the population selected for the research.
Types of Statistics
Descriptive Statistics
Techniques to organize and summarize data (e.g., averages, tables).
Describe characteristics of the sample or population.
Inferential Statistics
Techniques that allow inferences about a population based on a sample.
Involve making generalized conclusions.
Units of Analysis
Individual Units
Most common in social research (e.g., surveying individual student opinions).
Group Units
Focuses on group behavior rather than individual entities (e.g., gangs in different environments).
Organizational Units
Evaluation of behavior across organizations (e.g., school performance).
Sampling Concepts
Population vs. Sample
Population: Entire group of interest.
Sample: Subset selected for practical research constraints.
Accessible Population
Elements of the population available for selection as a sample.
Sampling Frame
A comprehensive list of all elements to choose from in a population.
Sampling Strategies
Representative Samples
Samples should ideally represent broader populations.
Law of Large Numbers: Larger sample sizes yield results closer to actual population behaviors.
Sampling Methods
Sampling with Replacement
Samples drawn are returned; probabilities remain the same.
Sampling without Replacement
Drawn samples are not returned; affects remaining selection probabilities.
Probability Sampling
Characteristics
Each element has a defined chance of selection.
Involves randomness, reducing biases.
Methods of Probability Sampling
Random Sampling
Every population element has an equal chance of selection.
Best method for minimizing bias.
Systematic Random Sampling
Involves a defined starting point and a set interval for selection.
Stratified Random Sampling
Divides population into subgroups; samples are selected from each group.
Cluster Sampling
Involves selecting entire clusters or groups; useful for large populations.
Non-Probability Sampling
General Characteristics
No defined probability for selection; more practical with no sampling frame available.
Methods of Non-Probability Sampling
Quota Sampling
Participants chosen based on representation rates within the population.
Snowball Sampling
Current participants help identify future subjects, useful for hidden populations.
Convenience Sampling
Selecting based on ease of access at a specific time and place.
Purposive Sampling
Researcher uses judgment to select representative units based on variation within the population.