Module 2: Sampling and Introduction to Bivariate Regression

0.0(0)
studied byStudied by 1 person
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/52

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

53 Terms

1
New cards

Samples provides estimates of

Population parameters

2
New cards

Whilst samples can be used to make inferences about the population of interest, they can also give…

  • Biased estimations

  • Different estimates

3
New cards

Bias

  • Refers to accuracy

  • Aggregated means from samples should equal the population mean

  • Suggests that the sampling approach tends to zero in on the true underlying parameters

  • Important for external validity

Think: In a perfect world, if we measured the mean reading abilities of year 6 classes in a primary school, the school mean should be equal to the population mean of Australian year 6 students.

4
New cards

Bias is a function of _________.

Sampling

  • Ensure to get a sample that best represents the population of interest

5
New cards

Precision

  • Refers to spread of scores (i.e how much they deviate from the mean)

  • Measured by standard deviation

  • High precision means that on average, estimates from each sampling occasion are not far from the mean.

6
New cards

High standard deviation score suggests…

Low precision

7
New cards

Low standard deviation score suggests…

High precision

8
New cards

Precision is a function of…

Sample size

  • The larger the sample size, the more precise the estimate

9
New cards

Describe the concepts of precision and bias through a target practice analogy. What do the targets look like as precision and bias vary?

knowt flashcard image
10
New cards

Bivariate Regression

  • Looks at the relationship between one predictor variable and one outcome variable

    • Think: CorrelationPRO

  • When conducted, a correlation statistic is obtained along with additional information inc. but not limited to

    • the slope

    • the intercept

11
New cards

Equation for a straight line

Y = a + bX

a = the intercept

b = the gradient

12
New cards

Formula for Line of Best Fit (Regression Line)

Y’ = predicted value of Y

b0= INTERCEPT of the regression line (predicted Y value when x = 0)

b1 = SLOPE of regression line

<p>Y’ = predicted value of Y</p><p><em>b</em><sub>0</sub>= INTERCEPT of the regression line (predicted Y value when x = 0)</p><p>b<sub>1</sub> = SLOPE of regression line</p>
13
New cards
<p>Y’</p>

Y’

Predicted value of Y

14
New cards

b0

INTERCEPT of the regression line (predicted Y value when x = 0)

15
New cards

b1

SLOPE of regression line

Think: Are you thinking what I’m thinking B1? = slippery slope to anarchy

16
New cards

Formula for the gradient

b = (difference between y scores) / (difference between x scores)

<p><em>b </em>= (difference between y scores) / (difference between x scores)</p>
17
New cards

Bivariate Regression: The Slope

The rate at which Y changes with each 1-unit change in X

  • Denoted as b1

18
New cards

Bivariate Regression: The Intercept

Where the line cross the y axis

  • represented as a

19
New cards

Why is the straight line equation modified in bivariate regressions?

To include a systematic and random component

  • Called ‘line of best fit’

20
New cards

Systematic Component of line of best fit in bivariate regressions

Shows relationship with predictor variable

21
New cards

Random component of line of best fit in bivariate regressions

Unrelated to predictor

22
New cards

Where do you find the b0 and b1 values in SPSS output?

knowt flashcard image
23
New cards

What value in SPSS output from a bivariate regression is the same as the correlation value?

Beta coefficient in the coefficient table

  • Located next to the b0 and b1 values

24
New cards

Syntax for Bivariate Regression in SPSS

regression var = variable1 variable2
/dep = responsevar
/enter = predictorvar.
25
New cards

How can you use the slope and intercept values from SPSS to predict scores?

  1. Find your slope and intercept

  2. Substitute them into the Y’ = b0 + b1X1 equation

  3. Put your x value (predictor value) into X1

  4. Y’ will be your predicted score, which is a plotted point on the line of best fit

<ol><li><p>Find your slope and intercept</p></li><li><p>Substitute them into the Y’ = b0 + b1X1 equation</p></li><li><p>Put your x value (predictor value) into X1</p></li><li><p>Y’ will be your predicted score, which is a plotted point on the line of best fit</p></li></ol>
26
New cards

The basic purpose of regression is…

Prediction

Think: A regression is basically a correlation, but if that’s the case, why don’t we just use correlation? Regressions give us the correlation score and a bit more, like the intercept and slope. We can plot a line of best fit on a regression, which gives predictive qualities where there are no data points (not infallible though)

27
New cards

When can errors in prediction arise?

When the residual is not ± 1

r ≠ ± 1

28
New cards

Residual

The difference between the actual score and predicted score

  • AKA error

  • Predicted scores are susceptible to error when the residual is more than ± 1

<p>The difference between the actual score and predicted score</p><ul><li><p>AKA error</p></li><li><p>Predicted scores are susceptible to error when the residual is more than ± 1</p></li></ul>
29
New cards

When we are sampling, we are selecting a ______ of a population.

Subset

30
New cards

Using different sampling plans or approaches will ensure….

Accuracy to a certain level in the long run if it is representative.

31
New cards

Define and explain universalistic sampling approaches*

Testing specific theories or hypotheses about the relationships between variables

  • Research questions are:

    • How does this work?

    • What is the mechanism?

Think: The ones we usually do where we are trying to find causation

32
New cards

Define and explain particularistic sampling approaches*

  • Primary goal is statistical estimation

  • Try to capture the range of the underlying parameters

  • Descriptive hypotheses

  • Research questions include:

    • How many people think like this?

    • How many people have this disorder?

    • How many people would benefit from this treatment?

Think: Trying to find information about the variable, not the relationship between them.

33
New cards

Probability Sampling

Based on random sampling approaches such as:

  • Simple Random Sampling

  • Stratified sampling

  • Cluster sampling

34
New cards

Simple Random Sampling (SRS)

  • Every member of the population has an equal and known chance of being selected

  • Gold standard but requires very large samples

  • Can be expensive and not always feasible

Think: Dury Duty is serious (SRS)

Every adult in the population sample frame) can be selected for dury duty and they know this

35
New cards

Stratified Sampling

The population is divided into subpopulations in some meaningful way to ensure all populations are sampled appropriately

  • Strata (subgroups) are created based on particular characteristics

    • Age, gender, profession, etc

  • You should sample proportionally to the population

    • I.e. If 70% of psych students are female and that is your population of interest, in a sample of 100, 70 should be female.

36
New cards

Cluster Sampling

Uses subgroups in the population, but samples the entire subgroup rather than basing it on subgroup proportions

Think: If you were interested in Year 9 attitudes towards university, you would sample the subgroup of ‘year 9 students’ from a range of schools.

37
New cards

Advantages of Probability Sampling

  • We have a known or estimated probability of inclusion for each sample element which can be factored into final sample

  • Lower risk of sampling bias

  • Greater external validity

38
New cards

Disdvantages of Probability Sampling

  • Expensive

  • Often not feasible

39
New cards

Non-probability sampling approaches*

Uses non-random approaches and therefore not every person in the population of interest has an equal and known chance of being included.

Includes:

  • Convenience sampling

  • Purposive sampling

  • Snowball sampling

  • Quota sampling

40
New cards

Convenience Sampling

Involves recruiting from a sample of participants that the researcher has access to

  • Recruiting whoever we can get

  • Most widely used type of sampling in psychology

  • Unknown bias can be controlled by increasing precision through sample size

41
New cards

Purposive Sampling

Judgement sampling or known groups sampling

  • Researcher makes a judgement based on their expertise to select the best sample

42
New cards

Snowball Sampling

Involves having participants recruit other participants with an increasing number of participants

Think: Pyramid scheme

43
New cards

Quota Sampling

Effectively the same as stratified sampling but using a non-random approach

Think: If 10% of the population in your community were teachers, so you surveyed a bunch of teachers outside of a teacher’s convention. You’d meet your 10% quota but because they travelled to the place, they would not represent the views of teachers in that community.

44
New cards

Advantages of Non-probability samples

  • Cheaper

  • Easier to access

  • Larger samples affords greater precision

45
New cards

Disadvantages of Non-probability samples

  • Probability of inclusion is unknown and cannot be calculated

  • High risk of unknown bias

  • Bias results in abiguity of results

  • limits external validity (generalisability)

46
New cards

Define a sampling frame*

A list of people from a population of interest from which we can draw our sample

  • List should include the entire population and only those in the population

    • Key to probability sampling

  • Serves as a conduit to the population

Think: Phone book, class list, electoral role

47
New cards

Dummy Variables

Categorical variables that have values of 0 and 1

48
New cards

Dummy Variable Conventions

  • Largest category is assigned as value ‘1’

  • Name the dummy variable after the category assigned variable ‘1’

49
New cards

Cronbach’s Alpha

A measure of internal consistency/reliability.

  • a = >0.70 → adequate reliability

  • a = >0.80 → good reliability

  • a = >0.90 → excellent reliability

0.7 is the benchmark! If it’s less; forget it

50
New cards

3 Strategies for making coding easier in a questionnaire

  1. Put answer blocks in left margin

  2. Have respondents put their answers on a separate answer sheet

  3. Have Ps put their responses on a coding sheet that can be marked electronically

51
New cards

In what sampling method do people have an equal chance of being selected?

Random sampling

52
New cards

When determining the required sample size for random sampling methods, what happens to the sample size as the confidence level increases (error margin decreases)?

The required sample size increases

<p>The required sample size increases</p>
53
New cards

Rank the following sampling techniques in order of most representative samples to least

  • Quota sampling

  • Proportionate stratified random sampling

  • Random sampling

  • Convenience sampling

  1. Proportionate stratified random sampling

  2. Random sampling

  3. Quota sampling

  4. Convenience sampling