Bivariate Data

0.0(0)
studied byStudied by 1 person
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/33

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

34 Terms

1
New cards

The Statistical Investigation Process

Identify a problem, Pose a statistical question, Collect or obtain data, Analyse Data, Communicate Results

2
New cards

Univariate Data

Data which involves one variable

3
New cards

Bivariate Data

Data which involves two variables. Bivariate data analysis looks at whether there is a relationship between two variables.

4
New cards

Association

A general term used to describe the relationship between two (or more) variables.

5
New cards

Correlation

Used interchangeably with the term association. Correlation tends to be used when referring to the strength of a linear relationship between two numerical variables.

6
New cards

Explanatory Variable (EV)

Variable used to explain or predict a difference in the RV.

7
New cards

Response Variable (RV)

What happens in response to the EV.

8
New cards

Example of EV and RV

When investigating the relationship between the temperature of a loaf of bread and the time it has spent in the oven, the temperature is the response variable and time is the explanatory variable. 

9
New cards

Displaying Categorical Bivariate Data

Two-way table, Side by side column graph, Segmented column graph

10
New cards

Displaying Numerical Bivariate Data

Scattergraphs

11
New cards

Does EV go on x axis or y axis?

x axis

12
New cards

Does RV go on x axis or y axis

y axis

13
New cards

When do you use a column percentage table?

If the explanatory variable is at the top of the frequency table.

14
New cards

When do you use a row percentage table?

If the explanatory variable is on the side of the frequency table.

15
New cards

Words to use when commenting on trend

In general, tend to, seems to be

16
New cards

Rule for line of best fit

Line that joins as many points as possible but leaves the same amount of unconnected points on either side of the line.

17
New cards

Reliability of prediction can be affected by…

Form of scattergraph, number of points on scattergraph, how closely the points form a straight line, predicting between the given EV values or beyond given EV values.

18
New cards

Interpolation

Predicting between given values (Reliable)

19
New cards

Extrapolation

Predicting beyond given values (Unreliable)

20
New cards

When interpreting a graph comment on

Form, direction, strength of association

21
New cards

Form

Linear, non-linear, no relationship/random

22
New cards

Direction (if linear)

Positive, negative

23
New cards

Positive direction

As the explanatory variable increases, the response variable increases.

24
New cards

Negative direction

As the explanatory variable increases, the response variable decreases.

25
New cards

Strength

Strong relationship, moderate relationship, weak relationship

26
New cards

Equation of least squares regression line

y hat = ax + b

27
New cards

Interpreting the gradient (a)

The RV increases/decreases by a units for every one unit increase in the EV

28
New cards

Interpreting y intercept (b)

The RV is b units when the EV is zero (not always relevant)

29
New cards

Weak range Pearson’s correlation coefficient

0-0.35

30
New cards

Moderate range Pearson’s correlation coefficient

0.35-0.75

31
New cards

Strong range Pearson’s correlation coefficient

0.75-1

32
New cards

Commenting on r

r = xxxx which suggest strong/moderate/weak positive/negative correlation. As the EV increases/decreases, the RV increases/decreases.

33
New cards

Commenting on causation

Although there is a strong/moderate/weak positive/negative correlation between the variables, a correlation does not imply causation. There might be a coincidence or a lurking variable such as…

34
New cards

Commenting on coefficient of determination

xx% of the variation in the RV can be explained by the variation in the EV.