GCSE Statistics

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/84

flashcard set

Earn XP

Description and Tags

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

85 Terms

1
New cards

Population

A collection of all the items

2
New cards

Sample

A selection of the population to use data from

3
New cards

Census

When data is taken from every member in the population

4
New cards

Advantages of a census over a sample

More representative, less biased, includes everyone's opinions

5
New cards

Advantages of a sample over a census

Quicker, cheaper, easier to analyse as less data

6
New cards

Disadvantages of a census over a sample

Time consuming, expensive, difficult to do

7
New cards

Disadvantages of a sample over a census

Less representative, possibly biased

8
New cards

Pilot Study

A small scale replica of the survey to be carried out.

9
New cards

Advantages of a pilot study

Ensures questions can be understood, identify ambiguity, test response rate, identifies likely responses, check methods

10
New cards

Sampling Frame

A list containing data that a sample can be taken from

11
New cards

Examples of a sampling frame

Electoral role, SIMS register, DVLA, telephone directory

12
New cards

Primary Data

Data that has been collected by the person doing the survey

13
New cards

Secondary data

Data that hasn't been collected by the person doing the survey

14
New cards

Advantages of primary data

More reliable, up-to-date, tailored for investigation

15
New cards

Advantages of secondary data

Easier to obtain, cheaper, less time-consuming

16
New cards

Continuous Data

Data that lies on a continuous scale (can be at any point on a number line)

17
New cards

Discrete Data

Data that consists of separate numbers (jumps along the number line)

18
New cards

Quantitative Data

Data that has numerical values

19
New cards

Qualitative Data

Data that is not numerical values

20
New cards

Open Questions

Has no suggested answers and has freeform boxes to reply in

21
New cards

Advantages of open questions

Allows for a range of responses, so can cover all eventualities

22
New cards

Closed Questions

Has a set of answers for the person to choose from

23
New cards

Advantages of closed questions

Easier to analyse as range of responses restricted

24
New cards

Leading Questions

Questions that infer an opinion and promote a certain answer

25
New cards

Convenience Sample

The first so many pieces of data in the list are sampled

26
New cards

Advantages of a convenience sample

Quick and easy

27
New cards

Disadvantages of a convenience sample

Unlikely to be representative

28
New cards

Random Sample

Each person has an equally likely chance to be picked

29
New cards

How to take a random sample

(a) Number everyone in list
(b) Use a random number generator to select numbers
(c) Select the data points corresponding to the numbers picked
(d) If you get a number outside the range or the same number twice you repeat, if you get a decimal round to the nearest number.

30
New cards

Advantages of a random sample

Easy to do

31
New cards

Disadvantages of a random sample

May not be representative

32
New cards

Systematic Sample

Data is chosen at regular intervals (e.g. every 10th person)

33
New cards

How to take a systematic sample

Order population and divide population by sample size to find how often data chosen. Then choose random number to decide where in this interval to start.

34
New cards

Advantages of a systematic sample

Useful for production line - will spot problems over time

35
New cards

Disadvantages of a systematic sample

May not be representative

36
New cards

Quota Sample

The same amount of people from different chosen groups are sampled

37
New cards

How to take a quota sample

Decide on a quota size for each group. Then take a random sample, ignoring any results from a group where the quota has been reached.

38
New cards

Advantages of a quota sample

Makes sure all quota groups are represented, easy to take

39
New cards

Disadvantages of a quota sample

Not likely to be representative, may be difficult to reach quota if numbers limited

40
New cards

Cluster Sample

The population is divided into groups and a group is chosen at random.

41
New cards

Advantages of a cluster sample

Easy to do

42
New cards

Disadvantages of a cluster sample

Unlikely to be representative

43
New cards

Stratified Sample

Where the data sampled in each group is proportional to that of the whole population

44
New cards

How to take a stratified sample

Multiply the fraction of each group in the whole population by the total sample size to decide on the size of the sample in each strata. Then take a random sample.

45
New cards

Advantages of a stratified sample

Representative

46
New cards

Disadvantages of a stratified sample

Harder to collect, more expensive

47
New cards

Features of a good question

Unambiguous, closed, non-overlapping answer boxes, unbiased/not leading, not offensive or personal, easy to analyse

48
New cards

Positive correlation

As one variable increases, so does the other

49
New cards

Negative correlation

As one variable increases, the other decreases

50
New cards

Response variable

The variable being measured or studied

51
New cards

What values does the SRCC lie between and what do they mean?

-1 and 1.
1 = Perfect positive correlation
0 = No correlation
-1 = Perfect negative correlation

52
New cards

What does the symbol x with a line above it mean?

The mean average value of x

53
New cards

How is frequency represented on histograms?

By area

54
New cards

What do we call the height on a histogram?

The frequency density

55
New cards

What does the capital sigma (that looks like an 'E') symbol mean?

Sum

56
New cards

How is the IQR calculated?

UQ - LQ

57
New cards

How much of the data is contained within each quartile?

25%

58
New cards

How do define the median?

The middle value in a dataset

59
New cards

How would we compare two distributions given their median and IQR?

Higher median = higher result on average
Higher IQR = less consistent on average

60
New cards

How would I define outliers?

Low outliers < LQ - 1.5 x IQR
High outliers > UQ + 1.5 x IQR

61
New cards

What does a positive skew look like?

The median is closer to the LQ than the UQ.

62
New cards

What does a negative skew look like?

The median is closer to the UQ than the LQ.

63
New cards

If a line of best fit is given by y = ax + b, what does 'a' mean?

For every unit the 'x' variable increases, the 'y' variable increases by 'a'.

64
New cards

What does a normal distribution look like?

Symmetrical about mean, bell-shaped curve

65
New cards

How much of the data is within 2 s.d. of the mean for a normal distribution?

95%

66
New cards

How much of the data is within 3 s.d. of the mean for a normal distribution?

99.8%

67
New cards

What conditions need to be met for a binomial distribution?

Two outcomes (success or failure), fixed number of independent trials, fixed probability of success

68
New cards

What is a discrete uniform distribution and what would it's graph look like?

The same probability for all events. The graph is a bar chart with each possible outcome going to the same height (the probability of it happening).

69
New cards

How could one compare values from different sets of data?

Use a standardised score.

70
New cards

How is a standardised score calculated?

(score-mean)/s.d.

71
New cards

What is an index number?

A number that shows the rate of change in quantity, value or price of an item over a period of time.

72
New cards

How is an index number calculated?

100*[(quantity in given year)/(quantity in base year)]

73
New cards

What is a chain base index number?

The annual percentage change in quantity, value or price of an item. It is found by using the previous year as the base year.

74
New cards

What is a trend line?

A line of best fit through moving averages

75
New cards

How would you describe a trend line?

As increased or decreasing, not as positive or negative

76
New cards

What is the average seasonal variation?

The mean average difference between the trend line and actual value for a given season

77
New cards

How can one predict values using a trend line?

Read the value from the trend line for the season wanted and add/subtract the average seasonal variation

78
New cards

Why might one not want to predict a value from a scatter graph or trend line?

If the correlation is not strong enough or if the prediction lies outside the range of data (extrapolation)

79
New cards

Mutually Exclusive

Two events that cannot happen at the same time.

80
New cards

Independent events

Two events that have no impact on one another (one happening doesn't affect the probability of the other)

81
New cards

Exhaustive

A set of events that covers all possibilities

82
New cards

What do the probabilities of mutually exclusive exhaustive events sum to?

1

83
New cards

When can we add probabilities?

When they are mutually exclusive.

84
New cards

When can we multiply probabilities?

When they are independent.

85
New cards

What might we use to find probabilities of two events following one another?

A tree diagram.