Stats Maths A-Level Year 1 Edexcel

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/167

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

168 Terms

1
New cards

Population

The whole set of terms that are of interest.

2
New cards

Census

Observes or measures every member of a population

3
New cards

Sample

A selection of observations taken from a subset of the population which is used to find information about the population as a whole

4
New cards

Census advantages

It should give a completely accurate result

5
New cards

Census disadvantages

- Time consuming and expensive

- Cannot be used when the testing process destroys the item

- Hard to process large quantity of data

6
New cards

Sample advantages

- Less time consuming and expensive than a census

- Fewer people have to respond

- Less data to process that a census

7
New cards

Sample disadvantages

- The data may not be as accurate

- The sample may not be large enough to give information about small sub-groups of the population

8
New cards

Sampling units

Individual units of a population

9
New cards

Sampling frame

Individually named or numbered to form a list called a sampling frame

10
New cards

The sample should be

representative of the population

11
New cards

Random sampling helps to remove

bias from a sample

12
New cards

Random sampling methods

- Simple random sampling

- Systematic sampling

- Stratified sampling

13
New cards

Simple Random Sampling

A simple random sample of size n is one where every sample of size n has an equal chance of being selected

14
New cards

Systematic sampling

The required elements are chosen at regular intervals from an ordered list

15
New cards

Stratified sampling

The population is divided into mutually exclusive strata (which are proportional to the population) and a random sample is taken from each.

16
New cards

Stratified sampling equation

The number sampled in a stratum = (number in stratum/number in population) x overall sample size

17
New cards

Simple random sampling advantages

- Free of bias

- Easy and cheap to implement for small populations and small samples

- Each sampling unit has a known and equal chance of selection

18
New cards

Simple random sampling disadvantages

- Not suitable when the population size or the sample size is large

- A sampling frame is needed

19
New cards

Systematic sampling advantages

-simple and quick

-suitable for large samples and populations

20
New cards

Systematic sampling disadvantages

- A sampling frame is needed

- It can introduce bias if the sampling frame is not random

21
New cards

Stratified sampling advantages

- Sample accurately reflects the population structure

- Guarantees proportional representation of groups within a population

22
New cards

Stratified sampling disadvantages

- Population must be clearly classified into distinct strata

- Selection within each stratum suffers from the same disadvantages as simple random sampling

23
New cards

Non-random sampling methods

- Quota sampling

- Opportunity (convenience) sampling

24
New cards

Quota sampling

An interviewer or researcher selects a sample that reflects the characteristics of the whole population

25
New cards

Quota sampling advantages

- Allows a small sample to still be representative of the population

- No sampling frame required

- Quick, easy and inexpensive

- Allows for easy comparison between different groups within a population

26
New cards

Opportunity (convenience) sampling

The sample is taken from people who are available at the time the study is carried out and who fir the criteria you are looking for.

27
New cards

Quota sampling disadvantages

- Non-random sampling can introduce bias

- Population must be divided into groups, which can be costly or inaccurate

- Increasing scope of study increases number of groups, which adds time and expense

- Non-responses are not recorded as such

28
New cards

Opportunity (convenience) sampling advantages

- Easy to carry out

- Inexpensive

29
New cards

Opportunity (convenience) sampling disadvantages

- Unlikely to provide a representative sample

- Highly dependent on individual researcher

30
New cards

Quantitative

Quantitative data is variables that are associated with numerical observations, e.g. height

31
New cards

Qualitative

Qualitative data is variables that are associated with non-numerical observations, e.g. hair colour

32
New cards

Continuous

Any value in a given range

33
New cards

Discrete

Only specific values in a given range

34
New cards

Classes

When data is presented in a grouped frequency table, the specific data values are not shown. The groups are more commonly known as classes.

35
New cards

Class boundaries

The class boundaries tell you the maximum and minimum values that belong in each class

36
New cards

Midpoint

The midpoint is the average of the class boundaries

37
New cards

Class width

The class width is the difference between the class boundaries

38
New cards

The large data set

The weather data provided is set over two periods of time (May to October in 1987 and in 2015) and is for five UK and three overseas weather stations.

39
New cards

Leuchars

UK, Northern, Coastal

40
New cards

Leeming

UK, Northern, Inland

41
New cards

Heathrow

UK, Southern, Inland

42
New cards

Hurn

UK, Southern, Coastal

43
New cards

Camborne

UK, Southern, Coastal

44
New cards

Jacksonville

Worldwide, Northern Hemisphere, Coastal

45
New cards

Beijing

Worldwide, Northern Hemisphere, Inland

46
New cards

Perth

Worldwide, Southern Hemisphere, Coastal

47
New cards

Daily mean temperature

in °C - this is the average of the hourly temperature readings during a 24-hour period.

48
New cards

Daily total rainfall

Including solid precipitations such as snow and hail, which is melted before being included in any measurements.

49
New cards

tr

rainfall amounts less than 0.05 mm are recorded as 'tr' or 'trace'

50
New cards

Daily total sunshine

recorded to the nearest tenth of an hour

51
New cards

Daily mean wind direction

Mean wind directions are given as bearings and as cardinal directions.

52
New cards

Daily mean windspeed

in knots, averaged over 24 hours from midnight to midnight. The data for mean windspeed is also categorised according to Beaufort scale.

53
New cards

Beaufort scale

1: Calm, less than 1 knot

1-3: Light, 1 to 10 knots

4: Moderate, 11 to 16 knots

5: Fresh, 17 to 21 knots

54
New cards

Daily maximum gust

in knots - this is the highest instantaneous windspeed recorded. The direction from which the maximum gust was blowing is also recorded.

55
New cards

Knot

A knot (kn) is a 'nautical mile per hour'. 1 kn = 1.15 mph

56
New cards

Daily maximum relative humidity

Given as a percentage of air saturation with water vapour. Relative humidities above 95% give rise to misty and foggy conditions.

57
New cards

Daily mean cloud cover

Measured in oktas or eighths of the sky covered by cloud

58
New cards

Daily mean visibility

Measured in decametres (Dm). This is the greatest horizontal distance at which an object can be seen in daylight.

59
New cards

Daily mean pressure

Measured in hectopascals (hPa)

60
New cards

If you need to calculations on the large data set in your exam,

the relevant extract from the data set will be provided

61
New cards

Countif

Command in a spreadsheet to work out the frequency in each class

62
New cards

Measure of location

A single value which describes a position in a data set.

63
New cards

Measure of central tendency

If the single value describes the centre of the data.

64
New cards

Mode

The mode is the value or class that occurs most often

65
New cards

Median

The middle value when the data values are put in order

66
New cards

Mean

sum of data values over number of data values

67
New cards

X bar

represents the mean of the data

68
New cards

Frequency table mean

sum of the products of the data values and their frequencies/sum of their frequencies

69
New cards

Quartiles

- Lower Quartile

- Upper Quartile

70
New cards

Percentiles

Split the data set into 100 parts.

71
New cards

Lower quartile

One quarter of the way through the data set

72
New cards

Upper quartile

Three quarters of the way through the data set

73
New cards

To find the lower quartile for discrete data

divide n by 4. If this is a whole number, the lower quartile is halfway between this data point and the one above. If it is not a whole number, round up and pick this data point.

74
New cards

To find the upper quartile of discrete data

find 3/4 of n. If this is a whole number, the upper quartile is halfway between this point and the one above. If it is not a whole number, round up and pick this data point.

75
New cards

Age is always rounded

down

76
New cards

Interpolation

When data are presented in a grouped frequency table you can use interpolation to estimate the median, quartiles and percentiles.

77
New cards

When you use interpolation you are assuming the data values are

evenly distributed within each class.

78
New cards

Quartiles for grouped continuous data

Q1 = n/4th data value

Q2 = n/2th data value

Q3 = 3n/4th data value

79
New cards

For finding quartiles in an ungrouped frequency tabled

use the rules for discrete data

80
New cards

Range

Difference between the largest and smallest values in the data set

81
New cards

Measures of spread names

- Measures of dispersion

- Measures of variation

82
New cards

Interquartile range (IQR)

The difference between the upper quartile and the lower quartile

Q3 - Q1

83
New cards

Interpecentile range

The difference between the values for two given percentiles.

84
New cards

Variance

Each data point deviates from the mean by the amount x - ⨲

85
New cards

Variance =

Σx²/n - (Σx/n)²

86
New cards

Sxx =

Σx² - (Σx)²/n

87
New cards

The standard deviation is

The square root of the variance

88
New cards

σ =

√(Σx²/n - (Σx/n)²)

89
New cards

σ² in a frequency table

Σfx²/Σf - (Σfx/Σf)²

90
New cards

σ in a frequency table

√(Σfx²/Σf - (Σfx/Σf)²)

91
New cards

f in standard deviation

frequency for each group

92
New cards

Σf in standard deviation

total frequency

93
New cards

Calculate estimates for the variance and standard deviation of the data in a grouped frequency table using

the midpoint of each class interval

94
New cards

Sxx

summary statistic

95
New cards

If data is coded using the formula y = x-a/b

- The mean of the coded data is given by Ý = ⨲-a/b

- The standard deviation of the coded data is given by σy = σx/b, where ox is the standard deviation of the original data.

96
New cards

To find the original data from the coded data

rearrange the formulae:

- ⨲ = bÝ + a

- σx = bσy

97
New cards

Outlier

- Either greater than Q3 + k(Q3 - Q1)

- Or less than Q1 - k(Q3 - Q1)

98
New cards

Cleaning the data

The process of removing anomalies from a data set.

99
New cards

Anomalies

Where an outlier should be removed from the data since it is clearly an error and ti would be misleading to keep it in.

100
New cards

Cumulative frequency diagram

If you are given data in a grouped frequency table, you are not able to find the exact values of the median and quartiles. The diagram can help find estimates for the mean, quartiles and percentiles.