STATS OCR A LEVEL MATHS

0.0(0)
Studied by 1 person
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/46

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 1:25 PM on 2/17/24
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

47 Terms

1
New cards

Statistical Sampling:

What is a census and what are the downfalls

A question asked of the whole population

It is expensive, time consuming, and sometimes biased if certain people do not respond

2
New cards

Statistical Sampling:

Discrete data means

Distinct categories

Aka shoe sizes

3
New cards

Statistical sampling:

Continuous data means

Ever increasing levels of accuracy

Aka the exact length of your foot

4
New cards

Statistical sampling

Quantitative data

Has a numerical value

5
New cards

Statistical sampling:

Qualitative data

Is worded

6
New cards

Statistical sampling:

The Simple Random method

Downfalls + positives

-assign numbers to everything in a population

-select random numbers with a raffle or random number generator

-remove any repeats

Ideal for self-contained areas, but time-consuming with large populations

7
New cards

Statistical sampling:

Systematic method

Downfalls and positives

-Assign numbers to everything

-pick every nth person

Has a periodic nature so is useful for finding patterns

Could be biased if every nth item is the same.

8
New cards

Statistical sampling:

Stratified method

Downfalls and positives

Select a sample of n that proportionally represents all categories

Can be costly trying to find populations from different categories

Can focus attention on particular areas to see if that influences results

9
New cards

Statistical sampling:

Quota method

Downfalls and positives

Find people from two categories

Make them do the questionnaires to meet the quota

Not randomly selected, must fill quota before finishing sample

It is cost effective

10
New cards

Statistical sampling:

Opportunity method

Downfalls and positives

Stand in the street with a clipboard

Very biased

Self contained, fast, cost efficient, no quota

11
New cards

Statistical sampling:

Cluster method

Downfalls and positives

Split the samples of people into clusters (group similar items together)

Easy but there could be a key difference in the clusters you are comparing

12
New cards

Statistical sampling:

Self-selection

Volunteered responses

Cost efficient, fast

Can attract a certain type of opinionated people

13
New cards

Probability:

P(AuB)=

(P(A)+P(B))-P(AnB)

14
New cards

Probability:

When drawing a venn diagram, writing in values, start with

The central overlap

<p>The central overlap</p>
15
New cards

Probability:

P(AnB)=

(P(A)+P(B))-P(AuB)

Or use a venn diagram visually

Or P(A) x P(B)

16
New cards

Probability:

If two events are independent

P(AnB)= P(A) x P(B)

P(A|B)=P(A)

17
New cards

Probability:

Two events are mutually exclusive

P(AnB)=0

18
New cards

Probability:

P(A|B)=P(AnB)/P(B)

19
New cards

Probability:

For binomial distribution to be valid

There must be a fixed no. of trials

Two outcomes

Each event must be independent

Probability must be constant

20
New cards

Presentation:

Box plot

Has min, LQ, median, UQ, max

<p>Has min, LQ, median, UQ, max</p>
21
New cards

Graphs:

how to find outliers on a box plot

Upper outliers: UQ+1.5(IQR)

Lower outliers: LQ-1.5(IQR)

22
New cards

Graphs:

Comparing box plots

Compare medians to discuss average

Compare IQR to discuss consistency

Compare difference in min and max to discuss performance

23
New cards

Graphs:

Drawing cumulative frequency curves

Find the cumulative frequency by adding the frequencies going along the table of x values

Plot CF-x and find the curve

24
New cards

Graphs:

Finding the median, LQ and UQ of the cumulative frequency graph

Median: CF/2 so find the x-value at CF/2

LQ: CF/4 so find the x value

UQ: CF x ¾ so find the x value

25
New cards

Graphs:

Drawing a histogram

Find the frequency density for the table

Frequency density= f/class width

Plot fd-x

Area of the boxes= k (frequency)

Often k is 1 but check

26
New cards

Graphs:

Find median, LQ and UQ for a histogram

Find the the sum of the frequencies

find the sum of F/ 2 which equals a constant a

So the a^th value along is where the median lies

So do (X-lower part of inequality)/( upper - lower part of inequality)= (a-lower value of inequality)/(upper- lower value of inequality)

LQ: sum of F/4

UQ: sum of F x ¾

<p>Find the the sum of the frequencies</p><p>find the sum of F/ 2 which equals a constant a</p><p>So the a^th value along is where the median lies</p><p>So do    (X-lower part of inequality)/( upper - lower part of inequality)= (a-lower value of inequality)/(upper- lower value of inequality) </p><p>LQ: sum of F/4</p><p>UQ: sum of F x ¾</p><p></p>
27
New cards

Bivariate data:

Definition and how to plot

Two variables, generally plotted with a scatter graph, with the independent variable horizontal and the dependent variable vertical

28
New cards

Bivariate data:

Correlation

-1<=r<=1

Where -1 is strong negative

1 is strong positive

0 is no correlation

29
New cards

Bivariate data

How to find PMCC

Use given formula in booklet

Sxy/ (sqrt Sxx Syy)

Or

Calculator:

Stats, x in left column, y in right column, CALC, REG, X, a+bx, r=…

Or

Guess between -1<=r<=1 looking at correlation

30
New cards

Bivariate data:

Regression line is .. and how to find it

An optimal line of best fit

y-y_=b(x-x_)

find x_ and y_ therefore the regression line must go through (x_,y_)

Find b on calculator using STATS, x first column, y second column, CALC, REG, X, a+bx, b=…

Draw the regression line using y=b(x-x_)+y_

31
New cards

Bivariate data:

Interpolation and extrapolation meaning

Interpolation: anything inside the region is valid

Extrapolation: extending the region to guess a value

32
New cards

Discrete data:

Find the mean, mode, median, variance from discrete data

Mean= sum of x . Frequency/ n

Calc: STATS, first column x, second column F, CALC, SET, 1 var freq: List 2, exit, 1-VAR, x_=…

33
New cards

Discrete data:

Find the mode from discrete data

Look at the highest frequency

34
New cards

Discrete data:

Find the median, LQ, UQ

Find the sum of the frequencies

(Sum+1)/2= position of the median

Find the CF column

Identify where the median is and find its value

LQ: (sum/2 +1)/2= position of LQ

UQ: sum/2 + position of LQ

35
New cards

Grouped data:

Estimate the mean

Estimate the mean is in the middle of the group

So find the midpoint for each inequality, use STATS, to find mean with midpoints in column 1 and frequency in column 2

If the inequalities aren’t continuous, rewrite so they are

With age always change the inequality aka 10<=x<=15 should be 10<=x<16

<p>Estimate the mean is in the middle of the group</p><p>So find the midpoint for each inequality, use STATS, to find mean with midpoints in column 1 and frequency in column 2</p><p></p><p>If the inequalities aren’t continuous, rewrite so they are </p><p>With age always change the inequality aka 10&lt;=x&lt;=15 should be 10&lt;=x&lt;16</p>
36
New cards

Grouped data:

Estimate the median, LQ, UQ

Find the cumulative frequency

Divide the cumulative frequency by 2 to find the position of the median

Find the median using linear interpolation

LQ: cumulative frequency/4

UQ: cumulative frequency x ¾

37
New cards

Comparing data sets:

What to say about median, IQR, variance

Median: discuss their average

IQR: discuss consistency

Standard deviation: discuss consistency

38
New cards

Variance and standard deviation:

Find the variance and s.d from a table of x values

Find the mean

Find the distance of each value from the mean

Square the distances

Find the sum of the squares of the distances

Divide by n=variance

So sqrt the variance= standard deviation

Use calc or given formula or remember this method

39
New cards

Standard deviation and variance:

Difference between ōx and sx

ōx is for population

sx is for a sample

40
New cards

linear coding:

Convert the mean and standard deviation from celsius to F’

Find the mean and standard deviation in terms of celsius

The mean should be effected by the multiplication and addition in the conversion formula

The standard deviation should only be multiplied or divided by the value in the conversion formula

41
New cards

Outliers:

Find outliers using quartiles

Find outliers using mean and standard deviation

Quartiles:

LQ-1.5(IQR)=outlier

UQ+1.5(IQR)=outlier

Mean and s.d:

mean-2(s.d)= outlier

mean+2(s.d)=outlier

Omit them if they are an outlier

42
New cards

Discrete random variable:

Tabulate data

X (possible outcomes)

P(X=x) (probabilities) sum should= 1

43
New cards

Discrete random variables:

Stick graph

Tabulate data

Y-axis is P(X=x)

x-axis is X (outcomes)

<p>Tabulate data</p><p>Y-axis is P(X=x)</p><p>x-axis is X (outcomes)</p>
44
New cards

Discrete random variables:

Find data from tabulated data

knowt flashcard image
45
New cards

Discrete random variables:

Notation P(X=x)=x/15, x=1,2,3,4,5

Probability X=x is x/15 when x=1,2,3,4,5

46
New cards

Cumulative distribution function:

Tabulate

x (outcomes)

P(X=x) (probabilities)

F(x) (cumulative probabilities) with the last value in the row being 1

x must be whole for discrete data!!

47
New cards