Unit 3 Representations of data

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 10

flashcard set

Earn XP

Description and Tags

Terminology and formulae for AS stats Pearson Edexcel textbook CH3.

11 Terms

1

Common definition for any outlier

Any value that is:

  • greater than Q3 +k(Q3-Q1)

  • less than Q1 - k(Q3-Q1)

k not always used

New cards
2

What is ‘cleaning data’?

The process of removing anomalies from data. Must justify why values are being removed.

New cards
3

Why (when to) use a histogram to represent data?

when the data is grouped, continuous data.

New cards
4

Frequency density equation

frequency density = frequency/class width

New cards
5

How do you form a frequency polygon?

joining the middle of the top of each bar in a histogram with equal class widths

New cards
6

What two things do you comment on when comparing data?

Measure of spread, measure of location.

New cards
7

Which two pairs of measure of spread and location can you comment on when comparing data?

  • mean and standard deviation

OR

  • median and interquartile range

New cards
8

Which pair for comparison is more suitable for a set of data with extreme values?

Median and interquartile range

New cards
9

Adv of box plot?

  • It helps us to see the spread of the data easily.

  • The plot is clear and easy to understand.

  • It uses the range and the median values.

  • It is easy to compare the stratified data.

New cards
10

Disadv of box plot?

  • Original data is not clearly shown in the box plot.

  • Mean and mode cannot be identified using the box plot.

  • It can be easily misinterpreted.

  • If large outliers are present, the box plot is more likely to give an incorrect representation.

New cards
11

Which pair of measure of location/spread to use for box plot comparison?

median and interquartile range

New cards
robot