CH3 - Representations of Data

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/5

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

6 Terms

1
New cards

What is an Outlier?

An outlier is a value that deviates significantly from the other observations in a dataset, potentially indicating variability in the measurement or an anomaly.

M1 - An outlier is a value that is:

Greater than Q3 + K x IQR

Less than Q1 - K x IQR

M2 - Is 2 Standard Deviations away from the mean

K often = 1.5

Which method to use will be specified.

2
New cards

What is a Boxplot?

A representation of key features of a data set featuring: quartiles, min/max points and outliers.

<p>A representation of key features of a data set featuring: quartiles, min/max points and outliers.</p>
3
New cards

What is Cumulative Frequency?

4
New cards

What are Histrograms?

Histograms are graphical representations of the distribution of numerical data, using bars to show the frequency of data points within specified intervals.

The Area of a Bar = K x Frequency

When K = 1, FD = Frequency / Class Width

<p>Histograms are graphical representations of the distribution of numerical data, using bars to show the frequency of data points within specified intervals.</p><p>The Area of a Bar = K x Frequency</p><p>When K = 1, FD = Frequency / Class Width</p>
5
New cards

What is a Frequency Polygon?

A graphical method for displaying data distributions, created by connecting the midpoints of the tops of the bars in a histogram with straight lines.

<p>A graphical method for displaying data distributions, created by connecting the midpoints of the tops of the bars in a histogram with straight lines. </p>
6
New cards

How do you Compare Data?

When your comparing data, you should comment on Measure of Location and Spread,

You should either use Mean and Standard Deviation or Median and Interquartile Range, depending on the data (eg Median and IQR for data with extreme values)