CH. 3 DISPLAYING AND SUMMARIZING QUANTITATIVE DATA

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall with Kai
GameKnowt Play
New
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/62

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

63 Terms

1
New cards

What is quantitative data?

It’s data that consists of variables that measure numerical quantities 

2
New cards

What are 2 major ways we can describe quantitative data? Using what?

plots, numerical summaries 

3
New cards

What are the 2 types of plots we can use to describe quantitative data?

histogram, boxplot 

4
New cards

What are the 3 types of numerical summaries we can use to describe quantitative data?

mean, median, range, etc.

5
New cards

What type of categories do histograms create? 

artificial categories for values of quantitative variable

6
New cards

What do you call these artificial categories that these histograms create?

bins or classes

7
New cards

Fill in the blanks.

Pretend variables is categorical & create _______.

bar-plot

8
New cards
<p>What type of graph is this in the image shown?</p>

What type of graph is this in the image shown?

relative frequency histogram

9
New cards

What are the 2 major features that distinguish bar-plot from histogram?

  • Categories (classes)
    are ordered

  • No gaps between
    bars (unless freq.=0)

10
New cards

For any quantitative variable, what are 3 aspects we look at to describe the overall pattern of it’s distribution?

  • shape

  • centre

  • spread

11
New cards

What do you look at when you see the shape of the distribution?

the modality

12
New cards

What does modality refer to?

the number of modes or peaks in the data set distribution 

13
New cards
<p>What are the 3 different types of modality (shape of distribution)?</p>

What are the 3 different types of modality (shape of distribution)?

  1. unimodel

  2. bimodel

  3. uniform

<ol><li><p>unimodel </p></li><li><p>bimodel</p></li><li><p>uniform</p></li></ol><p></p>
14
New cards

What are the 4 factors you must consider when looking at the shape of the distribution?

  1. modality

  2. symmetry

  3. skeweness

  4. outliers

15
New cards

Looking at the shape of the distribution, when do we call a distribution “symmetric”?

if, when we draw a vertical line down its center, the two sides are similar in shape and size

16
New cards

When do we call a shape of distribution towards being “skewed”?

When the graph is a unimodal distribution with one
tail longer than the other.

17
New cards
<p>How would you describe the shape of the distribution in the provided image?</p>

How would you describe the shape of the distribution in the provided image?

skewed to the left (negatively skewed)

18
New cards
<p>How would you describe the shape of the distribution in the provided image?</p>

How would you describe the shape of the distribution in the provided image?

skewed to the right (positively skewed)

19
New cards

Looking at any data set provided on a graph (in general) , what would be considered an “outlier”?

Any data that lie far off the main body of the distribution

20
New cards

True or False.

Extreme values are known as “outliers”.

True.

21
New cards
<p><span style="color: rgb(255, 255, 255);">How would you describe the shape of the distribution of the variable, infections?</span></p>

How would you describe the shape of the distribution of the variable, infections?

unimodel, skewed to the right & one outlier present

22
New cards

What are 2 different types of centres that an describe the pattern of distribution?

mean and median

23
New cards

What value is considered to be the “mean”?

A value that is the sum of all values of the quantitative variable divided by the number of values

24
New cards
<p>What is this formula (seen in the image) calculating for?</p>

What is this formula (seen in the image) calculating for?

the mean

25
New cards

Fill in the blanks.

Mean is “_______” of data

center of gravity

26
New cards

True or false.

Mean is not sensitive to outliers.

False.

27
New cards

What value is considered to be the “median”?

A value that is the midpoint of all values, after they are
ordered from the smallest to the largest.

28
New cards

True or false.

Median is considered to be the value expressing 50% of values above & 50% below the median.

True.

29
New cards

For an even (#) number data set, how would you calculate the median?

You would take the mean of the two middle numbers.

30
New cards

True or false.

Median is robust to outliers.

True.

31
New cards

What does the term ‘robust’ mean?

less influenced by

32
New cards

Fill in the blanks. 

For symmetric distributions: mean __ median

For skewed distributions: mean __ median

For symmetric distributions: mean = median
For skewed distributions: mean >< median

33
New cards

Fill in the blank

When does a graph become left-skewed:

when mean __ median.

mean < median

34
New cards

Fill in the blank

When does a graph become right skewed:

when mean __ median.

mean > median

35
New cards
<p>Answer the following questions on the provided image.</p>

Answer the following questions on the provided image.

  1. Median 

  2. T

  3. F

36
New cards

What are the 4 different types of spreads of distributions to consider when describing the pattern of distribution of a graph/chart?

range, standard deviation, IQR, variance

37
New cards

What does spread describe?

how much data vary about their centre

38
New cards

What is the formula used to calacuate the range of a data set?

range = max - min

39
New cards

True or false.

Range is not very sensitive to outliers.

False.

40
New cards

If your data set clearly has outliers present and you want to figure out a good measure of spread distribution for that particular data set provided. What spread of measure would you use?

IQR (interquartile range)

41
New cards

Fill in the blanks.

Quartiles are ___ values that divide distribution
into ____ parts, each containing ¼ of data.

3, 4

42
New cards

Fill in the blanks.
1st quartile (Q1) has __ of data below &
__ above

¼ , ¾

43
New cards

Fill in the blanks.
2nd quartile (Q2) has __ of data below & __ above

½, ½

44
New cards

Fill in the blanks.

3 rd quartile (Q3) has __ of data below & __ above.

¾, ¼

45
New cards

Given a data set, how would you find Q1, Q2 and Q3?

Split the data into two halves, and the middle value would be the Q2. The first half of the data's middle value would be the Q1, and the second half's middle value would be the Q3. *NOTE: (if two values are present, take the mean of both values to obtain one value)

46
New cards

What is the formula to caluclate the IQR?

IQR = Q3 - Q1

47
New cards

What does IQR (interquartile range) mean?

It’s the distance between Q1 and Q3

48
New cards

Fill in the blanks.

IQR is _________ to outliers.

resistant

49
New cards

What measure is variance (s²)?

A measure that measures the average squared deviation of individual data from their mean.

50
New cards

What measure of units is variance always?

squared units

51
New cards

How would you calculate standard deviation?

s = √s² = √variance

52
New cards

What do both variance and standard deviation both measure?

the spread around the mean (distance)

53
New cards

Are variance and standard deviation both sensitive to outliers? Explain.

Yes, as they look at the measure from each data point to the sample mean (which, is sensitive to outliers)

54
New cards

What do both variance and standard deviation work best for?

symmetric distribution

55
New cards

What kind of measure is standard deviation?

measure of the amount of variation in a set of data 

56
New cards

What is a 5-number summary?

A 5-number summary is set of five measures giving quick summary of a distribution.

57
New cards

What does the five-number summary consist of?

  1. Minimum

  2. Q1

  3. Q2 - Median

  4. Q3

  5. Maximum

58
New cards

What would be a visual display of 5-number summary?

a boxplot

59
New cards

Why do boxplots aid in identifying outliers?

by calculating both the upper and lower fences, and drawing those fences on the boxplot - those points outside these fences are suspected the outliers

60
New cards

How would you calculate upper & lower fences?

upper fence = Q3 + 1.5 * IQR

lower fence = Q1 - 1.5 * IQR

61
New cards

Fill in the blanks.

Five-number summary is better for describing
__________________________.

skewed distributions w/ outliers

62
New cards

Fill in the blank.

___________ and __________ work well for reasonably symmetric distributions w/o outliers

mean, standard deviation

63
New cards
<p>Fill in the blanks on this box-plot diagram?</p>

Fill in the blanks on this box-plot diagram?

knowt flashcard image