Chapter 2: Exploring Data with Graphs and Numerical Summaries

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/34

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

35 Terms

1
New cards

variable

any characteristic of an individual

2
New cards

categorical variable (w/ examples)

a variable that places an individual into one of several groups or categories (fav color, yes/no questions, phone #s, zip codes, ID #s)

3
New cards

quantitative variable (w/ examples)

a variable that takes numerical values for which arithmatic operations such as adding and averaging make sense (length, time, age, wage)

4
New cards

distribution

the overall pattern of variation of a variable, showing how often each value occurs

5
New cards

What are the key features to describe a quantitative variable (3)?

shape, center, variability/spread

6
New cards

modal category

a key feature of a categorical variable, notes which category has the largest frequency

7
New cards

nominal

a type of categorical variable that does not have an inherent ordering

8
New cards

ordinal

a type of categorical variable that can be ordered

9
New cards

discrete variable

a quantitative variable that can assume only a finite/countable number of variables (often whole numbers)

<p>a quantitative variable that can assume only a finite/countable number of variables (often whole numbers)</p>
10
New cards

continuous variable

a quantitative variable that can assume an infinite number of values in one or more intervals

<p>a quantitative variable that can assume an infinite number of values in one or more intervals</p>
11
New cards

frequency table

a listing of the possible values for a variable, together with the number of observations for each value

12
New cards

What is the difference between using pie charts/bar graphs and histograms?

Use pie charts/bar graphs for categorical data. Use a histogram for quantitative data. The bars/sections are different categories, and therefore don’t touch (in a bar graph). The bars touch because the different bars are intervals.

13
New cards

outlier

an individual observation that falls outside the overall pattern of the graph

14
New cards

How can you describe a distribution?

unimodal, bimmodal, skewed right, skewed left, normal

15
New cards

unimodal distribution

a distribution with one major peak

16
New cards

bimodal

a distribution with two major peaks

17
New cards

skewed to the right

the right side of the graph extends much further than the left (the majority of the data is on the left, with outliers on the right)

18
New cards

skewed to the left

the left side of the graph extends much further than the right (the majority of the data is on the right, with outliers on the left)

19
New cards

mode

value/property that occurs most frequently in the data set

20
New cards

What are the different graphs presented in this unit?

pie charts, bar graphs, histograms, stem-and-leaf plots, dot plots, time plots, box plots

21
New cards

mean (and how it is measured)

balance point of the distribution (sum of measurements/#of measurements

22
New cards

median (and how to calculate it)

the middle number of an ordered data set ((n+1)/2)

23
New cards

resistant measure of center

less sensitive to extremely small/large measurements

24
New cards

standard deviation

a statistical measure of data spread, indicating how much individual data points deviate from the mean

25
New cards

range

difference between the highest and lowest values in the data set

26
New cards

sample variance equation

knowt flashcard image
27
New cards

standard variation equation

knowt flashcard image
28
New cards

Empirical Rule

app. 68% of the data fall w/in 1 standard deviation of the mean
app. 95% of the data fall w/in 2 standard deviations of the mean
app 99.7% of the data fall w/in 3 standard deviations of the mean

29
New cards

What does Q1 mean?

25th percentile, lower quartile, median of the lower half of the data

30
New cards

What does Q2 mean?

50th percentile, median

31
New cards

What does Q3 mean?

75th percentile, upper quartile, median of the upper half of the data

32
New cards

interquartile range (and equation)

describes the spread of the middle half of the data (IQR=Q3-Q1)

33
New cards

When is an observation a potential outlier?

if it falls more than 1.5xIQR below the first quartile or above the third quartile

34
New cards

What is included in the five-number summary?

min, Q1, median, Q3, max

35
New cards

z-score (and equation)

distance between a given measurement x and the mean, expressed in standard deviations (z=(observation-mean)/standard deviation)