Describing and Displaying Data

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/34

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

35 Terms

1
New cards

What are the three main ways to summarize empirical distributions?

Tables (frequency distributions), graphs (bar graphs, histograms), and descriptive statistics.

2
New cards

What does "location" refer to in a distribution?

Where the distribution lies on a number line (e.g., x-axis of a histogram).

3
New cards

What is central tendency?

It refers to the "middle" of a distribution.

4
New cards

What is the mode?

The most frequently occurring value(s) in a distribution.

5
New cards

Is the mode affected by outliers?

No, it is robust to outliers.

6
New cards

What do unimodal, bimodal, and multimodal mean?

They describe the number of modes (1, 2, 3+, respectively).

7
New cards

What is the median?

The 50th percentile — the value that splits the distribution in half by frequency.

8
New cards

How do you calculate the median?

Rank the values and use the middle score, or average the two middle scores if N is even.

9
New cards

Is the median affected by outliers?

No, it is relatively robust to outliers.

10
New cards

What is the median locator formula?

(N + 1) ÷ 2

11
New cards

What is the mean?

The arithmetic average of all values.

12
New cards

What does the mean minimize?

The sum of squared deviations.

13
New cards

Is the mean sensitive to outliers?

Yes, outliers can significantly affect the mean.

14
New cards

What does dispersion measure?

How spread out the values are from the center (mean).

15
New cards

What is the range formula?

Max value − Min value

16
New cards

What is the IQR formula?

IQR = 75th percentile − 25th percentile

17
New cards

What does IQR represent?

The spread of the middle 50% of data.

18
New cards

What is variance?

The average of the squared deviations from the mean.

19
New cards

What’s the formula for sample variance (data analysis)?

20
New cards

What’s the formula for sample variance (inference)?

s2=∑(Xi​−Xˉ)2​
——————-

N

21
New cards

Why do we square the deviations?

To prevent them from canceling out (∑ deviations = 0).

22
New cards

What is standard deviation?

The square root of the variance.

23
New cards

Why is standard deviation preferred over variance?

It’s in the same metric as the original data.

24
New cards

What does skewness describe?

The symmetry of a distribution.

25
New cards

What does g1=0g1​=0 indicate?

Symmetric distribution.

26
New cards

What does g1>0g1​>0 mean?

Positive skew (tail to the right).

27
New cards

What does g1<0g1​<0 mean?

Negative skew (tail to the left).

28
New cards

What does kurtosis describe?

The peakedness of a distribution.

29
New cards

What does g2=0g2​=0 indicate?

Mesokurtic (normal distribution shape).

30
New cards

What does g2>0g2​>0 indicate?

Leptokurtic (tall/narrow peak).

31
New cards

What does g2<0g2​<0 indicate?

Platykurtic (flat/wide peak).

32
New cards

What do boxplots visualize?

Dispersion and outliers in a distribution.

33
New cards

What is the max length of whiskers in a boxplot?

Up to 1.5 × IQR from Q1 or Q3.

34
New cards

What are aggregate-type propositions?

Claims about groups as a whole.

35
New cards

Why should we be cautious with individual-level interpretations in statistics?

Because statistics summarize groups, not individuals.