Ch2/3Descriptive Statistics & Data Analysis – Key Vocabulary

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/46

flashcard set

Earn XP

Description and Tags

A comprehensive set of vocabulary flashcards covering major concepts, graphical tools, and numerical measures introduced in the lecture on descriptive statistics and exploratory data analysis.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

47 Terms

1
New cards

Descriptive Statistics

Methods that summarize or describe the important characteristics of a known data set.

2
New cards

Inferential Statistics

Techniques that use sample data to make generalizations or inferences about a population.

3
New cards

Center (of a data set)

A representative value, such as the mean or median, indicating where the middle of the data lies.

4
New cards

Variation

A measure of how much the data values differ from one another.

5
New cards

Distribution (shape)

The overall pattern or form of how data values are spread, e.g., bell-shaped, uniform, or skewed.

6
New cards

Outlier

A data value that lies far from the majority of other observations.

7
New cards

Frequency Table

A table that lists classes (categories) of values with the count of observations in each class.

8
New cards

Lower Class Limit

The smallest value that can belong to a particular class in a frequency table.

9
New cards

Upper Class Limit

The largest value that can belong to a particular class in a frequency table.

10
New cards

Class Boundary

The value that separates adjacent classes; halfway between upper limit of one class and lower limit of the next.

11
New cards

Class Midpoint

The average of the upper and lower class limits; used as a representative value for the class.

12
New cards

Class Width

The difference between two consecutive lower class limits (or boundaries).

13
New cards

Relative Frequency

Class frequency divided by the sum of all frequencies, usually expressed as a percentage.

14
New cards

Cumulative Frequency

Running total of frequencies for classes up to a given point.

15
New cards

Histogram

A bar graph of class frequencies where bars touch, depicting the distribution of quantitative data.

16
New cards

Relative Frequency Histogram

A histogram whose vertical axis shows relative frequencies instead of raw counts.

17
New cards

Dot Plot

A simple graph placing a dot for each data value above a number line.

18
New cards

Stem-and-Leaf Plot

A display that separates each data value into a stem (leading digits) and leaf (trailing digit).

19
New cards

Scatter Diagram

A graph of paired (x, y) data points used to assess relationships between two variables.

20
New cards

Mean (Arithmetic Mean)

The sum of all data values divided by the number of values; the average.

21
New cards

Population Mean (µ)

The mean of all values in an entire population, denoted by the Greek letter mu (µ).

22
New cards

Sample Mean (x̄)

The mean of values in a sample, denoted by x-bar (x̄).

23
New cards

Σ (Sigma)

Mathematical symbol indicating the summation of a set of numbers.

24
New cards

n

The number of data values in a sample.

25
New cards

N

The number of data values in a population.

26
New cards

Median

The middle value when data are ordered; 50th percentile, denoted by x͂ (x-tilde).

27
New cards

Mode

The data value that occurs most frequently in a set.

28
New cards

Bimodal

A distribution with exactly two modes.

29
New cards

Multimodal

A distribution with more than two modes.

30
New cards

Midrange

The value halfway between the highest and lowest data values; (max + min)/2.

31
New cards

Symmetric Distribution

A distribution whose left half mirrors its right half.

32
New cards

Skewed Distribution

A distribution that stretches further on one side; can be skewed left (negative) or right (positive).

33
New cards

Range

The difference between the highest and lowest data values.

34
New cards

Standard Deviation

A measure of how far data values typically deviate from the mean.

35
New cards

Sample Standard Deviation (s)

Standard deviation of sample data; denominator uses (n − 1).

36
New cards

Population Standard Deviation (σ)

Standard deviation of all population data; denominator uses N.

37
New cards

Variance

The square of the standard deviation (s² for a sample, σ² for a population).

38
New cards

Range Rule of Thumb

Approximation that range ≈ 4 × standard deviation for many data sets.

39
New cards

Empirical Rule

For bell-shaped data: ~68 % within 1 s, 95 % within 2 s, 99.7 % within 3 s of the mean.

40
New cards

Chebyshev’s Theorem

In any data set, at least 1 − 1/K² of values lie within K standard deviations (K > 1) of the mean.

41
New cards

z-Score

Number of standard deviations a value x is above or below the mean; z = (x − mean)/s.d.

42
New cards

Quartiles (Q1, Q2, Q3)

Values that divide ordered data into four equal parts; Q2 is the median.

43
New cards

Deciles

Cut points that divide ordered data into ten equal parts (D1–D9).

44
New cards

Percentile

A score indicating the percentage of data values below it; e.g., P25 = Q1.

45
New cards

Five-Number Summary

Minimum, Q1, Median (Q2), Q3, Maximum.

46
New cards

Boxplot (Box-and-Whisker)

Graph of the five-number summary that highlights center, spread, and outliers.

47
New cards

Exploratory Data Analysis (EDA)

Use of graphical and numerical tools to understand data characteristics before formal inference.