Unit One: Exploring One-Variable Data- essential knowledge

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/53

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

54 Terms

1
New cards

What does it mean when numbers are placed in context?

Numbers may convey meaningful information when placed in context.

2
New cards

What is a variable in statistics?

A variable is a characteristic that changes from one individual to another.

3
New cards

What is a categorical variable?

A categorical variable takes on values that are category names or group labels.

4
New cards

What is a quantitative variable?

A quantitative variable takes on numerical values for a measured or counted quantity.

5
New cards

What does a frequency table show?

A frequency table gives the number of cases falling into each category.

6
New cards

What is a relative frequency table?

A relative frequency table gives the proportion of cases falling into each category.

7
New cards

How are percentages, relative frequencies, and rates related to proportions?

Percentages, relative frequencies, and rates all provide the same information as proportions.

8
New cards

How can counts and relative frequencies be used in categorical data?

Counts and relative frequencies of categorical data reveal information that can be used to justify claims about the data in context.

9
New cards

What are bar charts used for?

Bar charts (or bar graphs) are used to display frequencies (counts) or relative frequencies (proportions) for categorical data.

10
New cards

What does the height or length of each bar in a bar graph represent?

The height or length of each bar corresponds to the number or proportion of observations in each category.

11
New cards

Are there additional ways to represent frequencies or relative frequencies for categorical data?

Yes, there are many additional ways to represent frequencies (counts) or relative frequencies (proportions) for categorical data.

12
New cards

What can graphical representations of a categorical variable reveal?

Graphical representations of a categorical variable reveal information that can be used to justify claims about the data in context.

13
New cards

How can frequency tables or bar graphs be used for comparison?

Frequency tables, bar graphs, or other representations can be used to compare two or more data sets in terms of the same categorical variable.

14
New cards

What is a discrete variable?

A discrete variable can take on a countable number of values, either finite or countably infinite.

15
New cards

What is a continuous variable?

A continuous variable can take on infinitely many values, with the possibility of determining another value between any two values.

16
New cards

What does the height of each bar in a histogram represent?

The height of each bar shows the number or proportion of observations in the interval corresponding to that bar.

17
New cards

How does altering interval widths affect a histogram?

Altering the interval widths can change the appearance of the histogram.

18
New cards

How is a stem and leaf plot structured?

In a stem and leaf plot, each data value is split into a “stem” (the first digit or digits) and a “leaf” (usually the last digit).

19
New cards

What does a dotplot show?

A dotplot represents each observation by a dot, with positions on the horizontal axis corresponding to data values, and similar values stacked on top of each other.

20
New cards

What does a cumulative graph represent?

A cumulative graph represents the number or proportion of a data set less than or equal to a given number.

21
New cards

Are there other ways to graphically represent quantitative data distributions?

Yes, there are many additional ways to graphically represent distributions of quantitative data.

22
New cards

What are key features to describe in a distribution of quantitative data?

Descriptions include shape, center, variability (spread), and unusual features like outliers, gaps, clusters, or multiple peaks.

23
New cards

What is an outlier in one-variable data?

An outlier is a data point that is unusually small or large relative to the rest of the data.

24
New cards

When is a distribution skewed to the right?

A distribution is skewed to the right (positively skewed) if the right tail is longer than the left.

25
New cards

When is a distribution skewed to the left?

A distribution is skewed to the left (negatively skewed) if the left tail is longer than the right.

26
New cards

What does it mean for a distribution to be symmetric?

A distribution is symmetric if the left half is a mirror image of the right half.

27
New cards

What is a unimodal distribution?

A unimodal distribution has one main peak.

28
New cards

What is a bimodal distribution?

A bimodal distribution has two prominent peaks.

29
New cards

What is an approximately uniform distribution?

An approximately uniform distribution has bars of similar height with no prominent peaks.

30
New cards

What is a gap in a distribution?

A gap is a region in a distribution between data values where there are no observed data.

31
New cards

What are clusters in a distribution?

Clusters are concentrations of data points usually separated by gaps.

32
New cards

What is the role of descriptive statistics?

Descriptive statistics do not attribute properties of a data set to a larger population but may provide the basis for conjectures for further testing.

33
New cards

What is a statistic in statistics?

A statistic is a numerical summary of sample data.

34
New cards

How is the mean calculated for a sample?

The mean is the sum of all data values divided by the number of values.

35
New cards

What is the median of a data set?

The median is the middle value when data are ordered. If there is an even number of data points, the median is the average of the two middle values.

36
New cards

What are the first and third quartiles (Q1 and Q3)?

Q1 is the median of the lower half of the ordered data, and Q3 is the median of the upper half.

37
New cards

What is the interpretation of the pth percentile?

The pth percentile is the value that has p% of the data less than or equal to it.

38
New cards

What are three common measures of variability?

Range, interquartile range (IQR), and standard deviation are commonly used measures of variability.

39
New cards

How is the range calculated?

The range is the difference between the maximum and minimum data values.

40
New cards

How is the interquartile range (IQR) calculated?

The IQR is the difference between the third quartile (Q3) and the first quartile (Q1).

41
New cards

What does the standard deviation measure?

The standard deviation measures the variability of a quantitative variable’s distribution.

42
New cards

What is a five-number summary?

A five-number summary consists of the minimum, first quartile (Q1), median, third quartile (Q3), and maximum data values.

43
New cards

What is a boxplot?

A boxplot is a graphical representation of the five-number summary with whiskers extending to the most extreme non-outlier data points and outliers represented separately.

44
New cards

How does the mean compare to the median in different distributions?

In a symmetric distribution, the mean and median are close. If skewed right, the mean is usually to the right of the median; if skewed left, the mean is usually to the left of the median.

45
New cards

How can you compare two or more data sets using graphical representations?

Graphical representations like histograms or side-by-side boxplots can be used to compare center, variability, clusters, gaps, outliers, and more.

46
New cards

How can you compare data sets using numerical summaries?

Numerical summaries such as mean, standard deviation, and relative frequency can be used to compare different data sets.

47
New cards

What is a parameter?

A parameter is a numerical summary of a population.

48
New cards

What does a normal distribution look like?

A normal distribution is mound-shaped and symmetric.

49
New cards

What is the empirical rule for a normal distribution?

The empirical rule states that 68% of observations are within 1 standard deviation of the mean, 95% are within 2, and 99.7% are within 3 standard deviations.

50
New cards

How do you calculate a standardized score?

standardized score is calculated as (data value – mean) / (standard deviation), indicating how many standard deviations a data value is from the mean.

51
New cards

What is a z-score?

A z-score measures how many standard deviations a data value is from the mean.

52
New cards

How can technology be used in normal distributions?

Technology like calculators or software can be used to find proportions of data values in intervals of a normally distributed random variable.

53
New cards

How can you estimate population parameters given the area under a normal curve?

Technology can be used to estimate parameters for populations based on the area under a normal distribution curve.

54
New cards

How can percentiles and z-scores be used for comparisons?

Percentiles and z-scores compare the relative positions of points within a data set or between data sets.