MAT251

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/75

flashcard set

Earn XP

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

76 Terms

1
New cards

Data is defined as __________.

individual facts

2
New cards

Information is processed, organized, and __________ data that provides useful knowledge.

structured

3
New cards

Qualitative data can be categorized, while __________ data takes numerical values.

Quantitative

4
New cards

______________ data can take values in categories such as country, gender, favorite color.

Qualitative

5
New cards

The difference between the largest and smallest values in a dataset is known as the __________.

Range

6
New cards

A dataset can be divided into __________, which split it into four equal parts.

quartiles

7
New cards

In statistics, the __________ is the value that occurs most frequently in a dataset.

Mode

8
New cards

The __________ is the average value calculated by summing all data points and dividing by the count.

Mean

9
New cards

The measure of __________ helps to describe the variability of data points around the mean.

dispersion

10
New cards

A __________ distribution exhibits a longer tail on one side, indicating skewness.

skewed

11
New cards

Confidence intervals provide an estimate of the __________ characteristics using samples.

population

12
New cards

Pearson’s median skewness is calculated using the formula __________.

3 * (Mean - Median) / Standard Deviation

13
New cards

Inferential statistics help make predictions about a __________ based on sample data.

population

14
New cards

The interquartile range (IQR) is used to measure the __________ of a dataset, focusing on the middle 50%.

dispersion

15
New cards

The _____ value represents the central point in a dataset identified after sorting.

Median

16
New cards

A standard score, also known as a z-score, indicates how many __________ a certain score is from the mean.

standard deviations

17
New cards

__________ statistics summarize and describe characteristics of a dataset.

Descriptive

18
New cards

What is dispersion in statistics?

Dispersion refers to the extent to which data points differ from the mean.

19
New cards

Name a common measure of dispersion.

Range, variance, and standard deviation are common measures of dispersion.

20
New cards

What does a small standard deviation indicate?

A small standard deviation indicates that data points are close to the mean.

21
New cards

What does a large standard deviation signify?

A large standard deviation shows that data points are spread out over a wider range of values.

22
New cards

How is variance different from standard deviation?

Variance is the average of the squared differences from the mean; standard deviation is the square root of variance.

23
New cards

When is the mode a useful measure?

The mode is useful for categorical data where we want to know the most common category.

24
New cards

Can there be more than one mode in a dataset?

Yes, a dataset can be unimodal (one mode), bimodal (two modes), or multimodal (more than two modes).

25
New cards

What is the formula for calculating the mean?

Mean = (Sum of all data points) / (Number of data points).

26
New cards

What is the primary measure of central tendency?

The mean is often referred to as the primary measure of central tendency.

27
New cards

What is the median and how is it calculated?

The median is the middle value in a sorted dataset. If the dataset has an even number of points, it is the average of the two middle points.

28
New cards

What does it mean if the mode is higher than the mean?

It may indicate a left-skewed distribution.

29
New cards

What is a skewed distribution?

A skewed distribution is one in which data points are not symmetrically distributed around the mean.

30
New cards

What can affect the mode of a dataset?

The mode can change with the frequency of certain values in the dataset.

31
New cards

What is the relationship between mean and median in a symmetric distribution?

In a symmetric distribution, the mean and median are equal.

32
New cards

How do you find the mode in a frequency distribution?

Identify the value with the highest frequency in the distribution.

33
New cards

Which measure of central tendency is most affected by outliers?

The mean is most affected by outliers.

34
New cards

What is the interquartile range (IQR)?

The IQR is the difference between the third quartile (Q3) and the first quartile (Q1).

35
New cards

How does the IQR help in data analysis?

The IQR is used to measure the spread of the central 50% of the data, reducing the effect of outliers.

36
New cards

What does a higher IQR indicate?

A higher IQR indicates greater variability and dispersion among the middle values.

37
New cards

In what situations is the median preferred over the mean?

The median is preferred when dealing with skewed distributions or when there are outliers.

38
New cards

Can the mean be a non-existent value in some datasets?

Yes, if all values are undefined or if all numbers are 0, the mean may also be undefined.

39
New cards

What indicates a zero standard deviation?

A zero standard deviation indicates that all values in the dataset are identical.

40
New cards

What is a cumulative frequency?

Cumulative frequency is the sum of the frequencies of all data points up to a certain value.

41
New cards

How can data be visually displayed to represent dispersion?

Box plots and histograms can visually display the dispersion of data.

42
New cards

What is a uniform distribution?

A uniform distribution is one where all outcomes are equally likely.

43
New cards

What role does mode play in a bimodal distribution?

Both modes in a bimodal distribution represent the two most frequently occurring values.

44
New cards

What is the purpose of calculating the range?

Calculating the range provides a measure of the spread of values in a dataset.

45
New cards

How does one calculate the interquartile range?

IQR = Q3 - Q1.

46
New cards

What does it mean if the mode is equal to the mean and median?

It indicates a symmetric distribution.

47
New cards

When analyzing data, why is it important to consider dispersion?

Considering dispersion helps understand data variability and consistency.

48
New cards

What is the difference between absolute and relative dispersion?

Absolute dispersion deals with measures like standard deviation, while relative dispersion is represented as a fraction or percentage of the mean.

49
New cards

How do outliers affect the mean?

Outliers can significantly distort the mean, making it less representative of the dataset.

50
New cards

What is a probability distribution?

A probability distribution is a mathematical function that provides the probabilities of occurrence of different possible outcomes.

51
New cards

What does a box plot display?

A box plot displays the distribution of data based on a five-number summary: minimum, first quartile, median, third quartile, and maximum.

52
New cards

What is normalized data?

Normalized data adjusts values measured on different scales to a common scale.

53
New cards

What statistical methods help deal with outliers?

Winsorizing, trimming, and using median-based methods help deal with outliers.

54
New cards

What is the purpose of confidence intervals?

Confidence intervals estimate the uncertainty around a sample statistic.

55
New cards

When describing a dataset, why use the mode?

The mode identifies the most common value, which can highlight trends in the data.

56
New cards

How can measures of central tendency mislead?

If examined in isolation without measures of dispersion, they may not provide a complete picture of the data.

57
New cards

What is a histogram?

A histogram is a graphical representation of the distribution of numerical data.

58
New cards

How does the mean change when a constant is added to all values in a dataset?

The mean increases by that constant.

59
New cards

What does it indicate if the mean is less than the median?

It may indicate a right-skewed distribution.

60
New cards

What is the role of standard scores (z-scores)?

Z-scores indicate how many standard deviations a data point is from the mean.

61
New cards

What type of datasets are best represented by the mean?

Datasets that are normally distributed are best represented by the mean.

62
New cards

How does one identify an outlier?

An outlier can be identified if it lies beyond 1.5 times the IQR above Q3 or below Q1.

63
New cards

What is the mean of a dataset with equal values?

The mean will be the same as any value in the dataset.

64
New cards

Why is the mode not appropriate for numerical average calculations?

The mode reflects frequency, not the average value of a dataset.

65
New cards

What concepts should be compared with measures of central tendency?

Dispersion and variability should be compared with measures of central tendency.

66
New cards

What are the characteristics of a normal distribution?

It is symmetric, bell-shaped, and has mean = median = mode.

67
New cards

How can understanding mean, median, and mode help in real-world applications?

It aids in summarizing data, making decisions, and performing statistical analyses.

68
New cards

In analyzing healthcare data, which measure is often preferred?

In healthcare data, the median is often preferred to minimize the influence of outliers.

69
New cards

What are dummy variables?

Dummy variables are used in regression analysis to represent categories with binary values.

70
New cards

What is a scatter plot?

A scatter plot is a graph that shows the relationship between two quantitative variables.

71
New cards

What does a diagonal line on a scatter plot indicate?

A diagonal line indicates a positive or negative correlation between the variables.

72
New cards

How can one interpret high positive correlation?

A high positive correlation means that as one variable increases, so does the other.

73
New cards

What does a negative correlation in a scatter plot imply?

A negative correlation implies that as one variable increases, the other decreases.

74
New cards

What are the essential components of statistical inference?

Statistical inference includes point estimation, confidence intervals, and hypothesis testing.

75
New cards

When is the mean less informative than the median?

The mean may be less informative when dealing with skewed data or outliers.

76
New cards

Why is it critical to understand the shape of distribution?

Understanding the shape helps in choosing the appropriate statistical methods for analysis.