MAT251

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 75

flashcard set

Earn XP

76 Terms

1

Data is defined as __________.

individual facts

New cards
2

Information is processed, organized, and __________ data that provides useful knowledge.

structured

New cards
3

Qualitative data can be categorized, while __________ data takes numerical values.

Quantitative

New cards
4

______________ data can take values in categories such as country, gender, favorite color.

Qualitative

New cards
5

The difference between the largest and smallest values in a dataset is known as the __________.

Range

New cards
6

A dataset can be divided into __________, which split it into four equal parts.

quartiles

New cards
7

In statistics, the __________ is the value that occurs most frequently in a dataset.

Mode

New cards
8

The __________ is the average value calculated by summing all data points and dividing by the count.

Mean

New cards
9

The measure of __________ helps to describe the variability of data points around the mean.

dispersion

New cards
10

A __________ distribution exhibits a longer tail on one side, indicating skewness.

skewed

New cards
11

Confidence intervals provide an estimate of the __________ characteristics using samples.

population

New cards
12

Pearson’s median skewness is calculated using the formula __________.

3 * (Mean - Median) / Standard Deviation

New cards
13

Inferential statistics help make predictions about a __________ based on sample data.

population

New cards
14

The interquartile range (IQR) is used to measure the __________ of a dataset, focusing on the middle 50%.

dispersion

New cards
15

The _____ value represents the central point in a dataset identified after sorting.

Median

New cards
16

A standard score, also known as a z-score, indicates how many __________ a certain score is from the mean.

standard deviations

New cards
17

__________ statistics summarize and describe characteristics of a dataset.

Descriptive

New cards
18

What is dispersion in statistics?

Dispersion refers to the extent to which data points differ from the mean.

New cards
19

Name a common measure of dispersion.

Range, variance, and standard deviation are common measures of dispersion.

New cards
20

What does a small standard deviation indicate?

A small standard deviation indicates that data points are close to the mean.

New cards
21

What does a large standard deviation signify?

A large standard deviation shows that data points are spread out over a wider range of values.

New cards
22

How is variance different from standard deviation?

Variance is the average of the squared differences from the mean; standard deviation is the square root of variance.

New cards
23

When is the mode a useful measure?

The mode is useful for categorical data where we want to know the most common category.

New cards
24

Can there be more than one mode in a dataset?

Yes, a dataset can be unimodal (one mode), bimodal (two modes), or multimodal (more than two modes).

New cards
25

What is the formula for calculating the mean?

Mean = (Sum of all data points) / (Number of data points).

New cards
26

What is the primary measure of central tendency?

The mean is often referred to as the primary measure of central tendency.

New cards
27

What is the median and how is it calculated?

The median is the middle value in a sorted dataset. If the dataset has an even number of points, it is the average of the two middle points.

New cards
28

What does it mean if the mode is higher than the mean?

It may indicate a left-skewed distribution.

New cards
29

What is a skewed distribution?

A skewed distribution is one in which data points are not symmetrically distributed around the mean.

New cards
30

What can affect the mode of a dataset?

The mode can change with the frequency of certain values in the dataset.

New cards
31

What is the relationship between mean and median in a symmetric distribution?

In a symmetric distribution, the mean and median are equal.

New cards
32

How do you find the mode in a frequency distribution?

Identify the value with the highest frequency in the distribution.

New cards
33

Which measure of central tendency is most affected by outliers?

The mean is most affected by outliers.

New cards
34

What is the interquartile range (IQR)?

The IQR is the difference between the third quartile (Q3) and the first quartile (Q1).

New cards
35

How does the IQR help in data analysis?

The IQR is used to measure the spread of the central 50% of the data, reducing the effect of outliers.

New cards
36

What does a higher IQR indicate?

A higher IQR indicates greater variability and dispersion among the middle values.

New cards
37

In what situations is the median preferred over the mean?

The median is preferred when dealing with skewed distributions or when there are outliers.

New cards
38

Can the mean be a non-existent value in some datasets?

Yes, if all values are undefined or if all numbers are 0, the mean may also be undefined.

New cards
39

What indicates a zero standard deviation?

A zero standard deviation indicates that all values in the dataset are identical.

New cards
40

What is a cumulative frequency?

Cumulative frequency is the sum of the frequencies of all data points up to a certain value.

New cards
41

How can data be visually displayed to represent dispersion?

Box plots and histograms can visually display the dispersion of data.

New cards
42

What is a uniform distribution?

A uniform distribution is one where all outcomes are equally likely.

New cards
43

What role does mode play in a bimodal distribution?

Both modes in a bimodal distribution represent the two most frequently occurring values.

New cards
44

What is the purpose of calculating the range?

Calculating the range provides a measure of the spread of values in a dataset.

New cards
45

How does one calculate the interquartile range?

IQR = Q3 - Q1.

New cards
46

What does it mean if the mode is equal to the mean and median?

It indicates a symmetric distribution.

New cards
47

When analyzing data, why is it important to consider dispersion?

Considering dispersion helps understand data variability and consistency.

New cards
48

What is the difference between absolute and relative dispersion?

Absolute dispersion deals with measures like standard deviation, while relative dispersion is represented as a fraction or percentage of the mean.

New cards
49

How do outliers affect the mean?

Outliers can significantly distort the mean, making it less representative of the dataset.

New cards
50

What is a probability distribution?

A probability distribution is a mathematical function that provides the probabilities of occurrence of different possible outcomes.

New cards
51

What does a box plot display?

A box plot displays the distribution of data based on a five-number summary: minimum, first quartile, median, third quartile, and maximum.

New cards
52

What is normalized data?

Normalized data adjusts values measured on different scales to a common scale.

New cards
53

What statistical methods help deal with outliers?

Winsorizing, trimming, and using median-based methods help deal with outliers.

New cards
54

What is the purpose of confidence intervals?

Confidence intervals estimate the uncertainty around a sample statistic.

New cards
55

When describing a dataset, why use the mode?

The mode identifies the most common value, which can highlight trends in the data.

New cards
56

How can measures of central tendency mislead?

If examined in isolation without measures of dispersion, they may not provide a complete picture of the data.

New cards
57

What is a histogram?

A histogram is a graphical representation of the distribution of numerical data.

New cards
58

How does the mean change when a constant is added to all values in a dataset?

The mean increases by that constant.

New cards
59

What does it indicate if the mean is less than the median?

It may indicate a right-skewed distribution.

New cards
60

What is the role of standard scores (z-scores)?

Z-scores indicate how many standard deviations a data point is from the mean.

New cards
61

What type of datasets are best represented by the mean?

Datasets that are normally distributed are best represented by the mean.

New cards
62

How does one identify an outlier?

An outlier can be identified if it lies beyond 1.5 times the IQR above Q3 or below Q1.

New cards
63

What is the mean of a dataset with equal values?

The mean will be the same as any value in the dataset.

New cards
64

Why is the mode not appropriate for numerical average calculations?

The mode reflects frequency, not the average value of a dataset.

New cards
65

What concepts should be compared with measures of central tendency?

Dispersion and variability should be compared with measures of central tendency.

New cards
66

What are the characteristics of a normal distribution?

It is symmetric, bell-shaped, and has mean = median = mode.

New cards
67

How can understanding mean, median, and mode help in real-world applications?

It aids in summarizing data, making decisions, and performing statistical analyses.

New cards
68

In analyzing healthcare data, which measure is often preferred?

In healthcare data, the median is often preferred to minimize the influence of outliers.

New cards
69

What are dummy variables?

Dummy variables are used in regression analysis to represent categories with binary values.

New cards
70

What is a scatter plot?

A scatter plot is a graph that shows the relationship between two quantitative variables.

New cards
71

What does a diagonal line on a scatter plot indicate?

A diagonal line indicates a positive or negative correlation between the variables.

New cards
72

How can one interpret high positive correlation?

A high positive correlation means that as one variable increases, so does the other.

New cards
73

What does a negative correlation in a scatter plot imply?

A negative correlation implies that as one variable increases, the other decreases.

New cards
74

What are the essential components of statistical inference?

Statistical inference includes point estimation, confidence intervals, and hypothesis testing.

New cards
75

When is the mean less informative than the median?

The mean may be less informative when dealing with skewed data or outliers.

New cards
76

Why is it critical to understand the shape of distribution?

Understanding the shape helps in choosing the appropriate statistical methods for analysis.

New cards
robot