chapter1_What-is-statistics

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/23

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

24 Terms

1
New cards

What is statistics?

The field of statistics is the practice and study of collecting and analyzing data.

2
New cards

What is a summary statistic?

A summary statistic is a fact about or summary of some data.

3
New cards

What can statistics analyze regarding product purchases?

Statistics can analyze how likely someone is to purchase a product, and whether different payment systems affect purchasing likelihood.

4
New cards

What are descriptive statistics?

Descriptive statistics describe and summarize data.

5
New cards

What are inferential statistics?

Inferential statistics use a sample of data to make inferences about a larger population.

6
New cards

What types of data are there?

There are numeric (quantitative) data, which can be continuous or discrete, and categorical (qualitative) data, which can be nominal or ordinal.

7
New cards

How can categorical data be represented?

Categorical data can be represented as numbers, with nominal data as unordered and ordinal data as ordered.

8
New cards

Why does data type matter in statistics?

Data type informs the methods of analysis, including how to calculate summary statistics and create plots.

9
New cards

What is the mean?

The mean is the average of a set of values.

10
New cards

What is the median?

The median is the middle value of a dataset when sorted.

11
New cards

What is the mode?

The mode is the most frequent value in a dataset.

12
New cards

What is variance?

Variance measures the average distance from each data point to the data's mean.

13
New cards

What does standard deviation measure?

Standard deviation measures the amount of variation or dispersion of a set of values.

14
New cards

What is the interquartile range (IQR)?

The IQR is the height of the box in a boxplot, calculated as Q3 - Q1.

15
New cards

How do we identify outliers in data?

A data point is an outlier if it is less than Q1 - 1.5 Ă— IQR or greater than Q3 + 1.5 Ă— IQR.

16
New cards

What is the purpose of boxplots?

Boxplots are used to visually represent the distribution of a dataset and highlight its quartiles.

17
New cards

What is the purpose of histograms in statistics?

Histograms display the distribution of data values across intervals.

18
New cards

What does a left-skewed distribution indicate?

A left-skewed distribution indicates that the mean is less than the median.

19
New cards

What is a typical value in a dataset?

The typical value refers to the measure of center such as mean, median, or mode.

20
New cards

What statistical function would you use to calculate the mean in Python?

You would use np.mean() to calculate the mean.

21
New cards

What built-in Python function would you use to find the median?

You would use np.median() to find the median.

22
New cards

What is the recommended function in Python for calculating variance?

You can use np.var() to calculate variance, setting ddof=1 for sample variance.

23
New cards

How do you calculate the mean absolute deviation?

Mean absolute deviation is calculated as the average of the absolute distances from the mean.

24
New cards

What does a histogram show?

It shows the distribution of numerical data by depicting the frequency of data points in intervals.