Descriptive Statistics: Key Concepts and Excel Applications for Data Analysis

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/39

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

40 Terms

1
New cards

Statistics

A branch of mathematics concerning the analysis, interpretation, and presentation of data.

2
New cards

Descriptive Statistics

Summarizing data, usually with estimates of common parameters (e.g. mean, median, range, etc.) or graphs and data visualization techniques.

3
New cards

Inferential Statistics

Inferential statistics help determine whether trends/patterns found in the current dataset (sample) also apply beyond it.

4
New cards

Histogram

A frequency chart that specifies ranges of numbers for each bar called 'bins'.

<p>A frequency chart that specifies ranges of numbers for each bar called 'bins'.</p>
5
New cards

Average/Mean

The balance point of a dataset.

6
New cards

Standard Deviation

Measure of the average distance that points lie from the mean; for a normal distribution, roughly 70% of the data is within ± 1 standard deviation from the mean and 95% of the data is within ± 2 standard deviations from the mean.

7
New cards

Median

The numeric value separating the higher half of a sample, a population, or a probability distribution, from the lower half.

8
New cards

Mode

The most frequently occurring observation in a dataset.

9
New cards

Range

The difference between the maximum observed data point and the minimum observed data point.

10
New cards

Percentile

An observed value of a given variable below which a certain percent of observations fall.

11
New cards

20th Percentile

The value below which 20 percent of the observations may be found.

12
New cards

First Quartile (Q1)

The 25th percentile.

13
New cards

Median (Q2)

The 50th percentile.

14
New cards

Third Quartile (Q3)

The 75th percentile.

15
New cards

Excel Function for Average

=AVERAGE(Data) to calculate the mean.

16
New cards

Excel Function for Standard Deviation

=STDEV.S(Data) to calculate the standard deviation.

17
New cards

Symmetric Distribution

If the distribution is symmetric (not skewed), then the mean and the median are equivalent.

18
New cards

Data Sorting for Median

Sort the data to find the median, which is the dividing data point where half are below and half above.

19
New cards

Tallest Bar on Histogram

Represents the mode in a frequency distribution.

20
New cards

Percentiles/Quartiles Identification

Approximate percentiles/quartiles can be identified using a histogram.

21
New cards

50th Percentile

The median, which is the value below which 50 percent of the observations may be found.

22
New cards

Common Parameters in Descriptive Statistics

Includes mean, median, mode, range, and standard deviation.

23
New cards

Data Visualization Techniques

Methods used to graphically represent data for better understanding.

24
New cards

Sample Size (n)

The number of data points in your sample.

25
New cards

Average vs. Median

The average is commonly used, but the median is preferred for household incomes as it divides households evenly, providing a more accurate representation.

26
New cards

Mean Household Income

$63,000, according to the US Census Bureau.

27
New cards

Median Household Income

$46,000, according to the US Census Bureau.

28
New cards

Example 1: Incomes for 5 households

35k, 35k, 35k, 35k, 100k; Average Income = 48k, Median Income = 35k.

29
New cards

Example 2: Incomes for 5 households

35k, 35k, 35k, 35k, 500k; Average Income = 128k, Median Income = 35k.

30
New cards

Top 1% Wealth Distribution

The top 1% of the wealth distribution has more wealth than the bottom 50%.

<p>The top 1% of the wealth distribution has more wealth than the bottom 50%.</p>
31
New cards

Max Income

4.9 Billion, with a scale of 1 inch = 50k, and distribution stretching 1.5 miles.

32
New cards

When is the Median Preferred?

The median is preferred when the data is not highly discrete.

33
New cards

When are Means Preferred?

Means are preferred when the data is highly discrete.

34
New cards

Quantitative Variables

Observations that take on numerical values representing different magnitudes.

35
New cards

Categorical Variables

Observations that take on values that are names or labels.

36
New cards

Common Statistic for Quantitative Data

Means are the most common statistic.

37
New cards

Common Statistic for Categorical Data

Proportions (or percentages) in each category are the most common statistic.

38
New cards

Census Data

According to the US Census Bureau, the median income is considerably lower than the average.

39
New cards

Second Quartile (Q2)

The 50th percentile, also known as the median.

40
New cards

Extreme Case for Median

The median equals the more common outcome in binary data but gives no information about the relative number of observations.