Descriptive Statistics

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/48

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

49 Terms

1
New cards

Qualatative Data

data in the form of words, characteristics. etc. (ex> fav color, birthday month)

2
New cards

Quantitative Data

data in the form of numerical values (ex>height, weight)

3
New cards

Discrete Quantitative Data

Limited values that we can count

4
New cards

Continous Quantitative Data

Values over an interval (normal curve)

5
New cards

univariate data

1 variable

6
New cards

bivariate data

studies the relationship between 2 variables

7
New cards

dot plot

quantitative

<p>quantitative</p>
8
New cards

boxplot

quantitative

<p>quantitative</p>
9
New cards

histogram

quantitative, bars touch, measures the variable against the frequency

<p>quantitative, bars touch, measures the variable against the frequency</p>
10
New cards

stem and leaf plot

quantitative, needs a key

<p>quantitative, needs a key</p>
11
New cards

Cummulative Frequency Plot (ogive)

quantitative, remember when interpretating - it is to the left (say “or less”), 0% = min, 25% = Q1, 50% = med, 75% = Q3, 100% = max, y-axis adds up to 1 (or 100%)

<p>quantitative, remember when interpretating - it is to the left (say&nbsp;“or less”), 0% = min, 25% = Q1, 50% = med, 75% = Q3, 100% = max, y-axis adds up to 1 (or 100%)</p>
12
New cards

qualitative graphical displays

pie chart, bar chart

13
New cards

distribution definition

a set of data that uses the frequency that each outcome occurs among all possibilities (all of the possible outcomes of your data)

14
New cards

measures of central tendency

where center of distribution of data lies

  • mean

  • median

  • mode (The mode is strictly for qualitative data!! You HAVE to use mean or median for quantitative data !!)

15
New cards

measures of spread

amount of variation in distribution

  • range

  • IQR

  • standard deviation

16
New cards

calculator key

n = sample size

<p>n = sample size</p>
17
New cards

shapes of distribution

  • skew right

  • skew left

  • unimodal

  • bimodal

  • symmetric 

  • uniform

  • multi-modal

18
New cards
term image
  • skew right

  • mean>median>mode

  • tail is to the right

  • few data points to the right pull the mean up

  • unimodal (one peak, one mode)

19
New cards
term image
  • skew left

  • mean<median<mode

  • tail is to the left

  • few data points to the left pull the mean down

  • unimodal (one peak, one mode)

20
New cards
term image
  • symmetric

  • bell shape

  • unimodal (one peak, one mode)

  • mean = median = mode

21
New cards
term image
  • uniform

  • about same heights

22
New cards
term image
  • bimodal

  • symmetric vs. nonsymmetric

23
New cards
term image

multi-modal

24
New cards

description: with skewed data use…

  • median

    • median doesn’t get affected by outliers/skewedness (unlike the mean)

  • IQR

  • boxplot

25
New cards

description: with symmetric data use…

  • mean

  • standard deviation

  • histogram

26
New cards

how to make classes on histogram

(not needed)

<p>(not needed)</p>
27
New cards

5 number summary

min, Q1, med, Q3, max

28
New cards

how to find median point

(n+1)/2 

…then count to that point and that is your median (when listed in order)

29
New cards

when making a boxplot…

  • create number line to space everything out evenly

  • intervals are spaced out in 25%’s (this shows the spread, not the number of points)

  • when it is longer, it just means the points are more spread apart from each other (variability)

<ul><li><p>create number line to space everything out evenly</p></li><li><p>intervals are spaced out in 25%’s (this shows the spread, not the number of points)</p></li><li><p>when it is longer, it just means the points are more spread apart from each other (variability)</p></li></ul><p></p>
30
New cards

calculator steps

knowt flashcard image
31
New cards

standard deviation

the average distance each value lies from the mean

  • The more points you have in the middle, the smaller the standard deviation. The less points you have in the middle, the larger the standard deviation.

32
New cards

calculating outliers

  • outliers fall OUTSIDE of this interval

  • can write it as the “usual interval of points”

<ul><li><p>outliers fall OUTSIDE of this interval</p></li><li><p>can write it as the “usual interval of points”</p></li></ul><p></p>
33
New cards

rules when describing data

  1. center

  2. spread

  3. shape

  4. unusual features (gaps, outliers, clusters)

MUST BE IN CONTEXT

34
New cards

symbols

knowt flashcard image
35
New cards

pie chart question

<p></p>
36
New cards

contingency table

2 way table

<p>2 way table</p>
37
New cards

segmented bar graph

knowt flashcard image
38
New cards

relative frequency vs cumulative relative frequency

  • relative frequency: how frequent this number occurs

  • cumulative relative frequency: the frequencies but added together as the data moves up

<ul><li><p><strong>relative frequency: </strong>how frequent this number occurs</p></li><li><p><strong>cumulative relative frequency: </strong>the frequencies but added together as the data moves up</p></li></ul><p></p>
39
New cards

when talking percents

whenever there is “of the __” - that is the denominator

<p>whenever there is&nbsp;“of the __” - that is the denominator</p>
40
New cards

comparitive statements

“greater than, higher, larger, less than, lower, smaller, equal, the same”

41
New cards

Normal Distribution - properties

  • symmetric about the mean

  • curve approaches the horizontal axis, but never touches or crosses it

  • total area under the curve is always equal to ONE

  • area under the curve = probability

  • The Empirical Rule can be applied for ANY normal distribution

  • it is considered UNUSUAL to be more than 2 SD from the mean in either direction

42
New cards

Normal Distribution - graph

  • 68% of data lies within 1 SD of the mean

  • 95% of data lies within 2 SD of the mean

  • 99.7% of data lies within 3 SD of mean

when describing normal distributions, ALWAYS write “approximately” and always state the direction (above/below) you are describing

<ul><li><p>68% of data lies within 1 SD of the mean</p></li><li><p>95% of data lies within 2 SD of the mean</p></li><li><p>99.7% of data lies within 3 SD of mean</p></li></ul><p></p><p><strong>when describing normal distributions, ALWAYS write&nbsp;“approximately” and always state the direction (above/below) you are describing</strong></p><p></p>
43
New cards

percentile

  • that # and to the left (below)

  • The rth percentile is a value such that r% of the observations in the
    data set fall at or below that value

44
New cards

Comparing 2 data sets

  • Back to back stem and leaf

  • Parallel box plots

  • Side by side histograms

  • Write comparative statements between distributions

  • Include context and suggestions!

45
New cards

z-score

  • A z-score tells you exactly how many SD a data value is above or below the mean.

  • z-score is positive when data is above the mean

  • z-score is negative when data is below the mean

  • can only use z-score with normal distribution

<ul><li><p>A z-score tells you exactly how many SD a data value is <u>above</u> or <u>below</u> the mean.</p></li><li><p>z-score is positive when data is above the mean</p></li><li><p>z-score is negative when data is below the mean</p></li><li><p>can only use z-score with normal distribution</p></li></ul><p></p>
46
New cards

calculating probability with z-score

  • draw a graph to visually see it

  • to the left: refer to formula sheet

  • to the right: refer to the formula shete and subtract that from 1

  • between: larger value in table - smaller value in table

<ul><li><p>draw a graph to visually see it </p></li><li><p><u>to the left:</u> refer to formula sheet</p></li><li><p><u>to the right:</u> refer to the formula shete and subtract that from 1</p></li><li><p><u>between:</u> larger value in table - smaller value in table</p></li></ul><p></p>
47
New cards

steps when handling problem w/ z-score

  1. write the info (or do a diagram)

  2. find z-score (round to 2 decimal places)

  3. find probability using table (must have you P statement)

answer in context

<ol><li><p>write the info (or do a diagram)</p></li><li><p>find z-score (round to 2 decimal places)</p></li><li><p>find probability using table (must have you P statement)</p></li></ol><p><strong>answer in context</strong></p><p></p>
48
New cards

quartiles

  • Q1 = median of 1st half of data

  • Q3 = median of 2nd half of data

49
New cards

How does a shift and multiplier affect the mean (measure of center) or SD (measure of spread)?

  • Measures of spread are ONLY affected by multipliers

  • Measures of center and individual values are affected by BOTH shifts and multipliers