Psych 300A: Midterm 1 Review (Central Tendency & Variability)

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/54

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

55 Terms

1
New cards

Central Tendency

A statistical measure to determine a single score that defines the centre of a distribution

2
New cards

What are the 3 measures of central tedency

  1. Mode

  2. Median

  3. Mean

3
New cards

Mode

The most frequent category/score in a distribution (often the high point in a graph)

4
New cards

In what scales of measurement can mode be used with

Nominal, ordinal and ratio/interval scale (Only scale that can work with nominal)

5
New cards

T or F: You can only have one mode in each dataset

F, there can be multiple if two values have the same number of frequencies

6
New cards

What are the 4 major advantages of the mode

  1. Easy to compute/determine

  2. Value that is observed in dataset (real number from data)

  3. Can be used for all scales of measurement

  4. Not affected by outliers

7
New cards

What are the 2 major disadvantages of the mode

  1. Not used in most statistical computations as it is not useful for making inferences

  2. May not be representative of the entire collection of numbers

8
New cards

What is the measure of variability with the mode

There is none as the variable cannot be quantified

9
New cards

Median

Physical middle of an ordered set of data, also known as the 50th percentile, can be used with ordinal and ratio/interval scale

10
New cards

When is the most common use for the median

When data is extremely skewed

11
New cards

What are the 3 major advantages of the median

  1. Can be computed on data that is ordinal, interval or ratio and is the best measure of central tendency in those ranks

  2. Less biased measure of central tendency when interval/ratio data is skewed

  3. Not affected by outliers or extreme scores

12
New cards

What are the 2 major disadvantages of the median

  1. Not used in statistical computations/can not be used to make inferences

  2. Subject to sampling variation (not stable from sample to sample so we cannot infer anything about population)

13
New cards

What is the measure of variability used with the median

Find the min/max values as well as the range (max value - min value)

14
New cards

Mean

The average value in a dataset, can work with an interval/ratio scale

15
New cards

What is the symbol used for the population mean

Mu (μ)

16
New cards

What is the symbol used for the sample mean

M or x-bar ()

17
New cards

What are the 3 major characteristics of the mean

  1. Changing the value of any score or changing the number of scores will change the mean

  2. Adding or subtracting a constant for each score in a distribution will add or subtract that same constant from the mean

  3. Multiplying or dividing every score in a distribution by a constant will multiply or divide the mean by that same constant

18
New cards

What are the 4 major advantages of the mean

  1. Can be manipulated algebraically

  2. Takes into account quantitative info about each value

  3. Value has the most meaning for interpretation, especially with ratio values

  4. Most common value for inferential stats

19
New cards

What are the 3 major disadvantages of the mean

  1. Applies only to interval/ratio data

  2. Influenced by outliers

  3. Computed value may not reflect any actual value in the dataset

20
New cards

What measure(s) of CT can be found with nominal data

Only the mode

21
New cards

What measure(s) of CT can be found with ordinal data

Median or mode, median is usually the best pick

22
New cards

Variability

Quantitative distance, the measure of the differences between the scores in a distribution

23
New cards

What are the 5 major measures of variability

  1. Range

  2. Semi-interquartile Range (SIQR)

  3. Median Absolute Deviation (MAD)

  4. Variance

  5. Standard Deviation

24
New cards

Range

Distance coved by scores in a distribution from smallest to largest

25
New cards

T or F: Range is calculated differently depending on if the variables are continuous or discrete

T, Discrete values are simply max - min, whereas continuous values are the upper limit of max - upper limit of min

26
New cards

What are the major advantages and disadvantages of range

Advantages: Quick to compute and includes entire distribution of data

Disadvantages: Derived from only two values so spread of data is unknown and sensitive to outliers

27
New cards

Semi-interquartile Range (SIQR)

Half of the range of the middle 50% of observations, calculated by the (third quartile - first quartile) / 2

28
New cards

First quartile

25th percentile, range of the first 25% of data

29
New cards

Third quartile

75th percentile, range between 50% and 75% of data

30
New cards

Interquartile range

third quartile - first quartile

31
New cards

Median Absolute Deviation (MAD)

Absolute measure of how many physical units values deviate from the median, calculated as: MAD = Mdn|X - Mdn| (take the median of the each value - the median of the original data set)

32
New cards

What are the 3 major advantages of the MAD

  1. Takes all scores into account

  2. Less sensitive than SD to extreme scores or skews in data

  3. It is a minimum, we cannot get smaller value even if we take the absolute deviation from another location in the dataset

33
New cards

What are the 2 major disadvantages of the MAD

  1. Provides limited description of variability

  2. Not useful in advanced statistical procedure

34
New cards

Variance

Average squared distance from the mean, one can calculate it by using the formula: V = SD2 = (Σ(x - x̄)2) / N

35
New cards

What is the Sum of Squared Deviations (SS)

Sum of the squared difference between each score and its mean, calculated by: SS = ∑(x −x̄)2

36
New cards

Standard Deviation

Measure of standard/average distance from the mean (how dispersed scores are around the mean), calculated: SD = √SD2 = √variance

37
New cards

What measure of variability can by used with a nominal scale

None, needs quantitative values

38
New cards

What measure of variability can by used with a ordinal scale

Range, SIQR, MAD

39
New cards

What measure of variability can by used with a interval/ratio scale

Range, SIQR, MAD, Variance, SD

40
New cards

What are the two characteristics of standard deviation

  1. Adding or subtracting a constant to every score in a distribution will not change the standard deviation

  2. Multiplying or dividing every score in a distribution by a constant will multiply or divide the standard deviation by the same constant

41
New cards

What are the 3 advantages of SD

  1. Accounts for all scores in a distribution

  2. Good description of variability

  3. Used in many advanced stats

42
New cards

What are the 2 disadvantages of SD

  1. Can only be used with interval/ratio data

  2. Sensitive to extreme scores or outliers and is biased when distributions are skewed

43
New cards

Pearson coefficient of skew (Skewp)

Helps us understand the magnitude of skew, calculated using a skew statistic (in our case the pearson coefficient of skew)

44
New cards

How does one calculate Skewp

Skewp = 3(x̄ - Mdn) / SD

45
New cards

What does the sign of Skewp tell us

The direction of skew (positive or negative)

46
New cards

What does the magnitude Skewp tell us

The degree of skew, no skew and Skewp = 0

47
New cards

What values of Skewp demonstrate a normal distribution and what measure of variability should be used

Between 0 and |.5|, use mean and SD

48
New cards

What values of Skewp demonstrate a mild/moderate skew and what measure of variability should be used

Between |.5| and |1.0|, use mean and SD

49
New cards

What values of Skewp demonstrate a moderate/strong skew and what measure of variability should be used

Between |1.0| and |2.0|, use mean and SD if ≤ |1.5|, else use median and MAD

50
New cards

What values of Skewp demonstrate a severe skew and what measure of variability should be used

Greater than |2.0|, use the median and MAD

51
New cards

T or F: We do not need central tendency and variability to fully grasp the shape of a distribution

F, we need both

52
New cards

Explain what each variable in the formula represents:

X0= Xt ± Xe

X0: represents our observed value

Xt: represents our measure of central tendency

Xe: represents our measure of error

53
New cards

Why is measure of error so important

Allows us to make hypothesis on why something occured

54
New cards

Anscombes quartet

4 datasets with nearly identical summary stats, however the actual datasets are different which can be demonstrated with graphs

55
New cards

What is the overall conclusion we can make about shape, central tendency, variability and data observation

We need all of them if we are to be able to make assumptions about our data