HLSC/PSYC 3450: Descriptive Statistics & Normal (Bell) Curve

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/20

Earn XP

Description and Tags

Flashcards covering descriptive statistics, measures of central tendency, measures of variability, the normal distribution, Z-scores, and shapes of distributions like skewness and kurtosis.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

21 Terms

New cards

Descriptive Statistics

Statistics used to describe data by summarizing and organizing it.

New cards

Inferential Statistics

Statistics used to determine the likelihood of pure randomness explaining data, often for making generalizations about a population based on a sample.

New cards

types of descriptive statistics

measures of central tendency - one number to represent all numbers
measures of variability - how the data varies, also called measure of spread or dispersion
statistics for describing shapes of distributions

New cards

Measures of Central Tendency

Statistics that describe the center of a distribution of values, providing a single number about which a group of numbers cluster, often called 'averages'.
mean, median, mode
tells the most central value and how other values vary around the central value

New cards

Mode

The value that occurs most frequently in a set of data, best for nominal and dichotomous data.
if there are two highest frequently occurring values, that is bimodal
if there are three or more highest frequently occurring values, that is multimodal

New cards

Median

The middle value or score in a sorted set of data (ascending or descending order), with half of the scores falling above and half below; best for ordinal data.
to get this, you need to order the data. if there are two values in the middle, you add them together and divide by 2
you can use the cumulative percentage and find the value where the 50% threshold is at

New cards

Mean

The arithmetic average of a set of scores, calculated by adding all raw scores and dividing by the number of scores; the most commonly used measure of central tendency, best for normal/scale data.
sensitive to every single number in distribution - can be heavily influenced by outliers

New cards

Outliers

Scores far higher or lower than most others in a dataset, which can significantly influence the mean.

New cards

Measures of Variability

Statistics that describe how the values of a variable are spread out or dispersed, or how much the values vary from each other.
the less variability there is, the less spread out the values are, the better the dataset

New cards

Number of Categories

A measure of variability best suited for nominal data.

New cards

Range

A measure of variability that represents the difference between the highest and lowest values in a dataset, best for ordinal data.

New cards

Standard Deviation (SD)

A commonly used measure of the spread or dispersion of scores, representing the 'average distance' between each score and the mean. A smaller SD means values are less spread out. the higher the standard deviation, the more spread out the data is
also called frequency distribution, average distance, average mean

New cards

Z-scores (Standardized Scores)

Scores that tell you the number of standard deviations a raw score or value is from the mean (above or below) - calculated using mean and standard deviation
used to compare values from different distributions - ex comparing income of two employed working in different cities
they have a mean of 0 and a standard deviation of 1.
raw score - group mean, divided by group standard deviation . that gives you the z score
negative z score means actual raw score is below the mean
positive z score means actual raw score is above the mean

New cards

Normal Distribution (Normal Curve, Bell Curve)

A theoretical distribution that is bell-shaped, symmetrical with its peak in the middle, and has most scores in the middle with fewer at the extreme ends. In a normal distribution, the mean, median, and mode are the same.

New cards

Central Limit Theorem

A theorem stating that data are often distributed approximately as the normal curve when the sample size is large - larger the sample, the closer to a normal curve it will be.

New cards

Skewness

A statistic describing the asymmetry of a distribution, indicating how its shape differs from a normal curve.

New cards

Symmetrical Distribution

A distribution where the mean, median, and mode are equal, indicating no skewness.

New cards

Positive Skewness

A distribution where the tail trails off to the right, and the mean is higher than the median, which is also higher than the mode.

New cards

Negative Skewness

A distribution where the tail trails off to the left, and the mean is lower than the median, which is also lower than the mode.

New cards

Kurtosis

A statistic describing how peaked or flat the centre of a distribution is, and how flat or skinny its tails are, relative to a normal distribution.
leptokurtic - thin tails and higher centre
mesokurtic - normal curve
platykurtic - flat tails, middle is too short

New cards

why does normal distribution matter

lots of tests - especially parametric tests - can only be used for normally distributed variables or data
if continuous data are not normally distributed, then you have to use nonparametric (less powerful) tests - nominal and ordinal data are assumed to be skewed