Probability: refers to the likelihood of a particular event of interest occurring
Example: tossing a coin with one desired outcome (heads) and two possible outcomes (heads or tails) = 1 ÷ 2 = 0.5 [in percentage = 0.5 * 100 = 50%]
Conditional Probability: the probability of a particular event happening if another event (or set of conditions) has also happened)
Inferential Statistics: techniques employed to draw conclusions from your data.
Standard Normal Distribution (SND): the distribution of z-scores; normally shaped probability distribution which has a mean (as well as median and mode) of zero and a standard deviation of 1
z-scores: aka standardized scores; you can convert any score from a sample into this by subtracting the sample mean from the score and then dividing by the standard deviation
Example:
mean IQ score = 100
IQ standard deviation = 15
your IQ score = 135
z-score = ?
(your IQ score - mean IQ score) ÷ IQ standard deviation = (135 - 100) ÷ 15 = 2.33
Probability Distribution: a mathematical distribution of scores where we know the probabilities associated with the occurrence of every score in the distribution'; we know that the probability is of randomly selecting a particular score or set of scores from the distribution
Advantage: there is a probability associated with each particular score from the distribution.
An important characteristic of probability distributions is that the area under the curve between any specified point represents the probability of obtaining scores within those specified points.
Example: You decided to take a course in pottery and weightlifting. At the end of the courses, you were graded 65% for pottery and 45% for weightlifting. Suppose that the mean and standard deviation for pottery are 56% and 9% and for weightlifting 40% and 4%. Which career would be better for you?
z-score for pottery = (65-56) ÷ 9 = 1
z-score for weightlifting = (45-40) ÷ 4 = 1.25
Sampling Distribution: a hypothetical distribution. it is where you have selected an infinite number of samples from a population and calculated a particular statistic (e.g., a mean) for each one; when you plot all these calculated statistics as a frequency histogram, you have a sampling distribution
Sampling Distribution of the Mean: if you plotted the sample means of many samples from one particular population
Central Limit Theorem: states that as the size of the samples we select increases, the nearer to the population mean will be the mean of the sample means and the closer to normal will be the distribution of the sample means
Point Estimate: a single figure of an unknown number
Interval Estimate: range within which we think the unknown number will fall
Confidence Interval: a statistically determined interval estimate of a population parameter
Standard Error: refers to the standard deviation of a particular sampling distribution; in the context of the sampling distribution of the mean, it is the standard deviation of all of the sample means
Error Bar Chart: a graphical representation of confidence intervals around the mean
Error bar charts simply display your means as a point on a chart and a vertical line through the mean point that represents the confidence interval. The larger the confidence interval, the longer the line is through the mean.
If there is substantial overlap between the 2 sets of confidence intervals, we cannot be sure whether there is a difference in the population means.
When the confidence intervals do not overlap, we can be 95% confident that both population means fall within the intervals indicated and therefore do not overlap. This would suggest that there is a real difference between the population means.
Examining confidence intervals gives us a fair idea of the pattern of means in the populations.