Stats Exam 2 Terms

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/110

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:03 AM on 3/21/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

111 Terms

1
New cards

Sample design

specific plan or protocol for how individuals will be selected or sampled from a population of interest. Must address the following two questions: 1. Does the order of selecting participants matter? 2. Do we replace each selection before the next draw?

2
New cards

What are the 2 strategies for sampling?

Theoretical sampling and

Experimental sampling

3
New cards

example of sampling without order

If order does not matter, you can sample participants in any order which means sample A, C, D is seen as equivalent/redundant to sample C, A, D

4
New cards

Sampling without replacement

-when sampling, each participant selected is not replaced before the next selection

◦ Most common method used in behavioral research

5
New cards

example of sampling without replacement

Sample without replacement: A, D, C (no repeats) ◦ The odds of selection increase each round of selection ◦ Odds of selecting A in round 1 is ¼ ◦ If not already selected, A has a 1/3 chance in round 2. ◦ If not selected already, A has a ½ chance of selection in round 3.

(Basically once a participant (letter) is selected, it wont be selected again and it heightens the chances of the other participants (letters) being chosen)

6
New cards

Theoretical sampling

-sampling strategy of used in development of statistical theory

-In theoretical sampling, the order of selecting individuals matters and each individual selected is replaced before sampling again To determine the number of samples of any size that can be selected from a population:

◦ Total number of samples possible = N^n

◦ Example--if we had samples of two participants (n = 2) from a population of 3 people (N = 3): 3^2 = 9 samples

7
New cards

Experimental sampling

-most commonly used in experimental research

In experimental sampling, the order of selecting individuals does not matter and each individual selected is not replaced before selecting again To determine the number of samples of any size that can be selected from a population:

◦ Total number of samples possible = N!/n! (N-n)!

Example--if we select as many samples of two participants as possible (n = 2) from a population of three people (N = 3): 3!/2!(3-2)!

8
New cards

What is a Factorial sign? (!)

We are going to take the product of a number by every number between that number and zero (not including 0) EX: 4! = 4 x 3 x 2 x 1 = 24

9
New cards

Sampling distributions

a distribution of all possible sample means or variances that could be obtained in samples of a given size from the same population

◦You can then compare the statistics obtained in the samples to the value of the mean and variance in the hypothetical population

10
New cards

unbiased estimator

The mean of its sampling distribution is equal to the true value of the parameter being estimated

11
New cards

Central Limit Theorem

the distribution of sample averages tends to be normal regardless of the shape of the process distribution

12
New cards

standard errors of the means

the standard deviation of the distribution of sample means It measures the average distance between a sample mean and the population mean Provides a measure of how accurately, on average, a sample mean represents its corresponding population mean.

13
New cards

sampling error

-the extent to which sample means selected from the same population differ from one another

14
New cards

what is a z transformation?

A sampling distribution can be converted to a standard normal distribution by applying the z transformation

-z transformation is used to determine the likelihood of measuring a particular sample mean from a population with a given mean and variance

15
New cards

What decreases standard error?

-As the population standard deviation decreases, standard error decreases

-As sample size increases, standard error decreases (Law of Large numbers)

16
New cards

What are the main characteristics of the sample mean?

1. The sample mean is an unbiased estimator.

2. The distribution of sample means follows the central limit theorem.

3. A distribution of sample means has minimum variance.

17
New cards

standard normal distribution

a normal distribution with a mean of zero and a standard deviation of one

18
New cards

Z transformation formula (population scores)

knowt flashcard image
19
New cards

Z transformation formula (Sample scores)

knowt flashcard image
20
New cards

What is hypothesis testing?

the method for testing a hypothesis about a population parameter, using sample statistics

• The goal is to test questions, claims and ideas about a population

• Specifically, we are predicting the probability (p) that a population parameter, such as the mean, is likely to be true

21
New cards

What is the goal of a hypothesis test?

Determine whether our observed difference - between our sample mean and what we expect from the population - is large enough that we can rule out random chance as the cause

22
New cards

What are the 4 steps of hypothesis testing?

1. State the null and alternative hypothesis

2. Set the criteria for a decision (find the critical test value)

3. Compute the test statistic

4. Make a decision to accept or reject the null hypothesis and interpret)

23
New cards

What is a Null hypothesis?

-H0 = "H naught"

-considered the "status quo"

-what the world already thinks is true

-statement about the population parameter (such as the mean) that is assumed to be true

24
New cards

What is an alternative hypothesis?

-H1

-a challenge to the status quo

-what may actually be true

-statement that contradicts the null hypothesis

25
New cards

Directionality

-indication the direction of the difference of the H1 from the H0

-The word "smaller" (or larger) indicates direction of the difference

26
New cards

Level of significance

Criterion of judgment upon which a decision is made regarding the value stated in a null hypothesis

-the threshold for accepting/rejecting the null hypothesis

-Based on the probability in behavioral research studies of obtaining a statistic measured in a sample if the value stated in the null hypothesis were true

27
New cards

What is the most common level of significance? (traditionally)

5% or .05

-When the probability of obtaining a sample mean is less than 5% if the null were true, we reject the null (p < .05)

28
New cards

What are p-values?

-probability of getting study estimate when there is really no association because of sampling error

-in more recent years there has been a push to lower this number to p < .01 (1%) or even p < .001 (.1%). In other words, we are less likely to mistakenly reject the null hypothesis if we set the level of significance lower.

29
New cards

What is the test statistic?

-A standardized value that is calculated from sample data during a hypothesis test.

◦ The value of the test statistic can be used to make a decision regarding null hypothesis

◦ Helps determine how likely the sample outcome is if the population mean stated in the null is true

◦ The larger the value of the test statistic, the further a sample mean deviates from the population mean stated in null hypothesis

30
New cards

How do you make a decision to reject the null hypothesis?

-the sample mean is associated with low probability of occurrence if the null is true ◦ p value <.05 (or the set level of significance); reached significance

31
New cards

How do you make a decision to retain the null hypothesis?

the sample mean is associated with high probability of occurrence when null is true ◦ p value >.05 (or the set level of significance); significance failed to reach

32
New cards

What are the 4 possible outcomes when conducting the 4th step of a hypothesis test?

1. Decision to retain the null is correct

2. Decision to retain the null is incorrect (Type II or β error)

3. Decision to reject the null is correct

4. Decision to reject the null is incorrect (Type I error)

33
New cards

Type 1 error

rejecting the null hypothesis when it is true (false positive)

-Alpha level (α)--level of significance or criterion of a hypothesis test ◦ Researchers control for Type I error by stating a level of significance (also called an alpha level) ◦ The significance level is the largest probability of committing Type I error that we will allow and still decide to reject the null hypothesis ◦ Criterion is usually set at .05 ◦ α level is compared to the p value in making a decision

34
New cards

Type 2 error (Beta Error)

-Accepting null hypothesis when you should have rejected it (Null IS false but you incorrectly retain the null)

-Power--the probability of rejecting a false null hypothesis

◦ It is the likelihood that we will detect an effect, assuming an effect exists

35
New cards

What is a one-sample z test?

Used to test hypotheses concerning the mean in a single population with known population variance . Can be directional or non-directional.

36
New cards

Non-directional z test

Two-Tailed Tests (H1: ≠): Alternative hypothesis is stated as not equal to (≠) the null

-divide significance level in half for two-tailed test before locating score

-Interested in any alternative from null hypothesis

37
New cards

Directional z test

One-Tailed Tests (H1: >) or (H1: <): Alternative hypothesis is stated as greater (>) or less than (<) null

◦ Upper tail critical test (H1 > H0)

◦ Lower tail critical test (H1 < H0)

38
New cards

How do you compute a z statistic?

Formula is the sample mean minus the population mean stated in the null hypothesis divided by the standard error of the mean

39
New cards

How do you find the standard error of the mean?

The population standard deviation(SD) divided by the square root of the sample size (n)

40
New cards

When is a Z test performed?

We can only do a z test if we know the population mean and error of the mean the standard (which requires the population standard deviation)

41
New cards

William Gossett

-Statistician known for developing the t test.

-We substitute the population variance with the sample variance in the formula for standard error This substitution, called the estimated standard error, is the denominator of the test statistic for a t test This is acceptable because sample variance is an unbiased estimator of the population variance

42
New cards

Estimated standard error

an estimate of the standard deviation of a sampling distribution of sample means selected from a population with an unknown variance.

◦ It is an estimate of the standard error or standard distance that sample means deviate from the value of the population mean stated in the null hypothesis

43
New cards

What is a t-statistic?

Used to determine the number of standard deviations in a t distribution that a sample deviates from the mean value or difference stated in the null

44
New cards

What is the t distribution?

-Using the estimated standard error in the denominator of the test statistic led to a new sampling distribution

-It is like a normal distribution, but with greater variability in the tails

45
New cards

What are degrees of freedom? (df)

the number of independent components in a control system and the number of ways each component can vary

-The df for a t distribution are equal to the df of the sample variance: n - 1. ◦ As the n increases, sample variance more closely resembles population variance ◦ As n increases, the probability of outcomes in the tails become less likely and the tails approach the x-axis faster

46
New cards

What are the steps of a t-test?

Step 1: State the hypotheses

Step 2: Set the criteria for making a decision

Step 3: Compute the test statistic

Step 4: Make a decision

(the same as a hypothesis test)

47
New cards

what is a t-test?

-A statistical procedure used to test hypotheses concerning a single group mean in a population with an unknown variance

-statistical measure, the difference between TWO mean scores

(TIP: "t" stands for two)

48
New cards

What are the 3 assumptions made when conducting a one sample t-test?

Normality, Random sampling, and independence

49
New cards

Normality

assume data in the population being sampled are normally distributed

50
New cards

Random sampling

assume that the data were obtained using a random sampling procedure

51
New cards

independence

assume that probabilities of each observation in a study are independent

52
New cards

what are effect sizes?

Effect sizes are a measure of how important the difference between group means is (or how meaningful a relationship between 2 variables is)

53
New cards

Cohen's D

Cohen's d is a measure of the number of standard deviations an effect shifted above or below the population mean stated in the null hypothesis. We only report Cohen's d when the null hypothesis is rejected .

54
New cards

Estimated Cohen's d

Estimated Cohen's d-measure of effect size in terms of the number of SDs that mean scores shift above or below the population mean stated by the null hypothesis ◦ The larger the value of d, the larger the effect

55
New cards

Cohen's D (z test)

d =sample mean minus population mean, divided by population standard deviation

56
New cards

Estimated Cohen's D (t-test)

d= sample mean minus population mean, divided by sample standard deviation

57
New cards

What are the 3 measures of effect size for T-tests?

-Estimated Cohen's d

◦ Eta-Squared (proportion of variance)

◦ Omega-Squared (proportion of variance)

58
New cards

Proportion of variance

measure of effect size in terms of the proportion or percent of variability in a dependent variable that can be explained or accounted for by a treatment

59
New cards

Treatment

any unique characteristic of a sample or any unique way that a researcher treats a sample

◦ Can change value of a dependent variable

◦ Associated with variability in a study

60
New cards

eta-squared

Eta-squared tends to overestimate proportion of variance explained by treatment

<p>Eta-squared tends to overestimate proportion of variance explained by treatment</p>
61
New cards

omega-squared

-Omega-squared gives a more conservative estimate

-w^2

62
New cards

What are confidence intervals?

The confidence interval (CI) is a range of values that's likely to include a population value with a certain degree of confidence. It is often expressed as a % whereby a population mean lies between an upper and lower interval.

-The confidence is in the method, not in a particular CI . If we repeated the sampling method many times, approximately 95% of the intervals constructed would capture the true population mean. Therefore, as the sample size increases, the range of interval values will narrow, meaning that you know that mean with much more accuracy compared with a smaller sample.

63
New cards

What are the 3 steps to estimate the value of a population mean using a point estimate and interval estimate?

Step 1: Compute the sample mean and standard error ◦ Step 2: Choose the level of confidence and find the critical values at that level of confidence

Step 3: Compute the estimation formula to find the confidence limits

64
New cards

Variance of population data

knowt flashcard image
65
New cards

variance of sample data

knowt flashcard image
66
New cards

standard deviation (population)

knowt flashcard image
67
New cards

standard deviation (sample)

s^2

68
New cards

summation

knowt flashcard image
69
New cards

mean symbol for sample data

or "M"

<p>or "M"</p>
70
New cards

population mean

knowt flashcard image
71
New cards

sample size

n

72
New cards

Population size

N

73
New cards

probability

p

74
New cards

z statistic

z

75
New cards

t statistic

t

76
New cards

Cohen's d (effect size)

d

77
New cards

eta squared (effect size)

knowt flashcard image
78
New cards

omega squared (effect size)

w^2

79
New cards

alpha level (level of significance)

α

80
New cards

Beta (power) (type 2 error)

B

81
New cards

Sum of squares (squared distance between observation and mean)

SS

82
New cards

correlation

statistical procedure used to describe the strength and direction of the linear relationship between two factors. ◦ In a correlation, we treat each factor like a dependent variable and measure the relationship between the pair.

83
New cards

How is correlation used?

◦ (1) describe the pattern of data points for the values of two factors

◦ (2) determine whether the pattern observed in a sample is also present in the population from which the sample was selected

84
New cards

How is correlation illustrated?

Typically illustrated through a scatter plot

◦ illustrate the relationship between two variables (x,y) plotted on the x and y axes of the graph, Pairs of values for x and y are called data points.

◦ Data points plotted to see if a pattern emerges.

85
New cards

Linearity

In behavioral research, we mostly describe the linear (or straight line) relationship between two factors.

86
New cards

Correlation coefficient (r)

-used to measure the strength and direction of the linear relationship, or correlation, between two factors ◦ The value of r ranges from −1.0 to +1.0 • Values closer to ±1.0 indicate stronger correlations • A value of 0 indicates no relationship o The sign of the correlation coefficient (− or +) indicates only the direction or slope of the correlation

87
New cards

positive correlation

A correlation where as one variable increases, the other also increases, or as one decreases so does the other. Both variables move in the same direction.

EX: As you drink more coffee, the number of hours you stay awake increases

<p>A correlation where as one variable increases, the other also increases, or as one decreases so does the other. Both variables move in the same direction.</p><p>EX: As you drink more coffee, the number of hours you stay awake increases</p>
88
New cards

negative correlation

as one variable increases, the other decreases

EX: As snowfall totals increase, the number of people driving decreases

<p>as one variable increases, the other decreases</p><p>EX: As snowfall totals increase, the number of people driving decreases</p>
89
New cards

no correlation

there does not appear to be a relationship between two sets of data

<p>there does not appear to be a relationship between two sets of data</p>
90
New cards

regression line

the best fitting straight line to a set of data points. A best fitting line is the line that minimizes the distance from it of all data points that fall

91
New cards

What are the strength sizes for correlation?

.1 = small

.3 = moderate

.5 = large

*Magnitude of the relation does not depend on the direction (positive or negative)

92
New cards

Pearson correlation coefficient formula

knowt flashcard image
93
New cards

Pearson's correlation coefficient

The Pearson correlation coefficient (r)--used to measure the direction and strength of the linear relationship of two factors in which the data for both factors are measured on an interval or ratio scale of measurement

94
New cards

What is the purpose of the numerator in Pearson's correlation coefficient?

The value in the numerator reflects the extent to which values on the x-axis (X) and y-axis (Y) vary together

95
New cards

What is the purpose of the denominator in Pearson's correlation coefficient?

The value in the numerator reflects the extent to which values on the x-axis (X) and y-axis (Y) vary together

96
New cards

covariance

The extent to which the values of two factors vary together is called

97
New cards

coefficient of determination (r^2)

mathematically equivalent to eta squared, and is used to measure the proportion of variance of one factor (Y) that can be explained by known values of a second factor (X)

-just Person's coefficient squared

98
New cards

Homoscedasticity

assumption that there is an equal ("homo") variance or scatter ("scedasticity") of data points dispersed along the regression line

<p>assumption that there is an equal ("homo") variance or scatter ("scedasticity") of data points dispersed along the regression line</p>
99
New cards

what are limits in interpreting correlation?

casualties, spruious relationships, outliers, and restriction of range

100
New cards

casualty

A significant correlation does not show that one factor causes changes in a second factor.

-We cannot infer causality based on correlation alone

Explore top flashcards

flashcards
Central Nervous System III
81
Updated 389d ago
0.0(0)
flashcards
La Honors lit terms
81
Updated 388d ago
0.0(0)
flashcards
Descubre 1: chapter 8, week 1
38
Updated 1141d ago
0.0(0)
flashcards
Endocrine (test 11)
75
Updated 1204d ago
0.0(0)
flashcards
Modern Architecture Exam 1
66
Updated 1071d ago
0.0(0)
flashcards
gov u4/5
154
Updated 1213d ago
0.0(0)
flashcards
Central Nervous System III
81
Updated 389d ago
0.0(0)
flashcards
La Honors lit terms
81
Updated 388d ago
0.0(0)
flashcards
Descubre 1: chapter 8, week 1
38
Updated 1141d ago
0.0(0)
flashcards
Endocrine (test 11)
75
Updated 1204d ago
0.0(0)
flashcards
Modern Architecture Exam 1
66
Updated 1071d ago
0.0(0)
flashcards
gov u4/5
154
Updated 1213d ago
0.0(0)