Stats Final Exam Dr. Smith Cedarville

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/134

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

135 Terms

1
New cards

Parametric Test

A test that assumes a specific distribution of the populations involved.

2
New cards

Nonparametric Test

A test that is distribution free and does not require that samples come from populations with normal distribution or any other particular distribution.

3
New cards

Advantages of Nonparametric Test

1. Less rigid requirements applied to a variety of solutions. 2. Can be applied to more data types.

4
New cards

Disadvantages of Nonparametric Test

1. Wastes information; numerical data reduced to qualitative form. 2. Not as efficient (need stronger evidence).

5
New cards

Efficiency

Quality of solving the largest amount of work while using as little energy as possible.

6
New cards

Rank

Rank number assignment to an individual sample item according to its order in the sorted list.

7
New cards

Tie in Ranks

In the event of a tie, find the mean of ranks involved in the tie and assign the mean rank to each of the tied items.

8
New cards

Sign Test

A nonparametric test that can test claims about the median of a distribution.

9
New cards

Wilcoxon Signed-Ranks Test

A nonparametric test that can test claims about the differences between paired observations.

10
New cards

Wilcoxon Rank-Sum Test

A nonparametric test that can test claims about the differences between two independent groups.

11
New cards

Kruskal-Wallis Test

A nonparametric test that can test claims about the differences among three or more independent groups.

12
New cards

Rank Correlation Test

A test that assesses the strength and direction of the relationship between two ranked variables.

13
New cards

Random Variable

A variable (normally denoted as x) that has a single numerical value determined by chance for the outcome of a procedure.

14
New cards

Probability Distribution

A description that gives the probability for each value of a random variable.

15
New cards

Example of Ranks with Ties

In the sorted list 4,5,5,5,10,11,12,12, the ranks are assigned as follows: 4=1, 5=2,3,3,4, 10=5, 11=6, 12=7,7.5, 12=8,7.5.

16
New cards

Excel Sign Test

Instructions to conduct a Sign Test in Excel.

17
New cards

SPSS Sign Test

Instructions to conduct a Sign Test in SPSS.

18
New cards

SPSS Wilcoxon Signed-Ranks Test

Instructions to conduct a Wilcoxon Signed-Ranks Test in SPSS.

19
New cards

SPSS Wilcoxon Rank-Sum Test

Instructions to conduct a Wilcoxon Rank-Sum Test in SPSS.

20
New cards

SPSS Kruskal-Wallis Test

Instructions to conduct a Kruskal-Wallis Test in SPSS.

21
New cards

Excel Rank Correlation Test

Instructions to conduct a Rank Correlation Test in Excel.

22
New cards

SPSS Rank Correlation Test

Instructions to conduct a Rank Correlation Test in SPSS.

23
New cards

Significant high number of success

More than .05

24
New cards

Significant low

Fewer than .05

25
New cards

Rare Event Rule

The probability of a particular observed event is extremely small, we conclude that the assumption is probably not correct.

26
New cards

Binomial probability distribution outcomes

The number of possible outcomes is determined by the number of trials, (n). Specifically, there are (n + 1) possible outcomes, ranging from 0 successes to (n) successes.

27
New cards

Example of binomial outcomes

If you have 5 trials, the possible outcomes are 0, 1, 2, 3, 4, and 5 successes, making a total of 6 possible outcomes.

28
New cards

Range rule of thumb

s~R/4 helpful in understanding meaning of mean and standard deviation.

29
New cards

Continuous probability distribution

The area under the curve equals 1, therefore there is a correspondence between area and probability.

30
New cards

Z score

A z-score represents how many standard deviations a raw score is above or below the mean.

31
New cards

Area in standard normal distribution

Area represents the probability under the curve, essentially equal to the probability associated with that range of values.

32
New cards

At most

The maximum possible value that a variable can take.

33
New cards

No more than

Means that a value is less than or equal to a specific limit.

34
New cards

At least

Means the minimum possible value or count of a certain event that can occur.

35
New cards

Z score interpretation

How many standard deviations are away from the mean.

36
New cards

Z score critical values

A specific point on the standard normal distribution that marks the boundary between the region where a test statistic is considered statistically significant and where it is not, based on a chosen significance level (alpha).

37
New cards

Formula 6-2

Used to convert values to Z scores in order to find probabilities.

38
New cards

Significantly high or low values

Determined by comparing the calculated Z score to critical values.

39
New cards

Miu

Represents the mean in probability calculations.

40
New cards

Standard deviation symbol

The symbol at the bottom of the formula represents standard deviation.

41
New cards

Figures 6-16 and 6-17

Refer to specific illustrations on p 277, disregarding the Variances portion.

42
New cards

Central Limit Theorem

A statistical theory that states that the distribution of sample means will be approximately normal if the sample size is large enough.

43
New cards

Z score

A statistical measurement that describes a value's relation to the mean of a group of values, calculated by subtracting the mean and dividing by the standard deviation.

44
New cards

Rare Event Rule for Inferential Statistics

A principle stating that if an event is rare, it is unlikely to occur by random chance alone.

45
New cards

Point Estimate

A single value used to estimate a population parameter, such as the best point estimate of population proportion p.

46
New cards

Confidence Interval

A range of values used to estimate the true value of a population parameter.

47
New cards

Confidence Level

The probability that the confidence interval actually contains the population parameter, assuming the estimation process is repeated a large number of times.

48
New cards

Critical Value

A number on the borderline separating sample statistics that are significantly high or low from those that are not significant.

49
New cards

Margin of Error

The maximum likely amount of error when estimating a population proportion, denoted by E.

50
New cards

Sample Size Requirement

According to the Central Limit Theorem, if the sample size is large enough (typically n > 30), the distribution of the sample mean will be approximately normal.

51
New cards

Student t Distribution

A type of probability distribution that is symmetric and bell-shaped, similar to the normal distribution but with heavier tails.

52
New cards

Degrees of Freedom (df)

Typically the sample size minus one (n - 1), used in statistical calculations.

53
New cards

Confidence Interval Formula

Confidence Interval = x̄ ± t*(s/n^1/2)

54
New cards

Confidence Interval for a Proportion

A method to construct a confidence interval for a proportion using specific statistical formulas.

55
New cards

Normal Quantile Plot

A graphical tool used to evaluate the normality of a dataset.

56
New cards

Sample Mean

The best estimate of the population mean, calculated as the average of sample data.

57
New cards

t-Distribution Table

A table used to find critical values for the t-distribution based on degrees of freedom and confidence level.

58
New cards

Sampling Method

The technique used to select a sample from a population, which affects the quality of poll results.

59
New cards

Simple Random Sample

A sample that is chosen randomly from a population, ensuring that every individual has an equal chance of being selected.

60
New cards

Confidence Level Examples

Common confidence levels are 90%, 95%, and 99%, determining the area in the tails of the distribution.

61
New cards

Four Points in Analyzing Polls

1. Sample should be a simple random sample, not an inappropriate sample. 2. Confidence level should be provided. 3. Sample size should be provided. 4. Quality of the poll results depends on sampling method.

62
New cards

Confidence Interval

Use sample data to construct and interpret a confidence interval estimate of the true value of a population mean µ.

63
New cards

Sample Size

Find the sample size necessary to estimate a population mean.

64
New cards

µ

Population mean.

65
New cards

n

Number of sample values.

66
New cards

Data

Collections of observations, such as measurements, genders, or survey responses.

67
New cards

Statistics

Science of planning studies and experiments; obtaining data; and organizing, summarizing, presenting, analyzing, and interpreting those data and then drawing conclusions based on them.

68
New cards

Population

Complete collection of all measurements or data that are being considered.

69
New cards

Census

Collection of data from every member of the population.

70
New cards

Sample

Subcollection of members selected from a population.

71
New cards

Statistical Significance

Achieved in a study when we get a result that is very unlikely to occur by chance.

72
New cards

Practical Significance

It is possible that some treatment or finding is effective, but common sense might suggest that the treatment or finding does not make enough of a difference to justify its use or to be practical.

73
New cards

Parameter

Numerical measurement describing some characteristic of a population.

74
New cards

Statistic

Numerical measurement describing some characteristic of a sample.

75
New cards

Qualitative (Categorical)

Consists of names or labels.

76
New cards

Quantitative

Consists of numbers representing counts or measurements.

77
New cards

Discrete Data

Results when the data values are quantitative and the number of values is finite, or 'countable.'

78
New cards

Continuous Data

Results from infinitely many possible quantitative values, where the collection of values is not countable.

79
New cards

Nominal Data

Categories only; data cannot be arranged in order.

80
New cards

Ordinal Data

Data can be arranged in order, but differences either can't be found or are meaningless.

81
New cards

Interval Data

Differences are meaningful, but there is no natural zero starting point and ratios are meaningless.

82
New cards

Ratio Data

There is a natural zero starting point and ratios make sense.

83
New cards

Skewed to the Right

Have a longer right tail.

84
New cards

Skewed to the Left

Have a longer left tail.

85
New cards

Mean

One of the three measures of center.

86
New cards

Median

One of the three measures of center.

87
New cards

Mode

One of the three measures of center.

88
New cards

Resistant

Refers to a measure that is not affected by extreme values.

89
New cards

Resistant

The presence of extreme values (outliers) does not cause it to change very much.

90
New cards

Range

Range = (MAXIMUM DATA VALUE) - (MINIMUM DATA VALUE)

91
New cards

Standard Deviation

Set of sample values, denoted by s, is a measure of how much data values deviate away from the mean.

92
New cards

Population Standard Deviation

Denoted by σ.

93
New cards

Sample Standard Deviation

Denoted by s.

94
New cards

Population Mean

Notation for population mean: μ.

95
New cards

Sample Mean

Notation for sample mean: x̄.

96
New cards

Z Score

A z score allows you to compare a data value to the mean of a data set.

97
New cards

Significantly Low Z Score

A data value is significantly low if its z score is less than or equal to -2.

98
New cards

Significantly High Z Score

A data value is significantly high if its z score is greater than or equal to +2.

99
New cards

Percentiles

Percentiles are measures of location that divide a data set into 100 equal parts, with each part representing approximately 1% of the data values.

100
New cards

Quartile

Measures of location which divide a set of data into four groups with about 25% of the values in each group.