Unit 1 AP Stats

0.0(0)
studied byStudied by 2 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/105

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

106 Terms

1
New cards

Data Set

Includes information about individuals, which can be a person, place, object, or animal.

2
New cards

Individual

In a data set, it refers to the entity being measured, which may not always be a person.

3
New cards

Variable

Characteristics that are evaluated or collected in a data set.

4
New cards

Example of Variable

In a zoo, the individual is one tiger and the variable is the number of pounds of meat consumed in one day.

5
New cards

Categorical Variables

Labels or categories into which data are grouped, often displayed using pie charts and bar graphs.

6
New cards

Quantitative Variables

Variables that can be quantified, such as height, as opposed to subjective values.

7
New cards

Caution in Data Classification

Not all numerical data are quantitative; for example, area codes and social security numbers should not be averaged.

8
New cards

Statistical Data Displays

Organized visual representations of data to enhance understanding.

9
New cards

Pie Chart

Displays the distribution of a categorical variable as slices of a whole, sized by percentage.

10
New cards

Bar Graph

Displays each category of data as a bar, allowing for quick visual comparisons.

11
New cards

Limitations of Graphs

Too many categories can make pie charts and bar graphs confusing and unusable.

12
New cards

Two-Way Table

Also known as a 'contingency table', used to display categorical data that require a more complex graph.

13
New cards

Level of Concern Table Example

A two-way table showing the responses of 480 adolescents regarding their concern over privacy on social media.

14
New cards

Row Variable

In a two-way table, it represents one of the categorical variables, such as level of concern.

15
New cards

Column Variable

In a two-way table, it represents another categorical variable, such as gender.

16
New cards

Total Responses

In the level of concern table, the total number of adolescents surveyed is 480.

17
New cards

Not Concerned at All

In the table, 26 adolescents (11 females and 15 males) reported not being concerned at all.

18
New cards

A Little Concerned

In the table, 99 adolescents (45 females and 54 males) reported being a little concerned.

19
New cards

Moderately Concerned

In the table, 115 adolescents (65 females and 50 males) reported being moderately concerned.

20
New cards

Somewhat Concerned

In the table, 155 adolescents (90 females and 65 males) reported being somewhat concerned.

21
New cards

Very Concerned

In the table, 85 adolescents (44 females and 41 males) reported being very concerned.

22
New cards

Marginal Distribution

The distribution of a categorical variable shows how often each outcome occurred, represented in a two-way table.

23
New cards

Conditional Distributions

Distributions that include only members of a specified variable within a table, showing how the variables in a data set relate to one another.

24
New cards

Mosaic Plot

A modified segmented bar graph where the width of the bars is proportionate to the size of the group, allowing comparison of both percentages and sizes among groups.

25
New cards

Statistical Relationships

An association between variables when there appears to be a statistical relationship between the values of each.

26
New cards

Independent Variables

When the distribution of one variable is the same for all categories of another, indicating no association between these variables.

27
New cards

Frequency Distribution Table

A table that converts raw data to percentages or proportions to show how variables relate within a group.

28
New cards

Total Column

The column in a two-way table that shows the total for each row or column as part of the entire sample.

29
New cards

Sample Size

The number of respondents in a survey, which can affect the ability to make equal comparisons using raw data.

30
New cards

Percentage Calculation

The process of converting individual values to a percentage of the whole column to obtain conditional distributions.

31
New cards

No. of Females

The count of female respondents in a survey, used to calculate conditional distributions.

32
New cards

No. of Males

The count of male respondents in a survey, used to calculate conditional distributions.

33
New cards

Survey Results

Data collected from respondents regarding their privacy concerns about social media use.

34
New cards

Adolescents' Privacy Concerns

The focus of the survey, examining how adolescents feel about privacy in relation to social media.

35
New cards

Column Total

The total number of responses in a specific column of a two-way table.

36
New cards

Row Responses

The individual responses recorded in each row of a two-way table.

37
New cards

Group Comparison

The analysis of how individuals responded within their groups and between two groups.

38
New cards

Statistical Independence

A concept where the outcome of one variable does not affect the outcome of another variable.

39
New cards

Alternative hypothesis

the hypothesis that sample observations are influenced by some nonrandom cause

40
New cards

Association

a relationship between two variables where the values of one variable occur in combination with specific values of the other variable

41
New cards

Bias

overestimating or underestimating the desired response in a survey consistently

42
New cards

Binomial distribution

the probability distribution of a binomial random variable

43
New cards

Binomial random variable

the number of successes, x, in repeated trials of a binomial experiment

44
New cards

Bivariate data

quantitative data that has two variables; often represented using a scatterplot

45
New cards

Blinding

the practice of not telling subjects whether they are receiving a treatment or placebo

46
New cards

Blocking

method of dividing subjects into subgroups called blocks, such that the variability within blocks is less than the variability between blocks

47
New cards

Categorical variable

places an individual into a category or group

48
New cards

Causation

cause-and-effect relationship between or among variables

49
New cards

Census

a survey that collects information from every member of a population

50
New cards

Central limit theorem

when n is large, the sampling distribution of the sample mean is approximately Normal

51
New cards

Chi-square for homogeneity

a test to determine whether two or more categorical distributions are equal

52
New cards

Chi-square test for independence

a test to determine whether there is an association between two categorical variables

53
New cards

Conditional distribution

the distribution of values of a categorical variable among one specified group of individuals described in a two-way table; each group will have a separate conditional distribution

54
New cards

Confidence interval

range of values that describes the amount of uncertainty associated with a sample statistic of a population parameter

55
New cards

Confounding variables

variables that affect the response variable under consideration

56
New cards

Continuous random variables

random variables with outcomes that can take on any numeric value within the range of values

57
New cards

Convenience

choosing only those individuals for a survey who are easy to access

58
New cards

Critical value

a factor used to compute margin of error

59
New cards

Degrees of freedom

the number of independent observations in a sample less the number of population parameters that must be estimated from sample data

60
New cards

Density curve

a curve that is on or above the horizontal axis; the total area underneath = 1, representing 100% of observations

61
New cards

Discrete random variables

random variables with a countable number of outcomes

62
New cards

Dotplot display

a graphic display of data for visual comparison of frequency within categories

63
New cards

Event

any outcome or collection of outcomes that is a subset of the sample space

64
New cards

Expected value

the sum of the products of each possible value and the probability that it occurs

65
New cards

Extrapolation

the use of a regression line to predict values that are outside the original interval of the explanatory variable; these predictions are often inaccurate or unrealistic

66
New cards

Geometric random variable

the number of trials, Y, takes to get a success

67
New cards

Independent event

when the occurrence of one event does not change the probability that the other event will happen

68
New cards

Individuals

objects described by a set of data

69
New cards

Inferential statistics

statistical data from a sample that are used to draw conclusions about the entire population

70
New cards

Law of large numbers

as more and more repetitions of any chance process occur, the proportion of times a specific outcome will happen approaches a single value

71
New cards

Least-squares regression line

the line that makes the sum of the squared residuals as small as possible

72
New cards

Lurking variables

variables other than the independent variable and the dependent variable that may affect experimental outcomes

73
New cards

Margin of error

the maximum expected difference between the true population parameter and a sample estimate of that parameter

74
New cards

Matched pairs design

experimental method where subjects are grouped into pairs based on a blocking variable, then randomly assigned to treatment or control

75
New cards

Mode

the number that occurs most frequently in a set

76
New cards

Nonresponse

when an individual chosen for the sample can't be contacted or refuses to participate

77
New cards

Normal distribution

shown by a Normal density curve with the mean, median, and mode at the center of the curve and described in the format N(µ,σ)

78
New cards

Null hypothesis

the hypothesis that sample observations result purely from chance

79
New cards

Observational study

a study that observes individuals and measures variables of interest but does not attempt to influence the outcome

80
New cards

P-value

a value that measures the strength of the evidence in support of the null hypothesis

81
New cards

Percentile

the lowest score that is greater than a certain percentage of the scores; for example, the 30th percentile indicates that 30 percent of the data fall below that number

82
New cards

Placebo effect

a subject's positive response to receiving a placebo when no treatment has actually been applied

83
New cards

Point estimate

the statistic itself, such as the sample mean, sample median, or sample proportion given as an estimate of the population parameter of interest

84
New cards

Pooling

the name given to a technique used to obtain a more precise estimate of the standard deviation of a sample statistic by combining the estimates given by two (or more) independent samples

85
New cards

Population

the entire group of individuals about which we want information

86
New cards

Power of the test

the probability a test will reject the null hypothesis at a chosen significance level α when the specified alternative value of the parameter is true

87
New cards

Probability

the likelihood that an event will occur; the mathematics of chance

88
New cards

Quantitative variable

measures a specific numerical value that can be used for analysis

89
New cards

Random sample

a group or set chosen in a random manner that allows for each member of the population to have an equal chance of being selected

90
New cards

Regression line

a line that describes how the response variable changes as the explanatory variable changes; can be used to make predictions about a relationship

91
New cards

Representative sampling

a group or set chosen to replicate characteristics of a larger population

92
New cards

Residual

the difference between the observed value of the response variable and the value predicted by the regression line

93
New cards

Robust

procedures work even if there is a violation of the condition of Normality

94
New cards

Sample

part of the population from which information is collected; used to draw conclusions about the entire population

95
New cards

Sampling distribution

a distribution of all the proportions (or means, depending on what is being calculated) from all possible samples

96
New cards

Sampling variability

statistical information from a statistic that varies as random sampling is repeated

97
New cards

Simpson's Paradox

a paradox in statistics in which a trend appears in different groups of data but is reversed when those groups are combined

98
New cards

Simulation

a random process of numerous trials used to estimate probability and imitate chance behavior

99
New cards

Standard deviation

the average distance of a value from the mean of the data

100
New cards

Standard Normal table

a table of areas under the Standard Normal curve; the table entry for each value of z is the area under the curve to the left of that z-score