ap stats

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/80

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

81 Terms

1
New cards

describe a distribution

SOCS:

Shape: the shape of the distribution is described. (symmetric, skewed left or right, unimodal, bimodal, uniform)

Outliers: Describe any outliers, gaps, clusters, or unusual features.

Center: what does the “typical” value look like- eyeball or find the median or mean

Spread: How much variability does the data have/range

2
New cards

outlier rule

Lower outliers: <Q1-1.5(IQR/Q3-Q1)

Upper outliers: >Q3+1.5(IQR/Q3-Q1)

3
New cards

IQR formula

Q3-Q1

4
New cards

How can you use a graph to compare the mean and the median?

knowt flashcard image
5
New cards

How do you interpret the standard deviation?

The standard deviation gives the typical distance that the values are away from the mean.

6
New cards

How do you describe the relationship between two variables? (ex-scatterplot)

DUFS:

Direction: positive or negative

Unusual values/outliers

Form: Linear or curved?

Strength: Weak, moderate, or strong?

7
New cards

How do you compare two distributions?

SOCS- remember to use comparison words!

Shape, Outliers, Center, Spread

8
New cards

How do you find the Mean, SD, & 5-Number Summary using your calc?

enter data into L1, leave FreqList blank, Calculate.

9
New cards

How do you calculate a Least Squares Regression Line using a calc?

enter x-values into L1 & y-values into L2; leave FreqList blank & calculate

10
New cards

What is the IQR?

Q1 & Q3 form the boundaries for the middle 50% of values in an ordered data set.

11
New cards

How do you calculate the percentile of a particular value in a data set?

  1. Order the data

  2. count the number of values that are less then or equal to the value of interest.

  3. Count the number of values in the data set

  4. Express it as #/___th percentile

12
New cards

Interpret the y-interest of the Least Squares Regression Line

<p></p>
13
New cards

Interpret the slope of the Least Squares Regression Line

knowt flashcard image
14
New cards

Interpret the coefficient of determination/ r²

the percent of the variation of y-context that is explained by the least squares regression line using x-context

15
New cards

Properties of correlation r

r is unitless.

r is always between -1 and 1.

r is greatly affected by regression outliers.

If the direction is negative, then r<0

If the direction is positive, then r>0.

The closer r is to -1 or 1, the stronger the relationship.

The closer r is to 0, the weaker the relationship.

16
New cards

Regression Outlier

a point that does not follow the general trend shown in the rest of the data and has a large residual.

17
New cards

Correlation r

<p></p>
18
New cards

High-Leverage Point (in regression)

has a substantially larger or smaller x-value than the other observations have.

19
New cards

Influential Point (in regression)

any point that, if removed, changes the relationship substantially. Outliers and high leverage point area often influential.

20
New cards

What is the difference between categorical and quantitative variables?

Categorical variables take on values that are category names or group labels.

Quantitative variables take on numerical values for a measured or counted quantity.

21
New cards

What is the difference between discrete and continuous variables?

a discrete variable can take on a countable number of values. The number of values may be finite or infinite.

ex: number of students, countable.

a continuous variable can take on infinitely many values, but those values cannot be counted.

ex:height, must be measured.

22
New cards

What is a control group and what is its purpose?

a collection of experiment units either not given a treat of interest or given a placebo.

its purpose is to provide a baseline to which the treatment groups can be compared, so it can be determined if the treatments have an effect.

23
New cards

What is single blind & double blind?

in a single blind, subject do not know which treatment they are getting, but the researchers do, or vice-versa.

in a double blind, neither researchers or subjects know which treatment a subject is getting.

24
New cards

What are 2 poor sampling methods?

  1. convenience sampling

  2. voluntary response sampling

    Not random

25
New cards

What’s an explanatory variable? (or factor)

a variable whose levels are manipulated intentionally.

26
New cards

What’s a response variable?

an outcome from the experimental units that is measured after the treatments have been administered.

27
New cards

Nonresponse Bias

Selected people do not respond

28
New cards

Under coverage Bias

systematically excluding people from being able to be selected

29
New cards

Response bias

provided inaccurate responses

30
New cards

wording issues

worded dumb

31
New cards

Bias

the systematic tendency to over or under estimate the true population parameter

32
New cards

Can the results of a survey or experiment be generalized to a larger population?

the results can only be generalized to the population from which the sample/subjects were randomly selected

if not randomly selected, the results can only be generalized to people like the ones in the study.

33
New cards

Observation study vs experiment

Did the researchers impose a treatment?

NO- observation study

YES-experiment

34
New cards

When can we make conclusion about cause and effect?

Did the researchers randomly assign the subject to treatment groups?

YES- you can make conclusions

NO-NO, get fucked. you CANNOT say that the explanatory variable caused the change in the response variable.

35
New cards

SRS

knowt flashcard image
36
New cards

Stratified random sample

knowt flashcard image
37
New cards

systematic random sample

knowt flashcard image
38
New cards

confounding variable

a variable that is related to the explanatory variable and influenced the response variable and makes it challenging to determinate cause and effect

39
New cards

What should a well designed experiment include?

  1. Comparisons of at least two treatment groups, one of which could be a control group.

  2. Random assignment of treatments to experimental units

  3. Replication (enough experimental units in each treatment group to be able to detect a difference)

  4. Control of potential confounding variables where appropriate.

40
New cards

Complete randomized design

knowt flashcard image
41
New cards

What is a randomized block design and what is its purpose?

Treatments are assigned randomly within each block. For each block, indians are similar to eachother with respect to at least one blocking variable.

Its purpose is to reduce the variability of results within each treatment group, and to eliminate the possibility of the blocking variable as a confounding variable.

42
New cards

Matched Pairs design

-a special case of a randomized block design. using a blocking variable, subjects are arranged in pairs matched on one or more relevant factors. Every pair recovers both treatments by randomly assigning one treatment to one, and the other to the other. alternatively, each subject may get both treatments

43
New cards

How would you carry out a random assignment by selecting from a hat

  1. Write each subjects name on an equal sized piece of paper

  2. Put all the slips of paper in a hat. Mix well

  3. Select as many names as needed for each treatment group, without replacement.

44
New cards

Mutually Exclusive events (disjoint)

they cannot occur at the same time. P(A & B) =0

45
New cards

How do you calculate P(A & B)?

= P(A) times P(B|A)

46
New cards

what’s the difference between binopdf & binocdf?

binopdf calculates the probability of x successes

binocdf calculates the cumulative probability of x or fewer successes

47
New cards

Empirical Rule

knowt flashcard image
48
New cards

interpretation of a z-score

how many standard deviation a value falls away from the mean and in what direction

49
New cards

z-score formula

knowt flashcard image
50
New cards

How can you tell if two events are independent?

knowt flashcard image
51
New cards

How do you calculate a conditional probability?

knowt flashcard image
52
New cards

How do I evaluate the probability of “at least 1?”

knowt flashcard image
53
New cards

What is the Law of Large numbers?

knowt flashcard image
54
New cards

What are the conditions for a geometric random variable?

BITS

Binary:two outcomes for each trial (success or failure)

Independent: each trial is independent of the next

Trails UNTIL a success (not a fixed number)

Same probability of success for each trial (p)

keep going until a success

55
New cards

Type I error

null hypothesis (Ho) is true and is rejected (false positive)

56
New cards

Type II error

alt hypothesis (Ha) is true and the null hypothesis is not rejected (false negative)

57
New cards
58
New cards

Power

the probability that a test will correctly reject the null hypothesis, given that the alt hypothesis is true.

P(Type II error)=1-P(power)

59
New cards
term image
knowt flashcard image
60
New cards
term image
knowt flashcard image
61
New cards
term image
knowt flashcard image
62
New cards

Conditions for a binomial random variable

BINS

Binary: two outcomes for each trial (success or failure)

Independent: Each trial is independent of the next

Number of trials is a fixed number (n)

Same probability of success for each trial (p)

Fixed number of trials

63
New cards
term image
knowt flashcard image
64
New cards
term image
knowt flashcard image
65
New cards
term image
knowt flashcard image
66
New cards

What is the difference between a parameter and a statistic?

a parameter is a number that describes the population. Ex- mew, p, standard deviation

a statistic is a number that describes the sample. Ex- x bar, p hat, s

67
New cards

Central Limit Theorem

when the sample size is sufficiently large, a sampling distribution of the mean of a random variable will be approximately normally distributed.

68
New cards

What is the difference between the population distribution, the sample distribution, and the sampling distribution?

the population distribution is the distribution of responses for every indian of the population.

the sample distribution is the distribution of responses for a single sample.

the sampling distribution is the distribution of values for the statistic for all possible samples of a given size from a given population

69
New cards
term image
knowt flashcard image
70
New cards
term image
knowt flashcard image
71
New cards
term image
knowt flashcard image
72
New cards
term image
knowt flashcard image
73
New cards

What decision do i make based on the p-value?

p-value < alpha, reject, there is convincing evidence

p-value > alpha, fail to reject, there is not convincing evidence

74
New cards

How do you calculate expected counts in a chi-square text for homogeneity/indp?

knowt flashcard image
75
New cards
76
New cards
77
New cards
78
New cards
79
New cards
80
New cards
81
New cards