Stats Textbook Terms

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/63

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

64 Terms

1
New cards

Cases

Objects described in a set of data. Ex: customers, companies, study subjects, units

2
New cards

Label

Special variable distinguished in different cases

3
New cards

Variable

Characteristics of a case

4
New cards

Values

Different cases have different values

5
New cards

Categorical variable

Places a case into several groups or categories

6
New cards

Quantitative Variable

takes numerical variables for which arithmetic operations such as adding and averaging make sense

7
New cards

key characteristics of data set

who what and why

8
New cards

Explanatory Data Analysis

Examine data and describe their main features

9
New cards

Distribution of a categorical variable

Lists the categories and gives either the count or the percent (or proportion) of the cases that fall in each category

10
New cards

mean

x1+x2+…./n

11
New cards

median M

(n+1)/2 (to find location of median)

12
New cards

Median versus mean

Median is more resistant to the mean

If shape is symmetric, median = mean

13
New cards

Shape

Skewed where the tail is

14
New cards

Quartiles

25%, 50%, 75%

15
New cards

IQR (Interquartile range)

Q3-Q1

16
New cards

Outlier IQR

1.5 x IQR

Q1- (1.5 x IQR) = lower bound

Q3 + (1.5 x IQR) = upper bound

17
New cards

S (Standard Deviation) Formula

√[ Σ(xi - x̄)² / (n-1) ]

18
New cards

S² (Variance) Formula

∑ (xi - x̄)² / (n - 1)

19
New cards

Degrees of Freedom formula

n-1

20
New cards

What does s measure?

spread about the mean

21
New cards

s=0

No spread

22
New cards

When are variables associated?

If knowing the valuable of one tells you something about the values of the other

23
New cards

Responsive Variable

Measures an outcome of a study

24
New cards

Explanatory Variable

Explains or causes changes in the response variable

25
New cards

Independent Variable

the factor that a researcher manipulates or changes to see how it affects another variable

26
New cards

Dependent Variable

a variable (often denoted by y ) whose value depends on that of another.

27
New cards

Scatterplot

Shows the relationship between two quantitative variables measured on the same cases

28
New cards

Positively Associated

two things are positively associated when above-average values of one tend to accompany above-average values of the other and below-average values tend to occur together

29
New cards

Negatively Associated

Two variables are negatively associated when above-average values of one tend to accompany below-average values of the other, and vice versa

30
New cards

Correlation (r ) formula

31
New cards

Causation

x → y

32
New cards

Common Response

z→x and y

33
New cards

Confounding

x→ y

z→ y

34
New cards

Sample Space S

The set of all possible outcomes

35
New cards

Event

An outcome or a set of outcomes of a random phenomenon. Subset of the sample space

ex: exactly four heads

36
New cards

Probability rules

Rule 1. The probability P(A) of any event A satisfies 0 ≤ P(A) ≤ 1.

Rule 2. If S is the sample space in a probability model, then P(S) = 1.

Rule 3. Two events A and B are disjoint if they have no outcomes in common and so can never occur together. If A and B are disjoint,

  • P (A or B) = P (A) + P (B)

Rule 4. The complement of any event A is the event that A does not occur, written as Ac. The complement rule states that

P (Ac) = 1 − P (A)

Rule 5. Two events A and B are independent if knowing that one occurs does not change the probability that the other occurs. If A and B are independent,

P (A and B) = P (A) P (B)

37
New cards

Disjoint events

knowt flashcard image
38
New cards

Complement A^c

knowt flashcard image
39
New cards

Multiplication rule for independent events

P (A and B) = P (A) P (B)

40
New cards

Independent in probability

The outcome of one event is not influenced by the outcome of another event

41
New cards

Additional Rule

If A and B are disjoint events, then

P (A or B) = P (A) + P (B)

42
New cards

Complement Rule

For any event A,

P (Ac) = 1 − P(A)

43
New cards

RULE FOR UNIONS OF TWO EVENTS

any two events A and B,

P (A or B) = P (A) + P(B) − P (A and B)

44
New cards

Conditional Probability

When P(A) > 0, the conditional probability of B given A is

P(B|A)= P(AandB)/ P(A)

45
New cards

Intersection of events

When P(A) > 0, the conditional probability of B given A is

P(B|A)= P(AandB) P(A)

46
New cards

Independent Events

Two events A and B that both have positive probability are independent if P(B|A) = P(B)

47
New cards

Density Curve

A density curve is a curve that

  • Is always on or above the horizontal axis.

  • Has area exactly 1 underneath it.

A density curve describes the overall pattern of a distribution. The area under the curve and above any range of values is the proportion of all observations that fall in that range.

symmetric normal density, right skewed/left skewed

48
New cards

Normal distribution density curve

knowt flashcard image
49
New cards

Right skewed density curve

knowt flashcard image
50
New cards

Left skewed density curve

knowt flashcard image
51
New cards

What does the standard deviation control in a curve

the spread of the curve

52
New cards

The 68-95-99.7 Rule

In the Normal distribution with mean μ and standard deviation σ:

Approximately 68% of the observations fall within σ of the mean μ. Approximately 95% of the observations fall within 2σ of μ. Approximately 99.7% of the observations fall within 3σ of μ.

53
New cards

N(μ, σ)

mean μ and standard deviation σ

54
New cards

z score

z = x–μ / σ

55
New cards

Random Variable

a variable whose value is a numerical outcome of a random process.

56
New cards
57
New cards
58
New cards
59
New cards
60
New cards
61
New cards
62
New cards
63
New cards
64
New cards