Stats Textbook Terms

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/63

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

64 Terms

New cards

Cases

Objects described in a set of data. Ex: customers, companies, study subjects, units

New cards

Label

Special variable distinguished in different cases

New cards

Variable

Characteristics of a case

New cards

Values

Different cases have different values

New cards

Categorical variable

Places a case into several groups or categories

New cards

Quantitative Variable

takes numerical variables for which arithmetic operations such as adding and averaging make sense

New cards

key characteristics of data set

who what and why

New cards

Explanatory Data Analysis

Examine data and describe their main features

New cards

Distribution of a categorical variable

Lists the categories and gives either the count or the percent (or proportion) of the cases that fall in each category

New cards

mean x̅

x1+x2+…./n

New cards

median M

(n+1)/2 (to find location of median)

New cards

Median versus mean

Median is more resistant to the mean

If shape is symmetric, median = mean

New cards

Shape

Skewed where the tail is

New cards

Quartiles

25%, 50%, 75%

New cards

IQR (Interquartile range)

Q3-Q1

New cards

Outlier IQR

1.5 x IQR

Q1- (1.5 x IQR) = lower bound

Q3 + (1.5 x IQR) = upper bound

New cards

S (Standard Deviation) Formula

√[ Σ(xi - x̄)² / (n-1) ]

New cards

S² (Variance) Formula

∑ (xi - x̄)² / (n - 1)

New cards

Degrees of Freedom formula

n-1

New cards

What does s measure?

spread about the mean

New cards

s=0

No spread

New cards

When are variables associated?

If knowing the valuable of one tells you something about the values of the other

New cards

Responsive Variable

Measures an outcome of a study

New cards

Explanatory Variable

Explains or causes changes in the response variable

New cards

Independent Variable

the factor that a researcher manipulates or changes to see how it affects another variable

New cards

Dependent Variable

a variable (often denoted by y ) whose value depends on that of another.

New cards

Scatterplot

Shows the relationship between two quantitative variables measured on the same cases

New cards

Positively Associated

two things are positively associated when above-average values of one tend to accompany above-average values of the other and below-average values tend to occur together

New cards

Negatively Associated

Two variables are negatively associated when above-average values of one tend to accompany below-average values of the other, and vice versa

New cards

Correlation (r ) formula

New cards

Causation

x → y

New cards

Common Response

z→x and y

New cards

Confounding

x→ y

z→ y

New cards

Sample Space S

The set of all possible outcomes

New cards

Event

An outcome or a set of outcomes of a random phenomenon. Subset of the sample space

ex: exactly four heads

New cards

Probability rules

Rule 1. The probability P(A) of any event A satisfies 0 ≤ P(A) ≤ 1.

Rule 2. If S is the sample space in a probability model, then P(S) = 1.

Rule 3. Two events A and B are disjoint if they have no outcomes in common and so can never occur together. If A and B are disjoint,

P (A or B) = P (A) + P (B)

Rule 4. The complement of any event A is the event that A does not occur, written as Ac. The complement rule states that

P (Ac) = 1 − P (A)

Rule 5. Two events A and B are independent if knowing that one occurs does not change the probability that the other occurs. If A and B are independent,

P (A and B) = P (A) P (B)

New cards

Disjoint events

New cards

Complement A^c

New cards

Multiplication rule for independent events

P (A and B) = P (A) P (B)

New cards

Independent in probability

The outcome of one event is not influenced by the outcome of another event

New cards

Additional Rule

If A and B are disjoint events, then

P (A or B) = P (A) + P (B)

New cards

Complement Rule

For any event A,

P (Ac) = 1 − P(A)

New cards

RULE FOR UNIONS OF TWO EVENTS

any two events A and B,

P (A or B) = P (A) + P(B) − P (A and B)

New cards

Conditional Probability

When P(A) > 0, the conditional probability of B given A is

P(B|A)= P(AandB)/ P(A)

New cards

Intersection of events

When P(A) > 0, the conditional probability of B given A is

P(B|A)= P(AandB) P(A)

New cards

Independent Events

Two events A and B that both have positive probability are independent if P(B|A) = P(B)

New cards

Density Curve

A density curve is a curve that

Is always on or above the horizontal axis.
Has area exactly 1 underneath it.

A density curve describes the overall pattern of a distribution. The area under the curve and above any range of values is the proportion of all observations that fall in that range.

symmetric normal density, right skewed/left skewed

New cards

Normal distribution density curve

New cards

Right skewed density curve

New cards

Left skewed density curve

New cards

What does the standard deviation control in a curve

the spread of the curve

New cards

The 68-95-99.7 Rule

In the Normal distribution with mean μ and standard deviation σ:

Approximately 68% of the observations fall within σ of the mean μ. Approximately 95% of the observations fall within 2σ of μ. Approximately 99.7% of the observations fall within 3σ of μ.

New cards

N(μ, σ)

mean μ and standard deviation σ

New cards

z score

z = x–μ / σ

New cards

Random Variable

a variable whose value is a numerical outcome of a random process.

New cards