Stats level 3 (Midterm 1)

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/200

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

201 Terms

1
New cards

Statistical analysis doesn’t eliminate uncertainty, but it allows us to

quantify it so we can decide how much we’re willing to live with

2
New cards

Descriptive Statistics

describe the
properties of a
sample

3
New cards

Inferential statistics


uses properties

of the sample to make conclusions
about the population

4
New cards

Examples of descriptive statistics

Sample statistic mean and we calculate M from our sample data

5
New cards

A representative sample is a useful for

inferring population parameters

6
New cards

Random sampling is the best technique to obtain

Representative samples

7
New cards

We are using data from a sample to estimate the truth about a
much larger population

The Estimation Approach

8
New cards

Samples are not a perfect guide to the population; therefore, we
express our uncertainty by reporting a

point estimate and a
confidence interval

9
New cards

The point estimate is our

best single guess about the population, it is
exactly what we found in the sample

<p><span style="color: rgb(0, 0, 0);">best single guess about the population, it is</span><span style="color: rgb(0, 0, 0);"><br></span><span style="color: rgb(0, 0, 0);">exactly what we found in the sample</span></p>
10
New cards

The Margin of Error (MoE) is the

likely largest error in the point
estimate

<p><span style="color: rgb(0, 0, 0);">likely largest error in the point</span><span style="color: rgb(0, 0, 0);"><br></span><span style="color: rgb(0, 0, 0);">estimate</span></p>
11
New cards

The Confidence Interval (CI) is the

point estimate +/- the MoE

<p><span style="color: rgb(0, 0, 0);">point estimate +/- the MoE</span><span style="color: rgb(0, 0, 0);"><br></span></p>
12
New cards

The Confidence Interval (CI) quantifies our

uncertainty about the population. It gives the range of population values which are plausible or likely given the sample data

13
New cards

When we have a Short CI


-> Precise estimate -> Little uncertainty

14
New cards

When we have a Long CI

-> Not a precise estimate -> Lots of uncertainty

15
New cards

What counts as a short and long CI is a

Matter of judgement

16
New cards

How do we get more certainty in our CI

Collect more data! 4x the sample size typically cuts the CI by ½

17
New cards

If you replicated a poll with a new sample, would you reach
exactly the same result?

Probably not! There is sampling variability—differences due to using
different samples to draw conclusions about the population.


• It will probably be close, though! The most likely result of the
replication is an answer within the CI of the original study!

18
New cards

Sampling variability:

Each sample gives somewhat
different results
The Estimation Approach: A simple
example

<p><span style="color: rgb(0, 0, 0);">Each sample gives somewhat</span><span style="color: rgb(0, 0, 0);"><br></span><span style="color: rgb(0, 0, 0);">different results</span><span style="color: rgb(0, 0, 0);"><br></span><span style="color: rgb(0, 0, 0);">The Estimation Approach: A simple</span><span style="color: rgb(0, 0, 0);"><br></span><span style="color: rgb(0, 0, 0);">example</span></p>
19
New cards

• If we conduct replications, we can conduct a

meta-analysis to synthesize or integrate the results

20
New cards

What does a meta-analysis ensure?

we are examining the whole story, and it gives us an overall estimate more precise than any one study

21
New cards

How are the results of a meta analysis summarized?

in a forest plot and the diamond indicates the overall point estimate and CI.

<p>in a forest plot and the diamond indicates the overall point estimate and CI.</p>
22
New cards

What are the basic principles of a research design

1: Random sampling

2:Experimental vs non-experimental studies

3: Measurement

23
New cards

Random sampling is the 

• Fundamental Assumption in statistical inference

24
New cards

Measurement include

• Operational definitions

• Reliability/Validity

• Levels of Measurement

25
New cards

• Every member of the population has an equal probability of
being chosen.


• All members of the sample are chosen independently.

Random Sampling

26
New cards

• Statistical inference, including the use of Confidence

Intervals, assumes

the data are from a random sample.

27
New cards

In practice, random samples can be difficult to obtain:

• Most psychology research is conducted with young university-

educated adults between 17 and 30 years of age

28
New cards

• Experiments include

• One or more variables are manipulated or assigned by the researcher

• Independent Variable (IV): the variable that is manipulated

• Dependent Variable (DV): the outcome that is observed

• Can justify causal conclusions; confounds controlled (random assignment)

• Describe with causal language: affects/effects, produces, causes, makes

29
New cards

In experiments, the independent variable is the variable that is

being manipulated

30
New cards

In experiments, the dependent variable is the variable that is

the outcome being observed

31
New cards

In experiments, they can justify 

causal conclusions; confounds controlled (random assignment)

32
New cards

• Non-Experiments:

• No manipulation or assignment

• Can not justify causal conclusions; cannot rule out confounds

• Describe with association language: relates to, associated with, correlated with

33
New cards

Non-experiments do not have

manipulation or assignments

34
New cards

Non-experiments cannot

justify causal conclusions; cannot rule out confounds

35
New cards

What is the language used in non-experiments

association language: relates to, associated with, correlated with

36
New cards

What is the language used in experiments

causal language: affects/effects, produces, causes, makes

37
New cards

What is a construct

• The abstract concept we are interested in

38
New cards

What is an operational definition

• A precise procedure for measuring values that represent a construct

39
New cards

What is a measured variable

• The actual variable measured

40
New cards

The measured variable should be

Reliable and Valid

41
New cards

Reliable is

Repeatable;

similar results on repeated

measurement

42
New cards

Valid is

Accurate, true,

measuring what it is meant to

measure

43
New cards

In NOIR, the N is

Nominal

44
New cards

In NOIR, the O is 

Ordinal

45
New cards

In NOIR, the I is

Interval

46
New cards

In NOIR, The R is

Ratio

47
New cards

Nominal is the

Numbers represent characteristics according to a simple

code

a level of measurement in statistics that categorizes data into distinct, non-ordered groupsa level of measurement in statistics that categorizes data into distinct, non-ordered groups

48
New cards

Ordinal is the 

Numbers are assigned based on rank amount of

characteristic:

49
New cards

Interval is the 

• Numbers are assigned based on relative quantity of

characteristic, with no true 0 point

• Examples: year, longitude, latitude, temperature

50
New cards

Ratio is the

• Numbers are assigned based on the absolute

magnitude of a characteristic. The scale has a

true zero point.

51
New cards

What are the two ways a variable can be classified?

Continuous & Discrete

52
New cards

A continuous Variable is a

variable that can take any of

the unlimited number of values in some range.

53
New cards

What are examples of continuous variables 

Height & time spent on a task

54
New cards

A discrete variable is a

variable that can take only distinct or separated values.

55
New cards

What are examples of discrete variables

Number of eggs in a basket, Children in a family,

Toys in a box

56
New cards

Continuous variables can take values

along a smooth curve

57
New cards

discrete variables cannot

take values along a smooth curve

58
New cards

The probability distribution of a continuous variable is a

smooth curve—like this normal distribution you may

remember seeing

<p>smooth curve—like this normal distribution you may</p><p>remember seeing</p>
59
New cards

• Histograms depict

the data sorted into bins – each bin contains

X observations of values contained within that bin

<p>the data sorted into bins – each bin contains</p><p>X observations of values contained within that bin</p>
60
New cards

What is included in the measure of central tendency?

Mean, Median, and Mode

61
New cards

What is included in a spread?

Standard deviation, Variance, Range, Inter-quartile range

62
New cards

What is the symbol for summation

Σ

63
New cards

What is the mean formula

knowt flashcard image
64
New cards

The mean is also known as the

average

65
New cards

M is for the

Sample

66
New cards

μ is for the

population

67
New cards

The mean represents the 

balancing point for the distribution 

68
New cards

Is the mean influenced by outliers?

Yes, especially in small data sets

69
New cards

The median is the

middle point in the distribution, the score at which half the others are above and the other half below (50th
percentile)

70
New cards

Only requires interval scaling; can be used for any type of data

except nominal

Median

71
New cards

The median is not 

as sensitive to outliers as the mean

72
New cards

The mode is

whatever response is most frequent.

73
New cards

Requires only nominal scaling; can be used for any type of

data and is Not very sensitive to outliers

The mode

74
New cards

• Characteristics of the normal distribution:

• Smooth, symmetrical bell shape with most data points near center

• Normal curve defined by two parameters: mean and SD

75
New cards

Normal distributions are not always the same 

• Some are much narrower, some wider

• Mean and SD can vary between distributions

76
New cards

The range is

-The distance from the minimum score to the maximum score

• Easy to understand, but can change dramatically with outliers

or even just with some new data.

77
New cards

The better way then the range is the

SD

78
New cards

What is the formula for the sample SD

knowt flashcard image
79
New cards

For the SD

—Each data point, there is a possible deviation from the mean (X-M)

• Think of the standard deviation as the average or typical deviation from the mean

80
New cards

s is the symbol for the

Sample SD

81
New cards

σ is the symbol for the

Population SD

82
New cards

The SD is sensitive to

outliers

83
New cards

The Standard Deviation is the square root of

the variance,
another measure of spread

84
New cards

What is the formula for the variance

knowt flashcard image
85
New cards

• Variance is useful, but

expressed spread in squared units,

which are a bit difficult to understand.

86
New cards

• Interquartile Range (IQR) only requires

Ordinal scaling and is not very sensitive to outliers 

87
New cards

The z score at the mean is 0, as labeled.

knowt flashcard image
88
New cards

• A z score expresses an individual’s score in

standard

deviation units.

89
New cards

What is the z-score formula

knowt flashcard image
90
New cards

What is the formula for turning a z-score back into a raw score

knowt flashcard image
91
New cards

z scores provide standardization

once expressed in z scores, no

need to worry about range of measurement (1 to 10 scale, 1-7 scale,

etc.) or units (inches, feet, etc.)

92
New cards

• z scores allow comparisons across

different measures with different

scales

93
New cards

Percentile% is the

% of data below a score, X

• Only requires ordinal scaling

94
New cards

If we transform each observation from a population into a z-
score and then examine how these z-scores are distributed, we
get the standard normal distribution always has a mean

of

0 and standard deviation of 1

<p>0 and standard deviation of 1</p>
95
New cards

Mean uses

Interval or Ratio scaling

96
New cards

Median uses

Ordinal, Interval, or ratio scaling

97
New cards

The mode uses

NOIR scaling

98
New cards

The distribution is labeled with z scores and

corresponding percentages—these represent how much of the

distribution falls between any two z scores

knowt flashcard image
99
New cards

We know M and s for our

Sample

100
New cards

• M and s are expected to change

from

Sample to Sample