Elementary Statistics for Psychology

5.0(1)
studied byStudied by 14 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/58

flashcard set

Earn XP

Description and Tags

Flash Cards for PBSI 301

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

59 Terms

1
New cards

Statistics

Statistics is a set of tools and techniques used for describing, organizing, and interpreting information or data.

2
New cards

Why do we need Statistics? (Ch. 1)

To describe, predict, and explain behaviours or trends.

3
New cards

Descriptive Statistics (Ch. 1)

Used to organize and describe data.

Examples: What is the most common major? What is the average age? What is the most popular baseball team? (They use counts, means, percentages, etc…)

4
New cards

Inferential Statistics (Ch. 1)

Inferential statistics allow you to INFER the truth about the larger group (population) based on the information you gather from this smaller group of people (sample).

(Done after description.)

5
New cards

Population (Ch. 1)

The group you are actually interested in drawing some conclusions about.

<p>The group you are actually interested in drawing some conclusions about.</p><p></p>
6
New cards

Sample (Ch. 1)

It is a subset containing the characteristics of our population, the individuals selected to make conclusions for our population.

<p>It is a subset containing the characteristics of our population, the individuals selected to make conclusions for our population.</p>
7
New cards

Variable (Ch. 1)

Something that can change (vary) or have different values for different individuals. Variables represent the concepts we are interested in. (i.e age, height, major, etc…).

8
New cards

Data (Ch. 1)

Information collected from the sample on the variables we are interested in. The actual numbers, measurements, or characteristics that represent the ideas we are interested in.

9
New cards

Continuous Data (Quantitative) (Ch. 1)

Data measured in a continuum like height, weight, age, etc. All numbers between two endpoints are possible scores.

10
New cards

Categorical data (Ch. 1)

Data that sorts people into categories, like major, eye color, ethnicity, etc.

11
New cards

Measures of Central Tendency (Ch. 2)

Mean, median, mode.

12
New cards
<p>The mean  (Ch. 2)</p>

The mean (Ch. 2)

The most familiar measure of central tendency also known as the “average”. The mean is sensitive to extreme scores.

13
New cards
<p>Formula for the mean  (Ch. 2)</p>

Formula for the mean (Ch. 2)

“X bar” is the symbol for the mean

X is each individual score in the group (the data)

“Σ” or “Sigma” is the summation sign

  • When you see a sigma, you should sum whatever follows it n is the sample size. So when you see ΣX, you sum up all individual scores in the data.

14
New cards

The median (Ch. 2)

The median is the midpoint in a set of scores. The median is not affected by extreme scores.

15
New cards

To find the median (Ch. 2)

To find the median you have to list the values in order (either highest to lowest or lowest to highest) then find the middle-most score. If there is no middle score, add the two nearest middle scores and divide by 2.

16
New cards

Percentile points (Interquartile range or IQR) (Ch.2)

Refers to the percentage of cases equal to and below a certain point in a group of scores. The median is always the 50% percentile.

17
New cards

The mode (Ch. 2)

The mode is the value that occurs most frequently in the data set. Not affected by extreme scores, just count how many times each value appears. Used in categorical dats usually.

18
New cards

Bimodal (Ch. 2)

There can be more than 1 mode—this is referred to as a bimodal data set.

19
New cards

When do we use each measure of central tendency (Ch. 2)

1. Use the mode when the data are categorical

2. Use the mean when the data are continuous and you have no (or very few) extreme scores

  • By far the most common measure of central tendency in psychology

3. Use the median when the data are continuous and you think the mean is misleading because of extreme scores

  • When in doubt—report both!

20
New cards

Weighted mean (Ch. 2)

Shortcut for calculating the mean in a large data set in which certain values show up multiple times

  • List all the values in the sample

  • List how often each value occurs

  • Multiply the values by their frequency

  • Sum all the values in the value times the frequency column

  • Divide by total frequency

21
New cards

Variability (Ch.3)

Tell us how different the scores are from each other. Represent the spread or dispersion in the dataset. Variability helps us understand the nature of our sample and the nature of our variables.

<p>Tell us how different the scores are from each other. Represent the spread or dispersion in the dataset. Variability helps us understand the nature of our sample and the nature of our variables.</p>
22
New cards

Why is variability important? (Ch. 3)

Different variables within the same sample may have different variability and tells us how much spread or dispersion (variability) there is in a data set.

<p>Different variables within the same sample may have different variability and tells us how much spread or dispersion (variability) there is in a data set.</p>
23
New cards

Measures of variability (Ch.3)

Range, standard deviation (SD), and variance.

24
New cards

Range (Ch.3)

The most general measure of variability. Gives an idea of how far apart the scores are from each other. Computed by subtracting the lowest score from the highest score.

  • r = h – l

  • r = range

  • h = highest score in the data set

  • l = lowest score in the data set

(Always give the minimum and maximum values when reporting the range. This provides some context for the range.)

25
New cards

Problems with range (Ch.3)

  • These 2 samples would have the same range

  • Ignores the middlemost values

  • Considers only the most extreme scores

26
New cards

Standard deviation (Ch. 3)

The most common measure of variability. Represents the average amount of variability in a set of scores. Its sensitive to extreme scores.

  • A small/low standard deviation value indicates that most data points tend to be close to the mean

  • A large/high standard deviation value indicates that most data points tend to be far away from the mean

27
New cards
<p>Calculating standard deviation  (Ch. 3)</p>

Calculating standard deviation (Ch. 3)

  • s = Standard deviation (also SD)

  • √ = square root (find the square root of what follows)

  • ∑= summation (find the sum of what follows)

  • x = each individual score

  • X bar = mean of all the scores on x

  • n = sample size

1. List all the scores 2. Compute the mean 3. subtract the mean from each score (x-xbar) 4. Square all the differences 5. Sum all the squared deviations 6. Divide the sum of squares by n-1. 7. Compute the square root of what is left.

28
New cards
<p>Variance  (Ch. 3)</p>

Variance (Ch. 3)

The third measure of variability. If you know the standard deviation, then you know the variance. Variance rarely reported as a descriptive statistic.

29
New cards

Outlier (Ch. 3)

A data point that appears to deviate markedly from the other data points in the sample. anything more than 2 standard deviations away from the mean is a potential outlier. A data point more than 3 standard deviations from the mean is a likely outlier.

30
New cards
<p>To find outliers  (Ch. 3)</p>

To find outliers (Ch. 3)

  • X-bar = mean

  • c = cut off value of interest

  • s = standard deviation

31
New cards
<p>Histograms  (Ch. 4)</p>

Histograms (Ch. 4)

Histograms allow us to see the distribution of our data. In a histogram, the height of each bar is the number of times each value occurs in our data set.

Distributions can vary in 4 ways;

  • Central tendency

  • Variability

  • Skewness

  • Kurtosis

(Bar graphs ≠ histograms. Bar graphs show the frequency of categorical responses. Histograms show the distribution of continuous variables).

  • Note that the y and x-axis can be altered to show a misleading graph.

32
New cards
<p>Symmetrical vs. Skewed Distributions  (Ch. 4)</p>

Symmetrical vs. Skewed Distributions (Ch. 4)

  • Symmetrical Distributions: If you fold the graph in half, the two halves look the same. Mean, median, and mode are in the middle.

  • Skewness: Refers to lack of symmetry. Asymmetrical or lopsided. “Piling up” of X values on one side of the distribution. Positive and negative skewness (AKA Right skewed or left skewed). Mean, median, and mode differ.

<ul><li><p>Symmetrical Distributions: If you fold the graph in half, the two halves look the same. Mean, median, and mode are in the middle.</p></li><li><p>Skewness: Refers to lack of symmetry. Asymmetrical or lopsided. “Piling up” of X values on one side of the distribution. Positive and negative skewness (AKA Right skewed or left skewed). Mean, median, and mode differ.</p></li></ul>
33
New cards

Kurtosis (Ch. 4)

Refers to how peaked vs. flat the distribution.

  • Platykurtic = LOW kurtosis. Relatively flat (more variability).

  • Leptokurtic =HIGH kurtosis. Relatively peaked (less variability).

<p>Refers to how peaked vs. flat the distribution.</p><ul><li><p>Platykurtic = LOW kurtosis. Relatively flat (more variability).</p></li><li><p>Leptokurtic =HIGH kurtosis. Relatively peaked (less variability).</p></li></ul>
34
New cards

Correlations (Ch. 5)

AKA what is the relationship between these two variables. We can compute a correlation when we have scores on two variables (x and y) from each member of our sample.

<p>AKA what is the relationship between these two variables. We can compute a correlation when we have scores on two variables (x and y) from each member of our sample. </p>
35
New cards

Correlation coefficients (Ch. 5)

Correlation coefficients assign a single number to describe the relationship between two variables. Abbreviated as r and ranges from -1 to 1. The coefficient tells us direction (positive or negative)and strength ( Strong, moderately strong, and weak.)

<p>Correlation coefficients assign a single number to describe the relationship between two variables. Abbreviated as r and ranges from -1 to 1. The coefficient tells us direction (positive or negative)and strength ( Strong, moderately strong, and weak.)</p>
36
New cards

Direction (Ch. 5)

Positive or Negative

  • + signs indicate positive (or direct) relationships. Scores on the variables move in the same direction.

  • - signs indicate negative (or indirect) relationships  Scores on the variables move in opposite directions.

<p>Positive or Negative </p><ul><li><p>+ signs indicate positive (or direct) relationships. Scores on the variables move in the same direction.</p></li><li><p>- signs indicate negative (or indirect) relationships  Scores on the variables move in opposite directions.</p></li></ul>
37
New cards
<p>Strength  (Ch. 5)</p>

Strength (Ch. 5)

The magnitude of the coefficient (or how far it is from zero) tells us how strong the relationship is.

  • If r = 0, then no relationship

  • If r = -1 or 1, then perfect relationship

  • The closer r gets to an absolute value of 1, the closer it is to a perfect relationship

( Perfect Relationship (r=+/-1) and no correlation r= 0)

<p>The magnitude of the coefficient (or how far it is from zero) tells us how strong the relationship is.</p><ul><li><p>If r = 0, then no relationship</p></li><li><p>If r = -1 or 1, then perfect relationship</p></li><li><p>The closer r gets to an absolute value of 1, the closer it is to a perfect relationship</p></li></ul><p>( Perfect Relationship (r=+/-1) and no correlation r= 0)</p>
38
New cards

Limitations of correlation coefficients (Ch. 5)

  • The correlation coefficient can only be used to identify linear relationships.

  • Can be curvilinear

  • There could be restrictions on the range

    • Occurs when most subjects have similar scores on one of the

    variables being correlated

    • The value of the correlation coefficient is reduced and does

    not capture the true relationship between the variables

  • Lastly, outliers.

39
New cards

Curvilinear (Ch. 5)

The variables below definitely have a relationship, but the correlation coefficient will not find it.

  • Note that r = 0 for both of these pictures.

<p>The variables below definitely have a relationship, but the correlation coefficient will not find it. </p><ul><li><p>Note that r = 0 for both of these pictures.</p></li></ul>
40
New cards

Coefficient of determination (Ch. 5)

The coefficient of determination represents how much variation 2 variables share (in percentages)

  • In other words, how much of the variance in variable X can be accounted for, or is shared, by the variance in Y (and vice-versa)

  • Another way to say how much 2 variables have in common

<p>The coefficient of determination represents how much variation 2 variables share (in percentages)</p><ul><li><p>In other words, how much of the variance in variable X can be accounted for, or is shared, by the variance in Y (and vice-versa)</p></li><li><p>Another way to say how much 2 variables have in common</p></li></ul>
41
New cards

Coefficient of determination pt. 2 (Ch. 5)

The more 2 variables have in common, the more variance they will share.

  • But even strong relationships (like the one we calculated, r = .61) have a lot of variance leftover (100% – 37%)

  • 63% of the variance in Y is accounted for by factors OTHER than X.

  • This variance that is leftover is called the coefficient of alienation.

42
New cards

Correlation v. causality (Ch. 5)

Correlation DOES NOT EQUAL causation. We can never definitively assume causation from a correlational relationship.

43
New cards

Reverse causation (Ch. 5)

The causal direction may be opposite from what has been hypothesized.

<p>The causal direction may be opposite from what has been hypothesized.</p>
44
New cards

Reciprocal causation (Ch. 5)

Two variables cause each other. (“Spiral effect”/cyclical)

<p>Two variables cause each other. (“Spiral effect”/cyclical)</p>
45
New cards

Common casual variables (Third variables) (Ch. 5)

Some unknown third variable is actually influencing the two variables in the correlation.

<p>Some unknown third variable is actually influencing the two variables in the correlation.</p>
46
New cards

Computing a correlation coefficient (Ch. 5)

  • rxy = the correlation between x and y

  • n is the sample size

  • X is each individual's score on the X variable

  • Y is each individual’s score on the Y variable

  • XY is the product of each X score times its corresponding Y score

  • X2 is each individual's X score squared

  • Y2 is each individual’s Y score squared

<ul><li><p>rxy = the correlation between x and y</p></li><li><p>n is the sample size</p></li><li><p>X is each individual's score on the X variable</p></li><li><p>Y is each individual’s score on the Y variable</p></li><li><p>XY is the product of each X score times its corresponding Y score</p></li><li><p>X2 is each individual's X score squared</p></li><li><p>Y2 is each individual’s Y score squared</p></li></ul>
47
New cards

Measurement (Ch. 6)

The act or process of assigning numbers to phenomena according to a rule.

48
New cards

Properties of measurements / scales (Ch. 6)

The type of scale we use has important implications for what kind of statistics we use.

  • Nominal scales

  • Ordinal scales

  • Interval scales

  • Ratio scales

49
New cards

Nominal scales (Ch. 6)

Categorical. The categories must be mutually exclusive. Data is in the form of counts/percentages. Nominal = nameable

50
New cards

Ordinal scales (Ch. 6)

The number is a ranking. Not clear how much “distance” separates the data points on the scale. Ordinal = ordering.

51
New cards

Interval scales (Ch. 6)

Ordered events with equal spacing (based on an underlying continuum). Zero (0) not necessarily meaningful • May not even be part of the potential range. EQUAL INTERVALS.

52
New cards

Ratio scales (Ch. 6)

Similar to interval scale, but 0 has a specific meaning. zerO = ratiO

  • 0 = the complete absence of the attribute

53
New cards
<p>Reliability vs validity  (Ch. 6)</p>

Reliability vs validity (Ch. 6)

  • Reliability: How do I know that the measure I use works consistently?

  • Validity: How do I know that the measure I use measures what it is supposed to (i.e., accuracy)?

54
New cards

Observed v. true scores (Ch. 6)

  • Observed score = the score you actually got

  • True score = the true reflection of what you really know

55
New cards
<p>Error score (Or “measurement error”)  (Ch. 6)</p>

Error score (Or “measurement error”) (Ch. 6)

Discrepancy between observed score and true score. The less error the more reliable/valid the measure can be.

56
New cards

Reliability (Ch. 6)

The consistency or reproducibility of a measure or method.

4 types-

  • Test-retest

  • Parallel forms

  • Internal consistency reliability - assessed by Cronbach’s Alpha (α)

  • Inter-rater

57
New cards

Validity (Ch. 6)

How accurately a method measures what it is intended to measure.

3 types-

  • Content

  • Criterion

  • Construct

58
New cards

Criterion validity (Ch. 6)

2 other sections/ divisions: Examples-

  • Concurrent Validity: Does the measure correlate with

    grades in PSBI 301? Class attendance? Etc.

  • Predictive Validity: Does the measure predict who will

    have a job using stats 10 years from now?

59
New cards

Construct validity (Ch. 6)

2 other sections/ divisions: Examples-

  • Convergent Validity- Does the measure relate to other things it should?

  • Discriminant Validity- Does the measure NOT relate to things it shouldn’t?