psyc011 week 1

chapter 1

part 1

famous statisticians
- karl pearson - chi square to decide among theories
- ronald fisher - experiments w random assignment
- jerzy neyman & egon pearson - NHST
- jacob cohen - effect sizes & power
- geoff cumming - the “new” stats
- john kruschke - bayesian stats
field of stats is a human endeavor designed to help people develop reasoned beliefs about variable processes
descriptive statistics - number that conveys a particular characteristic of a set of data
- ex) mean - arithmetic average; sum of scores divided by number of scores
inferential statistics - method that uses sample evidence and probability to reach conclusions about unmeasurable populations
- AKA smaller size of population too big
null hypothesis statistical testing (NHST) - technique producing probabilities accurate when the null hypothesis is true and certain assumptions are met

part 2

population - all measurements of a specified group
- sample - measurements of a subset of a population
parameter - numerical or nominal characteristic of a population
statistic - numerical or nominal characteristic of a sample
variable - something that exists in more than one amount or in more than one form
quantitative variable - variable whose scores indicate different amounts
- continuous variable - a quantitative variable whose scores indicate different amounts
  - note** these can identified by being able to have half numbers (ex. you can have a 9.5 shoe size or get somewhere in 1.5 hours)
- discrete variable - variable for which intermediate values between scores are not meaningful
  - note** these can be identified by not being able to have a half number (ex. you cant have half a student in a class)
categorical variables - variable whose scores differ in kind, not amount
- ex) race, religion, college major

part 3

scales of measurement
- nominal scale - measurement scale in which numbers serve only as labels and do not indicate quantitative relationship
  - ex) yes/no responses, colors (blue, green, red)
- ordinal scale - measurement scale in which numbers are ranks; equal differences between numbers do not represent equal differences between the things measured
  - ex) bachelor’s, master’s, PhD
- interval scale - measurement scale with meaningful intervals but no true zero point
  - ex) zero in celsius vs fahrehnheit
- ratio scale - measurement scale with meaningful intervals and a true zero point
  - height, weight, age
experimental design
- independent variable - variable controlled by the researcher
- dependent variable - observed variable that is expected to change
- level - one value of the of the independent variable
- treatment - one value/level of the independent variable
  - its basically like the most extreme level
- extraneous variable - variable other than the independent variable that may affect the dependent variable
Epistemology - study or theory of nature of knowledge
- epistemology aims to answer
  - what do we know
  - what does it mean to say that we know something
  - what makes justified beliefs justified
  - how do we know that we know
steps to statistical analysis
- 1) explore data
- 2) answer the question - what are the effects that chance could have on the descriptive stats I calculated?
- 3) write the story explaining the relationships in the data

chapter 2

part 1

simple frequency distribution
- 1) find lowest and highest scores
- 2) in columns, write all numbers from greatest to least
- 3) name the column
- 4) start w number in upper left, draw line under it, tally mark beside number in column
- 5) continue underlining and tall
computational tools
- R, MATLAB, Excel or sheets, python, SAS, SPAA, etc
- in this class, jamovi (free, based on R, similar to SPSS)

part 2

symmetric distributions
- normal distribution (normal curve) - graph of observed scores with particular shape
- rectangular distribution - occurs when frequency of each value on x-axis is the same

chapter 3

part 1

central tendency - descriptive statistics that indicate a typical or representative score
- mean - arithmetic average; sum of scores divided by number of scores
- median - point that divides a distribution of scores into equal halves
- mode - score that occurs most frequently in a distribution
- weighted mean - overall mean calculated from two or more samples with different Ns

part 2

median - point that divides distribution of scores into two equal parts
finding median:
- order scores greatest to least
- median itself should have equal number of scores above and below it
  - if odd # of scores, median in the middle
  - if even, median halfway between middle 2 (add both scores and divide by 2)
mode - most frequently occurring score; score observed most often
scale of measurement
- interval or ratio - mean, median, mode
- ordinal - median or mode
- nominal - mode only
skewed data
- if data very skewed, median may be better than mean
- EX) income - mean $83K, median $59K
open-ended class intervals
- ex) “75 and over” or “$200K or more”
estimating answers
- estimating helps you think the problem through
- estimates help you catch errors when compared to computer output
- catching errors increases your credibility

chapter 4

part 1

variability - having a range of values
- think of it as differences
to summarize group of numbers
- 1) how big or small or the numbers?
- 2) are the numbers mostly close to each other or far apart?
range - highest minus lowest score
- not very informative because only two scores matter in its computation
  - drastically different distributions can give rise to same mean and range
interquartile range - the middle 50% of the observations
- used to avoid instability of the range statistic
- just trimmed off top and bottom 25% of data
to find interquartile range, must find 25th and 75th percentile
percentile - point below which a specified percentage of distribution falls

part 2

standard deviation - descriptive measure of the dispersion of scores around the mean

deviation score - raw score minus the mean of its distribution
- just says how far a score is from mean

part 3

chapter 5

part 1

z score - score expressed in standard deviation units
- they are comparative, dependent on all other scores
- as descriptive stat, vary from -3 to +3
standard score - score expressed in standard deviation units; z is one example
- ex) IQ scores, z scores,

part 2

outlier - extreme score separated from others and at least 1.5 * IQR beyond the 25th or 75th percentile
elements of boxplot
- boxplot - graph that shows a distribution’s range, IQR, skew, median, and sometimes others
- median - line in middle of box
- mean - dot in box
- 25th percentile - bottom or left side of box
- 75th percentile - top or right side of box
- whiskers - extend to lowest and highest scores in distribution
- central tendency - mean or median
- variability
  - IQR - size of box
  - distance between end of whiskers
- skew
  - position of mean relative to median
  - relative length of the whiskers
effect size index - amount of degree or separation between two distributions
- use whatever units make most sense
- when scale is unfamiliar, use standard statistic “d”
descriptive statistics report
- table showing mean, median, percentiles, effect size, etc