Statistics and Data Analysis: Key Concepts and Methods

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/37

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 12:17 AM on 5/18/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

38 Terms

1
New cards

Nominal LOM

categories w no order

- qual and simplest LOM

ex) eye color, major

2
New cards

Ordinal LOM

can be ranked

- qual

ex) class ranks

3
New cards

Interval LOM

# data w no true 0

- quan

ex) temperature

4
New cards

Ratio LOM

# data with a true 0

- quan and most info

ex) height, age, money

5
New cards

Random sampling

everyone has equal chance of being selected

6
New cards

stratified sampling

divide pop into groups and select from each group

ex) students from every grade level

7
New cards

descriptive statistics

describe/ summarize data you already have

8
New cards

QUALitative data

categorical responses

9
New cards

QUANitative data

# data

10
New cards

inferential statistics

uses a sample to make conclusion about a population

11
New cards

systematic sampling

everyone assigned number, every __kth # is picked

ex) every 10th customer

Formula: F= N/n= kth number

12
New cards

cluster sampling

divide pop into clusters & select 1 or more to be included

** at least 1 group left out (diff than stratified)

ex) NYC boroughs but leaving Stat Island out

13
New cards

histogram

bars touching

- used w quan data

ex) weight, age...70-80, 80-90, 90-100

14
New cards

bar graph

space between bars

- used w qual data

ex) major and eye color... green, blue, brown

15
New cards

Scatterplot

shows relationship between 2 variables

- shows correlation/ association; r

16
New cards

boxplot

shows median, quartiles, min & max, spread of data

- can help show outliers

17
New cards

skewed right distribution

data toward left

- mean> median

18
New cards

skewed left distribution

data toward right

- median> mean

19
New cards

mean vs median vs mode

mean: avergae (add all and divide)

median: middle value, 50th percentile, in ordered data (avrg of 2 middle ones if uneven)

mode: mode frequent value

20
New cards

standard deviation

big SD= data spread out

small SD= data close to mean

21
New cards

quartiles and IQR

q1= first quartile 25%

q2= second quartile/ median 50 %

q3= third quartile 75%

IQR= q3-q1

22
New cards

Probability

- probability NEVER (-)

- between 0 & 1 (not bigger than 1)

23
New cards

independent vs dependent events

- one event doesnt affect probability of another event

- one event changes the probability of another event

24
New cards

Empirical Rule

1 sd= 68 % of data

2 sd= 95% of data

3 sd= 99.7% of data

<p>1 sd= 68 % of data</p><p>2 sd= 95% of data</p><p>3 sd= 99.7% of data</p>
25
New cards

Z Score

# of sd's from the mean

+Z = above mean

-Z= below mean

26
New cards

Hypothesis Testing (5 steps)

Z test = po (sigma) sd. t test = 's' sd

- null (no change) and alt (claim) hypothesis

- locate z critical values (table F with alpha & 1 or 2 tails)

- draw distribution w critical values to see rejection region

- test statistic

- decision; reject or not reject

- summary of results

<p>Z test = po (sigma) sd. t test = 's' sd</p><p>- null (no change) and alt (claim) hypothesis</p><p>- locate z critical values (table F with alpha &amp; 1 or 2 tails)</p><p>- draw distribution w critical values to see rejection region</p><p>- test statistic</p><p>- decision; reject or not reject</p><p>- summary of results</p>
27
New cards

p- value

- compare p-val to a to decided to reject or not

- p is low h0 goes (reject)

- p is high h0 stays (dont reject)

ex) if a= .05 and p= .02.... .02<.05 .... Reject H0

if a=.05 and p= .18.... .18>.05 .... Don't Reject H0

28
New cards

Correlation

relationship between2 variables

correlation coefficient (r) = sample strength and direction of correlation

+ correlation = variables same direction

- correlation = variables in opposite directions

29
New cards

when correlation coefficient r is close to 1 or -1, what does it mean?

strong positive or negative correlation

- closer to 0 = weak/ no correlation

30
New cards

In Pearson's Correlation... x = ? and y = ?

x = independent

y = dependent

31
New cards

simple linear regression analysis

should be done is Null is rejected from Correlation Test

Formula = y= a + bx

** can find a and b in calc

<p>should be done is Null is rejected from Correlation Test</p><p>Formula = y= a + bx</p><p>** can find a and b in calc</p>
32
New cards

coefficient of determination

aka Explained Variable

- r^2, then make a %

- tells us how much variability in y can be explained by x

33
New cards

coefficient of non-determination

aka Unexplained Variable

- subtract the explained variable from 100%

- tells us what is unexplained

34
New cards

Chi Square test of Independence

- tests if 2 variables are related or not

. Null = variables are not dependent

. Alt = variables are dependent

. degree of freedom = (rows -1)(columns -1)...table G

. always one positive c.v.

35
New cards

Chi Sqaure: test stat

** need to have the expected counts (observed counts given)

- Formula for E.C. = (Row total)(Column total) / entire total

. E.C. gives # it should be if variables are independent/ not related

<p>** need to have the expected counts (observed counts given)</p><p>- Formula for E.C. = (Row total)(Column total) / entire total</p><p>. E.C. gives # it should be if variables are independent/ not related</p>
36
New cards

not true flashcard

ok

37
New cards

chi sq = .. data

correlation/regression = …. data

  • categorical - use table of counts/ freq

  • numerical - pairs of #

38
New cards