HSCI 190 module 2

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/35

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

36 Terms

1
New cards

bar graphs

summarize categorical data, each bar represents the count and the category

2
New cards

histograms

depict scale data

3
New cards

histograms x axis, y axis

x axis: independent variable is broken into intervals on continuous scale, bars next to eachother

y axis: always start at 0, shows the count of values w/i each interval

4
New cards

width of bar, histogram

represents the size of the interval

5
New cards

area of all bars in a histogram

represents true frequency, because area = 100% of data, absolute and relative freq doesnt matter, it will have the same shape.

6
New cards

unimodial

graph having 1 peak(mode)

<p>graph having 1 peak(mode)</p>
7
New cards

bimodial

graph having 2 clear peaks(modes)

<p>graph having 2 clear peaks(modes)</p>
8
New cards

multimodial

graph having 3+ peaks(modes)

9
New cards

symmetric graph

mean, median, mode all same at peak

10
New cards

positive skewed graph

mode, median, mean(in that order)

11
New cards

negative skewed graph

mean, median, mode (in that order

12
New cards

frequency polygons

visualizing scale data, same axises as histogram (but x axis marks midpoint of each interval rather than the borders of interval), then a line going thru middle of each bar

- good for comparing multiple groups

<p>visualizing scale data, same axises as histogram (but x axis marks midpoint of each interval rather than the borders of interval), then a line going thru middle of each bar</p><p>- good for comparing multiple groups</p>
13
New cards

cumulative frequency polygon

- the sum of counts up to a certain interval

<p>- the sum of counts up to a certain interval</p>
14
New cards

one way scatter plots

- categorical or scale, uses a single axis to display the relative potition of each point in a group

- advantage: all obs are represented individually

- downside: can becomes hard to read if a lot of points are close together

- can be vertical or horizontal

<p>- categorical or scale, uses a single axis to display the relative potition of each point in a group</p><p>- advantage: all obs are represented individually</p><p>- downside: can becomes hard to read if a lot of points are close together</p><p>- can be vertical or horizontal</p>
15
New cards

boxplots

categorical or scale data

- anything beyond adjacent values are considered extreme values and are plotted as individual dots

<p>categorical or scale data</p><p>- anything beyond adjacent values are considered extreme values and are plotted as individual dots</p>
16
New cards

one way scatter or boxplot for single axis data?

scatter: good to show many obs, maybe overwhelming

box: nice summary of iqr, range, easy to interpret, doesnt tell u obs, excludes extreme values, used more often

17
New cards

two way scatter plots

depict relationship between 2 scale variables

each point represents where the x and y axis meet

- multiple can be used to compare groups (need different colours and a legend)

18
New cards

line graphs

represent relationship between 2 scale variables

- each point on the x axis has a corresponding y value

- most commonly the scale of x axis represents time

19
New cards

what constitutes a potential outlier - mean and stdv

values more than 2 stdvs above or below the mean

20
New cards

what constitutes a potential outlier - median and iqr

values more than 1.5 times the iqr above or below the quartiles

21
New cards

which outlier equation to use?

- median and IQR=most stable bc outliers can impact the mean

- mean and stdv=easier to interpret

22
New cards

data entry errors

types, can cause an outlier, double check data, correct error, re analyze

23
New cards

process error

issue when collecting data, if possible re do data collection, if not possible, remove outlier

24
New cards

when to keep outliers

when they appear to be genuinely obtained and might give new insight on a phenomenon or signal another group that should be accounted for

25
New cards

missing factor

reflect on if there is another factor that might impact outcome that should be considered - reflect on if you should remove or not and explain

26
New cards

random chance

the value is theoretically possible but highly unlikely - reflect on if you should remove or not and explain

27
New cards

random sampling

random selection is used to choose observations

each thing has equal chance of being included in sample

28
New cards

non random sampling

the items included in study are selected for a reason (proximity, feasibilty)

may not give you representative sample of population

29
New cards

sampling bias

when each memeber of relevant population does not have equal chance of ending up in sample

30
New cards

response bias

when participants give answered they believe the researcher wants or socially accepted answers

31
New cards

survirorship bias

when inidviduals leave the study and the researcher continues to measure the remaining participants without considering those that left

32
New cards

recall bias

when participants don’t remember past events properly or omit details, especially when measuring data long time after the event

33
New cards

representative samples

when your data represents the entire population

34
New cards

random sampling benefits and downfalls

can mitigate sampling bias, can ensure representative samples, but challening

35
New cards

transparency

sometimes not possible to use random sampling so you have to be transparent with methods so others can find possible mistakes

36
New cards

central tendency for skewed data

median because it takes extreme values into account but not greatly impacted by them