data reduction

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/16

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

17 Terms

1
New cards

data types

  • Nominal e.g. fluvial or answer to yes or no Q

  • Ordinal - e.g. 1,2,3

  • Continuous - interval and ratio e.g. mass, time, concentration, distance

  • scalar - isolated value that represents something e.g. height or weight

2
New cards

internal data

differences between measurements, but no true zero

3
New cards

ratio data

differences between measurements, the true zero exists

4
New cards

raw data

  • can not manage

  • difficult to see patterns

  • need to process it

  • use descriptive stats e.g. most common value

5
New cards

data assessment

min, max, range, middle values ?

plot data

outliers ?

trends/patterns ?

6
New cards

frequency data - histograms

how many of each value / class in the sample

for nominal and ordinal data - counting occurrences

frequency distribution - what is the shape ?

7
New cards

central tendency

middle values

8
New cards

dispersion

how spread are the values

9
New cards

central values

most useful if distribution is peak-shaped - cluster around centre

unimodal distributions are common

most values - central values - some variation on either side e.g. height

10
New cards

measures of central tendency

median, mode, mean

11
New cards

relationship of mean, median and mode

symmetrical unimodal distribution

mean = median = mode

non-symmetrical (skewed - unimodal distribution of values

leptokurtic = more than 0

platykurtic = less than 0

<p>symmetrical unimodal distribution </p><p>mean = median = mode </p><p>non-symmetrical (skewed - unimodal distribution of values </p><p>leptokurtic = more than 0 </p><p>platykurtic = less than 0 </p>
12
New cards

measure of dispersion when using media

interquartile range

13
New cards

measure of dispersion when using mean

standard deviation

interval scale data

unimodal

14
New cards

comparing standard deviations - coefficient of variation

useful if 2 values have 2 different means or 2 different scales

to do this divide standard deviation by mean then multiply by 100 for both samples

see which one has more variation - the one with higher percentage

15
New cards

moments of distribution

moments are indicators if the distribution shape - linked by powers of differenced

the s on the pic = standard deviation

<p>moments are indicators if the distribution shape - linked by powers of differenced </p><p>the s on the pic = standard deviation </p><p></p>
16
New cards

uncertainty

measurements naturally lead to uncertainty

unavoidable

include uncertainty estimated

weighing provides a measure of importance

17
New cards

root mean squared

if values fall above and below zero - will be in the middle of the values

sometimes useful to know absolute magnitude of the values - how far the values are from 0

rms - achieves this by taking the average of the squared values

<p>if values fall above and below zero - will be in the middle of the values </p><p>sometimes useful to know absolute magnitude of the values - how far the values are from 0 </p><p>rms - achieves this by taking the average of the squared values </p>