set of methods for obtaining, organizing, summarizing, presenting, and analyzing data.
2
New cards
data
characteristics measured on individuals or units
3
New cards
population
totality of individuals we want info
4
New cards
sample
subset of units in a population
5
New cards
variable
characteristics or property of an individual ex: TIME until light bulb burns out DISTANCE: traveled
6
New cards
categorical data
represent values of categorical variables, that places individuals into one of several group ex: gender of newborn eye colour
7
New cards
qualitative
relating to or involving comparisons based on qualities, - categorical - "types" names, symbol, things
8
New cards
categorical and ordinal
ordering makes sense for values of categorical variables
9
New cards
categorical and nominal
if the variables of ordering dont make sense
10
New cards
quantitative data
values quantitative variables for adding and averaging make sense ex: height exam scores volume sums
11
New cards
distribution of data
tells us what values a variable takes and how often it takes these values, VALUES dont have to be quantitative
12
New cards
bar charts
display variables values on one axis and frequencies on the other, - spaces imply no continuity - categorical variables and displays categorical data.
13
New cards
bar and pie charts
categorical data
14
New cards
stemplots, histogram, timeplot
quantitative data
15
New cards
pie charts
visual representation of the relative frequency
16
New cards
minimum
the smallest possible quantity
17
New cards
maximum
the largest possible quantity
18
New cards
frequency distribution
count of how many of our data values fall into various predetermined classes or intervals
19
New cards
continuous variable
take any value within a given range - type of quantitative -ex: weight
20
New cards
discrete variable
- only take a countable number of values ex: # of children in a family. - measure in certains #'s ex: number of pets
21
New cards
relative frequency or proportion
dividing the number of data values in each class by the total number of data values (sum)
22
New cards
proportion
values between 0-1, - inclusive - decimal representation of fractions - proportion of intervals must add up to 1
23
New cards
histogram
- large amount of data - form of a bar graph with no spaces - reflect continuity
24
New cards
symmetric data
if histogram is its centre and divides it into approximate mirror image
25
New cards
time plots
used for plotting time series data, - values measured over time - time plotted on x-axis - variable values plotted on y-axis presented by points, connected to make a trend
26
New cards
measure of centre
-mode -median -mean
27
New cards
measure of spread
- range - standard deviation
28
New cards
location and variability
are the two important features of a data set
29
New cards
mode
most frequently observed
30
New cards
median
middle of the set
31
New cards
mean
average
32
New cards
outliers
extreme values that dont affect the value of the MEDIAN -"resistant" to the effect of outlier
33
New cards
mean, range, standard deviation
affected by outliers
34
New cards
median, mode
resistant by outlier
35
New cards
mean and median
symmetric distribution are equal
36
New cards
weighted mean
each of the data points contributing equally to the final average, some data points contribute more than others.
37
New cards
variability
the difference being exhibited by data points within a data set, as related to each other or as related to the mean
38
New cards
range
the difference between the highest and lowest values.
39
New cards
interquartile range
of a data set measures the length of an interval which covers the middle 50 percent of the ordered observations
40
New cards
percentiles
describes how a score compares to other scores from the same set.