Send a link to your students to track their progress
23 Terms
1
New cards
The 6 Data Analysis Process Steps
1. Understanding the nature of the problem: knowing goals and what we hope to answer 2. Deciding what to measure and how to measure it: carefully define variables 3. Data collection: existing/new data 4. Data summarization and preliminary analysis: summarize graphically and numerically 5. Formal data analysis: select and apply statistical methods 6. Interpretation of results: What can we learn? What conclusion can we draw? How can results guide further research?
2
New cards
population
the entire collection of individuals or objects about which information is desired
3
New cards
census method
collecting data about the entire population
4
New cards
sample
subset of the population selected for study
5
New cards
variable
a characteristic whose value may change from one observation to another
6
New cards
data
a collection of observations on one or more variables
7
New cards
independent variable
variable whose variation DOES NOT depend on that of another
8
New cards
dependent
variable whose variation DOES depend on that of another
9
New cards
categorical data set
qualitative data
10
New cards
numerical data set
quantitative data (can be discrete or continuous)
11
New cards
discrete (numerical) variables
possible values of the variable correspond to isolated points on the number line; takes finite or countably infinite values. (EX: number of books, number rolled on a die)
12
New cards
continuous (numerical) variables
set of possible values forms an entire interval on the number line; cannot be accurately precise. (EX: height=1.722222... meters)
13
New cards
descriptive statistics
includes numerical, graphical, and tabular methods for organizing and summarizing data
14
New cards
inferential statistics
involves generalizing a sample to the population and assessing the reliability of such generalizations
15
New cards
univariate data
a data set consisting of observations on a single characteristic
16
New cards
bivariate data
data of two variables (EX: ice cream sales vs. temperature outside)
17
New cards
multivariate data
data of three or more variables; two or more dependent variables
18
New cards
frequency distribution for categorical data
a table that displays the possible categories along with the associated frequencies and/or relative frequencies
19
New cards
frequency
the number of times the category appears in the data set
20
New cards
relative frequency
frequency/number of observations in the data set (shown as a percentage)
The proportion of the observations that belong to that category
21
New cards
relative frequency distribution
a frequency distribution that includes relative frequencies
22
New cards
bar chart
graph of frequency distribution of categorical data. Each category is a bar/rectangle, and the area of each bar is proportional to the corresponding frequency or relative frequency.
23
New cards
dotplot
a simple way to display numerical data when the data set is reasonably small. Each observation is represented by a dot above the location corresponding to its value on a horizontal measurement scale.