Math 183 Part 2 (Descriptive Statistics and Plots)

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/29

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

30 Terms

1
New cards
Nominal variable
A categorical variable with no natural order (e.g., gender, region, waterfront).
2
New cards
Ordinal variable
A categorical variable with a meaningful order or ranking (e.g., grade, condition, view, yr built, date).
3
New cards
Discrete numerical variable
A numerical variable that takes only specific, separate values (e.g., number of bedrooms, number of bathrooms, number of floors).
4
New cards
Continuous numerical variable
A numerical variable that can take any value within an interval (e.g., price, square footage, latitude, longitude).
5
New cards
Bar plot
A graph used to display the counts or proportions of categories for a categorical or discrete variable.
6
New cards
Pie chart
A circular chart divided into sectors, each sector shows the proportion of each category.
7
New cards
Side-by-side bar plot
A bar plot comparing the distribution of a categorical variable across groups.
8
New cards
Segmented bar plot
A bar plot where each bar is divided into colored segments to show subcategories within the main categories.
9
New cards
Contingency table
A table showing the relationship between two categorical variables.
10
New cards
Histogram
A plot of the frequency or proportion of data falling into each of several bins for a numerical variable.
11
New cards
Binning
Grouping continuous or discrete numerical values into intervals for summarization (e.g., bin years by decade).
12
New cards
Strip chart
A plot where each data point is shown as a small mark along a line, good for visualizing outliers and spread.
13
New cards
Box plot
A plot showing the median, quartiles, range, and potential outliers of a numerical variable.
14
New cards
Mean
The arithmetic average of a numerical dataset.
15
New cards
Median
The middle value in a numerical dataset when ordered.
16
New cards
Quantile
A value that divides the data into equal-sized intervals (e.g., quartiles, percentiles).
17
New cards
Interquartile Range (IQR)
The difference between the 75th and 25th percentiles; describes the spread of the central 50% of data.
18
New cards
Range
The difference between the maximum and minimum values in a dataset.
19
New cards
Variance
A measure of how far data values are from the mean, on average.
20
New cards
Standard deviation
The square root of variance; measures typical distance from the mean.
21
New cards
Robust statistic
A statistic (like median or IQR) that is not heavily affected by outliers.
22
New cards
Skewness
A measure of the asymmetry of a distribution (right-skew = longer right tail, left-skew = longer left tail).
23
New cards
Outlier
A data point much higher or lower than the rest; can affect mean and standard deviation.
24
New cards
Transformation
Applying a function (like log) to a variable to reduce skewness or change its scale.
25
New cards
Scatterplot
A plot that displays the relationship between two numerical variables using points.
26
New cards
Covariance
A measure of how two variables change together (positive: increase together, negative: one up, other down).
27
New cards
Correlation
A standardized measure of linear association between two variables, ranges from -1 to 1.
28
New cards
Simultaneous comparison
Comparing distributions or statistics (like mean or median) across different groups.
29
New cards
Side-by-side boxplot
A plot for comparing the distributions of a numerical variable across groups.
30
New cards
Motivation for inference
Statistical analysis helps determine if observed differences (e.g., between groups) are likely to be real or just due to chance.