Data Analysis - ISEE 120

0.0(0)

Studied by 0 people

Call with Kai

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/23

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

24 Terms

New cards

Population

Total set of observations that can be made

The set of data (numerical or otherwise) corresponding to the entire collection of units about which information is sought

New cards

Sample

a set of observations drawn from a population

A set of data collected and/or selected from a population by a defined procedure.
Necessary to use sample for research
- Impractical to study whole population
- Rely on samples to make estimates related to population

New cards

Data

numerical facts

New cards

Statistics

measurement and modeling of random variables
estimates of population parameters

New cards

Continuous

values that can be measured (decimal/fractional)

New cards

Discrete

values that can be counted (whole numbers/distinct)

New cards

Graphical Data

used for quick information relay

New cards

Histograms

visual representation of distribution of quantitative data
x-axis = “bins”, y-axis = count of values

New cards

Line Charts

uses points connected by line segments (left to right) to demonstrate changes in value
X-axis = time, y-axis = reported values
can also be used for comparison of data

New cards

Correlation Graphs (Scatter Plots)

values represented as dots, showing a relationship between two variables, correlation shown by shape of the dots
x-axis = factor 1, y-axis = factor 2

New cards

Pie Charts (% Charts)

shows the total sum of a whole, and what percent each makes up

New cards

Pareto Chart

similar to a bar chart, sorts values highest to lowest

New cards

Bar Chart

compares two values with bars either stemming from the y or x axis, looks similar to Pareto charts or histograms

New cards

Mean

the average of all values

New cards

Median

the middle value in a chronological set of data

New cards

Mode

the most frequent value in a data set

New cards

Range

the difference of the largest and lowest values

New cards

Standard Deviation (SD)

the square root of the variance
used to measure distance away from mean to calculate certain percentiles

New cards

Normal Distribution

SDs from mean in a normal bell curve
1SD ~ 68.26%, 2SD ~ 95.44%, 3SD ~ 99.73%
S=sqrt((E(x₁-x₂)²)/(n-1)

New cards

Normal Bell Curve

the common distribution of values in data sets
resembles a bell

New cards

Variance

the average squared distance each observation is from the mean
S²=(E(x₁-x₂)²)/(n-1)

New cards

Standard Error (SE)

variability across multiple samples of a population
Quantifies uncertainty in the estimate of the mean
As sample size increases, sampling error decreases
SE=S/sqrt(n)

New cards

Confidence Interval (CI)

a range of values we are fairly sure our true value exists in.
CI = X+/-Z(S/sqrt(n))
95% confidence interval marks the bounds within which 95% of the sample means will fall, given certain facts about the population and the sampling process.
- Lower Bound = u - 1.96*SE
- Upper Bound = u + 1.96*SE

New cards

Z-Score

+/- number of SDs a percentile is away from mean
use z-score chart in this course