Data Analysis - ISEE 120

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/23

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

24 Terms

1
New cards

Population

  • Total set of observations that can be made

  • The set of data (numerical or otherwise) corresponding to the entire collection of units about which information is sought

2
New cards

Sample

  • a set of observations drawn from a population

  • A set of data collected and/or selected from a population by a defined procedure. 

  • Necessary to use sample for research 

    • Impractical to study whole population

    • Rely on samples to make estimates related to population

3
New cards

Data

numerical facts

4
New cards

Statistics

  • measurement and modeling of random variables

  • estimates of population parameters

5
New cards

Continuous

values that can be measured (decimal/fractional)

6
New cards

Discrete

values that can be counted (whole numbers/distinct)

7
New cards

Graphical Data

used for quick information relay

8
New cards

Histograms

  • visual representation of distribution of quantitative data

  • x-axis = “bins”, y-axis = count of values

9
New cards

Line Charts

  • uses points connected by line segments (left to right) to demonstrate changes in value

  • X-axis = time, y-axis = reported values

  • can also be used for comparison of data

10
New cards

Correlation Graphs (Scatter Plots)

  • values represented as dots, showing a relationship between two variables, correlation shown by shape of the dots

  • x-axis = factor 1, y-axis = factor 2

11
New cards

Pie Charts (% Charts)

  • shows the total sum of a whole, and what percent each makes up

12
New cards

Pareto Chart

  • similar to a bar chart, sorts values highest to lowest

13
New cards

Bar Chart

  • compares two values with bars either stemming from the y or x axis, looks similar to Pareto charts or histograms

14
New cards

Mean

  • the average of all values

15
New cards

Median

  • the middle value in a chronological set of data

16
New cards

Mode

  • the most frequent value in a data set

17
New cards

Range

  • the difference of the largest and lowest values

18
New cards

Standard Deviation (SD)

  • the square root of the variance

  • used to measure distance away from mean to calculate certain percentiles

19
New cards

Normal Distribution

  • SDs from mean in a normal bell curve

  • 1SD ~ 68.26%, 2SD ~ 95.44%, 3SD ~ 99.73%

  • S=sqrt((E(x1-x2)2)/(n-1)

20
New cards

Normal Bell Curve

  • the common distribution of values in data sets

  • resembles a bell

21
New cards

Variance

  • the average squared distance each observation is from the mean

  • S2=(E(x1-x2)2)/(n-1)

22
New cards

Standard Error (SE)

  • variability across multiple samples of a population

  • Quantifies uncertainty in the estimate of the mean

  • As sample size increases, sampling error decreases

  • SE=S/sqrt(n)

23
New cards

Confidence Interval (CI)

  • a range of values we are fairly sure our true value exists in. 

  • CI = X+/-Z(S/sqrt(n))

  • 95% confidence interval marks the bounds within which 95% of the sample means will fall, given certain facts about the population and the sampling process. 

    • Lower Bound = u - 1.96*SE

    • Upper Bound = u + 1.96*SE

24
New cards

Z-Score

  • +/- number of SDs a percentile is away from mean

  • use z-score chart in this course