Introduction to Data Analysis (Video)

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/34

flashcard set

Earn XP

Description and Tags

A set of vocabulary-style flashcards covering key data analysis concepts from the lecture notes.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

35 Terms

1
New cards

Frequentist probability

Probabilities are objective and have a frequency interpretation; prior knowledge is not formally included.

2
New cards

Bayesian probability

Probabilities can be subjective and represent degrees of belief; prior knowledge is formally included in analyses.

3
New cards

Statistics

Scientific study of numerical data collected from natural phenomena and the methods used to collect, analyze, and interpret such data.

4
New cards

Science

Systematic study of the physical and natural world through observation and experiment.

5
New cards

Description (in science)

An adequate description of the things and events investigated.

6
New cards

Law or theory (in science)

General laws or theories by which particular events may be explained and predicted.

7
New cards

Scientific Method

A sequence: observation, formulating a question/problem, hypothesis, prediction, and experimental design.

8
New cards

Population

All individuals of the same species in a defined location, sharing a gene pool.

9
New cards

Sample

A subset of the population used to make inferences about the population.

10
New cards

Random sampling

A sampling method where each member of the population has a known chance of being selected.

11
New cards

Random variable

A numerical quantity whose value is determined by chance from the population.

12
New cards

Parameters

Population characteristics (e.g., μ, σ², N) that describe the population.

13
New cards

Statistics (in sampling)

Numerical summaries computed from a sample (e.g., X̄, s², n).

14
New cards

Random sampling vs population vs sample

Understanding how samples estimate population parameters through probability.

15
New cards

Central tendency

A measure that summarizes the center of a data set (e.g., mean, median).

16
New cards

Mean (arithmetic mean)

The sum of values divided by the number of observations.

17
New cards

Median

The middle value of an ordered data set; splits data into two halves.

18
New cards

Variance

A measure of dispersion; the average squared deviation from the mean.

19
New cards

Standard deviation

The square root of the variance; a measure of dispersion around the mean.

20
New cards

Standard error

The standard deviation of the sampling distribution; equals S/√n.

21
New cards

Range

The difference between the maximum and minimum values in a data set.

22
New cards

Quartiles

Values that divide data into four equal parts: Q1, Q2 (median), Q3.

23
New cards

Five-number summary

Min, Q1, Median, Q3, Max; used for box plots.

24
New cards

Box plot

A graphical display of the five-number summary showing distribution and potential outliers.

25
New cards

IQR (Interquartile Range)

Q3 − Q1; the spread of the middle 50% of the data.

26
New cards

Outlier fences (f1, f3)

Lower and upper bounds for outliers: f1 = Q1 − 1.5×IQR, f3 = Q3 + 1.5×IQR.

27
New cards

Empirical rule

Approx. 68% of data fall within mean ± SD; approx. 95% within mean ± 2×SD.

28
New cards

Accuracy

Closeness of a measured value to the true value.

29
New cards

Precision

Closeness of repeated measurements to each other (reproducibility).

30
New cards

30-300 rule

Max−min should equal between 30 and 300 unit steps to assess accuracy/precision.

31
New cards

Quantitative variable

A numeric variable that can be measured; includes continuous and discrete types.

32
New cards

Continuous variable

Quantitative variable that can take any value within a range.

33
New cards

Discrete variable

Quantitative variable that takes only specific, separate values.

34
New cards

Ordinal (ranked) variable

Qualitative variable with an inherent order but uneven intervals.

35
New cards

Categorical variable

Qualitative variable representing categories without intrinsic order.