bstat exam 1

0.0(0)
studied byStudied by 1 person
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/49

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

50 Terms

1
New cards

Population

All members of group of interest

2
New cards

Element

Single member of the group of interest

3
New cards

Sample

A subset of the population

4
New cards

Census

Collection of all elements in the population

5
New cards

Variable

A characteristic of an element (values will vary)

6
New cards

Descriptive Statistics

Summarize and present- graphs, charts, tables, summaries

7
New cards

Structured Data

Data that is organized in a defined manner, making it easily searchable.

8
New cards

Unstructured Data

Data that does not have a pre-defined format or organization, making it more complex to analyze.

9
New cards

Data Cleanup

The process of preparing data for analysis by correcting errors, translating units, and handling missing values.

10
New cards

Cross-Section

Data collected from many elements at one time period, providing a snapshot.

11
New cards

Time-Series

Data collected from one element over many time periods, showing trends.

12
New cards

Longitudinal Data

Data that combines both time series and cross-sectional data.

13
New cards

Categorical Variables

Also known as qualitative variables, these are labels and names that do not allow for mathematical operations.

14
New cards

Quantitative Variables

Also known as numeric variables, these are numbers and measurements that answer 'how much?' or 'how many?'.

15
New cards

Discrete Variable

A variable that takes separate chunks, with a finite or countably infinite set of values.

16
New cards

Continuous Variable

A variable that can take any value within an interval, including fractions.

17
New cards

Nominal Scale

A scale of measurement that uses labels or names without any order.

18
New cards

Ordinal Scale

A scale of measurement where the ordering of values makes sense.

19
New cards

Interval Scale

A scale of measurement where distances between values are meaningful, but zero does not signify nothingness.

20
New cards

Ratio Scale

A scale of measurement where both distances and ratios are meaningful, and zero signifies nothingness.

21
New cards

Frequency Distribution Table

A table that displays the count of how values are spread out.

22
New cards

Relative Frequency

The proportion of a category count divided by the total.

23
New cards

Histograms

Graphs used for quantitative data, characterized by no spaces between bars.

24
New cards

Cumulative Relative Frequency

The sum of relative frequencies for each category, eventually reaching 1.0.

25
New cards

Ogive

A cumulative frequency polygon that typically has an elongated S shape and always ends at 1.

26
New cards

Scatter Plot

A graph used to display the relationship between two quantitative variables.

27
New cards

Line Graphs

Graphs used to show trends over time.

28
New cards

Mean

The arithmetic average, calculated by adding all values and dividing by the number of values.

29
New cards

Median

The middle value in a data set, determined after arranging values in ascending order.

30
New cards

Skewed Distribution

When the mean is different than the median, indicating asymmetry in the data.

31
New cards

Mode

The most common value in a data set.

32
New cards

Subgroup Mean

Used to find the average of subgroups within a whole data set.

33
New cards

Weighted Mean

Calculated as Mean sales price 1(relative frequency 1) + Mean sales price 2(relative frequency 2) + etc.

34
New cards

Percentiles

The p^th percentile is a value such that p% of the observations are smaller and (100-p)% are larger.

35
New cards

Quartiles

Special percentiles: Q1 is the 25th percentile, Q2 is the 50th percentile (Median), and Q3 is the 75th percentile.

36
New cards

Interquartile Range (IQR)

Calculated as Q3 - Q1, representing the distance of the middle 50% of values.

37
New cards

Lower Fence

Any value smaller than Q1 - 1.5(IQR) could be considered an outlier.

38
New cards

Upper Fence

Any value larger than Q3 + 1.5(IQR) could be considered an outlier.

39
New cards

Range

Calculated as Max - Min, representing the spread of the data.

40
New cards

Variance

Population variance is calculated as 𝞼^2 = (x1 - mu)^2 + (x2 - mu)^2 + …etc/ N.

41
New cards

Sample Variance

Calculated as S^2 = (x1 - x-bar)^2 + (x2 - x-bar)^2/ n-1.

42
New cards

Standard Deviation

Population standard deviation is the square root of variance, 𝞼 = square root of variance.

43
New cards

Coefficient of Variation (C.V)

Measures how much observations vary relative to their mean, calculated as C.V = standard deviation/mean.

44
New cards

Empirical Rule

In a symmetric bell-shaped distribution, 68% of observations are within one standard deviation of the mean, 95% within two, and 99.7% within three.

45
New cards

Z-scores

A measure of how many standard deviations an observation is from the mean, calculated as X - Xbar/s or X - mu/sigma.

46
New cards

Covariance

Indicates how two variables change together, calculated as sample covariance: sxy = (x1-xbar)(y1-ybar) + etc/ n-1.

47
New cards

Population Covariance

Calculated as sigmaxy = (x1-mux)(y1-muy)/N.

48
New cards

Correlation

Describes the strength and direction of the linear relationship between two variables.

49
New cards

Sample Correlation

Calculated as rxy = Sxy/SxSy, which is covariance divided by the product of the standard deviations.

50
New cards

Population Correlation

Calculated as Pxy = sigmaxy/sigmax sigma.