quantite methodologies CO1 (P1)

5.0(1)
studied byStudied by 1 person
5.0(1)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/91

flashcard set

Earn XP

Description and Tags

quandingle methods ;3

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

92 Terms

1
New cards

Descriptive Statistics

Describes the characteristics and properties of anything that you can gather data from (Person, Place, Companies). Based on facts or meaningful information. Cannot draw conclusions about larger sets of data.

2
New cards

Inferential Statistics

Draws conclusions about a population based on the data gathered from samples with the use of DS techniques, concerned with methods of analyzing smaller groups of data that lead to predictions about larger sets of data. Gives generalization about the whole from analyzing a part of it.

3
New cards

Population

Totality of all observations in which the dataset is obtained

4
New cards

parameter

A variable describing the population

5
New cards

Sample

a portion of a population, said portion will include as much diverse data as possible which will represent the entire population.

6
New cards

statistic

A variable describing the sample

7
New cards

Variables

parameters being studied in statistics

8
New cards

Qualitative Variables

nonnumeric data like gender, civil status, and location.

9
New cards

Quantitative Variables

numerical data like force, weight, voltage.

10
New cards

Continuous Data

This type of data needs to be obtained using measuring tools, measurable quantities but not countable, infinite values but has a range.

11
New cards

Discrete Data

countable and measurable quantities, finite values and only whole numbers. This data can be counted or measured using counting tools

12
New cards

Independent Variable

A variable that can be altered to see an outcome

13
New cards

Dependent Variable

A variable that is observed after altering an independent variable

14
New cards

Controlled Variable

A variable kept constant to avoid any influenced outcomes

15
New cards

Extraneous Variable

An unexpected/unplanned variable but with minimal effects to the outcome

16
New cards

Nominal

Assign numerical data to categorical data. Using counting to analyze data falling into a category

17
New cards

Ordinal Data

Assign rank to levels of data. Range for the ranks of the variable is not constant

18
New cards

Interval

Assign a constant difference between numeric data, addition and subtraction is applicable. Zero does not mean “nothing”.

19
New cards

Ratio

Assign continuous range of data over a range and allow all arithmetic operations. Zero means nothing.

20
New cards

Sampling

process of taking portions from the population.

21
New cards

Probability Sampling

eliminates biases against certain events with no chance of being selected, listing all possibilities and taking a chance that they will be selected to be part of the sample.

22
New cards

Non-Probability Sampling

Increases bias for certain events with no chance of being selected, not including all of the population in the sample.

23
New cards

Simple Random Sampling

arranging the population to a certain rule. Elements are numbered and a sample is taken by various randomizing principles.

24
New cards

Systematic Sampling

sample will be taken by dividing the population into equal groups and getting the kth element in each group.

25
New cards

Stratified Sampling

grouping the population into strata, random sampling is performed for each stratum proportional to the size of each stratum based on the population.

26
New cards

strata

a sample with generally similar characteristics.

27
New cards

Cluster Sampling

done by identifying groups known as clusters, must be similar to each other with respect to parameters being examined.

28
New cards

cluster

a subpopulation with elements as diverse as possible

29
New cards

Convenience Sampling

based primarily on availability of respondents.

30
New cards

Quota Sampling

There is a desired number of samples. Respondents were taken as they volunteered themselves to become part of the experiment.

31
New cards

Purposive Sampling

The sample is obtained based on certain conditions.

32
New cards

Textual Form

presenting data via sentences and paragraphs in describing data

33
New cards

Tabular Form

presenting data with tables arranged by row and column for various parameters

34
New cards

Graphical Form

presenting data with pictures

35
New cards

Ungrouped Data

Data points are treated individually. These are raw, individual data points that are not organized into groups or classes.

36
New cards

Grouped Data

Data points are treated as grouped according to categories, this is raw data that is arranged into classes or intervals

37
New cards

Frequency Distribution Table

Showing each value or range of values their frequent appearance in a dataset, used in statistics for larger sets of data to ease the interpretation and also for graphs

38
New cards

Reason to use Frequency Distribution Table

This procedure is used to lessen work by treating the data by group.

39
New cards

Class Limits

smallest and largest values that fall into class intervals and taken with equal number of significant figures as the given data.

40
New cards

range

r = highest value - lowest value

41
New cards

class amount

k = 1 + 3.322log(n)

42
New cards

class width

cw = r/k

43
New cards

Class Boundaries (tree class limits)

precise expression of class interval, usually one significant digit more than the class limit.

44
New cards

Class Boundary formula

Upper limit of Class A + Lower limit of Class B / 2

45
New cards

Class Mark

midpoint of a class interval

46
New cards

Class Mark formula

cm = Lower Class Limit + Upper Class Limit / 2

47
New cards

Cumulative Frequency Distribution

derived from frequency distribution and can be also obtained by adding class frequencies.

48
New cards

Relative Frequency

percentage of total frequency with respect to the total population

49
New cards

rf formula

rf = f/∑f

50
New cards

Relative Frequency Distribution (%rf)

percentage of frequency’s proportion in each class to the total frequency

51
New cards

%rf formula

%rf = f/∑f x 100

52
New cards

Less than cumulative frequency (<cf)

distribution whose frequencies are lower the upper-class boundary they correspond to

53
New cards

obtaining “<cf”

adding the frequencies from top to bottom

54
New cards

Greater than cumulative frequency (>cf)

distribution whose frequencies are above the lower-class boundary they correspond to

55
New cards

obtaining “>cf“

adding the frequencies from bottom to top

56
New cards

Frequency Polygon

points are plotted using the midpoint and frequency

57
New cards

Histogram

points are plotted using the midpoint and frequency

58
New cards

Ogive

points are plotted using upper(>ogive)/lower(<ogive) class boundary and cumulative frequency.

59
New cards

Pareto Chart

graph used to represent frequency distribution for categorical data and frequencies are displayed by the heights of bars, arranged from highest to lowest.

60
New cards

Bar Chart

graph similar to histogram. The height of each rectangle represents the frequency of that category, applicable for categorical data (or nominal level).

61
New cards

Pie Chart (Circle Graph)

circle divided into portions representing relative frequencies (or percentage) of the data.

62
New cards

Scatter Plot

used to examine possible relationships between two numerical variables. Two variables are plotted in x-axis and y-axis.

63
New cards

Time Series Graph

represents data occurring over a specific period under observation. Shows trend or pattern on the increase or decrease over the period

64
New cards

Pictograph

appropriate pictures are arranged in a row (sometimes columns) presented quantities for comparison.

65
New cards

Measure of Central Tendency

Statistical values that describe the center or typical value of a dataset, also helps in summarizing entire sets of data with a single representative number. Calculated by adding the highest value and the lowest value then dividing by 2

66
New cards

The Mean

Most used parameter for describing ratio data

67
New cards

Arithmetic Mean

Only measure under central tendency where sum of deviations of each value from mean is zero, affected by abnormally large or small values. Calculated by sum of all values and divided by number of values.

68
New cards

Geometric Mean

Used in factors multiplied to another quantity

69
New cards

Geometric Mean formula

GM = sqrt(ab)

70
New cards

Trimmed Mean

Removing upper and lower values of the distributing and obtaining the arithmetic mean. Calculated by trimming a certain percent of both the largest and smallest set of values

71
New cards

Trimmed Mean formula

TM = ∑%x/%n

72
New cards

The Median

Midpoint of the values, as many values above as well as below it, unaffected by extremely large or small values, computed for ratio-level data, interval-level data, ordinal-level data, and open-ended frequency distribution if not in an open-ended class.

73
New cards

The Mode

Value of observation appearing most frequently, used to find most occuring/frequent value, Most unreliable compared to other measures, only measurement that is used for nominal data.

74
New cards

Bi-modal

When distribution has 2 modes

75
New cards

Tri-modal

If distribution has three modes

76
New cards

Multi-modal

The distribution has more than 3 modes

77
New cards

Measures of Position

Describes the relative standing of a value in a dataset, Indicates where a particular data point lies in relation to the rest of the data, Main measurements are Quartile (Q), Decile (D), Percentile (P) and standard score (z).

78
New cards

Quantiles (or Fractiles)

points taken at regular intervals from cumulative distribution

79
New cards

Quartiles

Division of dataset in 4 groups

80
New cards

Deciles

Division of dataset in 10 groups

81
New cards

Percentiles

Division of dataset in 100 groups

82
New cards

Measure of Variation (or Dispersion)

describes how spread out or dispersed the values are in a dataset. Tells how far the data points are from that center, measurements are range, standard deviation, variance, quartile deviation, interquartile range, and coefficient of variation.

83
New cards

Range

Difference between largest to smallest number in the set

84
New cards

Variance

Average of square deviations

85
New cards

Standard Deviation (SD)

given as positive square root of population/sample variance

86
New cards

Coefficient of Variation (CV)

Percentage of the ratio of standard deviation to the mean

87
New cards

Mean Absolute Deviation (MAD)

Average of unsigned deviations from mean

88
New cards

Quartile Deviation (QD)

Absolute measurements of dispersion

89
New cards

Interquartile Range (IQR)

Spread of the middle 50% of the data

90
New cards

Measure of Shape

Describe the distribution pattern of data, specifically the values are spread and whether the distribution is symmetric, skewed, or has peaks of certain sharpness.

91
New cards

Skewness

Degree of asymmetry of distribution about a mean, measurement of how data departs from symmetry, can be interpreted as symmetric, and positively or negatively skewed.

92
New cards

Kurtosis

Degree of peakedness exhibited by the distribution, computed as the fourth-degree moment from the mean.