Correlation and regression and parametric and non parametric distributions

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/84

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 10:05 PM on 5/20/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

85 Terms

1
New cards

Parametric distibution

Assumes data follows a specific pattern usually normal distribution with parameters like mean and variance

2
New cards

Parametric distribution

Normal bell shaped, t distribution, f distribution, binomial, poisson, exponential

3
New cards

Parametric distribution

When used, data is continuous, normally distributed, large sample size, meets assumptions, linearity, equal variance, independence

4
New cards

Non parametric distribution

Does not assume any specific distribution, also called distribution free

5
New cards

Non parametric distribution

Works with ordinal or nominal data, handles skewed or small datasets, uses ranks instead of raw values

6
New cards

Non parametric distribution

Spearman correlation, manny whitney u, kruskal wallis, wilcoxon test

7
New cards

Parametric distribution

Strict assumptions, interval or ratio data, large sample size, higher power, used when data is normal

8
New cards

Non parametric distribution

Flexible assumptions, ordinal or norminal data, small sample size, lower power, used when data is skewed

9
New cards

Correlation

Is a statistical measure that shows the relationship between two variables

10
New cards

Correlation

Measures how variables change together, describes relationships but does not indicate cause and effect

11
New cards

Purpose of correlation

Identifies patterns and relationships between variables, measures the strength and direction of relationships, helps in prediction and decision making, simplifies data by identifying important variables, useful in fields like finance, healthcare, and research

12
New cards

Positive correlation

Both variables move in the same direction, when one increases, the other also increases, when one decreases, the other also decreases

13
New cards

Negative correlation

Variables move in opposite directions, when one variable increases, the other decreases, inverse correlation

14
New cards

Zero correlation

No relationship exists between the variables, changes in one variable do not affect the other

15
New cards

Strong correlation

Means the variables have a very close relationship, where changes in one consistently associated with changes in the other

16
New cards

Moderate correlation

Indicates a noticeable relationship, but it is not perfectly consistent and may have some variation

17
New cards

Weak correlation

Shows little to no clear relationship, meaning changes in one variable do not strongly predict changes in the other

18
New cards

Pearson correlation coefficient

Is a statistical measure of the linesr relationship between two variables, it is a descriptive statistic that summarizes data characteristics

19
New cards

Perfect positive relationship

+1 correlation

20
New cards

Perfect negative relationship

-1 correlation

21
New cards

No relationship

0 correlation

22
New cards

Pearson correlation coefficient

Used for continuous data, applied when both variables are measured on a continuous scale, best used when the relationship between variables is linear

23
New cards

Pearson correlation

Assumes that the data are normally distributed, it also assumes a linesr relationship between the two variables, this means the data should follow a bell shaped pattern and change consistently in one direction, if these assumptions are not met, the results may be inaccurate

24
New cards

Spearman rank correlation

Measures the strength and direction of a linear relationship between two variables, commonly used for continuous data such as height, weight, or test scores, assumes normal distribution and a linear relationship between variables

25
New cards

Kendall’s Tau

Nonparametric measure of association between two ranked variables, more accurate when there are many tied ranks, commonly used for small sample sizes

26
New cards

Regression

Is a statistical method used to examine relationships between variables

27
New cards

Regression

Helps determine how one variable can predict another

28
New cards

Regression

Used to identify patterns and trends for analysis and decision-making

29
New cards

Regression

Explains how one variable affects another and is used for prediction

30
New cards

Purpose of regression analysis

Understands relationships between variable, predicts future outcomes, identifies variables that strongly influence an outcome

31
New cards

Regression

Estimates the value of one variable based on another variable, useful for forecasting and decision-making

32
New cards

Regression

Measures how an independent variable affects a dependent variable, shows whether the effect is positive, negative, or no effect, helps determine the strength and direction of the effect between variables

33
New cards

Simple linear regression

Is used when there is one independent variable and one dependent variable

34
New cards

Simple linear regression

The method assumes a straight-line relationship, meaning as the independent variable changes, the dependent variable changes at a constant rate

35
New cards

Multiple regression

Involves two or more independent variables affecting a single dependent variable

36
New cards

Multiple regression

This type helps measure how several factors together influence an outcome and can show which variables have the strongest impact

37
New cards

Logistic regression

It is when the dependent variable is categorical pass or fail, estimates the probability of a certain

outcome

38
New cards

Logistic regression

Used when the dependent variable is categorical, such as yes or no or passor fail outcomes, instead of predicting a numeric value, it estimates the probability of an event occurring

39
New cards

Linear regression

Predicts continuous values, uses best-fit line, solves regression problems

40
New cards

Logistic regression

Predicts categorical classes, uses sigmoid s curve, solves classification problems

41
New cards

Non linear regression

Is a statistical method that models complex, curved relationships between a dependent variable and one or more independent variables

42
New cards

Non linear regression

Uses flexible curves such as exponential, logarithmic, or logistic functions to fit data that does not follow a linear pattern

43
New cards

Parametric distribution

Assume data follows a specific probability distribution, described using parameters such as mean and variance

44
New cards

Parametric distribution

Used for prediction, hypothesis testing, and statistical analysis, accurate only when the data fits the assumed distribution

45
New cards

Normal distribution

Also called the Gaussian or bell-shaped distribution, symmetrical around the mean, defined by mean and standard deviation, commonly used for heights, test scores, and measurement errors

46
New cards

t distribution

Similar to the normal distribution but with wider tails, used for small sample sizes usually below 30

47
New cards

t distribution

Defined by degrees of freedom, commonly used in t-tests for comparing means

48
New cards

F distribution

Positive and right-skewed distribution, based on the ratio of two variances, defined by two degrees of freedom, commonly used in anova to compare group means

49
New cards

Parametric tests

Use when data is continuous, interval or ratio scale, data should be approximately normally distributed bell-shaped, variances of groups should be equal homoscedasticity, observations must be independent, best used with moderate to large sample sizes

50
New cards

Non parametric distribution

Methods do not assume a normal distribution of data, also called distribution-free methods, analyze data using ranks, order, or categories instead of means and standard deviation

51
New cards

Non parametric tests

Does not require normality, can handle skewed data and outliers, suitable for small sample sizes, can be used for ordinal and nominal data

52
New cards

Spearman correlation

Measures the relationship between two ranked variables, used for ordinal or non-normally distributed data

53
New cards

Mann-Whitney U test

Compares two independent groups, non-parametric alternative to the independent samples t-test, uses ranks instead of means

54
New cards

Kruskal-Wallis test

Compares three or more independent groups, non-parametric alternative to one-way anova, determines if significant differences exist among groups

55
New cards

Wilcoxon test

Compares two related samples or repeated measurements, non-parametric alternative to the paired samples t-test, often used for before-and-after measurements

56
New cards

Parametric assumptions

Requires the data to follow certain conditions such as normal distribution, equal variance, and independence of observations, these assumptions allow more accurate and powerful statistical conclusions

57
New cards

Non parametric assumptions

Does not require strict assumptions about the distribution of data, it can be used even when data are skewed, not normally distributed, or contain outliers

58
New cards

Parametric type of data

Used for interval and ratio data, where numerical values have equal intervals, these data allow computation of mean and standard deviation

59
New cards

Non parametric type of data

Used for ordinal and nominal data, where values represent categories or rankings

60
New cards

Parametric sample size

Works best with larger sample sizes because the assumption of normality becomes more reliable, larger samples improve accuracy of results

61
New cards

Non parametric sample size

Can be used with small sample sizes since it does not rely heavily on distribution assumptions, it is useful when there are few participants

62
New cards

Parametric statistical power

Has higher statistical power, meaning it is more likely to detect a true difference or relationship if one exists, this makes parametric tests more sensitive

63
New cards

Non parametric statistical power

Has lower statistical power, meaning it may fail to detect small differences, it is safer when assumptions for parametric tests are violated

64
New cards

One sample t test

Compares one group with a known standard or population mean

65
New cards

Independent samples t test

Compares the means of two independent groups

66
New cards

Paired samples t test

Compares the mean of the same group before and after

67
New cards

One-Way ANOVA

Compares the means of three or more independent groups

68
New cards

Pearson r

Measures the relationship between two variables

69
New cards

Mann-Whitney U test

Alternative to independent t-test, compares 2 independent groups, it calculates a U statistic based on ranks

70
New cards

Wilcoxon signed-ranks test

Alternative to paired t-test, compares 1 group measured twice before to after

71
New cards

Kruskal-Wallis H test

Alternative to anova, compares 3 or more independent groups

72
New cards

Chi-Square test of goodness of fit

Compares observed and expected frequencies

73
New cards

Spearman rho

Alternative to pearson correlation, measures the relationship between 2 ranked variables

74
New cards

Chi-Square test or independence

Determines the association between twocategorical variables

75
New cards

Parametric

Used when data are normally distributed, the sample size is large, data are interval or ratio, you want more powerful results

76
New cards

Non parametric

Used when data are not normally distributed, sample size is small, data are ordinal or nominal, there are outliers or skewed data

77
New cards

Encoding

Determines correct statistical test, prevents wrong conclusions, ensures valid results

78
New cards

Friedman test

Is a non-parametric test used to compare three or more related groups or repeated measurements when data are not normally distributed

79
New cards

Friedman test

It is the non-parametric alternative to repeated measures anova and compares the ranks of scores rather than means

80
New cards

Friedman test

It helps analyze repeated data when parametric assumptions are not met

81
New cards

Repeated measures ANOVA

Is a parametric test used to compare the means of three or more repeated measurements from the same participants

82
New cards

Repeated measures ANOVA

It is used when data are normally distributed, making it suitable for studies with a repeating cycle or repeated observations

83
New cards

Repeated measures ANOVA

It helps determine whether significant changes occur over time within the same group

84
New cards

Parametric tests

These tests assume your data is normally distributed and uses continuous scales, they generally rely on the mean and standard deviation

85
New cards

Non parametric tests

These tests are distribution-free, they don't care about the mean; instead, they usually rank the data from smallest to largest and analyze the positions, the median