Correlation and regression and parametric and non parametric distributions

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/84

There's no tags or description

Looks like no tags are added yet.

Last updated 10:05 PM on 5/20/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

85 Terms

New cards

Parametric distibution

Assumes data follows a specific pattern usually normal distribution with parameters like mean and variance

New cards

Parametric distribution

Normal bell shaped, t distribution, f distribution, binomial, poisson, exponential

New cards

Parametric distribution

When used, data is continuous, normally distributed, large sample size, meets assumptions, linearity, equal variance, independence

New cards

Non parametric distribution

Does not assume any specific distribution, also called distribution free

New cards

Non parametric distribution

Works with ordinal or nominal data, handles skewed or small datasets, uses ranks instead of raw values

New cards

Non parametric distribution

Spearman correlation, manny whitney u, kruskal wallis, wilcoxon test

New cards

Parametric distribution

Strict assumptions, interval or ratio data, large sample size, higher power, used when data is normal

New cards

Non parametric distribution

Flexible assumptions, ordinal or norminal data, small sample size, lower power, used when data is skewed

New cards

Correlation

Is a statistical measure that shows the relationship between two variables

New cards

Correlation

Measures how variables change together, describes relationships but does not indicate cause and effect

New cards

Purpose of correlation

Identifies patterns and relationships between variables, measures the strength and direction of relationships, helps in prediction and decision making, simplifies data by identifying important variables, useful in fields like finance, healthcare, and research

New cards

Positive correlation

Both variables move in the same direction, when one increases, the other also increases, when one decreases, the other also decreases

New cards

Negative correlation

Variables move in opposite directions, when one variable increases, the other decreases, inverse correlation

New cards

Zero correlation

No relationship exists between the variables, changes in one variable do not affect the other

New cards

Strong correlation

Means the variables have a very close relationship, where changes in one consistently associated with changes in the other

New cards

Moderate correlation

Indicates a noticeable relationship, but it is not perfectly consistent and may have some variation

New cards

Weak correlation

Shows little to no clear relationship, meaning changes in one variable do not strongly predict changes in the other

New cards

Pearson correlation coefficient

Is a statistical measure of the linesr relationship between two variables, it is a descriptive statistic that summarizes data characteristics

New cards

Perfect positive relationship

+1 correlation

New cards

Perfect negative relationship

-1 correlation

New cards

No relationship

0 correlation

New cards

Pearson correlation coefficient

Used for continuous data, applied when both variables are measured on a continuous scale, best used when the relationship between variables is linear

New cards

Pearson correlation

Assumes that the data are normally distributed, it also assumes a linesr relationship between the two variables, this means the data should follow a bell shaped pattern and change consistently in one direction, if these assumptions are not met, the results may be inaccurate

New cards

Spearman rank correlation

Measures the strength and direction of a linear relationship between two variables, commonly used for continuous data such as height, weight, or test scores, assumes normal distribution and a linear relationship between variables

New cards

Kendall’s Tau

Nonparametric measure of association between two ranked variables, more accurate when there are many tied ranks, commonly used for small sample sizes

New cards

Regression

Is a statistical method used to examine relationships between variables

New cards

Regression

Helps determine how one variable can predict another

New cards

Regression

Used to identify patterns and trends for analysis and decision-making

New cards

Regression

Explains how one variable affects another and is used for prediction

New cards

Purpose of regression analysis

Understands relationships between variable, predicts future outcomes, identifies variables that strongly influence an outcome

New cards

Regression

Estimates the value of one variable based on another variable, useful for forecasting and decision-making

New cards

Regression

Measures how an independent variable affects a dependent variable, shows whether the effect is positive, negative, or no effect, helps determine the strength and direction of the effect between variables

New cards

Simple linear regression

Is used when there is one independent variable and one dependent variable

New cards

Simple linear regression

The method assumes a straight-line relationship, meaning as the independent variable changes, the dependent variable changes at a constant rate

New cards

Multiple regression

Involves two or more independent variables affecting a single dependent variable

New cards

Multiple regression

This type helps measure how several factors together influence an outcome and can show which variables have the strongest impact

New cards

Logistic regression

It is when the dependent variable is categorical pass or fail, estimates the probability of a certain

outcome

New cards

Logistic regression

Used when the dependent variable is categorical, such as yes or no or passor fail outcomes, instead of predicting a numeric value, it estimates the probability of an event occurring

New cards

Linear regression

Predicts continuous values, uses best-fit line, solves regression problems

New cards

Logistic regression

Predicts categorical classes, uses sigmoid s curve, solves classification problems

New cards

Non linear regression

Is a statistical method that models complex, curved relationships between a dependent variable and one or more independent variables

New cards

Non linear regression

Uses flexible curves such as exponential, logarithmic, or logistic functions to fit data that does not follow a linear pattern

New cards

Parametric distribution

Assume data follows a specific probability distribution, described using parameters such as mean and variance

New cards

Parametric distribution

Used for prediction, hypothesis testing, and statistical analysis, accurate only when the data fits the assumed distribution

New cards

Normal distribution

Also called the Gaussian or bell-shaped distribution, symmetrical around the mean, defined by mean and standard deviation, commonly used for heights, test scores, and measurement errors

New cards

t distribution

Similar to the normal distribution but with wider tails, used for small sample sizes usually below 30

New cards

t distribution

Defined by degrees of freedom, commonly used in t-tests for comparing means

New cards

F distribution

Positive and right-skewed distribution, based on the ratio of two variances, defined by two degrees of freedom, commonly used in anova to compare group means

New cards

Parametric tests

Use when data is continuous, interval or ratio scale, data should be approximately normally distributed bell-shaped, variances of groups should be equal homoscedasticity, observations must be independent, best used with moderate to large sample sizes

New cards

Non parametric distribution

Methods do not assume a normal distribution of data, also called distribution-free methods, analyze data using ranks, order, or categories instead of means and standard deviation

New cards

Non parametric tests

Does not require normality, can handle skewed data and outliers, suitable for small sample sizes, can be used for ordinal and nominal data

New cards

Spearman correlation

Measures the relationship between two ranked variables, used for ordinal or non-normally distributed data

New cards

Mann-Whitney U test

Compares two independent groups, non-parametric alternative to the independent samples t-test, uses ranks instead of means

New cards

Kruskal-Wallis test

Compares three or more independent groups, non-parametric alternative to one-way anova, determines if significant differences exist among groups

New cards

Wilcoxon test

Compares two related samples or repeated measurements, non-parametric alternative to the paired samples t-test, often used for before-and-after measurements

New cards

Parametric assumptions

Requires the data to follow certain conditions such as normal distribution, equal variance, and independence of observations, these assumptions allow more accurate and powerful statistical conclusions

New cards

Non parametric assumptions

Does not require strict assumptions about the distribution of data, it can be used even when data are skewed, not normally distributed, or contain outliers

New cards

Parametric type of data

Used for interval and ratio data, where numerical values have equal intervals, these data allow computation of mean and standard deviation

New cards

Non parametric type of data

Used for ordinal and nominal data, where values represent categories or rankings

New cards

Parametric sample size

Works best with larger sample sizes because the assumption of normality becomes more reliable, larger samples improve accuracy of results

New cards

Non parametric sample size

Can be used with small sample sizes since it does not rely heavily on distribution assumptions, it is useful when there are few participants

New cards

Parametric statistical power

Has higher statistical power, meaning it is more likely to detect a true difference or relationship if one exists, this makes parametric tests more sensitive

New cards

Non parametric statistical power

Has lower statistical power, meaning it may fail to detect small differences, it is safer when assumptions for parametric tests are violated

New cards

One sample t test

Compares one group with a known standard or population mean

New cards

Independent samples t test

Compares the means of two independent groups

New cards

Paired samples t test

Compares the mean of the same group before and after

New cards

One-Way ANOVA

Compares the means of three or more independent groups

New cards

Pearson r

Measures the relationship between two variables

New cards

Mann-Whitney U test

Alternative to independent t-test, compares 2 independent groups, it calculates a U statistic based on ranks

New cards

Wilcoxon signed-ranks test

Alternative to paired t-test, compares 1 group measured twice before to after

New cards

Kruskal-Wallis H test

Alternative to anova, compares 3 or more independent groups

New cards

Chi-Square test of goodness of fit

Compares observed and expected frequencies

New cards

Spearman rho

Alternative to pearson correlation, measures the relationship between 2 ranked variables

New cards

Chi-Square test or independence

Determines the association between twocategorical variables

New cards

Parametric

Used when data are normally distributed, the sample size is large, data are interval or ratio, you want more powerful results

New cards

Non parametric

Used when data are not normally distributed, sample size is small, data are ordinal or nominal, there are outliers or skewed data

New cards

Encoding

Determines correct statistical test, prevents wrong conclusions, ensures valid results

New cards

Friedman test

Is a non-parametric test used to compare three or more related groups or repeated measurements when data are not normally distributed

New cards

Friedman test

It is the non-parametric alternative to repeated measures anova and compares the ranks of scores rather than means

New cards

Friedman test

It helps analyze repeated data when parametric assumptions are not met

New cards

Repeated measures ANOVA

Is a parametric test used to compare the means of three or more repeated measurements from the same participants

New cards

Repeated measures ANOVA

It is used when data are normally distributed, making it suitable for studies with a repeating cycle or repeated observations

New cards

Repeated measures ANOVA

It helps determine whether significant changes occur over time within the same group

New cards

Parametric tests

These tests assume your data is normally distributed and uses continuous scales, they generally rely on the mean and standard deviation

New cards

Non parametric tests

These tests are distribution-free, they don't care about the mean; instead, they usually rank the data from smallest to largest and analyze the positions, the median