Quantitative Data Analysis I Flashcards

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/80

flashcard set

Earn XP

Description and Tags

Flashcards based on Quantitative Data Analysis I lecture notes.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

81 Terms

1
New cards

What are the four types of data based on measurement properties?

Nominal, ordinal, interval, and ratio.

2
New cards

What is the key difference between parametric and non-parametric tests?

Parametric tests assume data is normally distributed, while non-parametric tests do not.

3
New cards

What type of test is used to determine if data is normally distributed?

Normality test (e.g., Shapiro-Wilk test).

4
New cards

Name three statistical tests used for testing differences between two groups.

Student’s t-test, Mann-Whitney U-test, and Chi-squared test.

5
New cards

What is an explanatory variable?

A variable that is expected to be a cause.

6
New cards

What is a response variable?

A variable that is expected to be an effect.

7
New cards

Give an example of a nominal variable.

Hair color (e.g., red, blonde, brown).

8
New cards

Give an example of an ordinal variable.

Likert scales (e.g., strongly disagree to strongly agree).

9
New cards

Give an example of an interval variable.

Temperature (Celsius or Fahrenheit).

10
New cards

Give an example of a ratio variable.

Weight.

11
New cards

What type of data is categorized and ranked, but not evenly spaced?

Ordinal data.

12
New cards

What type of data is categorized, ranked, evenly spaced, and has a natural zero?

Ratio data.

13
New cards

What type of data is categorical and not ranked?

Nominal data.

14
New cards

What is a dichotomous variable?

A variable with only two categories (e.g., yes/no).

15
New cards

Which type of statistical test is more powerful?

Parametric tests are generally more powerful than non-parametric tests.

16
New cards

What does a normality test assess?

Whether the distribution of the data is normal or skewed.

17
New cards

Name a graphical method for assessing normality.

Histogram plot.

18
New cards

Name a numerical test for assessing normality.

Shapiro-Wilk test or Kolmogorov-Smirnov test.

19
New cards

What does a significant p-value in a Shapiro-Wilk test indicate?

Deviation from normality.

20
New cards

Which test should you perform if p-value is higher than 0.05 in Shapiro-Wilk test?

Parametric test

21
New cards

What is the null hypothesis (H0) in the Shapiro-Wilk test?

The data distribution is not different from a normal distribution.

22
New cards

What is the alternative hypothesis (H1) in the Shapiro-Wilk test?

The data distribution is different from a normal distribution.

23
New cards

What is checked when testing for differences between two groups?

Independent variable

24
New cards

What are the three assumptions of Student's t-test?

Random samples, normally distributed data, and homogeneity of variance.

25
New cards

What does Levene's test check?

Homogeneity of variance (homoscedasticity).

26
New cards

In Student's t-test, a larger T value indicates what?

A stronger trend (larger difference between means, smaller standard deviation).

27
New cards

What probability indicates statistical significance in Student's T-test?

P=0.026 (an example)

28
New cards

If the dependent variable deviates from normality, can a T-test preformed?

No

29
New cards

Which test should you perform if the dependent variable is numerical, but is not normally distributed?

Mann Whitney U-test

30
New cards

On what data the Mann Whitney U-test performs the test?

Data is ranked from low to high

31
New cards

What is reported when performing Mann Whitney U-test?

Differences between groups reported as medians and not means

32
New cards

With what kind of data the Chi-squared test should preform the test?

Nominal data/categorical variable

33
New cards

What are the expected values in Chi-squared test?

Calculated (theoretical) frequencies that would be expected if the null hypotheses were true – no difference between groups.

34
New cards

What are the observed values in Chi-squared test?

Actual values from your data collection (counts or frequencies)

35
New cards

The higher perception of taller than average GSS students are examined by which test?

Chi-squared test

36
New cards

What is the formula for the expected values in Chi-squared test?

(column total*row total)/grand total

37
New cards

What does the degrees of freedom in X2 indicates?

Number of observed columns minus one, multiplied by the number of observed rows minus one

38
New cards

Is the consumption of friut associated with frequency of physical activity?

Analyzed by Chi-squared test

39
New cards

What does 'location parameter' indicates in Mann Whitney U-test?

Difference between groups is significant

40
New cards

What is a continuous variable?

Numerical data.

41
New cards

What is another term for independent variable?

Explanatory/exposure variable.

42
New cards

What is another term for dependent variable?

Response/outcome variable.

43
New cards

Is pain rating in a 1-10 scale qualitative or quantitative?

Quantitative.

44
New cards

Is hair color considered numerical data?

No, it's non-numerical.

45
New cards

What does 'increasing degree of information' indicate?

More arithmetical operations.

46
New cards

What operation can be applied on nominal data?

Frequency or percentage.

47
New cards

If data is not normally distributed, what type of test is appropriate?

Non-parametric test.

48
New cards

What descriptive statistics are used for central tendency?

Mean, median, and mode.

49
New cards

What descriptive statistics are used for dispersion?

Range, variance, and standard deviation.

50
New cards

Define skewness.

A measure of the asymmetry of the probability distribution of a real-valued random variable about its mean.

51
New cards

What are the two approaches to test for normality?

Graphical and numerical.

52
New cards

Pros of graphical assessment of normality.

Easy and quick.

53
New cards

Cons of graphical assessment of normality.

Experience needed and not always objective.

54
New cards

Cons of numerical assessment of normality.

Unsensitive at small sample sizes and overly sensitive at large sample sizes.

55
New cards

What should you do when still in doubt of data normality?

Run both parametric and non-parametric versions of same statistical test.

56
New cards

What is the H0 of Shapiro-Wilk?

The data distribution is not different from a normal distribution.

57
New cards

What does a significant p-value in Shapiro-Wilk test indicate?

Deviation from normality.

58
New cards

When is a parametric test not applicable?

When data deviates from normality.

59
New cards

What are the three types of questions used as examples as the lecture?

Willingness to install solar panels, willingness to use CO2 neutral forms of transportation, the effect of wind stress on plant diversity.

60
New cards

What is the key component of 'Differences between two groups'?

Independent variable = two groups.

61
New cards

If the p-value > 0.05 in Levene's test, what does that indicate?

The variances are equal.

62
New cards

What is ranked from low to high in the Mann Whitney U-test?

Data.

63
New cards

Research question: Is the perception of corruption in Africa the same as in Asia?

Dependent variable: Perception of corruption, Independent variable: Country.

64
New cards

In Chi square test What does comparing observed values with expected values test?

Test relationship between two categorical variables

65
New cards

If there is no difference in preference among the groups what can be said about expected values?

Expected values are calculated frequencies that would be expected if there were no difference between groups

66
New cards

What does GSS stands for?

I am sorry, this was never defined in the text.

67
New cards

What indicates that students taller than average are more likely to indicate to be taller than average?

H0: Students taller than average are as likely to indicate to be taller than average than students shorter than average.

68
New cards

What is the difference between Yes (observed) and Yes (expected)?

Yes(observed) are just the amount of people who said yes and were shorter or taller, Yes(expected) accounts that it is by chance.

69
New cards

What does the formula χ2 = ∑(f1-e1) 2/e1 indicates?

That students taller than average perceived themselves as taller than average more frequently than students that are shorter than average.

70
New cards

What are the options to check in JASP for Mann-Whitney test?

✓ Mann-Whitney ✓ Location parameter ✓ Effect size

71
New cards

To what is the frequency of fruit consumption associated?

Frequency of physical activity

72
New cards

The summary part consists of?

Part 1: (more) Fundamental terminology, Part 2: Testing differences between two groups

73
New cards

What statistical test is used with nominal data and two groups?

Chi-square test.

74
New cards

What statistical test is used with ordinal data and two groups?

Mann-Whitney U test.

75
New cards

You have two related sets of data and want to examine the association between them. What test could you use?

Spearman’s R or Pearson’s R

76
New cards

Which test requires normal distribution of scale?

T-test.

77
New cards

Is Spearman’s R parametric of non-parametric test?

Non-parametric

78
New cards

Is Shapiro-Wilk test graphical or numerical?

Numerical.

79
New cards

Is normality test objective or subjective?

Numerical normality test is objective.

80
New cards

Is statistical test run before or after plotting histogram to see the data distribution?

After.

81
New cards

If results differ between parametric and non-parametric tests, what actions should be taken?

Be very careful in your interpretation of the results.