mlr with outliers

0.0(0)
studied byStudied by 1 person
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/15

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

16 Terms

1
New cards

Types of outliers and where to detect them

A. in y-space: Standardized residuals

B. in x-space: Mahalanobis distance

C. in xy-space: Cook’s distance

2
New cards

Outliers in y-space diagnosis and reason

Standardized residuals Poorly predicted cases (what is right and what is wrong)

<p>Standardized residuals <span data-name="black_small_square" data-type="emoji">▪</span> Poorly predicted cases (what is right and what is wrong)</p>
3
New cards

Standardized residuals are normally distributed, rule of thumb

cases with |z| > 3.3 are outliers in y-space, only frequentist on JASP

4
New cards

Outliers in x-space diagnosis and why

Mahalanobis distance

Extreme combination of scores on predictors Large value indicates outlier

<p>Mahalanobis distance </p><p><span data-name="black_small_square" data-type="emoji">▪</span> Extreme combination of scores on predictors <span data-name="black_small_square" data-type="emoji">▪</span> Large value indicates outlier</p>
5
New cards

x-space outliers rule of thumb and critical value

Critical value depends on # predictor.

MDcritical ≈ 10 + 2×(# predictors)

6
New cards
<p>Outliers in xy-space</p>

Outliers in xy-space

seriously affect your conclusions!

<p>seriously affect your conclusions!</p>
7
New cards

Outliers in xy-space • Diagnosis and why

Cook’s distance Outlier on both predictors and dependent variable

8
New cards

Outliers in xy-space Rule of thumb

Cases with CD > 1 are outliers in xy-space

9
New cards
term image

Y space

Standardized residuals

True problem if |z| > 3.3

10
New cards
term image

X space

Mahalanobis distance

True problem if Mahalanobis distance is > 10.82 (or approximately 10 + 2 × 1 = 12)

11
New cards
term image

XY space

Cook’s distance

True problem if CD > 1

12
New cards
term image

Both: • X space • Y space

Mahalanobis distance for X and standardized residual for Y

13
New cards

Reasons for outliers

Typo or miscoded missing value

Member or no member of the intended population

If no observable errors, keep the observation in your dataset

14
New cards

Typo or miscoded missing value

Exclude impossible values • Fix unambiguous typos made by the researcher/coder • Investigate suspected typos made by respondent

15
New cards

Member or no member of the intended population

Student of 60 years old • If not member, remove from the analysis (and explain why)

16
New cards

If no observable errors, keep the observation in your dataset

Report on this potential outlier • May investigate if the outlier influences the conclusions