Lecture Notes: Outliers in Data

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall with Kai
GameKnowt Play
New
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/5

flashcard set

Earn XP

Description and Tags

Question-and-answer flashcards focusing on the concept of outliers in data analysis.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

6 Terms

1
New cards

What term describes data points that deviate markedly from the rest of the dataset?

Outliers.

2
New cards

True or False: Outliers must always be removed from a dataset.

False; they may be errors or legitimate observations that require investigation.

3
New cards

Which measure of central tendency is most affected by outliers?

The mean.

4
New cards

What graphical method is commonly used to identify outliers in a distribution?

Box plot (box-and-whisker plot).

5
New cards

Name one technique used to reduce the impact of outliers on analysis.

Winsorizing (capping extreme values).

6
New cards

If an outlier is due to a data entry error, what is the appropriate action before analysis?

Correct the error if possible; otherwise consider removing the observation after documenting the reason.