Data standardization, cleaning, and validation

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/15

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 12:50 AM on 3/24/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

16 Terms

1
New cards

Cryptic data values

data items that have no meaning without understanding a coding scheme

2
New cards

Misfielded data values

data values that are correctly formatted but not listed in the correct field

3
New cards

Data consistency

the principle that every value in a field should be stored in the same way

4
New cards

Data cleaning

process of updating data to be consistent, accurate, and complete

5
New cards

Data de-duplication

process of analyzing data and removing two or more records that contain identical information

6
New cards

Data filtering

process of removing records or fields of information from a data source

7
New cards

Data imputation

process of replacing a null or missing value with a substituted value

8
New cards

Data contradiction errors

errors that exist when the same entity is descrived in 2 conflicting ways

9
New cards

Data threshold violations

data errors that occur when a data value falls outside of an allowable level

10
New cards

Violated attribute dependencies

errors that occur when a secondary attribute in a row of data does not match the primary attribute

11
New cards

Data entry errors

all types of errors that come from inputting data incorrectly

12
New cards

Data validation

process of analyzing data to make certain the data has the properties of high-quality data

13
New cards

Visual inspection

process of examining data using human vision to see if there are problems

14
New cards

Basic statistical tests

performed to validate the data

15
New cards

Audit a sample

one of the best techniques for assuring data quality

16
New cards

Advanced testing techniques

possible with a deeper understanding of the content of data

Explore top notes

note
Richard III
Updated 1239d ago
0.0(0)
note
Group 16 elements
Updated 1346d ago
0.0(0)
note
Ethics in Health Care
Updated 1197d ago
0.0(0)
note
Political History
Updated 1198d ago
0.0(0)
note
Developmental Psych Chapter 19
Updated 1284d ago
0.0(0)
note
Untitled Flashcards Set
Updated 466d ago
0.0(0)
note
Verben Conjugations
Updated 415d ago
0.0(0)
note
Richard III
Updated 1239d ago
0.0(0)
note
Group 16 elements
Updated 1346d ago
0.0(0)
note
Ethics in Health Care
Updated 1197d ago
0.0(0)
note
Political History
Updated 1198d ago
0.0(0)
note
Developmental Psych Chapter 19
Updated 1284d ago
0.0(0)
note
Untitled Flashcards Set
Updated 466d ago
0.0(0)
note
Verben Conjugations
Updated 415d ago
0.0(0)

Explore top flashcards

flashcards
poznávačka rostliny
95
Updated 421d ago
0.0(0)
flashcards
ANTHC101 EXAM #1
97
Updated 386d ago
0.0(0)
flashcards
El bienestar
52
Updated 1168d ago
0.0(0)
flashcards
IST Summitive Ms Yunus 9th
25
Updated 189d ago
0.0(0)
flashcards
Autoteile Vokabeln
43
Updated 1037d ago
0.0(0)
flashcards
AP Lang Rhetorical Devices
80
Updated 482d ago
0.0(0)
flashcards
Unit 8: The New South
20
Updated 46d ago
0.0(0)
flashcards
poznávačka rostliny
95
Updated 421d ago
0.0(0)
flashcards
ANTHC101 EXAM #1
97
Updated 386d ago
0.0(0)
flashcards
El bienestar
52
Updated 1168d ago
0.0(0)
flashcards
IST Summitive Ms Yunus 9th
25
Updated 189d ago
0.0(0)
flashcards
Autoteile Vokabeln
43
Updated 1037d ago
0.0(0)
flashcards
AP Lang Rhetorical Devices
80
Updated 482d ago
0.0(0)
flashcards
Unit 8: The New South
20
Updated 46d ago
0.0(0)