Home
Explore
Exams
Search for anything
Search for anything
Login
Get started
Home
chapter 6 slater
Studied by 0 people
0.0
(0)
Add a rating
View linked note
Learn
A personalized and smart learning plan
Practice Test
Take a test on your terms and definitions
Spaced Repetition
Scientifically backed study method
Matching Game
How quick can you match all your cards?
Flashcards
Study terms and definitions
1 / 17
There's no tags or description
Looks like no one added any tags here yet for you.
18 Terms
View all (18)
Star these 18
1
Data Cleaning
The process of correcting errors and inconsistencies in data to enhance its quality.
New cards
2
Data Structuring
The organization of data into a defined format for analysis, including methods like aggregation and joining.
New cards
3
De-duplication
The method of removing duplicate entries in data while preserving original data integrity.
New cards
4
Data Joining
The process of combining data from various sources to create a comprehensive dataset for analysis.
New cards
5
Data Pivoting
Transforming data from a row-oriented format to a columnar format to ease analysis.
New cards
6
Imputation
The technique of replacing missing data with substitute values, documented for future reference.
New cards
7
Filtering
The act of removing unnecessary or irrelevant data from a dataset.
New cards
8
Consistency
The practice of maintaining uniform data formats and structures across different systems.
New cards
9
Visual Inspection
Manually checking data to identify errors or inconsistencies in a dataset.
New cards
10
Statistical Tests
Using statistical analysis methods to validate the integrity of the data and detect anomalies.
New cards
11
Sample Audit
Examining a subset of data to verify its accuracy and representation of the whole dataset.
New cards
12
Threshold Violations
Situations where data values fall outside accepted numerical ranges and require correction.
New cards
13
Entry Errors
Mistakes made during data input, whether due to human error or system limitations.
New cards
14
Parsing
The process of dividing data into multiple fields by recognizing patterns.
New cards
15
Concatenation
The act of merging multiple fields into a single unit, often for easier identification.
New cards
16
Contradiction Errors
Conflicts that arise when data entries describe the same entity in inconsistent ways.
New cards
17
Attribute Dependencies
Errors that occur when related data points do not align or match accurately.
New cards
18
Handling Cryptic Data Values
Decoding nonsensical data points that require contextual understanding to interpret.
New cards