Data Quality

5.0(1)
studied byStudied by 16 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/54

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

55 Terms

1
New cards

Data

numbers word or images that have yet to be organized or analyzed to answer a specific question

2
New cards

Information

Produced through processing, manipulating, and organizing data to answer questions adding to knowledge of the receiver

3
New cards

Data Quality

it is the overall utility of a datasets as a function of its ability to be processed easily and analyzed for a database, data warehouse, or data analytics system

4
New cards

Accuracy, Completeness, Update status, Relevance, Consistency, Reliability, Appropriate presentation, and Accessibility

Aspects of Data Quality

5
New cards

Accuracy

indicates whether the data is free from significant errors and whether the numbers seem to make sense

6
New cards

Completeness

indicates whether there is enough information to draw a conclusion about the data and whether enough individuals responded to it to ensure representativeness

7
New cards

Relevance

refers to the degree to which data are important to users and their needs

8
New cards

Consistency

considers the extent to which data is collected using the same process and which procedures by everyone doing the collecting and in all locations over time

9
New cards

Reliability

determined by the degree to which measurements are similar (consistent) on repeated measurements

10
New cards

Appropriate presentation

degree from which the data is easily understood and well organized

11
New cards

Lot Quality Assurance Sampling

is a tool that allows the sue of small random samples to distinguish between different group of data elements with high and low data quality

12
New cards

Lot Quality Assurance Sampling

LQAS

13
New cards

Step by step of LQAS

-define the service to be assessed (DQA of DHIS)

-identify the unit of interest

-define the higher and lower thresholds of performance

-determine the level of acceptable error

-determine the size sample size and decision rule for acceptable errors

-identify the number of errors observed

14
New cards

Routine Data Quality Assessment

RDQA

15
New cards

Routine Data Quality Assessment

it is a simplified version of the data quality audit tool which allows programs and projects to verify and assess the quality of their reported data

16
New cards

Verify Rapidly

  • The quality of reported data for key indicators at selected sites;

  • The ability of data-management systems to collect, manage, and report quality data

17
New cards

Implement

Corrective measures with action plans for strengthening the data management and reporting system and improving data quality

18
New cards

Monitor

Capacity improvements and performance of the data management and reporting system to produce quality data.

19
New cards
  • Verify Rapidly

  • Implement

  • Monitor

Objectives of RDQA

20
New cards

Development Implementation Plan

-project management tool that illustrates how a project is expected to progress at a high level

-helps ensure that a development team is working to deliver and complete tasks on time

21
New cards

Development Implementation Plan

DIP

22
New cards

Steps of DIP

-define goals/objectives

-schedule milestone

-allocate resources

-designate team member responsibilities

-define metric for success

23
New cards

Data Quality Tool

it analyzes information and identifies incomplete or incorrect data

24
New cards

Data Cleansing

can be done to raise the quality of available data

25
New cards

Define goals/objectives

address the question "what do you want to accomplish?"

26
New cards

Schedule milestone

Outline the deadline and timeline in the implementation phase

27
New cards

Allocate resources

determine whether you have sufficient resources and decide how you will procure those missing

28
New cards

Designate team member responsibilities

create a general team plan with overall roles that each team member will play

29
New cards

Define metrics for success

how will you determine if you have achieved your goal?

30
New cards

Parsing and Standardization

refers to the decomposition of fields into component parts and formatting the values into consistent layouts based on industry standards and patterns and user defined business rules

31
New cards

Generalized "cleansing"

is the modification of data values to meet domain restrictions constraints on integrity or other rules that define data quality as sufficient for the organization

32
New cards

Matching

is the identification and merging of related entries within or across data sets

33
New cards

Profiling

refers to the analysis of data to capture statistics or metadata to determine the quality of the data and identify data quality

34
New cards

Monitoring

refers to the deployment of controls to ensure conformity of data to business rules set by the organization

35
New cards

Enrichment

is the enhancement of the value of the data by using related attributes from external sources such as consumer demographic attributes or geographic descriptors

36
New cards

Generalization of ETL

tools which allow optimization of the alimentation process

37
New cards

Extract, Transform, Load

ETL

38
New cards

Root Cause Analysis

is a problem solving method that identifies the root case of problems or events instead of simply addressing the obvious symptoms

39
New cards

Failure Mode and Effect Analysis

-aims to find various modes of failures within a system

-is used when there is a new product or process or when there are changes or updates in a product and when a problem is reported through customer feed back

40
New cards

Failure Mode and Effect Analysis

FMEA

41
New cards

Pareto Analysis

-uses 20% work produces 80% of result

-used when there are multiple potential causes to a problem

42
New cards

Fault Tree Analysis

-is used in risk and safety analysis

-uses boolean logic to determine the root causes of an undesirable event

43
New cards

Fault Tree Analysis

FTA

44
New cards

Current Reality Tree

CRT

45
New cards

CRT

-used when the root causes of multiple problems need to be analyzed all at once

-problems listed down followed by the potential cause for a problem

46
New cards

Fishbone Diagram

-aka Ishiwaka or cause and effect diagram

-categorizes the causes and sub causes of problem

-is useful in grouping causes into categories

47
New cards

Kepner-Tregoe Technique

-breaks a problem down to its root cause by assessing a situation using priorities and orders of concern for a specific issues

48
New cards

Rapid Problem Resolution Diagnosis

RPR

49
New cards

RPR

diagnoses the problem by

-discover

-investigate

-fix

50
New cards

Discover

data gathering and analysis of findings

51
New cards

Investigate

creation of diagnostic plan and identification of the root cause through careful analysis of the diagnostic data

52
New cards

Fix

Fixing the problem and monitoring to confirm and validate that the correct root cause was identified

53
New cards

Information Culture

is determined by the ff variables:

-mission

-history

-leadership

-employee traits

-industry

-national culture

-can also be cognitive and epistemic

54
New cards

Sustaining culture of information use

suggest that in order to have a sense of information attitudes and values managers should consider taking the pulse of information of their organizations

55
New cards

The management

Plays an important role in sustaining the culture of information