1/15
A set of practice flashcards covering the role, structure, stages, core contents, and examples of data dictionaries and metadata repositories as discussed in the lecture.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
What is the role of data dictionaries and metadata repositories in the data life cycle?
They store metadata about data so it can be easily understood and referenced, and they span the entire data life cycle from concept to repurposing.
What is metadata?
Properties that describe data; data about the data.
What is a data dictionary?
A resource describing data processes, including the meaning, relationships, origin, usage, and format of data.
How is a data dictionary typically structured?
In tabular form, with rows describing data fields/elements and columns describing the properties of those elements.
In the example, how is the patient’s last name described in the data dictionary?
As a data element stored as text with a length of 25 characters.
What core properties are typically included for each data element in a data dictionary?
Field name/Identifier, Type of data, Format, Rules, Description (and Last Updated Date for temporal data).
List the stages of the data life cycle that rely on the data dictionary.
Concept, Collection, Processing, Archiving, Distribution, Discovery, Analysis, and Repurposing.
What is the data dictionary’s role in Concept and Collection stages?
Defines data required for the study concept and ensures data collection aligns with what is needed; catalogs allowed values and maximum lengths.
What is the data dictionary’s role in Processing, Archiving, and Distribution stages?
Guides how data should be processed and stored; specifies data types for database columns; essential for programmatic processing.
What is the data dictionary’s role in Discovery and Analysis stages?
Provides clarity on data meanings not immediately evident and supports deeper understanding during analysis.
What is the data dictionary’s role in the Repurposing stage?
Critical for productive repurposing because metadata about the data is needed to reuse it appropriately.
What are the minimum core contents of a data dictionary?
Field name or Identifier; Type of data; Format; Rules; Description (and Last Updated Date for temporal data).
Why is Last Updated Date sometimes included in a data dictionary?
To track changes over time for temporal data.
What examples of data types and subtypes are mentioned?
Type of data includes text, numeric, date, time, Boolean; numeric data may be integer or decimal.
Are data dictionaries and metadata repositories the same thing?
Yes; a data dictionary is a type of metadata repository that stores metadata about data.
In the MRN example, which item would NOT be included in the data dictionary?
The actual Patient MRN value; the dictionary would store metadata like the source system, allowed ranges, and data type, not the MRN itself.