Data: Everywhere in an organization and comes in different levels, formats, and granularities.
Employees must be able to analyze organizational data to make decisions.
Collecting, compiling, sorting, and analyzing data provides insight into organizational performance.
Data Type: Transactional and Analytical
Transactional Data: Data within a single business process or unit of work that supports daily operational tasks.
Analytical Data: Encompasses all organizational data, supporting managerial analysis tasks.
Data Timeliness
Real-time data: Immediate, up-to-date data.
Real-time system: Provides real-time data in response to requests.
Data Quality
Business decisions are only as good as the quality of the data used to make them.
Data inconsistency: Occurs when the same data element has different values.
Data integrity issues: Occur when a system produces incorrect, inconsistent, or duplicate data.
Costs of low-quality data:
Inability to accurately track customers, identify valuable customers, identify selling opportunities, marketing to nonexistent customers.
Difficulty tracking revenue and building strong customer relationships.
Benefits of Good Data: High quality data can significantly improve the chances of making a good decision, which can directly impact an organization's bottom line.
Data steward: Responsible for ensuring data policies and procedures are implemented across an organization.
Data Governance
Data governance: Overall management of the availability, usability, integrity, and security of company data.
Master data management (MDM): Gathering data and ensuring it is uniform, accurate, consistent, and complete.
Includes entities such as customers, suppliers, products, sales, employees, and other critical entities.
Data validation: Tests and evaluations used to determine compliance with data governance policies.
Database: Maintains data about various types of objects (inventory), events (transactions), people (employees), and places (warehouses).
Database management systems (DBMS): Allows users to create, read, update, and delete data in a relational database.
Data element: The smallest or basic unit of data.
Data model: Logical data structures that detail the relationships among data elements using graphics or pictures.
Metadata: Details about data.
Data dictionary: Compiles all of the metadata about the data elements in the data model.
Entity: A person, place, thing, transaction, or event about which data is stored.
Rows in a table contain entities.
Attribute (field, column): The data elements associated with an entity.
Columns in each table contain the attributes.
Record: A collection of related data elements.
Primary keys and foreign keys identify the various entities (tables) in the database.
Primary key: A field (or group of fields) that uniquely identifies a given entity in a table.
Foreign key: A primary key of one table that appears as an attribute in another table and acts to provide a logical relationship among the two tables.
Increased Flexibility
Handle changes quickly and easily.
Provide users with different views.
Physical view: Deals with the physical storage of data on a storage device.
Logical views: Focuses on how individual users logically access data to meet their own particular business needs.
Increased Scalability and Performance
Scalability: Refers to how well a system can adapt to increased demands
Performance: Measures how quickly a system performs a certain process or transaction.
Reduced Data Redundancy
Data redundancy: The duplication of data or storing the same data in multiple places.
Inconsistency is one of the primary problems with redundant data.
Increased Data Integrity (Quality)
Data integrity: Measures the quality of data.
Integrity constraint: Rules that help ensure the quality of data.
Relational integrity constraint.
Business-critical integrity constraint.
Increased Data Security
Password: Provides authentication of the user.
Access level: Determines who has access to the different types of data.
Access control: Determines types of user access, such as read-only access.
Organizational data is difficult to access and contains structured data in databases, and unstructured data such as voice mail, phone calls, text messages, and video clips.
Data Rich, Information Poor: Many organizations find themselves in the position of being data rich and information poor.
BI enables business users to receive data for analysis that is: reliable, consistent, understandable, easily manipulated.
Data warehouses extend the transformation of data into information.
Data warehouse provided the ability to support decision making without disrupting the day-to-day operations.
Data warehouse: A logical collection of data – gathered from many different operational databases – that supports business analysis activities and decision-making tasks.
The primary purpose of a data warehouse is to aggregate data throughout an organization into a single repository for decision-making purposes.
Inconsistent Data Definitions, Lack of Data Standards, Poor Data Quality, Inadequate Data Usefulness, Ineffective Direct Data Access.
Data Aggregation: Collection of data from various sources for the purpose of data processing.
Extraction, transformation, and loading (ETL): A process that extracts data from internal and external databases, transforms the data using a common set of enterprise definitions, and loads the data into a data warehouse.
Data mart: Contains a subset of data warehouse data.
Data lake: A storage repository that holds a vast amount of raw data in its original format until the business needs it.
An organization must maintain high-quality data in the data warehouse.
Dirty data: Erroneous or flawed data.
Data cleansing or scrubbing: A process that weeds out and fixes or discards inconsistent, incorrect, or incomplete data.
Data Visualization: Describes technologies that allow users to “see” or visualize data to transform data into a business perspective.
Data visualization tools: Move beyond Excel graphs and charts into sophisticated analysis techniques such as pie charts, controls, instruments, maps, time-series graphs, and more.