[CSCI 113i] Long Test 1 Coverage

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/128

flashcard set

Earn XP

Description and Tags

Comprises Lectures 0–3 (Introduction to BI, Corporate Strategy and BI in the Value Chain, Data Lifecycle Management, Data Warehouse and OLAP)

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

129 Terms

1
New cards

Business Intelligence

The set of techniques and tools for the transformation of raw data into meaningful and useful information for business analysis purposes

2
New cards

Data Driven Decision Making

Decisions based on data instead of instinct or heuristics

3
New cards

Data

Events captured by a business’ system

<p>Events captured by a business’ system</p>
4
New cards

Information

Organized and categorized data usually in a data warehouse

<p>Organized and categorized data usually in a data warehouse</p>
5
New cards

Knowledge

Patterns, context, and insights discovered from information

<p>Patterns, context, and insights discovered from information</p>
6
New cards

Wisdom

Application of knowledge to solve business problems

<p>Application of knowledge to solve business problems</p>
7
New cards

Data Mining

Sometimes referred to as exploratory data analysis due to analysis of large quantities of data using statistical, technical, and business knowledge

8
New cards

Knowledge

The goal of data mining

9
New cards

Data Science

A set of fundamental principles that guide the extraction of knowledge from data

10
New cards

Data Mining

The process of extracting knowledge from data, via technologies that incorporate data science principles

11
New cards

Hypothesis Testing

A general approach to data mining. For decisions that repeat, especially in a massive scale, to improve decision making accuracy.

Sense & Respond

12
New cards

Knowledge Discovery

A general approach to data mining. For decisions that discoveries need to be made from data.

Predict & Act

13
New cards

Mission

Why we exist

14
New cards

Vision

What we want to be

15
New cards

Values

What we believe in and how we will behave

16
New cards

Competitive Advantage

Traits or ways an organization has that keep them ahead of their competitors; these are traits that rivals desire.

17
New cards

Differentiation Advantage

A type of competitive advantage where customer’s perceived value is greater than its competitors

18
New cards

Low Cost Advantage

A type of competitive advantage where company’s profit margin is higher than its competitors

19
New cards

Value Chain

A series of activities in an organization’s operations that add value to its final product or service.

<p>A series of activities in an organization’s operations that add value to its final product or service.</p>
20
New cards

Inbound Logistics

A primary activity in the value chain. Receiving and storing of raw materials.

21
New cards

Operations

A primary activity in the value chain. Conversion of raw materials into finished goods.

22
New cards

Outbound Logistics

A primary activity in the value chain. Storing and distributing finished goods to customers.

23
New cards

Marketing & Sales

A primary activity in the value chain. Making the customers aware of the product or service and provides them with an opportunity to buy.

24
New cards

Service

A primary activity in the value chain. Activities after the point of sale (training, installation, support).

25
New cards

Firm Infrastructure

A supporting activity in the value chain. Organizes a firm for it to function.

26
New cards

Human Resource Management

A supporting activity in the value chain. Supports all concerns related to staffing.

27
New cards

IT

A supporting activity in the value chain. Supports the organization’s IT infrastructure.

28
New cards

Procurement

A supporting activity in the value chain. Supports purchasing of materials used for production.

29
New cards

Business Intelligence

Provides historical, current, and predictive views of the business

30
New cards

Corporate Strategy

An organization’s game plan. Consists of a goal, scope, and means.

31
New cards

Business Motivation Model

A framework used to define and align a company’s goals and strategy to its overall vision.

<p>A framework used to define and align a company’s goals and strategy to its overall vision.</p>
32
New cards

Strategy

What our competitive game plan will be. Consists of objective (ends), scope (domain), and advantage (means).

33
New cards

Objective

A basic element of a strategy statement that corresponds to ends

34
New cards

Scope

A basic element of a strategy statement that corresponds to domain

35
New cards

Advantage

A basic element of a strategy statement that corresponds to means

36
New cards

Balanced Scorecard

How we will implement and monitor the game plan

<p>How we will implement and monitor the game plan</p>
37
New cards

Measures

A unit-specific term used to describe business objects or entities (e.g. 1M USD in revenue, 1000 monthly sales)

38
New cards

Metrics

A quantifiable measurement used to track and assess the performance of a business process (e.g. 10% increase in month on month sales, 5% increase in subscriber growth)

39
New cards

KPI

A subset of metrics used to measure how well a business is achieving its goals (e.g. after migrating from on-premise services to cloud servers, there is a 30% cost reduction in IT operations)

40
New cards

Measure

Classify as measure, metric, or knowledge: 3,273 new postpaid subscribers for the month of January

41
New cards

Knowledge

Classify as measure, metric, or knowledge: 1% growth on new postpaid subscribers for the month of January as compared to January last year

42
New cards

Measure

Classify as measure, metric, or knowledge: 20,000 km distance traveled by a sedan

43
New cards

Measure

Classify as measure, metric, or knowledge: 51 degrees celsius average CPU temperature

44
New cards

Metric

Classify as measure, metric, or knowledge: 51 degrees celsius average CPU temperature when playing Valorant vs 42 degrees celsius average CPU temperature when idle

45
New cards

Data Strategy

A subset of the corporate strategy that is specifically focused on data

46
New cards

Data Strategy

Creates the necessary alignment across the organization to create more value from data and ideally leads to improved decision-making and operational efficiency

47
New cards

Business Strategy

Influences on Data Strategy

  • What are your goals for this strategy?

  • What is your scope?

  • What are your advantages or means for this strategy to materialize?

<p><strong>Influences on Data Strategy</strong></p><ul><li><p>What are your goals for this strategy?</p></li><li><p>What is your scope?</p></li><li><p>What are your advantages or means for this strategy to materialize?</p></li></ul><p></p>
48
New cards

Business Model

Influences on Data Strategy

  • How does your company operate?

  • How do you earn (or what are your different revenue streams?)

  • How can your customers purchase your products?

  • What’s your value proposition?

  • What’s your cost structure?

<p><strong>Influences on Data Strategy</strong></p><ul><li><p>How does your company operate?</p></li><li><p>How do you earn (or what are your different revenue streams?)</p></li><li><p>How can your customers purchase your products?</p></li><li><p>What’s your value proposition?</p></li><li><p>What’s your cost structure?</p></li></ul><p></p>
49
New cards

Data Availability/Inventory

Influences on Data Strategy

  • What data is available?

  • Which activities in the value chain produce data?

  • Which data are relevant to the data strategy?

<p><strong>Influences on Data Strategy</strong></p><ul><li><p>What data is available?</p></li><li><p>Which activities in the value chain produce data?</p></li><li><p>Which data are relevant to the data strategy?</p></li></ul><p></p>
50
New cards

Data Literacy

Influences on Data Strategy

  • How well do your business functions understand data?

  • How well do your business functions understand statistical/mathematical principles?

  • How well do your business functions understand YOUR data?

<p><strong>Influences on Data Strategy</strong></p><ul><li><p>How well do your business functions understand data?</p></li><li><p>How well do your business functions understand statistical/mathematical principles?</p></li><li><p>How well do your business functions understand YOUR data?</p></li></ul><p></p>
51
New cards

Analytical Maturity

Influences on Data Strategy

  • Where are you in the (business intelligence) curve?

<p><strong>Influences on Data Strategy</strong></p><ul><li><p>Where are you in the (business intelligence) curve?</p></li></ul><p></p>
52
New cards

Data Operations

Influences on Data Strategy

  • Do you have a proper data platform?

  • Can your IT infrastructure support your data platform?

  • How mature is your data platform?

<p><strong>Influences on Data Strategy</strong></p><ul><li><p>Do you have a proper data platform?</p></li><li><p>Can your IT infrastructure support your data platform?</p></li><li><p>How mature is your data platform?</p></li></ul><p></p>
53
New cards

Business Understanding

The first phase of formulating a data strategy

<p>The <strong>first phase</strong> of formulating a <strong>data strategy</strong></p>
54
New cards

Data Understanding

The second phase of formulating a data strategy

<p>The <strong>second phase</strong> of formulating a <strong>data strategy</strong></p>
55
New cards

Determine Business Objectives

The first activity in the business understanding phase of formulating a data strategy

Outputs

  • Background

  • Business Objectives

  • Business Success Criteria

56
New cards

Assess Situations

The second activity in the business understanding phase of formulating a data strategy

Outputs

  • Inventory of Resources

  • Requirements, Assumptions, and Constraints

  • Risk and Contingencies

  • Terminology

  • Cost and Benefit

57
New cards

Determine Goals

The third activity in the business understanding phase of formulating a data strategy

Outputs

  • Data Science Goals

  • Data Science Success Criteria

58
New cards

Produce Project Plan

The fourth activity in the business understanding phase of formulating a data strategy

Outputs

  • Project Plan

  • Initial Assessment of Tools and Techniques

<p>The <strong>fourth activity</strong> in the <strong>business understanding</strong> phase of formulating a data strategy</p><p></p><p><strong>Outputs</strong></p><ul><li><p>Project Plan</p></li><li><p>Initial Assessment of Tools and Techniques</p></li></ul><p></p>
59
New cards

Collect Initial Data

The first activity in the data understanding phase of formulating a data strategy

Outputs

  • Datasets acquired

  • Data sources

  • Methods used to acquire the datasets

  • Any problems encountered and solutions (if existing)

60
New cards

Describe Data

The second activity in the data understanding phase of formulating a data strategy

Outputs

  • Format of the data

  • Quantity of data (no. of records and fields)

  • Identities of the fields

  • Any other surface features of the data that have been discovered

61
New cards

Explore Data

The third activity in the data understanding phase of formulating a data strategy

Outputs

  • First findings or initial hypothesis and their impact on the remainder of the project

  • If appropriate, include graph and plots

62
New cards

Verify Data Quality

The fourth activity in the data understanding phase of formulating a data strategy

Outputs

  • List of results of the data quality verifications

  • If quality problems exist, list possible solutions

63
New cards

Data Management

The development, execution, and supervision of plans, policies, programs, and practices that deliver, control, protect, and enhance the value of data and information assets throughout their life cycles

64
New cards

Data Governance and Stewardship

A knowledge area of data management that corresponds to the exercise of authority, control, and shared decision-making (planning, monitoring, and enforcement) over the management of data assets.

<p>A <strong>knowledge area</strong> of <strong>data management </strong>that corresponds to the <strong>exercise of authority</strong>, control, and shared decision-making (planning, monitoring, and enforcement) over the management of data assets.</p>
65
New cards

Data Architecture

A knowledge area of data management that corresponds to identifying the needs of the enterprise (regardless of the structure) and maintaining the master blueprints to meet those needs.

Using master blueprints to guide data integration, control data assets, and align data investments with business strategy.

<p>A <strong>knowledge area</strong> of <strong>data management </strong>that corresponds to <strong>identifying</strong> the <strong>needs</strong> of the enterprise (regardless of the structure) and <strong>maintaining</strong> the <strong>master blueprints</strong> to meet those needs.</p><p>Using master blueprints to guide data integration, control data assets, and align data investments with business strategy.</p>
66
New cards

Data Modeling and Design

A knowledge area of data management that corresponds to the process of discovering, analyzing, and scoping data requirements, and then representing and communicating these data requirements in a precise form.

This process is iterative and involves conceptual, logical, and physical models.

<p>A <strong>knowledge area</strong> of <strong>data management </strong>that corresponds to the process of discovering, analyzing, and scoping <strong>data requirements</strong>, and then <strong>representing</strong> and communicating these data requirements in a <strong>precise form</strong>.</p><p>This process is iterative and involves conceptual, logical, and physical models.</p>
67
New cards

Data Storage and Operations

A knowledge area of data management that corresponds to the design, implementation, and support of stored data to maximize its value.

<p>A <strong>knowledge area</strong> of <strong>data management </strong>that corresponds to the design, implementation, and support of <strong>stored data</strong> to maximize its value.</p>
68
New cards

Data Security

A knowledge area of data management that corresponds to the planning, development, and execution of security policies and procedures to provide proper authentication, authorization, access, and auditing of data and information assets within cultural and regulatory consideration.

<p>A <strong>knowledge area</strong> of <strong>data management </strong>that corresponds to the planning, development, and execution of <strong>security policies</strong> and <strong>procedures</strong> to provide proper authentication, authorization, access, and auditing of data and information assets within cultural and regulatory consideration.</p>
69
New cards

Data Integration and Interoperability

A knowledge area of data management. __________ is the movement and consolidation of data within and between data stores, applications, and organizations, while __________ is the ability for multiple systems to communicate.

<p>A <strong>knowledge area</strong> of <strong>data management</strong>. __________ is the movement and <strong>consolidation of data</strong> within and between data stores, applications, and organizations, while __________ is the ability for multiple systems to communicate.</p>
70
New cards

Document and Content Management

A knowledge area of data management that corresponds to controlling the capture, storage, access, and use of data and information stored predominantly relational databases.

<p>A <strong>knowledge area</strong> of <strong>data management </strong>that corresponds to <strong>controlling</strong> the capture, storage, access, and use of <strong>data</strong> and <strong>information </strong>stored predominantly relational databases.</p>
71
New cards

Reference and Master Data

A knowledge area of data management that corresponds to managing reconciled and integrated data through stewardship and semantic consistency in support of enterprise-wide needs to share its data assets.

<p>A <strong>knowledge area</strong> of <strong>data management </strong>that corresponds to <strong>managing</strong> reconciled and integrated data through <strong>stewardship</strong> and <strong>semantic consistency</strong> in support of enterprise-wide needs to share its data assets.</p>
72
New cards

Data Warehousing and Business Intelligence

A knowledge area of data management that corresponds to planning, implementation, and managing an integrated data system to support knowledge workers engaged in reporting, query, and analysis.

<p>A <strong>knowledge area</strong> of <strong>data management </strong>that corresponds to planning, implementation, and managing an <strong>integrated data system</strong> to support knowledge workers engaged in reporting, query, and analysis.</p>
73
New cards

Metadata Management

A knowledge area of data management that corresponds to planning, implementation, and control of activities that contribute to the ability to process, maintain, integrate, secure, audit and govern other data.

<p>A <strong>knowledge area</strong> of <strong>data management </strong>that corresponds to planning, implementation, and control of activities that contribute to the ability to process, maintain, integrate, secure, audit and govern <strong>other data</strong>.</p>
74
New cards

Data Quality Management

A knowledge area of data management that corresponds to the planning, implementation, and control of activities that apply techniques for collecting and handing data ensuring it addresses the needs of the enterprise and local consumer is fit for use.

<p>A <strong>knowledge area</strong> of <strong>data management </strong>that corresponds to the planning, implementation, and control of activities that apply techniques for collecting and handing data <strong>ensuring</strong> it addresses the needs of the enterprise and local consumer is <strong>fit for use</strong>.</p>
75
New cards

Data Lifecycle

Outlines the stages that a particular set of data goes through in analytics projects

76
New cards

Generation

A stage in the data lifecycle that comes in two forms: digital and analog data. Samples include sales transactions, promo subscriptions, cash in and cash out, stock in and stock out, and sensor generated data.

<p>A <strong>stage</strong> in the <strong>data lifecycle</strong> that comes in two forms: <strong>digital</strong> and <strong>analog data</strong>. Samples include sales transactions, promo subscriptions, cash in and cash out, stock in and stock out, and sensor generated data.</p>
77
New cards

Collection

A stage in the data lifecycle that corresponds to the process of retrieving data from data sources and putting them into a single data platform. This is the “E” in ETL. Can be automated or manual.

<p>A <strong>stage</strong> in the <strong>data lifecycle</strong> that corresponds to the process of <strong>retrieving data</strong> from data sources and putting them into a single data platform. This is the <strong>“E”</strong> in ETL. Can be <strong>automated</strong> or <strong>manual</strong>.</p>
78
New cards

Processing

When data is collected, they may not be in the format that you need them to be. A stage in the data lifecycle that corresponds to the “T” in ETL. Sample activities include data cleansing, data transformations, computations, and data aggregations.

<p>When data is collected, they may not be in the format that you need them to be. A <strong>stage</strong> in the <strong>data lifecycle</strong> that corresponds to the <strong>“T”</strong> in ETL. Sample activities include data cleansing, data transformations, computations, and data aggregations.</p>
79
New cards

Storage

A stage in the data lifecycle that corresponds to where data is loaded into the data platform infrastructure. This is the “L” in ETL.

<p>A <strong>stage</strong> in the <strong>data lifecycle</strong> that corresponds to where data is loaded into the data platform infrastructure. This is the <strong>“L”</strong> in ETL.</p>
80
New cards

Management

A stage in the data lifecycle that corresponds to an ongoing process that involves, organizing, storing, and retrieving data as necessary over the life of a data project. Also includes encryption, access control, and other related functions that support the operations of the data platform.

<p>A <strong>stage</strong> in the <strong>data lifecycle</strong> that corresponds to an <strong>ongoing process</strong> that involves, organizing, storing, and retrieving data as necessary over the life of a data project. Also includes encryption, access control, and other related functions that support the operations of the data platform.</p>
81
New cards

Analysis

A stage in the data lifecycle that corresponds to the processes that attempt to glean meaningful insights from raw data with the appropriate tools and strategies. Requires a question/challenge. Sample activities include statistical modeling, ad hoc analytics, artificial intelligence, data mining, and machine learning.

<p>A <strong>stage</strong> in the <strong>data lifecycle</strong> that corresponds to the processes that attempt to glean meaningful <strong>insights</strong> from raw data with the appropriate tools and strategies. Requires a question/challenge. Sample activities include statistical modeling, ad hoc analytics, artificial intelligence, data mining, and machine learning.</p>
82
New cards

Visualization

A stage in the data lifecycle that corresponds to the process of creating graphical representations of your information or knowledge to quickly communicate your analysis. Should take audience and data story into account.

<p>A <strong>stage</strong> in the <strong>data lifecycle</strong> that corresponds to the process of creating <strong>graphical representations</strong> of your information or knowledge to quickly communicate your analysis. Should take audience and data story into account.</p>
83
New cards

Interpretation

A stage in the data lifecycle that makes sense of your analysis and visualization and investigates it through the lens of your expertise and understanding (domain knowledge).

May include a description or explanation of what the data shows and the implications.

<p>A <strong>stage</strong> in the <strong>data lifecycle</strong> that makes sense of your analysis and visualization and investigates it through the lens of your expertise and understanding (domain knowledge).</p><p>May include a description or explanation of what the data shows and the implications.</p>
84
New cards

Data Pipelines

Support the Collection, Processing, and Storage stages of the Data Lifecycle. They are microservices that perform different tasks to prepare the data for analytics.

85
New cards

Data Sources

A component of the data pipeline that corresponds to the entry points for data into the pipeline, including applications and devices

<p>A component of the data pipeline that corresponds to the entry points for data into the pipeline, including applications and devices</p>
86
New cards

Transformations

A component of the data pipeline that corresponds to the operations that modify data to meet analysis requirements.

<p>A component of the data pipeline that corresponds to the operations that modify data to meet analysis requirements.</p>
87
New cards

Dependencies

A component of the data pipeline that corresponds to the factors that determine the timing and progression of data processing.

<p>A component of the data pipeline that corresponds to the factors that determine the timing and progression of data processing.</p>
88
New cards

Destinations

A component of the data pipeline that corresponds to the final endpoints where data is stored or analyzed.

<p>A component of the data pipeline that corresponds to the final endpoints where data is stored or analyzed.</p>
89
New cards

ETL (Extract, Transform, Load)

A special type of data pipeline that moves the data from a raw data source, transforms it to meet specific requirements, and loads it to a data storage. Not all data pipelines follow this sequence.

90
New cards

Batch

Data pipelines that are run on a specific schedule, and on specific intervals

91
New cards

Streaming

Data pipelines that are continuously running and ingesting data from one place to another

92
New cards

Cloud-Native

Data pipelines that use cloud managed services in performing their tasks. This can be batch or streaming

93
New cards

Batch

What kind of data pipeline operation is an API call?

94
New cards

Streaming

What kind of data pipeline operation is sensor generated data?

95
New cards

Streaming

What kind of data pipeline operation is data generated by IOT devices?

96
New cards

Data Analytics

Refers to the fundamental principles that guide analytics on business data. Commonly used interchangeably with data mining.

97
New cards

Data Engineering

The extraction of data from different sources

98
New cards

CRISP-DM (CRoss Industry Standard Process for Data Mining)

Provides a framework to structure our thinking about data analytics problems.

It is similar to the process on how data strategy is structured. The difference is the goal of each.

<p>Provides a framework to structure our thinking about data analytics problems.</p><p>It is similar to the process on how data strategy is structured. The difference is the goal of each.</p>
99
New cards

Business Understanding

Identify the CRISP-DM phase of this scenario: A game development company is looking to improve player retention in their mobile game. They need to identify the factors that contribute to player churn and create a model that will help predict when players are likely to stop playing. They meet with stakeholders to understand key business objectives, like increasing lifetime value and engagement.

<p><strong>Identify the CRISP-DM phase of this scenario</strong>: <span>A game development company is looking to improve player retention in their mobile game. They need to identify the factors that contribute to player churn and create a model that will help predict when players are likely to stop playing. They meet with stakeholders to understand key business objectives, like increasing lifetime value and engagement.</span></p>
100
New cards

Data Understanding

Identify the CRISP-DM phase of this scenario: The game development team collects data on player behaviors, such as in-game purchases, session times, and level completion rates. They examine the raw data to identify trends, outliers, and patterns, such as whether players who reach higher levels tend to stay longer or if certain in-game events correlate with churn.

<p><strong>Identify the CRISP-DM phase of this scenario</strong>: <span>The game development team collects data on player behaviors, such as in-game purchases, session times, and level completion rates. They examine the raw data to identify trends, outliers, and patterns, such as whether players who reach higher levels tend to stay longer or if certain in-game events correlate with churn.</span></p>