1/285
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
Which of the following is the best reason to use an extract instead of a live connection?
a. Your data source only supports a live connection via ODBC.
b. You need the freshest possible data at all times.
c. You need to apply an aggregation that takes too long when using a live connection.
d. You need to join tables that are in the data source.
c. You need to apply an aggregation that takes too long when using a live connection.
You created a group by selecting field labels in a view. How can you remove members from the group?
a. In the view, right-click the group members you want to remove and select Exclude.
b. In the Data pane, right-click the group and select Edit Group.
c. In the view, right-click the group members you want to remove and select Format.
d. On a color legend, right-click a member you want to remove and select Format legends.
b. In the Data pane, right-click the group and select Edit Group.
Which of the following can you use to create a calculated field that returns data independent of the data granularity in a view?
a. An INCLUDE LOD calculation
b. A table calculation
c. A basic calculation
d. A FIXED LOD calculation
d. A FIXED LOD calculation
4. Interactive elements that you can add to a dashboard for users include ______. (Select all that apply.)
a. URL actions
b. edit tooltip options
c. filter actions
d. hide and unhide all sheet options
a. URL actions
c. filter actions
A field that shows average home values for the United States in 2016 is most likely:
a. A discrete date part dimension
b. A continuous date value dimension
c. A geographical dimension
d. An aggregated measure
d. An aggregated measure
A Reference Band cannot be based on two fixed points.
False
True
False
A Reference Distribution plot cannot be along a continuous axis.
True
False
False, A Reference Distribution plot can be along a continuous axis.
Which of the following is not a Trend Line model
Linear Trend Line
Exponential Trend Line
Binomial Trend Line
Logarithmic Trend Line
Binomial Trend Line
The aggregation function attr() returns a * when __________________.
There is a single value for all rows in the group.
It is a null value.
There are more than one value in all rows in the group.
The data is not present at the desired level..
There are more than one value in all rows in the group.
Is it possible to deploy a URL action on a dashboard object to open a Web Page within a dashboard rather than opening the system's web browser?
True, with the use of Tableau Server
True, with the use of a Web Page object
False, not possible
True, requires a plug-in
True, with the use of a Web Page object
The Highlighting action can be disabled for the entire workbook.
True
False
True. From the toolbar the Highlighting action can be disabled for the entire workbook.
A sheet cannot be used within a story directly. Either sheets should be used within a dashboard, or a dashboard should be used within a story.
True
False
False. A sheet can be used within a story directly.
How do you identify a continuous field in Tableau?
It is identified by a blue pill in the visualization.
It is identified by a green pill in a visualization.
It is preceded by a # symbol in the data window.
When added to the visualization, it produces distinct values.
It is identified by a green pill in a visualization.
Is it possible to use measures in the same view multiple times (e.g. SUM of the measure and AVG of the measure)?
No
Yes
Yes, measures can be used multiple times in the same view.
Bins can only be created on:
Measures
Dimensions
Measures, bins can only be created on Measures
For creating variable size bins we use _____________.
Sets
Groups
Calculated fields
Table Calculations
Calculated fields with a logical statement "
Sets can only be created on:
Measures
Dimensions
Dimensions, sets can only be created on Dimensions
A good reason to use a bullet graph.
Analyzing the trend for a time period
Comparing the actual against the target sales
Adding data to bins and calculating count measure
Displaying the sales growth for a particular year
Comparing the actual against the target sales
Trend Lines can only be used with numeric or date fields.
True
False
True. Trend lines can only be used with numeric or date fields.
Disaggregation returns all records in the underlying data source.
True
False
True, Disaggregation returns all records in the underlying data source.
By definition, Tableau displays measures over time as a ____________.
Bar
Line
Histogram
Scatter Plots
Line, By definition, Tableau displays measures over time as a Lines.
The icon associated with the field that has been grouped is a ______________.
Paper Clip
Set
Hash
Equal To
Paper Clip
What is the profitable moving average in November 2013, including four months prior and four months after?
$8,553
$8,256
$8,441
$7,501
$8,441.
The best trend model for your view would be the one with?
R-Squared value closest to 1
P-Value more than 1
R-Squared value greater than 1
R-Squared value equal to P-Value
R-Squared value closest to 1 is the best trend model for a view.
How do you differentiate dimension and measure on Tableau?
Dimension is categorical and Measure is numerical
A Reference Line cannot be added from the Analytics pane.
True
False
False, A reference line can be added from the analytics pane.
It is possible to change the geographic roles of a dimension.
True
False
True, It is possible to change the geographic role of a dimension.
Groups can be used in a calculated field.
TRUE
FALSE
True, Groups can also be used in a calculated field.
From Tableau 10.x onwards
The highlight action in a dashboard is similar to filtering action in a worksheet.
TRUE
FALSE
False, The highlighting action in dashboard will highlight the selection and related data in other view in a dashboard. It won't filter the selection.
The default join type in case of Blended data sources is?
Cross Join
Inner Join
Left outer Join
Full outer Join
Left outer Join is the default join type in case of Blended data sources. The primary dataset is considered to be the left table.
What is a cross-tab?
a text table or a table of numbers
When do you use an extract connection and when do you use a live connection?
Extracts are faster, especially in more complex visualizations with large data sets, filters, calculations. These extracts are snapshots of data optimized for aggregation and loaded into system memory to be quickly recalled for visualization. This is for if hospitals need weekly/monthly trends.
Live connection are real time updates. This relies on the database for all queries and not always optimized for fast performance. Your data queries are only as fast as the database itself. This is for if hospitals need real-time updates.
What are scatter plots best used for?
Visualizing relationships between numerical variables.
What are scatter plots best used for?
Visualizing relationships between numerical variables.
What are scatter plots best used for?
Visualizing relationships between numerical variables.
Data warehouses are subsets of data marts.
false
One way an operational data store differs from a data warehouse is the recency of their data.
true
Organizations seldom devote a lot of effort to creating metadata because it is not important for the effective use of data warehouses.
false
Without middleware, different BI programs cannot easily connect to the data warehouse
true
Moving the data into a data warehouse is usually the easiest part of its creation.
false
With key performance indicators, driver KPIs have a significant effect on outcome KPIs, but the reverse is not necessarily true.
true
OLTP systems are designed to handle ad hoc analysis and complex queries that deal with many data items.
false
Data warehouse administrators (DWAs) do not need strong business insight since they only handle the technical aspect of the infrastructure
false
Why is a performance management system superior to a performance measurement system?
A) because performance measurement systems are only in their infancy
B) because measurement automatically leads to problem solution
C) because performance management systems cost more
D) because measurement alone has little use without action
d
Operational or transaction databases are product oriented, handling transactions that update the database. In contrast, data warehouses are
A) subject-oriented and nonvolatile.
B) product-oriented and nonvolatile.
C) product-oriented and volatile.
D) subject-oriented and volatile.
a
Which kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates?
A) sectional data mart
B) public data mart
C) independent data mart
D) volatile data mart
c
Oper marts are created when operational data needs to be analyzed
A) linearly.
B) in a dashboard.
C) unidimensionally.
D) multidimensionally.
d
Which approach to data warehouse integration focuses more on sharing process functionality than data across systems?
A) extraction, transformation, and load
B) enterprise application integration
C) enterprise information integration
D) enterprise function integration
b
In which stage of extraction, transformation, and load (ETL) into a data warehouse are anomalies detected and corrected?
A) transformation
B) extraction
C) load
D) cleanse
d
Data warehouses provide direct and indirect benefits to organizations. Which of the following is an indirect benefit of data warehouses?
A) better and more timely information
B) extensive new analyses performed by users
C) simplified access to data
D) improved customer service
d
All of the following are benefits of hosted data warehouses EXCEPT
A) smaller upfront investment.
B) better quality hardware.
C) greater control of data.
D) frees up in-house systems.
c
When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure?
A) star schema
B) snowflake schema
C) relational schema
D) dimensional schema
a
Real-time data warehousing can be used to support the highest level of decision making sophistication and power. The major feature that enables this in relation to handling the data is
A) country of (data) origin.
B) nature of the data.
C) speed of data transfer.
D) source of the data.
c
A large storage location that can hold vast quantities of data (mostly unstructured) in its native/raw format for future/potential analytics consumption is referred to as a(n)
A) extended ASP.
B) data cloud.
C) data lake.
D) relational database.
c
All of the following are true about in-database processing technology EXCEPT
A) it pushes the algorithms to where the data is.
B) it makes the response to queries much faster than conventional databases.
C) it is often used for apps like credit card fraud detection and investment risk management.
D) it is the same as in-memory storage technology.
d
The three main types of data warehouses are data marts, operational ________, and enterprise data warehouses.
data stores
Most data warehouses are built using ________ database management systems to control and manage the data.
relational
Given that the size of data warehouses is expanding at an exponential rate, ________ is an important issue.
scalability
Online ________ is a term used for a transaction system that is primarily responsible for capturing and storing data related to day-to-day business functions such as ERP, CRM, SCM, and point of sale.
transaction processing
________ modeling is a retrieval-based system that supports high-volume query access.
dimensional
Performing extensive ________ to move data to the data warehouse may be a sign of poorly managed data and a fundamental lack of a coherent data management strategy.
extraction, transformation, and load (ETL)
What is the definition of a data warehouse (DW) in simple terms?
a pool of data produced to support decision making; it is also a repository of current and historical data of potential interest to managers throughout the organization.
What is the definition of a data mart?
a subset of a data warehouse, typically consisting of a single subject area (e.g., marketing, operations). Whereas a data warehouse combines databases across an entire enterprise, a data mart is usually smaller and focuses on a particular subject or department.
T/F: Subject oriented databases for data warehousing are organized by detailed subjects such as disk drives, computers, and networks
False:
Page Ref: 42 Subject oriented databases for data warehousing are organized by detailed subjects such as sales, products, or customers, containing only information relevant for decision support.
Operational or transaction databases are product oriented, handling transactions that update the database. In contrast, data warehouses are
a.) subject-oriented and nonvolatile.
b.) product-oriented and nonvolatile.
c.) product-oriented and volatile.
d.) subject-oriented and volatile.
a.) subject-oriented and nonvolatile.
Page Ref: 40 The characteristics of data warehouses are subject oriented, integrated, time variant and nonvolatile.
T/F: Data warehouses are subsets of data marts.
False
Page Ref: 43 Data mart is a subset of a data warehouse, typically consisting of a single subject area.
T/F: One way an operational data store differs from a data warehouse is the recency of their data.
True
Page Ref: 43-44 An Operational Data Store (ODS) provides a fairly recent form of customer information file (CIF).
Active data warehousing can be used to support the highest level of decision making sophistication and power. The major feature that enables this in relation to handling the data is:
a.) country of (data) origin.
b.) nature of the data.
c.) speed of data transfer.
d.) source of the data.
c.) speed of data transfer.
Page Ref: 77 Real-time data warehousing (RDW), also known as active data warehousing (ADW), is the process of loading and providing data via the data warehouse as they become available.
T/F: Organizations seldom devote a lot of effort to creating metadata because it is not important for the effective use of data warehouses.
False
Page Ref: 45-46 Metadata describe the structure of and some meaning about data, thereby contributing to their effective or ineffective use.
T/F: OLTP systems are designed to handle ad hoc analysis and complex queries that deal with many data items.
False
Page Ref: 70 OLTP systems focus routine, periodic, narrow reports.
T/F: Because of performance and data quality issues, most experts agree that the federated architecture should supplement data warehouses, not replace them.
True
Page Ref: 54 Because of performance and data quality issues, most experts agree that federated approaches work well to supplement data warehouses, not replace them.
T/F: OLTP databases are optimized for output (querying/asking questions of the data) and data warehouses are optimized for input (getting new or updated data into the database).
False
T/F: Data warehouse administrators (DWAs) do not need strong business insight since they only handle the technical aspect of the infrastructure.
False
Page Ref: 82 DWA should be familiar with high-performance software, hardware, and networking technologies.
In the Isle of Capri case, the only capability added by the new software was increased processing speed of processing reports. (T/F)
F
Data warehouse administrators (DWAs) do not need strong business insight since they only handle the technical aspect of the infrastructure. (T/F)
F
The "single version of the truth" embodied in a data warehouse such as Capri Casinos' means all of the following EXCEPT
A)
decision makers get to see the same results to queries.
B)
decision makers get to use more dependable data for their decisions.
C)
decision makers have unfettered access to all data in the warehouse.
D)
decision makers have the same data available to support their decisions.
decision makers have unfettered access to all data in the warehouse.
Which kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates?
A)
independent data mart
B)
sectional data mart
C)
volatile data mart
D)
public data mart
independent data mart
A Web client that connects to a Web server, which is in turn connected to a BI application server, is reflective of a
A)
one tier architecture.
B)
two tier architecture.
C)
three tier architecture.
D)
four tier architecture.
three tier architecture.
Which data warehouse architecture uses metadata from existing data warehouses to create a hybrid logical data warehouse comprised of data from the other warehouses?
A)
independent data marts architecture
B)
hub-and-spoke data warehouse architecture
C)
centralized data warehouse architecture
D)
federated architecture
federated architecture
Which approach to data warehouse integration focuses more on sharing process functionality than data across systems?
A)
extraction, transformation, and load
B)
enterprise function integration
C)
enterprise information integration
D)
enterprise application integration
enterprise application integration
In which stage of extraction, transformation, and load (ETL) into a data warehouse are data aggregated?
A)
extraction
B)
cleanse
C)
transformation
D)
load
transformation
In which stage of extraction, transformation, and load (ETL) into a data warehouse are anomalies detected and corrected?
A)
cleanse
B)
transformation
C)
load
D)
extraction
cleanse
Data warehouses provide direct and indirect benefits to using organizations. Which of the following is an indirect benefit of data warehouses?
A)
better and more timely information
B)
extensive new analyses performed by users
C)
simplified access to data
D)
improved customer service
improved customer service
All of the following are benefits of hosted data warehouses EXCEPT
A)
greater control of data.
B)
smaller upfront investment.
C)
better quality hardware.
D)
frees up in-house systems.
greater control of data.
When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure?
A)
star schema
B)
snowflake schema
C)
relational schema
D)
dimensional schema
star schema
When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is
A)
slice.
B)
dice.
C)
drill down.
D)
roll-up.
drill down.
Which of the following online analytical processing (OLAP) technologies does NOT require the precomputation and storage of information?
A)
SQL
B)
HOLAP
C)
MOLAP
D)
ROLAP
ROLAP
Active data warehousing can be used to support the highest level of decision making sophistication and power. The major feature that enables this in relation to handling the data is
A)
nature of the data.
B)
speed of data transfer.
C)
country of (data) origin.
D)
source of the data.
speed of data transfer.
Which of the following statements is more descriptive of active data warehouses in contrast with traditional data warehouses?
A)
restrictive reporting with daily and weekly data currency
B)
large numbers of users, including operational staffs
C)
detailed data available for strategic use only
D)
strategic decisions whose impacts are hard to measure
large numbers of users, including operational staffs
How does the use of cloud computing affect the scalability of a data warehouse?
A)
Cloud computing vendors bring as much hardware as needed to users' offices.
B)
Cloud vendors are mostly based overseas where the cost of labor is low.
C)
Cloud computing has little effect on a data warehouse's scalability.
D)
Hardware resources are dynamically allocated as use increases.
Hardware resources are dynamically allocated as use increases.
All of the following are true about in-database processing technology EXCEPT
A)
it is the same as in-memory storage technology.
B)
it is often used for apps like credit card fraud detection and investment risk management.
C)
it makes the response to queries much faster than conventional databases.
D)
it pushes the algorithms to where the data is.
it is the same as in-memory storage technology.
________ Analytics answers questions like "what happened" or "Why did it happen".
descriptive
The three main types of data warehouses are data marts, operational ________, and enterprise data warehouses.
data stores
How are descriptive analytics methods different from the other two types?
They answer the "what-is?" queries, not "what will be?" queries
In the Magpie Sensing case study, the automated collection of temperature and humidity data on shipped goods helped with various types of analytics. Which of the following is an example of prescriptive analytics?
optimal temperature setting
If a company's strategy is properly aligned with DW and BI initiatives, and if the company's IS organization can be made capable of playing its role in such a project, and if the requisite user community is in place and has the proper motivation, then
it is wise to start BI and establish a BI Competency Center (BICC) within a company
Big Data often involves a form of distributed storage and processing using Hadoop and MapReduce. One reason for this is
the processing power needed for the centralized model would overload a single computer
In answering the question "Which customers are likely to be using fake credit cards?" you are most likely to use which of the following analytic applications?
fraud detection
Which data warehouse architecture uses metadata from existing data warehouses to create a hybrid logical data warehouse comprised of data from the other warehouses?
federated architecture
Which kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates?
independent data mart