ISDS 415 Final

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/142

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

143 Terms

1
New cards

A light bulb manufacturer uses descriptive analytics

to present supply chain to managers visually.

2
New cards

In the Opening Vignette on Sports Analytics, what was adjusted to drive one-time ticket sales?

ticket prices

3
New cards

In the Opening Vignette on Sports Analytics, what type of modeling was used to predict offensive tactics?

heat maps

4
New cards

Business applications have moved from transaction processing and monitoring to other activities. Which of the following is NOT one of those activities?

data monitoring

5
New cards

Which of the following is an umbrella term that combines architectures, tools, databases, analytical tools, applications, and methodologies?

BI

6
New cards

The competitive imperatives for BI include all of the following EXCEPT

Right user

7
New cards

Online transaction processing (OLTP) systems handle a company's routine ongoing business. In contrast, a data warehouse is typically

a distinct system that provides storage for data that will be made use of in analysis.

8
New cards

The very design that makes an OLTP system efficient for transaction processing makes it inefficient for

end-user ad hoc reports, queries, and analysis.

9
New cards

How are enterprise resources planning (ERP) systems related to supply chain management (SCM) systems?

complementary systems

10
New cards

BI applications must be integrated with

legacy systems, enterprise systems, databases

11
New cards

What has caused the growth of the demand for instant, on-demand access to dispersed information?

the more pressing need to close the gap between the operational data and strategic objectives

12
New cards

What type of analytics seeks to recognize what is going on as well as the likely forecast and make decisions to achieve the best performance possible?

prescriptive

13
New cards

What type of analytics seeks to determine what is likely to happen in the future?

Predictive

14
New cards

Which of the following statements about Big Data is true?

Pure Big Data systems do not involve fault tolerance.

15
New cards

Big Data often involves a form of distributed storage and processing using Hadoop and MapReduce. One reason for this is

the processing power needed for the centralized model would overload a single computer.

16
New cards

Organizations using BI systems are typically seeking to _________ the gap between the operational data and strategic objectives has become more pressing.

Close

17
New cards

A(n) ____________ is a major component of a Business Intelligence (BI) system that holds source data.

Data Warehouse

18
New cards

A(n) ____________ is a major component of a Business Intelligence (BI) system that is often browser based and often presents a portal or dashboard.

User Interface

19
New cards

The user interface of a BI system is often referred to as a(n) ____________

Dashboard

20
New cards

The programing algorithm developed by Google to handle Big Data computational challenges is known as __________

MapReduce

21
New cards

Demands for instant, on-demand access to dispersed information decrease as firms successfully integrate BI into their operations.

False

22
New cards

How does Amazon.com use predictive analytics to respond to product searches by the customer?

They suggest related products to the user based on similar items that other people have searched for - association mining technique

23
New cards

Today, many vendors offer diversified tools, some of which are completely preprogrammed (called shells). How are these shells utilized?

All a user needs to do is insert the numbers.

24
New cards

Business intelligence (BI) is a specific term that describes architectures and tools only.

False

25
New cards

Data generation is a precursor, and is not included in the analytics ecosystem.

False

26
New cards

Which of the following developments is NOT contributing to facilitating growth of decision support and analytics?

locally concentrated workforces

27
New cards

Due to industry consolidation, the analytics ecosystem consists of only a handful of players across several functional areas.

False

28
New cards

Which characteristic of data means that all the required data elements are included in the data set?

data richness

29
New cards

Key performance indicators (KPIs) are metrics typically used to measure

Internal results

30
New cards

Which characteristic of data requires that the variables and data values be defined at the lowest (or as low as required) level of detail for the intended use of the data?

data granularity

31
New cards

Which type of visualization tool can be very helpful when a data set contains location data?

Geographic map

32
New cards

Which type of question does visual analytics seeks to answer?

Why is it happening?

33
New cards

When you tell a story in a presentation, all of the following are true EXCEPT

a well-told story should have no need for subsequent discussion.

34
New cards

Benefits of the latest visual analytics tools, such as SAS Visual Analytics, include all of the following EXCEPT

they explore massive amounts of data in hours, not days.

35
New cards

What is the management feature of a dashboard?

operational data that identify what actions to take to resolve a problem

36
New cards

What is the fundamental challenge of dashboard design?

ensuring that the required information is shown clearly on a single screen

37
New cards

This measure of central tendency is the sum of all the values/observations divided by the number of observations in the data set.

arithmetic mean

38
New cards

This plot is a graphical illustration of several descriptive statistics about a given data set.

box-and-whiskers plot

39
New cards

Due to the ____________ expansion of information technology coupled with the need for improved competitiveness in business, there has been an increase in the use of computing power to produce unified reports that join different views of the enterprise in one place.

rapid

40
New cards

Data is the main ingredient for any BI, data science, and business analytics initiative.

true

41
New cards

The data storage component of a business reporting system builds the various reports and hosts them for, or disseminates them to users. It also provides notification, annotation, collaboration, and other services.

false

42
New cards

There are basic chart types and specialized chart types. A Gantt chart is a specialized chart type.

true

43
New cards

Visualization differs from traditional charts and graphs in complexity of data sets and use of multiple dimensions and measures.

true

44
New cards

Visual analytics is aimed at answering, "What is it happening?" and is usually associated with business analytics.

false

45
New cards

Dashboards provide visual displays of important information that is consolidated and arranged across several screens to maintain data order.

false

46
New cards

Data source reliability means that data are correct and are a good match for the analytics problem.

false

47
New cards

Data accessibility means that the data are easily and readily obtainable.

true

48
New cards

Structured data is what data mining algorithms use and can be classified as categorical or numeric.

true

49
New cards

Interval data are variables that can be measured on interval scales.

true

50
New cards

Nominal data represent the labels of multiple classes used to divide a variable into specific groups.

false

51
New cards

Descriptive statistics is all about describing the sample data on hand.

true

52
New cards

Describe the difference between simple and multiple regression.

A simple regression only contains 1 independent variable and 1 dependent variable. A multiple regression model contains 1 independent variable and multiple dependent variables.

53
New cards

This measure of dispersion is calculated by simply taking the square root of the variations.

standard deviation

54
New cards

Which type of visualization tool can be very helpful when the intention is to show relative proportions of dollars per department allocated by a university administration?

pie chart

55
New cards

Regression Models of _________________ data focus on predicting the future

time series

56
New cards

A regression model that involves a single independent variable is called

simple regression

57
New cards

If using a mining analogy, "knowledge mining" would be a more appropriate term than "data mining."

true

58
New cards

Open-source data mining tools include applications such as IBM SPSS Modeler and Dell Statistica.

false

59
New cards

All of the following statements about data mining are true EXCEPT

the process aspect means that data mining should be a one-step process to results.

60
New cards

What is the main reason parallel processing is sometimes used for data mining?

because of the massive data amounts and search efforts involved

61
New cards

A data mining study is specific to addressing a well-defined business task, and different business tasks require

different sets of data.

62
New cards

Which broad area of data mining applications analyzes data, forming rules to distinguish between defined classes?

classification

63
New cards

Which broad area of data mining applications partitions a collection of objects into natural groupings with similar features?

clustering

64
New cards

Identifying and preventing incorrect claim payments and fraudulent activities falls under which type of data mining applications?

insurance

65
New cards

What does the robustness of a data mining method refer to?

its ability to overcome noisy data to make somewhat accurate predictions

66
New cards

What does the scalability of a data mining method refer to?

its ability to construct a prediction model efficiently given a large amount of data

67
New cards

In estimating the accuracy of data mining (or other) classification models, the true positive rate is

the ratio of correctly classified positives divided by the total positive count.

68
New cards

Which of the following is a data mining myth?

Data mining requires a separate, dedicated database.

69
New cards

The cost of data storage has plummeted recently, making data mining feasible for more firms.

true

70
New cards

Data mining can be very useful in detecting patterns such as credit card fraud, but is of little help in improving sales.

false

71
New cards

Data mining requires specialized data analysts to ask ad hoc questions and obtain answers quickly from the system.

false

72
New cards

Statistics and data mining both look for data sets that are as large as possible.

false

73
New cards

Using data mining on data about imports and exports can help to detect tax avoidance and money laundering.

true

74
New cards

When a problem has many attributes that impact the classification of different patterns, decision trees may be a useful approach.

true

75
New cards

Data that is collected, stored, and analyzed in data mining is often private and personal. There is no way to maintain individuals' privacy other than being very careful about physical data security.

false

76
New cards

In the Miami-Dade Police Department case study, predictive analytics helped to identify the best schedule for officers in order to pay the least overtime.

false

77
New cards

Sentiment analysis projects require a lexicon for use. If a project in English is undertaken, you must generally make sure to

use an English lexicon appropriate to the project at your discretion.

78
New cards

What does advanced analytics for social media do?

It examines the content of online conversations.

79
New cards

_________ statistics help you understand whether your specific marketing objective for a Web page is being achieved.

Conversion

80
New cards

In the Tito's Vodka case, it was important that social media users all had a(n) __________ brand experience.

Consistent

81
New cards

___________ is a connections metric for social networks that measures the ties that actors in a network have with others that are geographically close.

Propinquity

82
New cards

___________ is a segmentation metric for social networks that measures the strength of the bonds between actors in a social network.

cohesion

83
New cards

In sentiment analysis, sentiment suggests a transient, temporary opinion reflective of one's feelings.

false

84
New cards

Current use of sentiment analysis in voice of the customer applications allows companies to change their products or services in real time in response to customer sentiment.

true

85
New cards

In sentiment analysis, it is hard to classify some subjects such as news as good or bad, but easier to classify others, e.g., movie reviews, in the same way.

true

86
New cards

Consistent high quality, higher publishing frequency, and longer time lag are all attributes of industrial publishing when compared to Web publishing.

false

87
New cards

In the evolution of social media user engagement, the largest recent change is the growth of creators.

false

88
New cards

Descriptive analytics for social media feature such items as your followers as well as the content in online conversations that help you to identify themes and sentiments.

false

89
New cards

Companies understand that when their product goes "viral," the content of the online conversations about their product does not matter, only the volume of conversations.

false

90
New cards

Which of the following is NOT a characteristic displayed by a LP allocation problem?

The problem is not bound by constraints.

91
New cards

Which of the following is NOT a characteristic displayed by a LP allocation problem?

There is a single way in which the resources can be used.

92
New cards

Which of the following is NOT an assumption used by a LP allocation problem?

All data are unknown with decision making under uncertainty.

93
New cards

The most common method for solving a risk analysis problem is to select the alternative with the

greatest expected value

94
New cards

Every LP model is composed of __________ variables whose values are unknown and are searched for.

decision

95
New cards

_________ analysis attempts to assess the impact of a change in the input data or parameters on the proposed solution.

Sensitivity

96
New cards

_________ analysis is structured as "What will happen to the solution if an input variable, an assumption, or a parameter value is changed?"

What if

97
New cards

Spreadsheets include all possible tools needed to deploy a custom DSS.

false

98
New cards

Spreadsheets are clearly the most popular developer modeling tool.

false

99
New cards

Every LP model has some internal intermediate variables that are not explicitly stated.

true

100
New cards

GE produces 5 different sizes of TVs using 2 machines. Different sizes of TVs take different amount of time to get processed at each of the two machines, and bring in different amount of profit for the company per unit. GE has a limited amount of time to process TVs during each month but would like to find out how many of each size of TV to produce each month to maximize its total profit.

How many decision variables are needed to properly formulate this problem?

5