ISDS 415 Final Review

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/85

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

86 Terms

1
New cards

Nathan wants to fill in the words "Call" and "Don't Call" on the spreadsheet based on the value in column C. He will list "Call" if the value is FALSE and "Don't Call" if the value is TRUE. What is the formula Nathan could write?

=IF(C5, "Don't Call", "Call")

2
New cards

The __________________ function pairs each element of the first array with its counterpart in the second array, multiplies the elements of the pairs together, and adds the results.

SUMPRODUCT

3
New cards

To perform the task of displaying "credit approved" or "credit denied" based on the corresponding Boolean value in column H, you can write a formula in cell I3 containing an IF function, as follows:

=IF(H3, "credit approved", "credit denied")

4
New cards

Within a given range of cells, the number of times a particular condition is satisfied is computed by using the ____________ function.

COUNTIF

5
New cards

Julia wants to take a count of all employees who are participating in different committees. Some employees are participating in more than one committees. The data is listed in column F of a worksheet. If she wants to take a count of all employees who are participating in exactly one committee. A correct formula would be:

=COUNTIF(F3:F13,1)

6
New cards

Julia wants to take a count of all employees who are participating in different committees. Some employees are participating in more than one committees. The data is listed in column F of a worksheet. If she wants to take a count of all employees who are participating in more than one committee. A correct formula would be:

=COUNTIF(F3:F13,">1")

7
New cards

When you write a VLOOKUP formula, you indicate the value you want to look up in a reference

table

8
New cards

In a PMT function, the argument referring to the number of compounding periods is called

nper

9
New cards

In a PMT function, the argument referring to the original principal value at the beginning of the financial transaction is called

pv

10
New cards

In a PMT function, the argument referring to the interest rate per compounding period is called

rate

11
New cards

A bank account would have an fv equal to the ____________ plus any accrued interest, plus or minus any payments into or out of the account.

pv

12
New cards

When you write a VLOOKUP formula, you indicate the value you want to look up in a table.

True

13
New cards

The syntax of the IF function is =IF(logical_test, value_if_true, value_if_false).

True

14
New cards

In a VLOOKUP function, the argument ____________ refers to the type of lookup you want to perform—TRUE or FALSE.

range_lookup

15
New cards

The PMT function finds the value of the payment per period, assuming that there are constant payments and a constant interest rate for the duration of the loan.

True

16
New cards

The syntax of the COUNTIF function is

=COUNTIF(range, criteria)

17
New cards

Categorization and clustering of documents during text mining differ only in the pre-selection of categories.

True

18
New cards

The goal of this step is to have all text data points in the document are aggregated and converted to a single sentiment measure for the whole document.

Correct match:

Collection and Aggregation

19
New cards

The goal is to differentiate between a fact and an opinion, which may be viewed as classification of text as objective or subjective.

Sentiment Detection

20
New cards

The goal of this step is to accurately identify the target of the expressed sentiment.

Target Identification

21
New cards

The goal is to classify the opinion as falling under one of two opposing sentiment polarities, given an opinionated piece of text.

N-P Polarity Classification

22
New cards

In the research literature case study, the researchers analyzing academic papers extracted information from which source?

The paper abstract.

23
New cards

Text analytics is the subset of text mining that handles information retrieval and extraction, plus data mining.

False

24
New cards

Regional accents present challenges for natural language processing.

True

25
New cards

In the evolution of social media user engagement, the largest recent change is the growth of creators.

False

26
New cards

A(n) _____________ is one or more web pages that provide a collection of links to authoritative web pages while _______________ is a software program that searches for websites or files based on keywords.

hub; search engine

27
New cards

____________ is a technique used to detect favorable and unfavorable opinions toward specific products and services using large numbers of textual data sources.

Sentiment analysis

28
New cards

Search engine optimization (SEO) is a means by which

website developers can increase Web site search rankings.

29
New cards

Web ___________ are used to automatically read through the contents of websites.

crawlers

30
New cards

What does advanced analytics for social media do?

It examines the content of online conversations.

31
New cards

All of the following are challenges associated with natural language processing EXCEPT

dividing up a text into individual words in English.

32
New cards

What does Web content mining involve?

Analyzing the unstructured content of Web pages

33
New cards

In text analysis, what is a lexicon?

A catalog of words, their synonyms, and their meanings.

34
New cards

What do voice of the market (VOM) applications of sentiment analysis do?

They examine customer sentiment at the aggregate level.

35
New cards

Search engines are only used in the context of the World Wide Web (WWW).

False

36
New cards

In the opening vignette, the architectural system that supported Watson used all the following elements EXCEPT

a core engine that could operate seamlessly in another domain without changes.

37
New cards

In sentiment analysis, sentiment suggests a transient, temporary opinion reflective of one's feelings.

False

38
New cards

In the Mining for Lies case study, a text based deception-detection method used by Fuller and others in 2008 was based on a process known as ______________ which relies on elements of data and text mining techniques.

message feature mining

39
New cards

Current use of sentiment analysis in voice of the customer applications allows companies to change their products or services in real time in response to customer sentiment.

True

40
New cards

Spreadsheets include all possible tools needed to deploy a custom decision support systems (DSS).

False

41
New cards

A decision made under risk is also known as a probabilistic or stochastic decision-making situation.

True

42
New cards

Risk _____________ is a decision-making method that analyzes the risk (based on assumed known probabilities) associated with different alternatives.

analysis

43
New cards

There are two common approaches to dealing with uncertainty. The first is the ___________ approach which assumes that the outcomes for all alternatives will be the best possible and then the ____________ of each of those may be selected. The second is the ____________ approach. Under the this approach the worst possible outcome is assumed for each alternative and then the ________ of the _________ are selected.

optimistic; best;

pessimistic; best;

worst

44
New cards

________________ is performed by indicating a target cell, its desired value, and a changing cell.

Goal seeking

45
New cards

Which of the following is NOT a component of a quantitative model?

classes

46
New cards

A(n) ____________ model can be constructed under assumed environments of certainty.

dynamic

47
New cards

The most common method for solving a risk analysis problem is to select the alternative with the

greatest expected value.

48
New cards

Simulation is normally used only when a problem is too complex to be treated using numerical optimization techniques.

True

49
New cards

Every linear programming (LP) model has some internal intermediate variables that are not explicitly stated.

True

50
New cards

A(n) __________ spreadsheet model represents behavior over time.

dynamic

51
New cards

Factors that are not under the control of the decision maker but can be fixed, are called

parameters

52
New cards

A model builder makes predictions and assumptions regarding input data, many of which deal with the assessment of certain futures.

False

53
New cards

This method calculates the values of the inputs necessary to achieve a desired level of an output.

goal seek

54
New cards

The components of a quantitative model are linked by ____________ expressions.

algebraic

55
New cards

Which of the following is NOT a characteristic displayed by a linear programming (LP) allocation problem?

There is a single way in which the resources can be used.

56
New cards

Important spreadsheet features for modeling include all of the following EXCEPT

pivot tables.

57
New cards

When the decision maker must consider several possible outcomes for each alternative, each with a given probability of occurrence, this is decision making under

risk.

58
New cards

A decision table shows the relationships of the problem graphically and can handle complex situations in a compact form.

False

59
New cards

Four major components of a quantitative model include: The variables that describe alternative courses of action are called

decision variables

60
New cards

Four major components of a quantitative model include: The variables in any decision-making situation that are not under the control of the decision maker are called

uncontrollable Variables

61
New cards

Four major components of a quantitative model include: The variables that reflect intermediate outcomes in mathematical models are called

intermediate result variables

62
New cards

Four major components of a quantitative model include: The variables that reflect the level of effectiveness of a system; that is, they indicate how well the system performs or attains its goal(s) are called

result (outcome) variables

63
New cards

For individual decision makers, ______________ values constitute a major factor in the issue of ethical decision making.

personal

64
New cards

Big Data is being driven by the exponential growth, availability, and use of information.

True

65
New cards

Predictive analytics is beginning to enable development of software that is directly used by a consumer. One key concern in employing these technologies is the loss of ___________

privacy

66
New cards

Which of the following is true about the furtherance of homeland security?

There is a greater need for oversight.

67
New cards

Despite their potential, many current NoSQL tools lack mature management and monitoring tools.

True

68
New cards

Using this model, companies can deploy their software and applications in the cloud so that their customers can use them.

PaaS

69
New cards

Internet of Things (IoT) is the phenomenon of connecting the physical world to the Internet.

True

70
New cards

One reason the IoT is growing exponentially is because hardware is smaller and more affordable.

True

71
New cards

How does Hadoop work?

It breaks up Big Data into multiple parts so each part can be processed and analyzed at the same time on multiple computers.

72
New cards

Why are companies like IBM shifting to provide more services and consulting?

Customers see that significant value can be created with the application of analytics, and need help completing these tasks.

73
New cards

Data today comes in all types of formats—ranging from traditional databases to hierarchical data stores created by the end users and OLAP systems, to text documents, e-mail, XML, meter-collected, sensor-captured data, to video, audio, and stock ticker data. By some estimates, 80 to 85 percent of all organizations' data is in some sort of unstructured or semistructured format.

Variety

74
New cards

This refers to both how fast data is being produced and how fast the data must be processed (i.e., captured, stored, and analyzed) to meet the need or demand. RFID tags, automated sensors, GPS devices, and smart meters are driving an increasing need to deal with torrents of data in near-real time.

Velocity

75
New cards

This is obviously the most common trait of Big Data. Many factors contributed to the exponential increase in data volume, such as transaction-based data stored through the years, text data constantly streaming in from social media, increasing amounts of sensor data being collected, automatically generated RFID and GPS data, and so forth.

Volume

76
New cards

In the classification of location-based analytic applications, examining geographic site locations falls in the consumer-oriented category.

False

77
New cards

Hadoop is primarily a(n) distributed file system and lacks capabilities we'd associate with a DBMS, such as indexing, random access to data, and support for SQL.

True

78
New cards

As the size and the complexity of analytical systems increase, the need for more ____________ analytical systems is also increasing to obtain the best performance.

efficient

79
New cards

MapReduce can be easily understood by skilled programmers due to its procedural nature.

True

80
New cards

Data flows can be highly inconsistent, with periodic peaks, making data loads hard to manage. What is this feature of Big Data called?

Variability

81
New cards

The portion of the IoT technology infrastructure that focuses on how to manage incoming data and analyze it is

software backend.

82
New cards

The portion of the IoT technology infrastructure that focuses on the sensors themselves is

hardware.

83
New cards

Big Data comes from

everywhere

84
New cards

Traditional data warehouses have not been able to keep up with

the variety and complexity of data

85
New cards

Which Big Data approach promotes efficiency, lower cost, and better performance by processing jobs in a shared, centrally managed pool of IT resources?

Grid computing

86
New cards

Which process allow Big Data to be processed in memory and distributed across a dedicated set of nodes can solve complex problems in near-real time with highly accurate insights?

In-memory analytics