1/85
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
|---|
No study sessions yet.
Nathan wants to fill in the words "Call" and "Don't Call" on the spreadsheet based on the value in column C. He will list "Call" if the value is FALSE and "Don't Call" if the value is TRUE. What is the formula Nathan could write?
=IF(C5, "Don't Call", "Call")
The __________________ function pairs each element of the first array with its counterpart in the second array, multiplies the elements of the pairs together, and adds the results.
SUMPRODUCT
To perform the task of displaying "credit approved" or "credit denied" based on the corresponding Boolean value in column H, you can write a formula in cell I3 containing an IF function, as follows:
=IF(H3, "credit approved", "credit denied")
Within a given range of cells, the number of times a particular condition is satisfied is computed by using the ____________ function.
COUNTIF
Julia wants to take a count of all employees who are participating in different committees. Some employees are participating in more than one committees. The data is listed in column F of a worksheet. If she wants to take a count of all employees who are participating in exactly one committee. A correct formula would be:
=COUNTIF(F3:F13,1)
Julia wants to take a count of all employees who are participating in different committees. Some employees are participating in more than one committees. The data is listed in column F of a worksheet. If she wants to take a count of all employees who are participating in more than one committee. A correct formula would be:
=COUNTIF(F3:F13,">1")
When you write a VLOOKUP formula, you indicate the value you want to look up in a reference
table
In a PMT function, the argument referring to the number of compounding periods is called
nper
In a PMT function, the argument referring to the original principal value at the beginning of the financial transaction is called
pv
In a PMT function, the argument referring to the interest rate per compounding period is called
rate
A bank account would have an fv equal to the ____________ plus any accrued interest, plus or minus any payments into or out of the account.
pv
When you write a VLOOKUP formula, you indicate the value you want to look up in a table.
True
The syntax of the IF function is =IF(logical_test, value_if_true, value_if_false).
True
In a VLOOKUP function, the argument ____________ refers to the type of lookup you want to perform—TRUE or FALSE.
range_lookup
The PMT function finds the value of the payment per period, assuming that there are constant payments and a constant interest rate for the duration of the loan.
True
The syntax of the COUNTIF function is
=COUNTIF(range, criteria)
Categorization and clustering of documents during text mining differ only in the pre-selection of categories.
True
The goal of this step is to have all text data points in the document are aggregated and converted to a single sentiment measure for the whole document.
Correct match:
Collection and Aggregation
The goal is to differentiate between a fact and an opinion, which may be viewed as classification of text as objective or subjective.
Sentiment Detection
The goal of this step is to accurately identify the target of the expressed sentiment.
Target Identification
The goal is to classify the opinion as falling under one of two opposing sentiment polarities, given an opinionated piece of text.
N-P Polarity Classification
In the research literature case study, the researchers analyzing academic papers extracted information from which source?
The paper abstract.
Text analytics is the subset of text mining that handles information retrieval and extraction, plus data mining.
False
Regional accents present challenges for natural language processing.
True
In the evolution of social media user engagement, the largest recent change is the growth of creators.
False
A(n) _____________ is one or more web pages that provide a collection of links to authoritative web pages while _______________ is a software program that searches for websites or files based on keywords.
hub; search engine
____________ is a technique used to detect favorable and unfavorable opinions toward specific products and services using large numbers of textual data sources.
Sentiment analysis
Search engine optimization (SEO) is a means by which
website developers can increase Web site search rankings.
Web ___________ are used to automatically read through the contents of websites.
crawlers
What does advanced analytics for social media do?
It examines the content of online conversations.
All of the following are challenges associated with natural language processing EXCEPT
dividing up a text into individual words in English.
What does Web content mining involve?
Analyzing the unstructured content of Web pages
In text analysis, what is a lexicon?
A catalog of words, their synonyms, and their meanings.
What do voice of the market (VOM) applications of sentiment analysis do?
They examine customer sentiment at the aggregate level.
Search engines are only used in the context of the World Wide Web (WWW).
False
In the opening vignette, the architectural system that supported Watson used all the following elements EXCEPT
a core engine that could operate seamlessly in another domain without changes.
In sentiment analysis, sentiment suggests a transient, temporary opinion reflective of one's feelings.
False
In the Mining for Lies case study, a text based deception-detection method used by Fuller and others in 2008 was based on a process known as ______________ which relies on elements of data and text mining techniques.
message feature mining
Current use of sentiment analysis in voice of the customer applications allows companies to change their products or services in real time in response to customer sentiment.
True
Spreadsheets include all possible tools needed to deploy a custom decision support systems (DSS).
False
A decision made under risk is also known as a probabilistic or stochastic decision-making situation.
True
Risk _____________ is a decision-making method that analyzes the risk (based on assumed known probabilities) associated with different alternatives.
analysis
There are two common approaches to dealing with uncertainty. The first is the ___________ approach which assumes that the outcomes for all alternatives will be the best possible and then the ____________ of each of those may be selected. The second is the ____________ approach. Under the this approach the worst possible outcome is assumed for each alternative and then the ________ of the _________ are selected.
optimistic; best;
pessimistic; best;
worst
________________ is performed by indicating a target cell, its desired value, and a changing cell.
Goal seeking
Which of the following is NOT a component of a quantitative model?
classes
A(n) ____________ model can be constructed under assumed environments of certainty.
dynamic
The most common method for solving a risk analysis problem is to select the alternative with the
greatest expected value.
Simulation is normally used only when a problem is too complex to be treated using numerical optimization techniques.
True
Every linear programming (LP) model has some internal intermediate variables that are not explicitly stated.
True
A(n) __________ spreadsheet model represents behavior over time.
dynamic
Factors that are not under the control of the decision maker but can be fixed, are called
parameters
A model builder makes predictions and assumptions regarding input data, many of which deal with the assessment of certain futures.
False
This method calculates the values of the inputs necessary to achieve a desired level of an output.
goal seek
The components of a quantitative model are linked by ____________ expressions.
algebraic
Which of the following is NOT a characteristic displayed by a linear programming (LP) allocation problem?
There is a single way in which the resources can be used.
Important spreadsheet features for modeling include all of the following EXCEPT
pivot tables.
When the decision maker must consider several possible outcomes for each alternative, each with a given probability of occurrence, this is decision making under
risk.
A decision table shows the relationships of the problem graphically and can handle complex situations in a compact form.
False
Four major components of a quantitative model include: The variables that describe alternative courses of action are called
decision variables
Four major components of a quantitative model include: The variables in any decision-making situation that are not under the control of the decision maker are called
uncontrollable Variables
Four major components of a quantitative model include: The variables that reflect intermediate outcomes in mathematical models are called
intermediate result variables
Four major components of a quantitative model include: The variables that reflect the level of effectiveness of a system; that is, they indicate how well the system performs or attains its goal(s) are called
result (outcome) variables
For individual decision makers, ______________ values constitute a major factor in the issue of ethical decision making.
personal
Big Data is being driven by the exponential growth, availability, and use of information.
True
Predictive analytics is beginning to enable development of software that is directly used by a consumer. One key concern in employing these technologies is the loss of ___________
privacy
Which of the following is true about the furtherance of homeland security?
There is a greater need for oversight.
Despite their potential, many current NoSQL tools lack mature management and monitoring tools.
True
Using this model, companies can deploy their software and applications in the cloud so that their customers can use them.
PaaS
Internet of Things (IoT) is the phenomenon of connecting the physical world to the Internet.
True
One reason the IoT is growing exponentially is because hardware is smaller and more affordable.
True
How does Hadoop work?
It breaks up Big Data into multiple parts so each part can be processed and analyzed at the same time on multiple computers.
Why are companies like IBM shifting to provide more services and consulting?
Customers see that significant value can be created with the application of analytics, and need help completing these tasks.
Data today comes in all types of formats—ranging from traditional databases to hierarchical data stores created by the end users and OLAP systems, to text documents, e-mail, XML, meter-collected, sensor-captured data, to video, audio, and stock ticker data. By some estimates, 80 to 85 percent of all organizations' data is in some sort of unstructured or semistructured format.
Variety
This refers to both how fast data is being produced and how fast the data must be processed (i.e., captured, stored, and analyzed) to meet the need or demand. RFID tags, automated sensors, GPS devices, and smart meters are driving an increasing need to deal with torrents of data in near-real time.
Velocity
This is obviously the most common trait of Big Data. Many factors contributed to the exponential increase in data volume, such as transaction-based data stored through the years, text data constantly streaming in from social media, increasing amounts of sensor data being collected, automatically generated RFID and GPS data, and so forth.
Volume
In the classification of location-based analytic applications, examining geographic site locations falls in the consumer-oriented category.
False
Hadoop is primarily a(n) distributed file system and lacks capabilities we'd associate with a DBMS, such as indexing, random access to data, and support for SQL.
True
As the size and the complexity of analytical systems increase, the need for more ____________ analytical systems is also increasing to obtain the best performance.
efficient
MapReduce can be easily understood by skilled programmers due to its procedural nature.
True
Data flows can be highly inconsistent, with periodic peaks, making data loads hard to manage. What is this feature of Big Data called?
Variability
The portion of the IoT technology infrastructure that focuses on how to manage incoming data and analyze it is
software backend.
The portion of the IoT technology infrastructure that focuses on the sensors themselves is
hardware.
Big Data comes from
everywhere
Traditional data warehouses have not been able to keep up with
the variety and complexity of data
Which Big Data approach promotes efficiency, lower cost, and better performance by processing jobs in a shared, centrally managed pool of IT resources?
Grid computing
Which process allow Big Data to be processed in memory and distributed across a dedicated set of nodes can solve complex problems in near-real time with highly accurate insights?
In-memory analytics