Quiz #10

5.0(1)
studied byStudied by 1 person
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/21

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

22 Terms

1
New cards
*Veracity, Variability*, *Value*, and *Visualization* of data are terms applicable exclusively to Big Data.
FALSE
2
New cards
Fully developed data warehouse or a pure data lake are *the only two choices* for large analytical data repositories.
FALSE
3
New cards
Unstructured and semi-structured data can become a *source of data for a data warehouse* after being processed by Hadoop.
TRUE
4
New cards
The *same data* in the data lake can be transformed and analyzed by multiple groups of users in different ways for a variety of different purposes.
True
5
New cards
Big data techniques *increase* the ability to analyze the data that the organization owns or to which it has access.
True
6
New cards
Collection of alphanumeric comma-separated data showing temperatures for each second of the day for a set of 10,000 commercial smart-refrigerators is an *example of big data*.
True
7
New cards
Big data methods, such as MapReduce, *replace* database and data warehousing approaches developed for managing and utilizing formally modeled data assets.
FALSE
8
New cards
Big data sets, when *compared* with most databases and data warehouses:
Have higher possibility of different interpretations
9
New cards
A corporation engaged in data analysis can create and maintain *both* data warehouses and data lakes at the same time.
True
10
New cards
CONSTANT DOCUMENTARIAN is an organization that receives each day a one-hour log video from 50 contributors around the world, recording mundane daily-life scenes.  It also receives one daily 200-words email from its contributors.  Consider the following CONSTANT DOCUMENTARIAN data sets: \n - Set A: collection of daily one-hour videos from the 50 contributors \n - Set B: collection of daily 200-words emails from its contributors \n - Set C: video footage of its 24/7 CCTV camera constantly recording scenes outside its headquarters \n - Set D: relational table containing first name, last name, phone number, and email address of each contributor \n   \n Which data set is exhibiting the lowest *volume*?
Set D
11
New cards
*Volume, Variety,* and *Velocity* of data are terms applicable exclusively to big data.
FALSE
12
New cards
CONSTANT DOCUMENTARIAN is an organization that receives each day a one-hour log video from 50 contributors around the world, recording mundane daily-life scenes.  It also receives one daily 200-words email from its contributors.  Consider the following CONSTANT DOCUMENTARIAN data sets: \\n - Set A: collection of daily one-hour videos from the 50 contributors \\n - Set B: collection of daily 200-words emails from its contributors - Set C: video footage of its 24/7 CCTV camera constantly recording scenes outside its headquarters \\n - Set D: relational table containing first name, last name, phone number, and email address of each contributor \\n   Which data set is exhibiting the lowest *velocity*?
Set D
13
New cards
Which of the following is *NOT* true?
The programmer using Hadoop has to write the functions for distributing the data among nodes.
14
New cards
Where on the Spectrum of Solutions for Large Analytical Data Repositories, would the following example best fit?

*Data from 10 data sources is extracted.  Two of those sources have the exact same structure, so the data from those two sources is pasted together before loading. There is a small overlap of data in those two sources, so the duplicates are eliminated before loading. The rest of the sources are loaded as they were.*
Inside the spectrum, closer to the left edge of the spectrum (Pure Data Lake).
15
New cards
*SQL* provides all the necessary functionalities for managing and analyzing big data.
FALSE
16
New cards
CONSTANT DOCUMENTARIAN is an organization that receives each day a one-hour log video from 50 contributors around the world, recording mundane daily-life scenes.  It also receives one daily 200-words email from its contributors.  Consider the following CONSTANT DOCUMENTARIAN data sets: \n - Set A: collection of daily one-hour videos from the 50 contributors \n - Set B: collection of daily 200-words emails from its contributors \n - Set C: video footage of its 24/7 CCTV camera constantly recording scenes outside its headquarters \n - Set D: relational table containing first name, last name, phone number, and email address of each contributor \n   \n Which data set is exhibiting the highest *velocity*?
Set C
17
New cards
Which of the following data examples has the most *structure*?
A row in a relational table
18
New cards
Which of the following is applicable to a *data lake*?
Potentially analytically useful data is extracted from sources and then placed in the data lake.
19
New cards
CONSTANT DOCUMENTARIAN is an organization that receives each day a one-hour log video from 50 contributors around the world, recording mundane daily-life scenes.  It also receives one daily 200-words email from its contributors.  Consider the following CONSTANT DOCUMENTARIAN data sets: \n - Set A: collection of daily one-hour videos from the 50 contributors \n - Set B: collection of daily 200-words emails from its contributors \n - Set C: video footage of its 24/7 CCTV camera constantly recording scenes outside its headquarters \n - Set D: relational table containing first name, last name, phone number, and email address of each contributor \n   \n  Which data set is exhibiting the highest *volume*?
Set A
20
New cards
A large fact table in a data warehouse is an *example of big data*.
FALSE
21
New cards
CONSTANT DOCUMENTARIAN is an organization that receives each day a one-hour log video from 50 contributors around the world, recording mundane daily-life scenes.  It also receives one daily 200-words email from its contributors.  Consider the following CONSTANT DOCUMENTARIAN data sets: \n - Set A: collection of daily one-hour videos from the 50 contributors \n - Set B: collection of daily 200-words emails from its contributors \n - Set C: video footage of its 24/7 CCTV camera constantly recording scenes outside its headquarters \n - Set D: relational table containing first name, last name, phone number, and email address of each contributor \n   \n Which data set has the most *structure*?
Set D
22
New cards
Which of the following (regarding the telemetry data set in the Insurance Company Example) is *NOT* true?
Telemetry data could be used as a source for adding data about the range of prices of new vehicles to the warehouse.