Google Building Blocks Review Flashcards

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/24

flashcard set

Earn XP

Description and Tags

Flashcards reviewing Google's core infrastructure technologies: MapReduce, Google File System (GFS), and BigTable.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

25 Terms

1
New cards

Google's major software systems include MapReduce, GFS (Google File System), and __.

BigTable

2
New cards

MapReduce is a methodology for exploiting __ in computing clouds.

parallelism

3
New cards

In the context of search engines, MapReduce is used for building Google's Search Index and __.

Article clustering for Google News

4
New cards

Examples of modern Internet applications that require managing immense amounts of data quickly include Dish network click collection and __ data collection.

Tesla car usage

5
New cards

MapReduce solves the problems of parallelization, fault tolerance, I/O scheduling, and __ for the programmer.

Monitoring & Status updates

6
New cards

The Map/Reduce paradigm involves breaking records into segments, mapping to extract something of interest, grouping intermediate results, reducing to aggregate results, and generating __.

final output

7
New cards

In MapReduce, the __ manages the parallel execution and coordination of tasks automatically.

system

8
New cards

In a MapReduce computation, map tasks turn a chunk into a sequence of __.

key-value pairs

9
New cards

The Map function parses a document, extracts each word and uses each word as a __.

key

10
New cards

The Reduce function aggregates intermediate results by __.

key

11
New cards

The master controller knows how many __ tasks there will be.

Reduce

12
New cards

The Reduce function is generally __ and commutative.

associative

13
New cards

The Google File System (GFS) is designed for efficient, reliable access to data using large clusters of __.

commodity hardware

14
New cards

GFS supports automatic sharding of large files, automatic recovery from failures, and is optimized for __ access to huge files.

sequential

15
New cards

In GFS, files are divided into fixed-size chunks of __ megabytes.

64

16
New cards

In GFS, the __ server holds all metadata, like namespace, access control, and chunk locations.

Master

17
New cards

In GFS, data transfers happen directly between __ and chunkservers.

clients

18
New cards

GFS is optimized for __ files rather than rewrites.

appended

19
New cards

Bigtable is a compressed, high performance, proprietary data storage system built on top of __.

the Google File System

20
New cards

A table in Bigtable is sparse, distributed, persistent, multidimensional, and __.

sorted map

21
New cards

In Bigtable, data is treated as __.

uninterpreted strings

22
New cards

In Bigtable, rows are ordered __.

lexicographically

23
New cards

In Bigtable, each cell contains a unique __ version of the data for that row and column.

timestamped

24
New cards

In Bigtable, columns have two-level name structure consisting of family and __.

optional qualifier

25
New cards

In Bigtable, Timestamps are used to store __ versions of data in a cell.

different