APCSP Unit 10 - Big Data

4.0(1)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/28

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

29 Terms

1
New cards

Big Data

extremely large and ever-growing amounts of data, both structured and unstructured, that are too large or complex to be dealt with by traditional data-processing application software

2
New cards

structured data

data that resides in a fixed field within a spreadsheet or relational database; data that has been intentionally collected in a organized manner

3
New cards

unstructured data

data that is not stored in a predefined way; it is not structured or categorized

4
New cards

The Five V’s of Big Data

volume, velocity, variety, veracity, value

5
New cards

data center

a physical room, building, or facility that houses the equipment necessary for storing data and hosting applications and services

6
New cards

data compression

a reduction in the number of bits needed to represent data

7
New cards

lossy compression

a non-reversible process that reduces the number of bits in a digital file by discarding some of the data and information

8
New cards

lossless compression

a reversible process that reduces the number of bits in a digital file without losing any data

9
New cards

byte

a group of 8 bits

10
New cards

kilobyte

one thousand bytes

11
New cards

megabyte

one million bytes (or one thousand kilobytes)

12
New cards

gigabyte

one billion bytes (or one thousand megabytes)

13
New cards

terabyte

one trillion bytes (or one thousand gigabytes)

14
New cards

cleaning data

a process that makes data uniform without changing its meaning

15
New cards

bias

prejudice in favor of or against one thing, person, or group compared with another, usually in a way considered to be unfair

16
New cards

COPPA

Children’s Online Privacy Protection Act

17
New cards

data

raw facts and statistics collected together for reference or analysis

18
New cards

information

data that has been processed, interpreted, analyzed, organized, or structured to make it more meaningful or useful

19
New cards

extraction

retrieving data, processing it, and placing it in a structure that can be analyzed

20
New cards

filtering

temporarily removing or hiding unwanted data

21
New cards

sorting

arranging data in alphabetical or numerical order

22
New cards

data mining

the practice of analyzing large databases in order to generate new information

23
New cards

trend analysis

the practice of attempting to spot a pattern in data

24
New cards

pattern

consistencies/repetitions in data; useful for predicting future behavior or events

25
New cards

visualization

the presentation of data in a pictorial or graphical form; allows decision makers to grasp a difficult concept or identify new patterns

26
New cards

causation

proof that one event or condition directly led to another event or condition

27
New cards

correlation

how closely related two or more events or condition are; just because trend graphs look similar does not mean that one trend caused another trend

28
New cards

model

a physical or virtual representation of an object, concept, or idea

29
New cards

simulation

a method for testing a hypothesis of a situation using a model