csp chapter 4: storing data - spreadsheets and databases

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 36

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

37 Terms

1

crowdsourcing

tapping into the collective intelligence of a large group of people to achieve a specific goal or solve a problem

New cards
2

citizen science

type of scientific research that is conducted by distributed individuals who contribute relevant data to research using their own computer devices

New cards
3

examples of data visualization

  1. charles minard’s map of napoleon’s 1912-13 russian campaign

  2. john snow’s mapping of cholera outbreaks in 1854

New cards
4

common misrepresentation in data

correlation does NOT mean causation

New cards
5

simpson’s paradox

groups of data individually trend in one direction but when combined, the trend disappears or is reversed

New cards
6

first electronic spreadsheets

visicalc (apple II) and lotus 1-2-3 (IBM PC)

New cards
7

rows

go from left to right, labeled using numbers (1…)

New cards
8

columns

go from top to bottom, labeled using letters (a…)

New cards
9

cell

each individual piece of grid (labeled with letter-number)

New cards
10

label

text that descirbes some part of the spreadsheet

New cards
11

constant

any number the user enters into the spreadsheet

New cards
12

formula

equation that can perform calculations on existing cells

New cards
13

formatting

makes data more visually appealing

  • right click on cell, select formatting, format cells and check under home tab

New cards
14

conditional formatting

highlights cells that meet a specific criteria (= a value, > value, top __), can also turn cells into mini graphs

New cards
15

auto formatting

pre-made templates to change spreadsheet

  • select cells, select formatting, format as table

New cards
16

database

organized collection of data stores in tables

  • data is consistent

New cards
17

consistency in databases

info in one table does not contradict itself in any other table

New cards
18

idempotency

an operation will result in the same end result no matter how many times its performed

New cards
19

write-ahead logging

all changes are written and saved to a log before they’re applied to the database so all components in a transaction need to be carried out before the transaction is considered complete

New cards
20

atomic transactions

transactions that cannot be broken down while being executed

New cards
21

rollback

returning back to the state before the write-ahead log began

New cards
22

deadlock

when two transactions are trying to lock the same row and neither can continue until the other is complete

  • one of the transactions must be rolled back

New cards
23

two-phase commit protocol

standardized way to make sure all data can be written without any inconsistencies

  • first phase: check to see if all processes can be completed

  • second: if they can be written without issue, then the processes will be committed

    • if not, the phase will rollback

New cards
24

relational database

have multiple tables that are connected or related through the use of unique keys, a column holding a unique value that distinguishes each record from all others

New cards
25

virtual table

temporary tables made up of parts of other tables that help to reduce redundant data

New cards
26

SQL (structured query language)

language used to manage, access, and manipulate relational databases

  • ignores white space and is not case-sensitive

New cards
27

SELECT class_year FROM students;

returns every student’s class year, including duplicates

New cards
28

distinct

only lists unique values

New cards
29

order by

sort the data

New cards
30

desc/asc

gives descending/ascending order

New cards
31

limit

limits to a certain # of rows

  • using * specifies all columns

New cards
32

where

used to get records that match specific criteria

  • can be used in conjunction with and/or/not

New cards
33

like %

% represents 0 or more unknown characters

  • “LIKE ‘W%’” would return entries of any length that start w capital W

  • LIKE %ing would return only entries ending in ‘ing’

New cards
34

like _

_ represents exaclty one character

  • LIKE T_m would return Tim, Tom, Tum

New cards
35

join

combines entries from two or more tables

  • using ON specifies how the tables being joined are related

New cards
36

fault-tolerance

ability of a system to continue to run properly even if one piece fails

New cards
37

big data

sets of data that are larger than a consumer software application can handle

New cards
robot