Big Idea 2 - Data

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/44

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

45 Terms

1
New cards

How many values can a byte represent?

256

2
New cards

What is the maximum decimal value a byte can represent?

255

3
New cards

What is the formula to calculate how many values can be represented by n bits?

2^n

4
New cards

To calculate the largest number of bits use the formula:

2^n - 1

5
New cards

Examples of abstractions

number systems, colors, characters

6
New cards

Abstractions allow

  • generalization of a program

  • This can help shrink the code by referencing one segment instead of repeating, which lowers the chance of errors

7
New cards

What is the base and digit range of the octal number system?

base-8 and uses digits 0 to 7

8
New cards

How many symbols does the hexadecimal system use, and what are they?

base-16 and uses 0–9 and A–F (where A=10 to F=15)

9
New cards

Pixels

digital images are a collection of pixels that each consist of binary values

10
New cards

RGB pixels

represented by bytes of data, or number values up to 255, each pixel would include a 3 of these values, one for red, green, and blue

11
New cards

Lossy compressions

  • significantly reduce file size for storage and transmission ( email ) but decreases resolution

  • results in no recovery to original file, some data is lost

12
New cards

Lossless data compression

  • if data is lost, the original file can be reproduced, but file size is larger, more difficult to store

13
New cards

Large datasets

may be difficult to work with, and the ability to process the data depends on the capability of the user

14
New cards

Bias in datasets

  • not eliminated by adding more data, and algorithms that use the biased data will reflect this bias

  • Large sample size could magnify bias

15
New cards

Cleaning

removing corrupt data, removing or repairing incomplete data, removing or flagging invalid data

16
New cards

Filtering

different subsets are identified and extracted to make meaning of data

17
New cards

Classifying

make meaning of large data sets by grouping by common attributes based on the criteria of individuals processing the data

18
New cards

Bias

occurs when the data collected does not represent all possibilities of the pool of available options

19
New cards

Patterns

the analysis starts with a question and uses rules to find patterns in data. Computers can find patterns better than people in large data sets.

20
New cards

Correlations

may not mean elements are caused by one another

21
New cards

Scalability

the capacity to increase ( or decrease ) the capacity of a resources without having to go to a completely new solution - important for storing and processing large data sets

22
New cards

PII ( personally identifiable information )

should be kept safe online through multi-factor identification

23
New cards

Metadata

Data that describes data

Used to find, organize, and manage information about data and increases the effective use of data

24
New cards

Changes and deletions made to metadata do not change primary data

True

25
New cards

Which file type is lossy compressed sound?

mp3

26
New cards

Which type of compression do zip files use?

Lossless

27
New cards

Bit

1 bit (2^0)

28
New cards

Byte

8 bits

29
New cards

Kilobyte

1,024 (2^10)

30
New cards

Megabyte

1,000,000 (2^20)

31
New cards

Gigabyte

1,000,000,000 (2^30)

32
New cards

Terabyte

1,000,000,000,000 (2^40)

33
New cards

Petabyte

1,000,000,000,000,000 (2^50)

34
New cards

Digital images

a collection of pixels. where each pixel consists of binary numbers.

35
New cards

Computers need metadata to know…

the size of an image

36
New cards

Data compression

is useful in computing to save disk space, or to reduce the bandwidth used when sending data (eg, over the Internet).

37
New cards

Text compression

  • lossless compression.

  • The reconstruction must be identical to the original text, as very small differences can result in statements with very different meanings.

38
New cards

Data extraction (first step)

the process of obtaining data from a database or software such as a social media website so that it can transport it to another software (such as spreadsheets) designed to support online analytical processing.

39
New cards

transform (2nd step)

Clean, filter, organize, or reformat the data into a usable structure.

40
New cards

analyze (3rd step)

Interpret the transformed data to find trends, patterns, or insights.

41
New cards

graph

A diagram used to represent data. It usually is used to depict a relationship between two things.

42
New cards

Picture graphs

use pictures to represent values.

43
New cards

Bar graphs

use either vertical or horizontal bars to represent the values.

44
New cards

Line graphs

use lines to represent the values.

45
New cards

Scatter plots

represent the data with points, and then a best-fit line is drawn through some of the points.