Chapter 8: Data

0.0(0)
studied byStudied by 10 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/43

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

44 Terms

1
New cards
lowest level
Computers read machine code, which at the ________, is made up of 0s and 1s.
2
New cards
Current
________ "and "no ________ "are easy conditions to detect.
3
New cards
similar features
Classifying: Grouping data with ________ and values helps computers make sense of large datasets.
4
New cards
attendees
Data collected from all types of events- such as visits, searches, inquiries, orders, returns, temperatures, scores, ________, acres planted, acres harvested, fish, birds, photos, videos, and audio files- are considered to be raw data.
5
New cards
Software tools
________ such as spreadsheets and databases can be used to filter, organize, and search the data.
6
New cards
software program
The ________ takes in the binary value and interprets it as a color, text value, or number, based on what the program is expecting.
7
New cards
actual data
Changing, adding, or deleting metadata does not impact or change the ________ in any way.
8
New cards
Binary
is the number system used in computer science.
9
New cards
Abstraction
is a concept that is a little hard for many students to grasp.
10
New cards
**Letters**
In addition to the numbers we just reviewed, binary numbers can also represent ______ for text fields.
11
New cards
colors
Computer monitors work the same way and add differing amounts of red, green, and blue to create the colors that are displayed.
12
New cards
Note
Use leading 0s on the left to make a byte (8 bits)
13
New cards
Example 1
Convert 21 to binary
14
New cards
Answer
2110 = 000101012
15
New cards
Example 1
Convert 00011011 to decimal
16
New cards
If we simply had a binary number
00101001, we would not know what it represented
17
New cards
Data Compression
Lossless and Lossy
18
New cards
Cleaning
Computers "clean" data
19
New cards
Filtering
Computers filter data easily
20
New cards
Classifying
Grouping data with similar features and values helps computers make sense of large datasets
21
New cards
Bias
This can unintentionally be present in data
22
New cards
Patterns
The data analysis starts with a hypothesis or question to check
23
New cards
**Machine Instructions**
It could be a number, text, color, instruction, or other representation.
24
New cards
Overflow errors
occur in computers when the integer to be represented needs more bits than the programming language can represent.
25
New cards
fixed number
of bits are assigned to hold integers in many programming languages.
26
New cards
Rounding errors
occur because of the way numbers with decimal points are stored in the computer.
27
New cards
Analog data
is a continuous stream of data values.
28
New cards
Lossless compression
techniques allow the original image to be restored.
29
New cards
Lossy compression
techniques lose some data in the compression process.
30
New cards
Computers
enable us to process data to turn it into information for decision making and research.
31
New cards
Data
collected from all types of events—such as visits, searches, inquiries, orders, returns, temperatures, scores, attendees, acres planted, acres harvested, fish, birds, photos, videos, and audio files—are considered to be raw data.
32
New cards
**Cleaning**
Computers "clean" data.
33
New cards
Data cleaning
can also change "dr." to "Drive" for consistency.
34
New cards
**Filtering**
Computers filter data easily.
35
New cards
**Classifying**
Grouping data with similar features and values helps computers make sense of large datasets.
36
New cards
Groupings
may use one or more criteria.
37
New cards
**Bias**
This can unintentionally be present in data.
38
New cards
**Patterns**
The data analysis starts with a hypothesis or question to check.
39
New cards
data mining
Computers are able to identify patterns in data that people are either unable to recognize or cannot process enough data to see the pattern.
40
New cards
correlation
may not mean one thing caused the other.
41
New cards
Scalability
is the ability to increase the capacity of a resource without having to go to a completely new solution, and for that resource to continue to operate at acceptable levels when the increased capacity is being added.
42
New cards
Metadata
is data that describes data and can help others find the data and use it more effectively.
43
New cards
Charts, tables, and other graphics
help summarize data visually.
44
New cards
Search tools and filtering systems
are needed to help analyze the data and recognize patterns.