Unit Five: Writing Classes

studied byStudied by 9 people
5.0(1)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 37

38 Terms

1
collaboration
working together
New cards
2
big data
extremely large data sets that may be analyzed computationally to reveal patterns, trends, and associations, especially relating to human behavior and interactions.
New cards
3
data collection
the process of gathering and analyzing data
New cards
4
usable data
data that can understood without additional information
New cards
5
useful data
data that can be used for reasoning and discussion
New cards
6
data set
a collection of related sets of information that is composed of separate elements but can be manipulated as a unit by a computer.
New cards
7
visualization(s)
any technique for creating images, diagrams, or animations to communicate a message.
New cards
8
concatenation
a series of interconnected things or events
New cards
9
ReCAPTCHA
establishes that the user is a human
New cards
10
citizen science
the collection and analysis of data relating to the natural world by members of the general public
New cards
11
regression analysis
a set of statistical processes for estimating the relationships among variables.
New cards
12
crowdsourcing
the practice of obtaining information or input into a task or project by enlisting the services of a large number of people, either paid or unpaid, typically via the Internet.
New cards
13
human computation
is a computer science technique in which a machine performs its function by outsourcing certain steps to humans, usually as microwork.
New cards
14
extraction
the act of removing something by effort or force
New cards
15
curation of information
the process of gathering information relevant to a particular topic or area of interest
New cards
16
storage
the retention of retrievable data on a computer or other electronic system; memory.
New cards
17
cache
a hardware or software component that stores data so that future requests for that data can be served faste
New cards
18
unstructured data
contain everything collected in "raw" form, but connections and relationships among strands of data are both harder to trace and much slower to process than structured data sets.
New cards
19
structured data
easy to access and organize, but may lack the big picture and details that unstructured data may possess.
New cards
20
screen scraping
the conversion of data formatted for human use to a format more easily used by automated computer processes.
New cards
21
knowledge extraction
the creation of knowledge from structured and unstructured sources. The resulting knowledge needs to be in a machine-readable and machine-interpretable format and must represent knowledge in a manner that facilitates inferencing.
New cards
22
data processing
a series of operations on data, especially by a computer, to retrieve, transform, or classify information.
New cards
23
spider bot
searches the web in advance so they can give you info faster
New cards
24
relational database
a database structured to recognize relations among stored items of information.
New cards
25
data vs. information
data are simply facts or figures — bits of information, but not information itself. When data are processed, interpreted, organized, structured or presented so as to make them meaningful or useful, they are called information. Information provides context for data.
New cards
26
generation loss
refers to the process of a qualitative loss in successively copied data.
New cards
27
data persistence
denotes information that is infrequently accessed and not likely to be modified.
New cards
28
data storage
storing of data
New cards
29
indexing
gathering and recording data in an index
New cards
30
filter bubble
Keeps you within limited information range based on searches and bias
New cards
31
privacy concerns
the right of mandating personal privacy concerning storing, re-purposing, provision to third parties, and displaying of information pertaining to oneself via the Internet.
New cards
32
automated summarization
is the process of shortening a text document with software, in order to create a summary with the major points of the original document
New cards
33
utility
Trading personal information for access to more information
New cards
34
analytics
is the discovery, interpretation, and communication of meaningful patterns in data
New cards
35
descriptive analytics
gives information like statistics on the data it collects.
New cards
36
distribution of means
using data means and finding the standard deviation
New cards
37
predictive analytics
give their hypothesis based on inductive reasoning or past information.
New cards
38
based on descriptive analytics
New cards
robot