working together
big data
extremely large data sets that may be analyzed computationally to reveal patterns, trends, and associations, especially relating to human behavior and interactions.
data collection
the process of gathering and analyzing data
usable data
data that can understood without additional information
useful data
data that can be used for reasoning and discussion
data set
a collection of related sets of information that is composed of separate elements but can be manipulated as a unit by a computer.
any technique for creating images, diagrams, or animations to communicate a message.
a series of interconnected things or events
establishes that the user is a human
citizen science
the collection and analysis of data relating to the natural world by members of the general public
regression analysis
a set of statistical processes for estimating the relationships among variables.
the practice of obtaining information or input into a task or project by enlisting the services of a large number of people, either paid or unpaid, typically via the Internet.
human computation
is a computer science technique in which a machine performs its function by outsourcing certain steps to humans, usually as microwork.
the act of removing something by effort or force
curation of information
the process of gathering information relevant to a particular topic or area of interest
the retention of retrievable data on a computer or other electronic system; memory.
a hardware or software component that stores data so that future requests for that data can be served faste
unstructured data
contain everything collected in "raw" form, but connections and relationships among strands of data are both harder to trace and much slower to process than structured data sets.
structured data
easy to access and organize, but may lack the big picture and details that unstructured data may possess.
screen scraping
the conversion of data formatted for human use to a format more easily used by automated computer processes.
knowledge extraction
the creation of knowledge from structured and unstructured sources. The resulting knowledge needs to be in a machine-readable and machine-interpretable format and must represent knowledge in a manner that facilitates inferencing.
data processing
a series of operations on data, especially by a computer, to retrieve, transform, or classify information.
spider bot
searches the web in advance so they can give you info faster
relational database
a database structured to recognize relations among stored items of information.
data vs. information
data are simply facts or figures â bits of information, but not information itself. When data are processed, interpreted, organized, structured or presented so as to make them meaningful or useful, they are called information. Information provides context for data.
generation loss
refers to the process of a qualitative loss in successively copied data.
data persistence
denotes information that is infrequently accessed and not likely to be modified.
data storage
storing of data
gathering and recording data in an index
filter bubble
Keeps you within limited information range based on searches and bias
privacy concerns
the right of mandating personal privacy concerning storing, re-purposing, provision to third parties, and displaying of information pertaining to oneself via the Internet.
automated summarization
is the process of shortening a text document with software, in order to create a summary with the major points of the original document
Trading personal information for access to more information
is the discovery, interpretation, and communication of meaningful patterns in data
descriptive analytics
gives information like statistics on the data it collects.
distribution of means
using data means and finding the standard deviation
predictive analytics
give their hypothesis based on inductive reasoning or past information.
based on descriptive analytics