5.1-5.5
Big Data
large amounts of structured and unstructured data that can potentially be mined, examined, and used by organizations.
Data Processing
converting information that can be understood by a computer
Useable Data
data that is capable of being used - i.e., data that has been processed so that it can be analyzed or used in its current form
Useful Data
can someone use the data to make predictions, describe some process or solve a problem
Data Collection
gathering and measuring information on targeted variables in order to answer questions and evaluate outcomes
Collaboration
working together to facilitate the application of multiple perspectives and diverse talents and skills
Unstructured Data
raw data with no connections and/or relationships among data detected - requires more storage space
Structured Data
data that is organized in some fashion - utilizes less storage space
Data Set
a collection of numbers or values that relate to a particular subject usually portrayed in a relational database table. Example: column header and row contents for test scores for each student.
Knowledge Extraction
knowledge created from structured relational databases
Relational Database
a collection of data organized and retrieved in various ways between database tables
Data
figures and facts
Information
data that is processed, interpreted and organized to become meaningful
Data Storage
the retention and retrieval of data
Screen Scraping
extracting information that is formatted for human use and converting it into a format for computer use (example: scanner or pdf converter)
Curation of Information
gathering information pertaining to a specific topic
Extraction
retrieving or processing data from unstructured data sources for further data processing, storage and/or analysis.
Knowledge Extraction
knowledge created from structured relational databases
Spiderbot
a virtual robot (program) that visits web sites and reads information to create entries for a search engine index
Generation Loss
the loss of quality between copies of data, usually analog formats (copies of copies) - unlike digital data where copies are identical as long as the format and size remain the same.
Browser
a computer program used to navigate and search the World Wide Web and display HTML files in a graphical format (example: Google Chrome, Internet Explorer, Mozilla Firefox)
Metadata
descriptive data about an image, a web page, or other complex objects (data about data)
Data vs Information
data are figures and facts while information is data that is processed, interpreted and organized to become meaningfulDa
Data Persistence
information that is not often accessed and rarely modified. Data that remains stored after a user has deleted it
Data Storage
static storage of various capacities and speed such as CD's, DVD's, flash memory, main memory, cache memory, magnetic tape, etc.
Indexing
the specific organization and method of keeping track of data
Filter Bubble
limiting a user's perspective by having an algorithm selectively determine what type of information a user would like to see based on past search history and behavior.
Privacy Concerns
digitization of personal data means your data is now easier to reproduce, share, sell and access.
Utility
the measurement of usefulness - example: sharing personal digital data in order to receive something of value in return.
Cache
a memory location to store active data temporarily to shorten data access times and reduce latency.