1/38
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
|---|
No study sessions yet.
big data
large amounts of structured and unstructured data that can potentially be mined, examined, and used by organizations.
data processing
converting information that can be understood by a computer.
useable data
data that is capable of being used - i.e., data that has been processed so that it can be analyzed or used in its current form.
useful data
can someone use the data to make predictions, describe some process or solve a problem.
data collection
gathering and measuring information on targeted variables in order to answer questions and evaluate outcomes.
collaboration
working together to facilitate the application of multiple perspectives and diverse talents and skills.
unstructured data
raw data with no connections and/or relationships among data detected - requires more storage space.
structured data
data that is organized in some fashion - utilizes less storage space.
data set
a collection of numbers or values that relate to a particular subject usually portrayed in a relational database table. Example- column header & row contents for test scores for each student
knowledge extraction
knowledge created from structured relational databases.
relational database
a collection of data organized and retrieved in various ways between database tables.
data
figures and facts
information
information is data that is processed, interpreted and organized to become meaningful.
data storage
the retention and retrieval of data.
screen scraping
extracting information that is formatted for human use and converting it into a format for computer use (example- scanner or pdf converter)
curation of information
gathering information pertaining to a specific topic.
Extraction
retrieving or processing data from unstructured data sources for further data processing, storage and/or analysis.
Spiderbot
virtual robot (program) that visits web sites and reads information to create entries for a search engine index.
Generation loss
the loss of quality between copies of data, usually analog formats (copies of copies) - unlike digital data where copies are identical as long as the format and size remain the same.
Browser
computer program used to navigate and search the world wide web and display HTML files in a graphical format (Ex- Google Chrome, Internet Explorer, Mozilla Firefox)
Metadata
descriptive data about an image, web page, or other complex objects (data about data).
Data vs. information
data are figures & facts white information is data that is processed, interpreted & organized to become meaningful.
Data persistence
information that is not often accessed and rarely modified. Data that remains stored after a user has deleted it.
Indexing
the specific organization and method of keeping track of data.
Filter bubble
limiting a user’s perspective by having an algorithm selectively determine what type of information a user would like to see based on past search history & behavior.
Privacy concerns
digitization of personal data means your data is now easier to reproduce, share, sell and access.
Utility
the measurement of usefulness (ex- sharing personal digital data in order to receive something of value in return).
Cache
a memory location to store active data temporarily to shorten data access times and reduce latency.
reCAPTCHA
a digital tool used to deter automated form-filling and exploitation of web-based registration systems.
Crowdsourcing
obtaining information from a large number of people, either paid or unpaid, voluntary or involuntary.
Human computation
using human cognition to provide computational data via techniques such as crowdsourcing.
Descriptive analysis
information about collected data using statistics (mean, median, mode, range) which describe circumstances.
Predictive analytics
information about future events based on collected and analyzed data.
Analytics
information resulting from the systematic analysis of data or statistics.
Automated summarization
summarizing data to a simpler state by removing redundant or less significant details.
Visualization
the representation of information using a chart, diagram, image, etc.
regression analysis
the forecasting of change through statistical analysis of the strength of the relationship between one dependent variable and other changing independent variables.
Models
physical or virtual representations of an object.
Simulations
test a hypothesis of a situation using a model.