1/46
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
Data
Information that is in a computer-readable form
Big Data
Massive amounts of structured and unstructured data that can potentially be mined, examined and used by organizations
Data processing
Converting information that can be understood by a computer, taking data and putting it into a useful format
Useable data
Data that has been processed so that it can be analyzed or used in its current form
Useful data
Data that someone can use to make predictions, describe some process or solve a problem
Data footprint
Receipt gives a bunch of info about the credit card
Data Storage
Relational database link correlate and disburse information
Unpredictable data
Data that is unreliable and non-useable format
Data Visualization
Using charts, graphs, or images to visualize complex data
Data collection
Gathering and measuring information on targeted variables in order to answer questions and evaluate outcomes
Unstructured data
Raw data with no connections and/or relationships among data detected - requires more storage space
Structured data
Data that is organized in some fashion - utilizes less storage space
Data set
A collection of numbers or values that relate to a particular subject usually portrayed in a relational database table
Relational database
A collection of data organized and retrieved in various ways between database tables
Knowledge
Data that is processed, interpreted and organized to become meaningful
Data Cycle
Collection, Extraction, Storage
Screen Scraping
Converting data that was human readable into data that computers can read
Input data
Data that is fed into a program
Output data
Data that is produced by a program
Data + computation
Knowledge
Google spiders
Go out to visit websites to gather information
Logs
Data that can be structured into useful information
generation loss
the loss of quality between copies of data, usually analog formats (copies of copies) - unlike digital data where copies are identical as long as the format and size remain the same.
data vs. knowledge
data are figures and facts while knowledge is data that is processed, interpreted and organized to become meaningful.
data persistence
information that is not often accessed and rarely modified. Data that remains stored after a user has deleted it.
data storage
static storage of various capacities and speed such as CDs', DVD's, flash memory, main memory, cache memory, magnetic tape, etc.
indexing
the specific organization and method of keeping track of data.
cache
a memory location to store active data temporarily to shorten data access times and reduce latency.
Concordances
a list of individual words and location pairings.
analytics
information resulting from the systematic analysis of data or statistics.
Data Limitations
Visualizations can be misleading by skewing the axes or labels, or leaving out relevant data.
Truncated Y-axis
Not starting the y-axis at zero. Shows you data but could look bad since it's manipulated to look bad.
Correlation
A connection between two things.
descriptive analytics
information about collected data using statistics (mean, median, mode, range) which describe circumstances.
predictive analytics
information about future events based on collected and analyzed data.
prescriptive analytics
the use of advanced processes and tools to analyze data and content to recommend the optimal course of action or strategy moving forward.
confirmatory analysis
think Scientific Method; confirms or rejects your hypothesis and checks to see if the two are correlated.
exploratory analysis
think AI coupled with Big Data; takes big data and asks AI to find the correlations.
Correlation vs. Causation
Correlation indicates a relationship between two variables, while causation indicates that one variable directly affects another.
Hemline index
Theory about the rise and fall of the economy is correlated with skirt lengths being longer or shorter.
Table
Precise data.
Pie Chart
Percentages of the whole.
Line Chart
Trends over time.
Bar Chart
Different Categories.
Histogram
Frequency of events.
Heat Map
Showing area of risk/reward.
Interactive Map
Allows us to interact with the map.