1/43
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
Data
information collected about the physical world (numbers, words, measurements, observations, etc) that is in a computer-readable form
human-readable information
Information like books, notebooks, physical photographs, voice
numbers (binary ones)
What is computer readable information?
song files, video files, image files, text files, spreadsheet measurements, and statistics
human-readable data into digital examples
write programs to store, process, manipulate and visualize data
What can be done once data is structured in a uniform way?
input data
any information, or data, that is sent to a computer for processing (spreadsheets, keyboard inputs, HTTP requests)
takes in input data, analyze and manipulate it, and produce output data
what do programs do?
output data
the data resulting from a computer run (sounds, images, HTTP response)
creation of knowledge
what do data and information facilitate?
extract info, identify trends, make predictions, make connections, recognize problems
what does digital data allow us to do?
computing tools
what is essential when working with large data sets in order to extract useful info and gain knowledge?
Computing tools
a computer program that provides ways to search, filter, store, and visualize data
databases and data base language
find solution: storing and processing massive amount of data efficiently is challenging
encrypt and anonymize data
find solution: keeping data private is challenging
read privacy policy
find solution: not everyone knows how much of their personal data is being tracked
digital universe
all data that has been created and stored
33%
what amount of info could be useful if appropriately tagged and analyzed?
worthless
what is data as raw and unorganized facts?
information
potentially valuable concepts based on data
knowledge
what we understand based upon information
wisdom
the effective use of knowledge in decision making
data visualization
using charts, graphs, or images to visualize complex data
easier to understand and extract useful info, faster, useful in all fields
why visualize data?
Charles Joseph Minard
Made famous visualizations of Napoleon's losses during the Russian campaign on 1812. One of the first to develop information graphics
19th century
When was the pie chart invented?
17th century
When were maps invented?
process large amounts of data quickly, constantly update to stay accurate, add interactivity, blend science and art, rapidly changing
how has computing been beneficial in data visualizations?
table
type of data visualization that is great for showing precise data
pie chart
type of data visualization that is great for showing percentages of the whole
line chart
type of data visualization that is great for showing trend over time
bar chart
type of data visualization that is great for comparing different catagories
Histogram
type of data visualization that is great for showing the frequency of events
widespread access to knowledge
what do public data sets and data visualizations allow for?
surveys, sensors, transactional data from credit cards, websites, crowdsourcing
ways data is collected
Sensors
Input devices used to measure physical traits, such as sound, heat, or light.
lock disk memory, databases, encrypted (if sensative)
how is data stored?
database query languages and data APIs
what allows us to ask a database for the data we want in out programs?
database query languages
computer languages used to make queries in databases and information systems (structured query language)
data sanitization
Throwing out data that is not well formatted
numbers
what is easier to process?: numbers or text
Data Limitations
There are limitations to what we can conclude from examining data and data visualizations
omitting data, not starting graph at zero, breaking conventions, correlation doesn't imply causation
misleading visualizations
Omitting Data
leaving out data points to make a fake trend
Metadata
data about data; describes how and when and by whom a particular set of data was collected, and how data is formatted