Data Visualization

studied byStudied by 9 people
5.0(1)
Get a hint
Hint

General Rules About Pie Charts

1 / 52

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

53 Terms

1

General Rules About Pie Charts

1. appropriate for non technical audiences
2. widely used
3. few categories
4. when data is 100%
5. add labels for percentage
6. Keep it simple

New cards
2

Time

goal is to see the evolution of one or several quantities over time.
- often referred to as time-series data

New cards
3

how to visualize discrete points in time?

- bar graph
- stacked bar graph
- points

New cards
4

Continuous time point

- time series chart
- step chart

New cards
5

why visualize data?

1. Find patterns and see data in context
2. Expand memory
3. Make data accessible to everyone
4. Answer questions (or discover them)5. Make decisions / Persuade others to make decisions

New cards
6

How to se distributions?

- histogram
- continuous density plot
- box plot

New cards
7

Microsoft excel Pros.

Supports processing of data
Compatible with Word and Power Point
Relatively easy to learn
Widely used

New cards
8

Excel Cons

Good for basic visualization - not interactive
Require customization to adhere to design
standards
May not process large dataset (~1GB)

New cards
9

GEOSPATIAL VISUALIZATION TOOLS

ARCGIS
- Built for desktop mapping
- User interface, no coding required
- Used by professional cartographers, graphics departments

- Google, Yahoo, and Microsoft Maps
- Easiest online solution, requires some programming
- Modest maps
- Flash and ActionScript library for tile based maps, coding required
- Other tools such as Tableau and R

New cards
10

PROGRAMMING TOOLS - R

Pros
Free open-source statistical programming language
Built and maintained for statisticians by statisticians.
Capable of both data analysis and data graphics
Libraries used for visualization in R: Graphics, ggplot2, car, lattice, ndtv, plotly
Can write your own functions and packages to make graphics the way you want
Cons
Default chart outputs require design refinements: Lack of titles, Undefined scales for axes
Use R to create graphs and edit and refine using design software: Adobe Illustrator, Inkmap
R is great for exploratory data visualization (analysis) but may not be the best tool for explanatory data visualization (presenting results and storytelling)

New cards
11

PROGRAMMING TOOLS - PYTHON

-pros
Can handle large amounts of data without crashing.
Useful for analyses and heavy computation
Clean and easy to read syntax
Some of pythons data visualization libraries: Matplotlib, seaborn, geoplotlib, ggplot

-cons
Great starting point for data exploration, not very good aesthetically

New cards
12

PROGRAMMING TOOLS - JAVASCRIPT

PROS
- web-based scripting language
- some javascript libraries - d3, rcharts, highcharts, charts.js, dimple.js
- freely available and allow users to create sophisticated web-based visualizations

CONS
- steep learning curve
- require skills in working with HTML and JSON

New cards
13

CHECKLIST FOR DATA VISUALIZATION TOOLS

- preparing data before visualization
- integration
- ease of use
- ease of collaboration
- visualization types
- communication
- performance
- privacy
- price

New cards
14

categorical data

Data that consists of names, labels, or other nonnumerical values

New cards
15

ordinal data

data exists in categories that are ordered but differences cannot be determined or they are meaningless. (Example: 1st, 2nd, 3rd)

New cards
16

Quantitative data collection

Measurable, using only factual content

New cards
17

Exploratory

testing a hypothesis (visual confirmation) and mining for patterns,trends, and anomalies (visual exploration)

New cards
18

Explanatory

usually simple everyday visualizations —line charts, bar charts, pies, andscatter plots conveying a single message

New cards
19

NINE VISUAL CUES

position, length, angle, direction, shapes, area, volume, color saturation, color hue

New cards
20

VISUAL CUES - POSITION

-commonly used on scatter plots
-you compare values bases on where other are placed in the coordinate system
-easy to notice outliers and clustering

New cards
21

VISUAL CUES - LENGTH

- Commonly used on bar charts
- Length of bars in bar graph provides visual cues
- The longer the bar, the longer the absolute value

- Start the axis at zero as people visually compare the distance from 0 to the end of the bar

New cards
22

VISUAL CUES - ANGLES

- Commonly used for pie charts
- Commonly used to represents parts of a whole
- Donut charts do not use angles since the center of the circle is cut out -arc lengths are used as visual cue

New cards
23

VISUAL CUES - DIRECTION

-Commonly noticed in line graphs
-Direction provides one basic visual cue
-Direction helps with noticing trends
-Slope be used to signal sharp/drastic changes in direction

New cards
24

VISUAL CUES - AREA AND VOLUME

bigger object represent greater value

New cards
25

VISUAL CUES - SHAPES

-shapes can be used to denote categories and objects
-visually shapes are readily recognized

New cards
26

VISUAL CUES - COLOR

Hue - refers to the different
colors
Saturation - refers to the
density of a given color e.g
gradients
Color can be used to show
categories
Color can be used to
highlight certain aspects of
your data visualization

New cards
27

steps to preparing data

Structuring data
Cleaning data
Aggregating data from different
sources
Validating data

New cards
28

What is special about a bar chart?

- intuitive
- appropriate for non-technical audience
- useful to visualize discrete data
- start axis at zero

New cards
29

wedge

each portion of the pie represents a category of value.

New cards
30

what are the kinds of coordinates?

cartesian, Polar, Geographic,

New cards
31

Types

Linear, Logarithmic, categorical, ordinal, percent, time

New cards
32

what is a histogram

- encodes data using height as the visual cue
- density is on the vertical axis
- horizontal axis has values

New cards
33

Histogram bin size

- bin size changes by dataset
- you want the bin sizes big enough so that you see variability in the data
- not so small that the histogram is too noisy to interpret

New cards
34

Continuous density plot

Like a histogram butis continuous insteadof bins

New cards
35

what are the three types of distributions?

symmetric distribution, left skewed distribution, right skewed distribution

New cards
36

Box Plot

Shows range, median and quartiles of the
data
Uses position and height/length visual cues
You can use multiple box plots to compare
distributions
Less specific than histograms or density
plots

New cards
37

Union

Merging data that is spread acorss several files, sheets, or tables

New cards
38

what are the kinds of unions?

- inner join
- Left Join
- Right Join
- Full outer Join

New cards
39

What does the pill color indicate?

continuous or discrete

New cards
40

what color are discrete pills?

blue

New cards
41

what color are continuous pills?

green

New cards
42

How do dimensions come out?

Dimensions come out onto the view as themselves

New cards
43

How do measures come out?

Measures come out onto the view as aggregates

New cards
44

What is a scatter plot used for?

Often used to visualize the
relationship between two
variables
Scatter plots use position as
the visual cue
Each dot has X- ad Y-
coordinates that match the
axes

New cards
45

correlation

means one thing tends to change a certain way as another thing changes.

New cards
46

Direction

positive or negative correlation

New cards
47

magnitude

strong or weak correlation

New cards
48

coefficient of correlation

quantifies how tightly coupledthe values of two variables are with respect to each other

New cards
49

Scatterplot Matrix

Scatter plot matrix is useful
to see relationships among
multiple variables
Allows comparison across
multiple dimensions
Plot every variable pair and
look for correlation

New cards
50

bubble chart

Allows you to compare 3
variables at once: x variable, y
variable, and area variable
Bubble should be sized based
on area not radius, diameter
or circumference
Example - Hans Rosling's Tool

New cards
51

symbol maps

Specific geographic locations are marked with circles, squares, or custom shapes
Form, size or color of these marks can vary according to a measure or dimension

New cards
52

choropleth maps

Geographic areas are shaded according to a measure or dimension

New cards
53

density maps

Areas of relative concentration are colored intensely, while those with sparse occurrences of the dimensionare colored lightly

New cards

Explore top notes

note Note
studied byStudied by 16 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 13 people
Updated ... ago
5.0 Stars(2)
note Note
studied byStudied by 4 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 11 people
Updated ... ago
5.0 Stars(2)
note Note
studied byStudied by 17 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 11 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 77 people
Updated ... ago
5.0 Stars(2)
note Note
studied byStudied by 130 people
Updated ... ago
4.0 Stars(1)

Explore top flashcards

flashcards Flashcard167 terms
studied byStudied by 8 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard47 terms
studied byStudied by 38 people
Updated ... ago
5.0 Stars(3)
flashcards Flashcard54 terms
studied byStudied by 7 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard20 terms
studied byStudied by 1 person
Updated ... ago
5.0 Stars(1)
flashcards Flashcard21 terms
studied byStudied by 185 people
Updated ... ago
5.0 Stars(7)
flashcards Flashcard46 terms
studied byStudied by 3 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard77 terms
studied byStudied by 2 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard42 terms
studied byStudied by 23 people
Updated ... ago
5.0 Stars(2)