7. Visualizing Multivariate Data

0.0(0)

Studied by 0 people

0.0(0)

Call with Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/29

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No study sessions yet.

30 Terms

New cards

Why do we need dedicated multivariate visualization techniques?

Because mapping many attributes onto a single scatterplot via channels (size, color, shape) does not scale: channels interfere, attributes become unreadable, and interpretation collapses when you move beyond ~4-5 attributes.

<p>Because mapping many attributes onto a single scatterplot via channels (size, color, shape) <em>does not scale</em>: channels interfere, attributes become unreadable, and interpretation collapses when you move beyond ~4-5 attributes.</p>

New cards

What is the defining condition for using a ternary plot?

The three variables must be parts of a whole - they must sum to 1 or 100%. This is what makes barycentric coordinates meaningful.

<p>The three variables must be <em>parts of a whole</em> - they must sum to 1 or 100%. This is what makes barycentric coordinates meaningful.</p>

New cards

Why are 3D scatterplots discouraged, and how do ternary plots avoid that issue?

3D scatterplots suffer from non-anchored points, making depth impossible to judge.

Ternary plots stay in 2D while still depicting three variables.

New cards

What is a Scatterplot Matrix?

A matrix of 2D scatterplots showing every pairwise attribute combination for an overview of relationships, correlations, and clusters.

<p>A matrix of 2D scatterplots showing <em>every pairwise attribute combination</em> for an overview of relationships, correlations, and clusters.</p>

New cards

What are the two main types of unused space in a Scatterplot Matrix, and how can they be used?

The main diagonal, often filled with histograms or density plots.

The mirrored lower triangle, which can show density contours or other summaries.

<p>The <strong>main diagonal</strong>, often filled with histograms or density plots.</p><p>The <strong>mirrored lower triangle</strong>, which can show density contours or other summaries.</p>

New cards

Why are linking & brushing essential in Scatterplot Matrices?

Because they let the viewer follow selected points across multiple projections, revealing structure across high-dimensional subspaces.

<p>Because they let the viewer follow selected points across <em>multiple projections</em>, revealing structure across high-dimensional subspaces.</p>

New cards

What problem does RadViz attempt to solve?

Visualizing many numerical attributes without small multiples by mapping each attribute to an anchor and placing points via spring-like forces (barycentric).

<p>Visualizing many numerical attributes without small multiples by mapping each attribute to an anchor and placing points via spring-like forces (barycentric).</p>

New cards

What is the major issue with original RadViz, and how does RadViz Deluxe fix it?

Correlated attributes placed opposite each other cancel out forces, distorting patterns. RadViz Deluxe reorders and spaces anchors based on correlations to reduce distortions.

<p>Correlated attributes placed opposite each other cancel out forces, distorting patterns. <strong>RadViz Deluxe</strong> reorders and spaces anchors based on correlations to reduce distortions.</p>

New cards

When is RadViz not appropriate?

When you need to interpret absolute positions in the original data space. RadViz only preserves relative structure and should not be used for precise numeric reading.

New cards

In parallel coordinates, what do correlations, negative correlations, and clusters look like?

Correlation: Lines approximately follow each other

Negative correlation: A fan or star shape where lines cross between two adjacent axes.

Clusters: Approximately horizontal bundles of lines

<p>Correlation: Lines approximately <strong>follow </strong>each other</p><p>Negative correlation: A <strong>fan or star shape</strong> where lines cross between two adjacent axes.</p><p>Clusters: Approximately horizontal <strong>bundles </strong>of lines</p>

New cards

Why does the ordering of axes matter in parallel coordinates?

Because patterns (correlations, clusters) are only visible between adjacent axes. Interpretation depends entirely on axis order.

New cards

What interactions are especially important in parallel coordinates?

Reordering axes (since only neighboring axes can be compared)

Brushing for cross-filtering

Adding/removing axes, including duplicating an axis to compare it with multiple others

New cards

What is alpha blending used for in parallel coordinates?

To handle overplotting by making lines semi-transparent so density becomes visible.

New cards

What is line bundling in parallel coordinates?

Grouping similar trajectories (polylines) into bundles to reduce clutter and emphasize trends.

New cards

What is Parallel Sets?

A variation of parallel coordinates for categorical data that uses ribbons instead of polylines to show frequency-weighted flows between categories.

<p>A variation of parallel coordinates for <strong>categorical</strong> data that uses ribbons instead of polylines to show frequency-weighted flows between categories.</p>

New cards

Why are Parallel Sets often superior to mosaic plots?

They provide clearer flow comparisons, show category distributions along each axis, and avoid the misinterpretation risks of area slicing.

New cards

What are Parallel Hierarchies?

An extension of parallel sets allowing interactive hierarchical drilling and rolling up within categorical hierarchies. Useful when categories have many nested levels.

<p>An extension of parallel sets allowing <em>interactive hierarchical drilling and rolling up</em> within categorical hierarchies. Useful when categories have many nested levels.</p>

New cards

What is the central critique of radar charts?

Radar charts do not scale:

Hard to compare shapes across many axes
Circular layout imposes arbitrary geometry
Overplotting ruins readability
Hans calls them “completely useless” for multivariate analysis

<p>Radar charts do <strong>not scale</strong>:</p><ul><li><p>Hard to compare shapes across many axes</p></li><li><p>Circular layout imposes arbitrary geometry</p></li><li><p>Overplotting ruins readability</p></li><li><p>Hans calls them “<em>completely useless</em>” for multivariate analysis</p></li></ul><p></p>

New cards

What are radar chart “small multiples,” and do they solve the problem?

Showing one radar chart per data item. This helps slightly but still breaks down when attributes or items increase. Shapes become indistinguishable.

<p>Showing one radar chart per data item. This helps slightly but still breaks down when attributes or items increase. Shapes become indistinguishable.</p>

New cards

What is a trellis display (small multiples)?

A grid of repeated charts, each showing a subset of the data conditioned on categorical or binned numerical variables.

New cards

Why is linking & brushing usually not applicable in trellis displays?

Because each panel contains different subsets of the data; selected items do not appear in other panels.

New cards

What makes trellis displays powerful for continuous data?

Binning allows you to reveal structures invisible in aggregated scatterplots - e.g., the “fish-hook” pattern in deep earthquakes that only appears after depth-based slicing.

New cards

What is the basic construction rule of mosaic plots?

Repeated slice-and-dice subdivision: alternating vertical and horizontal splits to show proportions across categorical combinations.

<p>Repeated <strong>slice-and-dice</strong> subdivision: alternating vertical and horizontal splits to show proportions across categorical combinations.</p>

New cards

What is a common mistake when interpreting mosaic plots?

Treating the axes as continuous coordinate axes instead of understanding them as divisions of area, which leads to incorrect category identification.

New cards

What are glyphs, and when are they used?

Visual objects whose features encode multiple attributes; used when embedding small multivariate indicators into spatial contexts (maps, matrices).

<p>Visual objects whose features encode multiple attributes; used when embedding small multivariate indicators into spatial contexts (maps, matrices).</p>

New cards

Why are Chernoff faces discouraged?

Experimental evidence shows that people do not reliably decode data from subtle facial variations.

They are the “rainbow color scale” of multivariate vis.

<p>Experimental evidence shows that people <em>do not</em> reliably decode data from subtle facial variations.</p><p>They are the “rainbow color scale” of multivariate vis.</p>

New cards

What are star glyphs / metroglyphs?

Glyphs where attribute values extend spokes or line lengths. These are more interpretable than faces but still limited when many glyphs overlap.

<p>Glyphs where attribute values extend spokes or line lengths. These are more interpretable than faces but still limited when many glyphs overlap.</p>

New cards

What is the main idea behind pixel-based techniques (Keim)?

Use each pixel as the smallest possible mark to display extremely high-dimensional or high-volume data without aggregation. Each attribute becomes a pixel map

<p>Use each pixel as the smallest possible mark to display extremely high-dimensional or high-volume data without aggregation. Each attribute becomes a pixel map</p>

New cards

Why is arrangement crucial in pixel-based visualizations?

Because spatial layout (line-by-line, serpentine, Hilbert curve, calendar layout) determines which patterns become perceptible. Misarrangement hides structure.

<p>Because spatial layout (line-by-line, serpentine, Hilbert curve, calendar layout) determines which patterns become perceptible. Misarrangement hides structure.</p>

New cards

When are pixel-based charts appropriate?

When completeness is required (e.g., climate data), and aggregation must be avoided. Also good for detecting anomalies over long temporal spans.