09 Graph Similarity Description – Key Vocabulary

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/34

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards covering key terms, concepts, algorithms, and structures introduced in the paper "Graph Similarity Description: How Are These Graphs Similar?"

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

35 Terms

1
New cards

Graph Similarity Assessment

The task of comparing two or more graphs to understand their commonalities and differences.

2
New cards

Graph Similarity Description

An approach that explains how graphs are similar by describing shared and specific structures instead of outputting a single numeric score.

3
New cards

Minimum Description Length (MDL) Principle

An information-theoretic model-selection framework that chooses the model minimizing the total number of bits needed to describe the model plus the data given the model.

4
New cards

Crude MDL

MDL variant in which model length and data-given-model length are encoded separately; used in this paper to emphasize model interpretability.

5
New cards

Kolmogorov Complexity

The length of the shortest program that outputs a given object on a universal Turing machine; basis for MDL but not computable in practice.

6
New cards

Information Distance

A measure equal to the maximum of the conditional Kolmogorov complexities K(x|y) and K(y|x), capturing the effort to transform x into y and vice-versa.

7
New cards

Common Model (M12)

A model that captures the structures shared by the input graphs being compared.

8
New cards

Individual Model (M1 / M2)

A model summarizing a single input graph, containing both shared and graph-specific structures.

9
New cards

Transformation (Δ1, Δ2)

A set of edit operations that morph the common model into each individual model, encoding graph-specific differences.

10
New cards

Structure Vocabulary (Ω)

The set of allowed subgraph patterns—clique, star, biclique, starclique—used to build models.

11
New cards

Clique

A subgraph in which every pair of nodes is connected; modeled with one constraint (high internal density).

12
New cards

Star

A hub-and-spokes subgraph where one central node connects to many peripheral nodes that are sparsely interconnected.

13
New cards

Biclique

A subgraph whose nodes can be partitioned into left and right sets with dense cross-connections and sparse internal links within each set.

14
New cards

Starclique

A biclique whose left partition forms a clique, combining a dense core with a sparse periphery.

15
New cards

Node Alignment

A (possibly partial) bijection between node sets of two graphs that indicates corresponding vertices.

16
New cards

Partial Alignment

An alignment where some nodes remain unmatched, allowing comparison of graphs of different sizes.

17
New cards

Node Fraction

The proportion of the graph’s nodes that participate in a structure, stored relative to a reference size in the model.

18
New cards

Edge Density

The ratio of present edges to possible edges inside a structure’s specified area of the adjacency matrix.

19
New cards

Transformation Vocabulary (Σ)

The set of edit operations (add, grow, shrink) allowed when converting the common model into individual models.

20
New cards

Momo Framework

‘Model of models’; the overall two-step framework (Beppo + Gigi) proposed to perform graph similarity description.

21
New cards

Beppo Algorithm

Step 1 of Momo; summarizes each graph individually by greedily adding MDL-beneficial structures.

22
New cards

Gigi Algorithm

Step 2 of Momo; aligns two individual models to build the common model and corresponding transformations.

23
New cards

Graph Summarization

The process of creating a compact, interpretable model of a single graph that minimizes MDL coding cost.

24
New cards

Model Alignment

The task of matching structures from two individual models to form a common model while respecting type constraints.

25
New cards

MaximalGreedy Matching

Heuristic used by Gigi that greedily pairs structures of the same type based on node overlap or alignment similarity.

26
New cards

Normalized Model Distance (NMD)

A normalized similarity score derived from MDL models: (L(M12)+L(Δ1,Δ2)−min{L(M1),L(M2)}) / max{L(M1),L(M2)}.

27
New cards

Maximum Entropy Distribution

The probability distribution over adjacency entries with highest entropy that still satisfies all model constraints; used for optimal graph encoding.

28
New cards

Lagrange Multipliers

Variables introduced to enforce structural constraints when computing the maximum-entropy encoding probabilities.

29
New cards

Universal Code for Integers (LN)

A prefix-free encoding that efficiently represents positive integers and is used to transmit counts in MDL descriptions.

30
New cards

Node Overlap Graph

A graph whose vertices are structures from a model and edge weights are Jaccard similarities of their node sets.

31
New cards

Node Overlap Tree

A maximum spanning tree of a node overlap graph, offering an interpretable visualization of structure overlaps.

32
New cards

SlashBurn Decomposition

A technique that iteratively removes high-degree nodes to peel a graph; Beppo adapts this idea to create small-diameter components.

33
New cards

Description Length L(x)

The number of bits required to encode object x under the chosen encoding scheme.

34
New cards

Overlap-Allowed Structures

The modeling choice permitting different structures to share nodes and even edges, increasing expressiveness of the summary.

35
New cards

Shannon-Optimal Code

The code derived from the true probability distribution that minimizes expected message length; applied when encoding graph data under a model.