Data mining MCQ 2

0.0(0)

Studied by 4 people

0.0(0)

Call with Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/39

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No study sessions yet.

40 Terms

New cards

In graph mining, a graph is best defined as:

A set of nodes connected by edges

New cards

What does the degree of a node represent?

The number of edges connected to the node

New cards

In an Erdős–Rényi random graph, what does the parameter p represent?

The probability that an edge exists between two nodes

New cards

What typically happens when n · p ≈ 1 in an Erdős–Rényi graph?

A giant connected component starts to emerge

New cards

Why is the largest connected component often studied in random graphs?

It reveals connectivity and phase transition properties

New cards

What information does a degree distribution provide?

How node degrees are distributed across the graph

New cards

Which characteristic is typical of an LFR benchmark graph?

Power-law degree distribution with known communities

New cards

Why are LFR graphs commonly used in community detection studies?

They provide ground-truth communities

New cards

What is a community in a graph?

A set of nodes densely connected internally and sparsely connected externally

New cards

What is the main principle behind the Louvain community detection algorithm?

Maximizing modularity

New cards

Why might the Louvain algorithm fail to recover the ground-truth communities of an LFR graph?

Maximizing modularity does not always match true communities

New cards

What does Normalized Mutual Information (NMI) measure?

Similarity between two partitions of nodes

New cards

What is the key idea of the Girvan–Newman algorithm?

Removing edges with high betweenness

New cards

Compared to Louvain, the Girvan–Newman algorithm is often:

More computationally expensive but sometimes more accurate

New cards

What is the goal of graph (node) embeddings such as Node2Vec?

To convert nodes into low-dimensional vector representations

New cards

What is an Erdős–Rényi random graph?

A graph where each pair of nodes is connected with probability p

New cards

What is an LFR graph mainly used for?

Benchmarking community detection algorithms

New cards

What is community detection?

Finding groups of densely connected nodes

New cards

What is the Louvain algorithm based on?

Modularity maximization

New cards

What is the core principle of the Girvan–Newman algorithm?

Removing edges with high betweenness

New cards

What does normalized Mutual Information (NMI) measure?

Similarity between two community partitions

New cards

What is the goal of node embeddings?

To map nodes into low-dimensional vector spaces

New cards

LFR graph

They have a priori known communities and are used to compare different community detection methods

New cards

Which statement best compares Louvain and Girvan–Newman?

Louvain maximizes modularity, Girvan–Newman removes high-betweenness edges

New cards

Why can Girvan–Newman outperform Louvain on LFR graphs?

explicitly separates communities via edge removal

New cards

What is a key difference between LFR and Erdős–Rényi graphs?

LFR graphs have realistic degree distributions and known communities

New cards

What is the main difference between Node embeddings and Community detection ?

Community detection finds groups; embeddings create vector representations

New cards

Why is NMI preferred over raw accuracy for community detection?

Labels are arbitrary and permutation-invariant

New cards

Acceed the ground-truth communities of the graph with nx.get node attributes(lfr,’community’)

The returned ground-truth is a dictionnary, which keys correspond to nodes, and values corre spond to a set of nodes forming a community. The communities are disjoint, meaning that each node is contained in one single community

New cards

Divisive clustering on Edge-Betweenness

You start with the entire network as a single cluster.
Then, you recursively split it into smaller communities until meaningful groups emerge

New cards

Why are Erdős–Rényi graphs often considered unrealistic models of social networks?

They assume uniform edge probability between all node pairs

New cards

LFR graph degree distribution ?

heavy-tailed (power-law-like) distribution

New cards

Why is the Karate Club graph commonly used in graph mining?

It has a well-known real community split

New cards

Why can the Louvain algorithm fail to recover the true communities in an LFR graph?

Modularity maximization may not align with planted communities

New cards

Which statement about scalability is correct?

Louvain is generally more scalable than Girvan–Newman

New cards

What is the main conceptual difference between edge betweenness and modularity?

Edge betweenness identifies bridges; modularity evaluates partition quality

New cards

How does community detection differ from finding connected components?

Communities allow sparse connections between groups

New cards

Why is Normalized Mutual Information (NMI) preferred over accuracy for evaluating communities?

Community labels are arbitrary and unordered

New cards

How do node-embedding-based methods differ from classical community detection?

They transform nodes into vectors before clustering

New cards

Which comparison is correct?

Degree counts neighbors, betweenness counts shortest-path participation

Explore top notes

Ch 10- Education

Updated 1017d ago

Note

Unit 1: Pre-Colonial/Colonial Test Review English III

Updated 771d ago

Note

Biology - Organ Systems, Medical Imaging Technology

Updated 1576d ago

Note

The other treaties

Updated 1045d ago

Note

The Cultural Landscape Chapter 2: Population

Updated 1222d ago

Note

Life Science 11 - Animal Kingdom

Updated 786d ago

Note

Term 3

Updated 959d ago

Note

Unit 4: Later Europe and Americas, 1750–1980 CE

Updated 1018d ago

Note

Ch 10- Education

Updated 1017d ago

Note

Unit 1: Pre-Colonial/Colonial Test Review English III

Updated 771d ago

Note

Biology - Organ Systems, Medical Imaging Technology

Updated 1576d ago

Note

The other treaties

Updated 1045d ago

Note

The Cultural Landscape Chapter 2: Population

Updated 1222d ago

Note

Life Science 11 - Animal Kingdom

Updated 786d ago

Note

Term 3

Updated 959d ago

Note

Unit 4: Later Europe and Americas, 1750–1980 CE

Updated 1018d ago

Note

Explore top flashcards

Lecture 11: Viral Diseases of Cats II

Updated 334d ago

Flashcards (23)

Chem test 4 things to remember

Updated 633d ago

Flashcards (29)

AP Psych History

Updated 883d ago

Flashcards (32)

E2a - Family Members & Pets

Updated 372d ago

Flashcards (33)

Polarity

Updated 821d ago

Flashcards (34)

Cuéntame de tus vacaciones

Updated 70d ago

Flashcards (79)

FNR Herps Lab Week 3

Updated 1163d ago

Flashcards (52)

Unit 1: Thinking Geographically

Updated 9d ago

Flashcards (60)

Lecture 11: Viral Diseases of Cats II

Updated 334d ago

Flashcards (23)

Chem test 4 things to remember

Updated 633d ago

Flashcards (29)

AP Psych History

Updated 883d ago

Flashcards (32)

E2a - Family Members & Pets

Updated 372d ago

Flashcards (33)

Polarity

Updated 821d ago

Flashcards (34)

Cuéntame de tus vacaciones

Updated 70d ago

Flashcards (79)

FNR Herps Lab Week 3

Updated 1163d ago

Flashcards (52)

Unit 1: Thinking Geographically

Updated 9d ago

Flashcards (60)