1/8
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced | Call with Kai |
|---|
No analytics yet
Send a link to your students to track their progress
Cluster analysis
A statistical method used to group similar cases (people, countries) together based on multiple variables. The goal is to identify naturally occuring patterns in the data without pre-defined categories.
Instead of testing relationships (like regression does), cluster analysis asks: “Which cases are similar enough to belong together?”
If the data is skewed or contains extreme values, clusters may be distorted.
Variable-oriented analysis
Focuses on relationship between variables across the whole sample.
Person-oriented analysis
Focuses on patterns within individuals, grouping them based on how variables combine. Gives a broader, more holistic understanding of individuals.
Hierarhical clustering
Step by step method where each case starts with its own cluster and gradually merges the most similar clusters, continues until all cases are in one Cluster.
Malahanobis distance
Measures how far a case is from the center of all variables, while considering relationships between the variables.
Silhouette score
Measures how well each case fits into its assigned cluster
Close to +1 - very well matched
Around 0 - Unclear
Negative - Likely in the wrong cluster
Wards method
Clustering method that groups data step by step by always choosing the merge that keeps clusters similar (low variance as possible)
Squared Euclidean distance
Measures how far apart two points are by summing the squared differences between their corresponding values. It is commonly used as a distance measure in hierarchical cluster analysis to assess similarity between cases.
Agglomeration coefficent
Used to determine the optimal number of clusters by evaluating the increase in disimiliarity within the clusters.