1/8
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
|---|
No study sessions yet.
Simple Analysis
tells you what is happening or what has happened.
Whatever can be done using SQL, Pivot table and Tableau Prep is not advance data analysis
Sum, average, min, max, time trend
Advance Data Analysis
tells you why it is happening, and help predict what will happen
Decision Trees, Clustering, Association Rules
Classification
Statistical method used to determine what category (class) a new observation belongs to
How classification works?
Choose a categorical outcome variable
Split the data set into training and validation subsets
Use the training set to find a model that predicts the outcomes as a function of the other attributes
Apply the model to the validation set to check accuracy
Apply the final model to future cases
Within-Cluster SSE
measures cohesion and how tightly grouped each cluster are
Lower SSE = better cluster
Between-cluster SSE
a measure used in clustering to capture how far each cluster’s center is from the overall center of the data. It helps describe how well the clusters are separated from one another.
Between-Cluster SSE
measures how far each cluster’s centroid is from the overall (grand) centroid, weighted by cluster size.
Confidence
Measures how often items in Y appear in transactions that contain X
Lift
Takes into account how co-occurence differs from what is expected by chance