Association Analysis and Apriori (ML Exam 3)

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/34

There's no tags or description

Looks like no tags are added yet.

Last updated 3:15 AM on 12/11/25

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

35 Terms

New cards

Association Analysis

Given a set of transactions, find rules that will predict the occurrence of an item based on the occurrences of other items in the transactions

New cards

transaction width

number of items in a transaction

New cards

itemset

any collection of 0 or more items

New cards

k itemset

an itemset with k items

New cards

support count

number of transactions that contain the itemset

New cards

frequent itemset

an itemset whose support count is >= some minsup threshold

New cards

Association rule

an implication expression of the form X→Y, where X and Y are disjoint itemsets. (X is the antecedent and Y is the consequent). Correlation, not causation. Directionality matters

New cards

support

fraction of transactions that contain both X and Y

New cards

confidence

measures how often items in y appear in transactions that contain X

New cards

strong association rule

high s and high c (support >= minsup and confidence >= minconf threshold)

New cards

goal of association rule mining

to find all strong rules

New cards

Association Rule Mining steps

Frequent itemset generation: find all itemsets that satisfy minsup
strong rule generation: find all rules in the frequent itemsets that satisfy minconf

New cards

how many candidate itemsets are there given d items?

2^d itemsets

New cards

Apriori principle

if an itemset is frequent, then all of its subsets must also be frequent. if an itemset is infrequent, then all of its supersets must be infrequent

New cards

Candidate Generation F_k-1 x F₁

items in each frequent itemset must be sorted
Extend each frequent k-1 itemset with a frequent 1 itemset that comes after all of the items in the k-1 itemset

New cards

Candidate Generation F_k-1 x F_k-1(Apriori gen)

items in each frequent itemset must be sorted
Merge pairs of k-1 itemsets if all but the last item is the same

New cards

Method for Apriori Pruning

if any k-1 subset of the candidate is not a frequent k-1 itemset, the candidate is pruned

New cards

Frequent itemsets

the final list of all frequent itemsets found at every k

New cards

Tyranny of Counting Pairs

the most memory is required for counting frequent pairs

New cards

rule generation brute force

all non-empty subsets of a frequent itemset are enumerated and tested against minconf

New cards

confidence based pruning

confidence is anti-monotone

as we move items to the right side of the rule, the confidence can only go down, it cannot go up (only rules generated from the same itemset)

look at denominator of confidence equation for explanation

New cards

Rule generation algorithm

start with rules that have only one item in the consequent and test against minconf
the high confidence rules that are found are then used to generate the next round of candidate rules by merging consequents
repeat on all frequent itemsets
rules that meet minconf are strong rules

New cards

how to set minsup

large set of items means small minsup (<1%)

New cards

maximal frequent itemset

no frequent supersets

New cards

closed frequent itemset

no superset with the same support count

New cards

maximal and closed frequent itemset algorithm

keep a list of maximal/closed frequent itemsets
each time a frequent itemset is generated, perform the subset check and superset check

New cards

subset check for maximal algorithm

is the frequent itemset just found a subset of anything in the maximal list? If so, it is not maximal, end. else, add it to the maximal list and do superset check

New cards

superset check for maximal algorithm

Is the frequent itemset just found a superset of anything in the maximal list? If so, remove items in the maximal list that are a subset of this itemset

New cards

subset check for closed algorithm

is the frequent itemset just found a subset of anything in the closed list? If so, is it’s support higher than its superset in the list? if no, it is not closed. If yes, add it to the closed list and do the superset check.

New cards

superset check for closed algorithm

Is the frequent itemset just found a superset of anything in the closed list? If so, does the subset in the list have the same or higher support? If subsets support is the same, remove the subset from the closed list. If subsets support is higher, it remains in the list.

New cards

cross support patterns