Collaborative Filtering (ML Exam 3)

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall with Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/11

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No study sessions yet.

12 Terms

1
New cards

data

set of users, set of items, some sort of interaction between the users and the items (rating, watch duration, skips, etc.)

represented as a table or matrix

2
New cards

Collaborative Filtering

method for recommender systems that is solely based on feedback that has been recorded between users and items

3
New cards

Neighborhood Models

datapoints are users and axises are items. measure similarity between data points using cosine similarity (can have user based or item based)

4
New cards

Cosine similarity

measures the angle between two datapoints. withstands issues with rating scales

5
New cards

cosine similarity values

-1: 180 angle, exactly opposite

0: 90 angle: no correlation

1: exactly the same

6
New cards

issues with cosine similarity

scaling: can’t predict the rating because rating scales differ

zeroes: fill in missing values with 0, but 0 implies a low rating

7
New cards

data scaling

scale all ratings from a user by their own average (subtract users average rating from all of their ratings). now 0 is average.

8
New cards

centered cosine

the cosine angle between scaled vectors

9
New cards

Predicting ratings

user-user model: find k nearest neighbors using centered cosine similarity. calculate the average of the scaled ratings of the neighbors to find the prediction for the rating for the user.

10
New cards

Latent Factors Model

Assume there exists some unknown set of factors that characterize both the items and the users, and that we can infer these factors from ratings. Can use matrix dependencies to fill holes.

11
New cards

finding matrix dependencies

  1. matrix factorization

  • matrix with items and characteristics x matrix with users and characteristics

  1. multiply to get ratings matrix

12
New cards

how to find the right factorization

machine will guess matrices until their product is close enough (number of latent factors is a hyper parameter). Use SSE to determine how far off the predicted matrix is from the target matrix. take partial derivative of the error with respect to each variable to find downhill slope.