Information Retrieval - Lecture 20 Vocabulary

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/12

flashcard set

Earn XP

Description and Tags

Flashcards covering key vocabulary and concepts related to Information Retrieval, including retrieval models, relevance, ranked retrieval, Jaccard coefficient, bag of words model, term frequency, document frequency, and tf-idf weighting.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

13 Terms

1
New cards

Retrieval Models

Mathematical framework for defining the search process, including explanation of assumptions, the basis of ranking algorithms, and can be implicit theories about relevance.

2
New cards

Relevance

A complex concept that has been studied for some time, with many factors to consider, and people often disagree when making relevance judgments.

3
New cards

Ranked retrieval models

Retrieval system where the system returns an ordering over the (top) documents in the collection for a query.

4
New cards

Free text queries

User's query is just one or more words in a human language

5
New cards

Scoring

Assigning a score to each document to measure how well document and query match.

6
New cards

Jaccard coefficient

A commonly used measure of overlap of two sets A and B, calculated as |A ∩ B| / |A ∪ B|.

7
New cards

Bag of words model

Vector representation that doesn’t consider the ordering of words in a document.

8
New cards

Term frequency (tf)

The number of times that term t occurs in document d.

9
New cards

Log-frequency weighting

The frequency weight of term t in d is 0 if tft,d <= 0, otherwise (1 + log10 tft,d)

10
New cards

Document frequency (df)

The number of documents that contain t.

11
New cards

Inverse document frequency (idf)

An inverse measure of the informativeness of t

12
New cards

Collection frequency

The number of occurrences of t in the collection, counting multiple occurrences.

13
New cards

tf-idf weighting

The product of its tf weight and its idf weight.