L3 - Sequence Labelling and POS tagging

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/15

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

16 Terms

New cards

What is sequence labeling in NLP?

The task of assigning a label to each token in a sequence.
E.g.:
- Part-of-Speech tagging (POS tagging)
- Named Entity Recognition (NER)
- Information extraction

New cards

Useful and challenge - sequence labelling

Useful: generalize information retrieval
Challenge:
- Diff understand meaning from text
- Word ambiguity
- Context sensitivity

New cards

BIO tagging

Tagging format for sequences.
B: Begin
I: Inside
O: Outside
E.g.:
The New York Times reported
B-org: The
I-org: New, York, Times
O: reported

New cards

Sequence Label as classification

Classifier depends of words around
E.g.:
seems like it
if the goal is to class. “like” → check “seems” and “it”

New cards

Cons of Sequence Label as classification

Not possible to determine most likely of all tokens
cos does not model the dependencies between labels
Cannot change earlier decisions once made

New cards

What is a Hidden Markov Model (HMM)?

Probabilistic model
Predicts tag sequences based on state transitions.

New cards

Markov assumption

Each next step only depends on current state

New cards

What are the three main problems solved in HMMs?

Given a model λ and a sequence of observations О = O₁ O₂ … O_T

Evaluation

What is the proba the observations are generated by the model? P(O|λ)

Decoding

What is the most likely state sequence in the model that produced the observations?

Learning

How should we adjust the model’s parameters in order to MAX P(O|λ)?

New cards

HMM Evaluation (Likelihood)

New cards

Forward Algorithm Complexity

O(TN²)

New cards

What algorithm is used for decoding in HMMs?

Viterbi Algorithm

New cards

Viterbi Algorithm

find the best path of length t-1 to each state.
extend each by 1 step to s_j
take the best option (Max) and save the best path

New cards

Beam Search

Inexact
Keep only best k hypothesis at each step

New cards

Diff Viterbi and Beam Search

Viterbi: Performs exact search (assumption) by evaluating all options.
Beam Search: faster but inexact. Avoids labeling some sequences

New cards

POS tagging

States = POS tags

New cards

HMM Learning - Supervised

Training instances with labeled tags
Learning with Maximum Likelihood Estimation (MLE)
Transition probabilities (a_ij)
Count(q_t=s_i,q_t+1=s_j)/count(q_t=s_i)
Observation likelihood (b_j(k))
Count(q_t=s_j,o_i=v_k)/count(q_t=s_j)

Explore top notes

AP World History - Unit 1: The Global Tapestry (1200-1450)

Updated 40d ago

Note

Chapter 5- The American Revolution

Updated 1075d ago

Note

Unit 7: Equilibrium

Updated 642d ago

Note

Hurricane Hazards In Depth

Updated 945d ago

Note

Chapter 2: The Market System and the Circular Flow

Updated 989d ago

Note

ALL OF HEIMLER'S VIDEOS (APWH)

Updated 247d ago

Note

SPECIALIZED CELLS AND TISSUE

Updated 865d ago

Note

Michael Kilpatrick

Updated 294d ago

Note

Explore top flashcards

Flashcards (62)

Flashcards (35)

AP Human Geo - Unit 7 Test

Updated 427d ago

Flashcards (118)

Unit 6 (1865-1898) APUSH

Updated 520d ago

Flashcards (44)

geometry math team formula

Updated 204d ago

Flashcards (81)

Georgia Studies Economics and Personal Finance

Updated 58d ago

Flashcards (26)

Biochem Lecture 11- Proteins I

Updated 249d ago

Flashcards (25)

sat vocab (college panda)

Updated 146d ago

Flashcards (405)