Dependency Syntax Notes

Syntax

Syntax is the study of sentence structure, answering "Who does what to whom?"
Various theories exist with commonalities: Government and Binding (GB), Minimalist Program (MP), Head-driven phrase structure grammar (HPSG), Lexical Functional Grammar (LFG), Categorial Grammar, and Dependency Grammar.

Theoretical syntacticians focus on grammaticality.
Relevant for NLP applications like text generation and grammar checking.
Parsing provides scaffolding for semantic analysis, aiding opinion mining, information extraction, and machine translation.

Form vs. Function
- Syntactic form uses parts of speech and phrases (NP, VP).
- Syntactic function describes roles in a sentence (Subject, Object, Adverbial).
Constituents
- Words are organized into groupings that function as a whole.
- Tested through linguistic tests of constituency.
Phrase Structure Grammar (PSG)
- Captures constituent status and ordering using context-free grammar.
- Example rules: S \rightarrow NP VP, NP \rightarrow D N, VP \rightarrow V NP

An alternative to phrase structure.
Syntactic functions are central.
Syntactic structure consists of lexical items linked by binary asymmetric dependencies.
Increasing interest in dependency-based parsing for NLP.
Useful in relation extraction, question answering, and sentiment analysis.

In the sentence "The dog ate my homework", relations include:
- ate →subj The dog
- ate →obj my homework

Dependency structures represent head-dependent relations, functional categories, and parts-of-speech.
Phrase structures represent phrases, structural categories, and grammatical functions.

Defined as a directed graph G with:
- A set V of nodes.
- A set E of arcs (edges).
- Labeled graphs with word forms and dependency types.
Notations: i \rightarrow j \equiv (i, j) \in E

G is (weakly) connected: For every node i, there is a node j such that i → j or j → i.
G is acyclic: If i → j then not j →∗ i.
G obeys the single-head constraint: If i → j, then not k → j, for any k ≠ i.

A projective graph: If i → j then for any k such that i< k < j or j < k < i, i →∗ k.
Non-projective structures are needed for long-distance dependencies and free word order.

Collections of sentences manually annotated with syntactic analysis.
Used to train data-driven NLP tools.
Examples: Penn Treebank, Prague Dependency Treebank, Negra/Tuba-DZ, Penn (Chinese), Norwegian Dependency Treebank, Universal Dependencies.

Note

0.0(0)

Chat with Kai