Back to Nlp Progress

Dependency parsing

english/dependency_parsing.md

0.315.8 KB
Original Source

Dependency parsing

Dependency parsing is the task of extracting a dependency parse of a sentence that represents its grammatical structure and defines the relationships between "head" words and words, which modify those heads.

Example:

     root
      |
      | +-------dobj---------+
      | |                    |
nsubj | |   +------det-----+ | +-----nmod------+
+--+  | |   |              | | |               |
|  |  | |   |      +-nmod-+| | |      +-case-+ |
+  |  + |   +      +      || + |      +      | |
I  prefer  the  morning   flight  through  Denver

Relations among the words are illustrated above the sentence with directed, labeled arcs from heads to dependents (+ indicates the dependent).

Penn Treebank

Models are evaluated on the Stanford Dependency conversion (v3.3.0) of the Penn Treebank with predicted POS-tags. Punctuation symbols are excluded from the evaluation. Evaluation metrics are unlabeled attachment score (UAS) and labeled attachment score (LAS). UAS does not consider the semantic relation (e.g. Subj) used to label the attachment between the head and the child, while LAS requires a semantic correct label for each attachment.Here, we also mention the predicted POS tagging accuracy.

ModelPOSUASLASPaper / SourceCode
Label Attention Layer + HPSG + XLNet (Mrini et al., 2019)97.397.4296.26Rethinking Self-Attention: Towards Interpretability for Neural ParsingOfficial
Pre-training + XLNet (Tian et al. 2022)-97.3095.92Enhancing Structure-aware Encoder with Extremely Limited Data for Graph-based Dependency ParsingOfficial
ACE + fine-tune (Wang et al., 2020)-97.2095.80Automated Concatenation of Embeddings for Structured PredictionOfficial
HPSG Parser (Joint) + XLNet (Zhou et al, 2020)97.397.2095.72Head-Driven Phrase Structure Grammar Parsing on Penn TreebankOfficial
Second-Order MFVI + BERT (Wang et al., 2020)-96.9195.34Second-Order Neural Dependency Parsing with Message Passing and End-to-End TrainingOfficial
CVT + Multi-Task (Clark et al., 2018)97.7496.6195.02Semi-Supervised Sequence Modeling with Cross-View TrainingOfficial
CRF Parser (Zhang et al., 2020)-96.1494.49Efficient Second-Order TreeCRF for Neural Dependency ParsingOfficial
Second-Order MFVI (Wang et al., 2020)-96.1294.47Second-Order Neural Dependency Parsing with Message Passing and End-to-End TrainingOfficial
Left-to-Right Pointer Network (Fernández-González and Gómez-Rodríguez, 2019)97.396.0494.43Left-to-Right Dependency Parsing with Pointer NetworksOfficial
Graph-based parser with GNNs (Ji et al., 2019)97.395.9794.31Graph-based Dependency Parsing with Graph Neural Networks
Deep Biaffine (Dozat and Manning, 2017)97.395.7494.08Deep Biaffine Attention for Neural Dependency ParsingOfficial
jPTDP (Nguyen and Verspoor, 2018)97.9794.5192.87An improved neural network model for joint POS tagging and dependency parsingOfficial
Andor et al. (2016)97.4494.6192.79Globally Normalized Transition-Based Neural Networks
Distilled neural FOG (Kuncoro et al., 2016)97.394.2692.06Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser
Distilled transition-based parser (Liu et al., 2018)97.394.0592.14Distilling Knowledge for Search-based Structured PredictionOfficial
Weiss et al. (2015)97.4493.9992.05Structured Training for Neural Network Transition-Based Parsing
BIST transition-based parser (Kiperwasser and Goldberg, 2016)97.393.991.9Simple and Accurate Dependency Parsing Using Bidirectional LSTM Feature RepresentationsOfficial
Arc-hybrid (Ballesteros et al., 2016)97.393.5691.42Training with Exploration Improves a Greedy Stack-LSTM Parser
BIST graph-based parser (Kiperwasser and Goldberg, 2016)97.393.191.0Simple and Accurate Dependency Parsing Using Bidirectional LSTM Feature RepresentationsOfficial

Universal Dependencies

The focus of the task is learning syntactic dependency parsers that can work in a real-world setting, starting from raw text, and that can work over many typologically different languages, even low-resource languages for which there is little or no training data, by exploiting a common syntactic annotation standard. This task has been made possible by the Universal Dependencies initiative (UD, http://universaldependencies.org), which has developed treebanks for 60+ languages with cross-linguistically consistent annotation and recoverability of the original raw texts.

Participating systems will have to find labeled syntactic dependencies between words, i.e. a syntactic head for each word, and a label classifying the type of the dependency relation. In addition to syntactic dependencies, prediction of morphology and lemmatization will be evaluated. There will be multiple test sets in various languages but all data sets will adhere to the common annotation style of UD. Participants will be asked to parse raw text where no gold-standard pre-processing (tokenization, lemmas, morphology) is available. Data preprocessed by a baseline system (UDPipe, https://ufal.mff.cuni.cz/udpipe) was provided so that the participants could focus on improving just one part of the processing pipeline. The organizers believed that this made the task reasonably accessible for everyone.

ModelLASMLASBLEXPaper / SourceCode
Stanford (Qi et al.)74.1662.0865.28Universal Dependency Parsing from ScratchOfficial
UDPipe Future (Straka)73.1161.2564.49UDPipe 2.0 Prototype at CoNLL 2018 UD Shared TaskOfficial
HIT-SCIR (Che et al.)75.8459.7865.33Towards Better UD Parsing: Deep Contextualized Word Embeddings, Ensemble, and Treebank Concatenation
TurkuNLP (Kanerva et al.)73.2860.9966.09Turku Neural Parser Pipeline: An End-to-End System for the CoNLL 2018 Shared TaskOfficial

The following results are just for references:

ModelUASLASNotePaper / Source
Stack-only RNNG (Kuncoro et al., 2017)95.894.6Constituent parserWhat Do Recurrent Neural Network Grammars Learn About Syntax?
Deep Biaffine (Dozat and Manning, 2017)95.7594.22Stanford conversion v3.5.0Deep Biaffine Attention for Neural Dependency Parsing
Semi-supervised LSTM-LM (Choe and Charniak, 2016) (Constituent parser)95.994.1Constituent parserParsing as Language Modeling

Cross-lingual zero-shot dependency parsing

Cross-lingual zero-shot parsing is the task of inferring the dependency parse of sentences from one language without any labeled training trees for that language.

Universal Dependency Treebank

Models are evaluated against the Universal Dependency Treebank v2.0. For each of the 6 target languages, models can use the trees of all other languages and English and are evaluated by the UAS and LAS on the target. The final score is the average score across the 6 target languages. The most common evaluation setup is to use gold POS-tags.

ModelUASLASPaper / SourceCode
XLM-R + SubDP (Shi et al., 2022)---79.6*Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency ParsingOfficial
Cross-Lingual ELMo (Schuster et al., 2019)84.277.3Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency ParsingOfficial
MALOPA (Ammar et al., 2016)70.5Many Languages, One ParserOfficial
Guo et al. (2016)76.769.9A representation learning framework for multi-source transfer parsing

*: Evaluated on four target languages.

Unsupervised dependency parsing

Unsupervised dependency parsing is the task of inferring the dependency parse of sentences without any labeled training data.

Penn Treebank

As with supervised parsing, models are evaluated against the Penn Treebank. The most common evaluation setup is to use gold POS-tags as input and to evaluate systems using the unlabeled attachment score (also called 'directed dependency accuracy').

ModelUASPaper / Source
Iterative reranking (Le & Zuidema, 2015)66.2Unsupervised Dependency Parsing - Let’s Use Supervised Parsers
Combined System (Spitkovsky et al., 2013)64.4Breaking Out of Local Optima with Count Transforms and Model Recombination - A Study in Grammar Induction
Tree Substitution Grammar DMV (Blunsom & Cohn, 2010)55.7Unsupervised Induction of Tree Substitution Grammars for Dependency Parsing
Shared Logistic Normal DMV (Cohen & Smith, 2009)41.4Shared Logistic Normal Distributions for Soft Parameter Tying in Unsupervised Grammar Induction
DMV (Klein & Manning, 2004)35.9Corpus-Based Induction of Syntactic Structure - Models of Dependency and Constituency

Go back to the README