TAG, dynamic programming, and the perceptron for efficient, feature-rich parsing
Document typeConference report
PublisherColing 2008 Organizing Committee
Rights accessOpen Access
We describe a parsing approach that makes use of the perceptron algorithm, in conjunction with dynamic programming methods, to recover full constituent-based parse trees. The formalism allows a rich set of parse-tree features, including PCFGbased features, bigram and trigram dependency features, and surface features. A severe challenge in applying such an approach to full syntactic parsing is the efficiency of the parsing algorithms involved. We show that efficient training is feasible, using a Tree Adjoining Grammar (TAG) based parsing formalism. A lower-order dependency parsing model is used to restrict the search space of the full model, thereby making it efficient. Experiments on the Penn WSJ treebank show that the model achieves state-of-the-art performance, for both constituent and dependency accuracy.
CitationCarreras, X.; Collins, M.; Koo, T. TAG, dynamic programming, and the perceptron for efficient, feature-rich parsing. A: Conference on Computational Natural Language Learning. "12th Conference on Computational Natural Language Learning (CoNLL-2011)". Manchester: Coling 2008 Organizing Committee, 2008, p. 9-16.