Show simple item record

dc.contributor.authorKoo, Terry
dc.contributor.authorCarreras Pérez, Xavier
dc.contributor.authorCollins, Michael
dc.date.accessioned2010-10-19T11:42:07Z
dc.date.available2010-10-19T11:42:07Z
dc.date.created2008
dc.date.issued2008
dc.identifier.citationKoo, T.; Carreras, X.; Collins, M. Simple semi-supervised dependency parsing. A: Annual Meeting of the Association for Computational Linguistics. "46th Annual Meeting of the Association for Computational Linguistics". Columbus, Ohio: 2008, p. 595-603.
dc.identifier.urihttp://hdl.handle.net/2117/9808
dc.description.abstractWe present a simple and effective semisupervised method for training dependency parsers. We focus on the problem of lexical representation, introducing features that incorporate word clusters derived from a large unannotated corpus. We demonstrate the effectiveness of the approach in a series of dependency parsing experiments on the Penn Treebank and Prague Dependency Treebank, and we show that the cluster-based features yield substantial gains in performance across a wide range of conditions. For example, in the case of English unlabeled second-order parsing, we improve from a baseline accuracy of 92:02% to 93:16%, and in the case of Czech unlabeled second-order parsing, we improve from a baseline accuracy of 86:13% to 87:13%. In addition, we demonstrate that our method also improves performance when small amounts of training data are available, and can roughly halve the amount of supervised data required to reach a desired level of performance.
dc.format.extent9 p.
dc.language.isoeng
dc.subjectÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural
dc.subject.lcshComputational linguistics
dc.titleSimple semi-supervised dependency parsing
dc.typeConference report
dc.subject.lemacLingüística computacional
dc.contributor.groupUniversitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://aclweb.org/anthology-new/P/P08/P08-1068.pdf
dc.rights.accessOpen Access
local.identifier.drac2752817
dc.description.versionPostprint (author’s final draft)
local.citation.authorKoo, T.; Carreras, X.; Collins, M.
local.citation.contributorAnnual Meeting of the Association for Computational Linguistics
local.citation.pubplaceColumbus, Ohio
local.citation.publicationName46th Annual Meeting of the Association for Computational Linguistics
local.citation.startingPage595
local.citation.endingPage603


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record