Show simple item record

dc.contributor.authorPeris, Aina
dc.contributor.authorTaulé, Mariona
dc.contributor.authorBoleda Torrent, Gemma
dc.contributor.authorRodríguez Hontoria, Horacio
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics
dc.date.accessioned2010-11-23T11:31:32Z
dc.date.available2010-11-23T11:31:32Z
dc.date.created2010
dc.date.issued2010
dc.identifier.citationPeris, A. [et al.]. ADN-classifier: automatically assigning denotation types to nominalizations. A: International Conference on Language Resources and Evaluation. "International Conference on Language Resources and Evaluation". Valletta: 2010.
dc.identifier.isbn2-9517408-6-7
dc.identifier.urihttp://hdl.handle.net/2117/10374
dc.description.abstractThis paper presents the ADN-Classifier, an Automatic classification system of Spanish Deverbal Nominalizations aimed at identifying its semantic denotation (i.e. event, result, underspecified, or lexicalized). The classifier can be used for NLP tasks such as coreference resolution or paraphrase detection. To our knowledge, the ADN-Classifier is the first effort in acquisition of denotations for nominalizations using Machine Learning.We compare the results of the classifier when using a decreasing number of Knowledge Sources, namely (1) the complete nominal lexicon (AnCora-Nom) that includes sense distictions, (2) the nominal lexicon (AnCora-Nom) removing the sense-specific information, (3) nominalizations’ context information obtained from a treebank corpus (AnCora-Es) and (4) the combination of the previous linguistic resources. In a realistic scenario, that is, without sense distinction, the best results achieved are those taking into account the information declared in the lexicon (89.40% accuracy). This shows that the lexicon contains crucial information (such as argument structure) that corpus-derived features cannot substitute for.
dc.format.extent1 p.
dc.language.isoeng
dc.subjectÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject.lcshSpanish Deverbal Nominalizations (Classification system)
dc.subject.lcshADN-Classifier (Automatic classification system)
dc.subject.lcshNatural language processing (Computer science)
dc.subject.lcshComputational linguistics -- Research
dc.titleADN-classifier: automatically assigning denotation types to nominalizations
dc.typeConference report
dc.subject.lemacLingüística computacional
dc.subject.lemacCorpus (Lingüística)
dc.subject.lemacCastellà -- Lexicografia
dc.contributor.groupUniversitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
dc.description.peerreviewedPeer Reviewed
dc.rights.accessOpen Access
local.identifier.drac3259506
dc.description.versionPostprint (published version)
local.citation.authorPeris, A.; Taulé, M.; Boleda, G.; Rodriguez, H.
local.citation.contributorInternational Conference on Language Resources and Evaluation
local.citation.pubplaceValletta
local.citation.publicationNameInternational Conference on Language Resources and Evaluation


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record