Show simple item record

dc.contributor.authorCatala Roig, Neus
dc.contributor.authorCastell Ariño, Núria
dc.contributor.authorMartín Muñoz, Mario
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Ciències de la Computació
dc.identifier.citationCatalà Roig, N., Castell, N., Martin, M. "Using ESSENCE for acquiring information extraction patterns". 2000.
dc.description.abstractOne important issue when constructing Information Extraction systems is how to obtain the knowledge needed for identifying relevant information in a document. In most approaches to this issue, the human expert intervention is necessary in many steps of the acquisition process. In this paper we describe {sc Essence}, a new methodology that reduces significantly the need for human intervention. It is based on ELA, a new algorithm for acquiring information extraction patterns. The distinctive features of {sc Essence} and ELA are that 1) allow to automatically acquire IE patterns from unrestricted text corpus representative of the domain, due to 2) the ability of identifying surrounding context regularities for semantically relevant concept-words for the IE task by using non domain specific lexical knowledge tools and semantic relations from WordNet, and 3) restricting the human intervention to only the definition of the task and the validation and typification of the set of IE patterns obtained. The use of a general purpose ontology and syntactic tools of general application allows the easy portability of the methodology and reduces the expert effort. Results of the application of this methodology for acquiring extraction patterns in a MUC-like task are also shown.
dc.format.extent15 p.
dc.subjectÀrees temàtiques de la UPC::Informàtica
dc.subject.otherAcquiring information extraction patterns
dc.titleUsing ESSENCE for acquiring information extraction patterns
dc.typeExternal research report
dc.contributor.groupUniversitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
dc.contributor.groupUniversitat Politècnica de Catalunya. KEMLG - Grup d'Enginyeria del Coneixement i Aprenentatge Automàtic
dc.rights.accessOpen Access
dc.description.versionPostprint (published version)
upcommons.citation.authorCatalà Roig, N., Castell, N., Martin, M.

Files in this item


This item appears in the following Collection(s)

Show simple item record

All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder