Mostra el registre d'ítem simple
Using ESSENCE for acquiring information extraction patterns
dc.contributor.author | Catala Roig, Neus |
dc.contributor.author | Castell Ariño, Núria |
dc.contributor.author | Martín Muñoz, Mario |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Ciències de la Computació |
dc.date.accessioned | 2016-11-10T15:43:30Z |
dc.date.available | 2016-11-10T15:43:30Z |
dc.date.issued | 2000-11 |
dc.identifier.citation | Català Roig, N., Castell, N., Martin, M. "Using ESSENCE for acquiring information extraction patterns". 2000. |
dc.identifier.uri | http://hdl.handle.net/2117/96498 |
dc.description.abstract | One important issue when constructing Information Extraction systems is how to obtain the knowledge needed for identifying relevant information in a document. In most approaches to this issue, the human expert intervention is necessary in many steps of the acquisition process. In this paper we describe {sc Essence}, a new methodology that reduces significantly the need for human intervention. It is based on ELA, a new algorithm for acquiring information extraction patterns. The distinctive features of {sc Essence} and ELA are that 1) allow to automatically acquire IE patterns from unrestricted text corpus representative of the domain, due to 2) the ability of identifying surrounding context regularities for semantically relevant concept-words for the IE task by using non domain specific lexical knowledge tools and semantic relations from WordNet, and 3) restricting the human intervention to only the definition of the task and the validation and typification of the set of IE patterns obtained. The use of a general purpose ontology and syntactic tools of general application allows the easy portability of the methodology and reduces the expert effort. Results of the application of this methodology for acquiring extraction patterns in a MUC-like task are also shown. |
dc.format.extent | 15 p. |
dc.language.iso | eng |
dc.relation.ispartofseries | LSI-00-65-R |
dc.subject | Àrees temàtiques de la UPC::Informàtica |
dc.subject.other | Acquiring information extraction patterns |
dc.subject.other | ESSENCE |
dc.title | Using ESSENCE for acquiring information extraction patterns |
dc.type | External research report |
dc.contributor.group | Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural |
dc.contributor.group | Universitat Politècnica de Catalunya. KEMLG - Grup d'Enginyeria del Coneixement i Aprenentatge Automàtic |
dc.rights.access | Open Access |
local.identifier.drac | 1874987 |
dc.description.version | Postprint (published version) |
local.citation.author | Català Roig, N.; Castell, N.; Martin, M. |
Fitxers d'aquest items
Aquest ítem apareix a les col·leccions següents
-
Reports de recerca [96]
-
Reports de recerca [1.107]
-
Reports de recerca [88]