Now showing items 1-20 of 58

    • A Bootstrapping architecture for time expression recognition in unlabelled corpora via syntactic-semantic patterns 

      Poveda Poveda, Jordi; Surdeanu, Mihai; Turmo Borras, Jorge (2007-06)
      Research report
      Open Access
      In this paper we describe a semi-supervised approach to the extraction of time expression mentions in large unlabelled corpora based on bootstrapping. Bootstrapping techniques rely on a relatively small amount of initial ...
    • A Constraint-Based Hypergraph Partitioning Approach to Coreference Resolution 

      Sapena Masip, Emilio; Padró, Lluís; Turmo Borras, Jorge (2013-12)
      Article
      Open Access
      This work is focused on research in machine learning for coreference resolution. Coreference resolution is a natural language processing task that consists of determining the expressions in a discourse that refer to the ...
    • A global relaxation labeling approach to coreference resolution 

      Sapena Masip, Emilio; Padró, Lluís; Turmo Borras, Jorge (2010)
      Conference report
      Restricted access - publisher's policy
      This paper presents a constraint-based graph partitioning approach to coreference resolution solved by relaxation labeling. The approach combines the strengths of groupwise classifiers and chain formation methods in ...
    • A graph partitioning approach to coreference resolution 

      Sapena Masip, Emilio; Padró, Lluís; Turmo Borras, Jorge (2009-01)
      Research report
      Open Access
      This report presents a graph partitioning approach given a set of constraints to resolve coreferences. Coreference resolution is the task of determining which referring expressions in a discourse refer to the same entity. ...
    • A graph partitioning approach to entity disambiguation using uncertain information 

      Sapena Masip, Emilio; Padró, Lluís; Turmo Borras, Jorge (Springer, 2008-08-31)
      Conference report
      Restricted access - publisher's policy
      This paper presents a method for Entity Disambiguation in Information Extraction from different sources in the web. Once entities and relations between them are extracted, it is needed to determine which ones are referring ...
    • A question-driven information system adaptable via information extraction techniques 

      Sapena Masip, Emilio; González Pérez, Manuel; Padró, Lluís; Turmo Borras, Jorge (2008-06)
      Research report
      Open Access
      This paper presents a system is composed by two main parts: The Information Extraction Process, which crawls the web and extracts all relevant knowledge that is then stored in the database, and the Question Processor which ...
    • A Question-driven Information system adaptable via Information Extraction techniques 

      Sapena Masip, Emilio; González Bermúdez, Meritxell; Padró, Lluís; Turmo Borras, Jorge (2008-06-13)
      Research report
      Open Access
      This paper presents a system is composed by two main parts: The Information Extraction Process, which crawls the web and extracts all relevant knowledge that is then stored in the database, and the Question Processor which ...
    • Alias assignment in information extraction 

      Sapena Masip, Emilio; Padró, Lluís; Turmo Borras, Jorge (-, 2007-01-31)
      Conference report
      Restricted access - publisher's policy
      This paper presents a general method for alias assignment task in information extraction. We compared two approaches to face the problem and learn a classifier. The first one quantifies a global similarity between the alias ...
    • An evaluation framework based on gold standard models for definition question answering 

      Kanaan Izquierdo, Samir; Turmo Borras, Jorge (2006-05)
      Research report
      Open Access
      This paper presents a weak supervised evaluation framework for definition question answering (DefQA) called Solon. It automatically evaluates a set of DefQA systems using existing human definitions as gold standard models. ...
    • Automatically extracting translation links using a wide coverage semantic taxonomy 

      Rigau Claramunt, German; Rodríguez Hontoria, Horacio; Turmo Borras, Jorge (1995-01-01)
      Research report
      Open Access
      TGE (Tlink Generator Environment) is a system for semi-automatically extracting translation links. The system was developed within the ACQUILEX II project as a tool for supporting the construction of a multi-lingual lexical ...
    • Building a Spanish/Catalan health records corpus with very sparse protected information labelled 

      Medina Herrera, Salvador; Turmo Borras, Jorge (2018)
      Conference lecture
      Open Access
      Electronic Health Records (EHR) are an important resource for the research and study of diseases, treatments and symptoms. However, due to data protection laws, information that could potentially compromise privacy must ...
    • Comparing non-parametric ensemble methods for document clustering 

      González Pellicer, Edgar; Turmo Borras, Jorge (Springer Verlag, 2008)
      Conference report
      Restricted access - publisher's policy
      The biases of individual algorithms for non-parametric document clustering can lead to non-optimal solutions. Ensemble clustering methods may overcome this limitation, but have not been applied to document collections. ...
    • Coreference Resolution in Freeling 4.0 

      Marimon, Montserrat; Padró, Lluís; Turmo Borras, Jorge (2018)
      Conference lecture
      Open Access
      This paper presents the integration of RelaxCor into FreeLing. RelaxCor is a coreference resolution system based on constraint satisfaction that ranked second in the CoNLL-2011 shared task. FreeLing is an open-source library ...
    • Coreference resolution survey 

      Sapena Masip, Emilio; Padró, Lluís; Turmo Borras, Jorge (2008-12)
      Research report
      Open Access
      This survey is an extended summarization of state of the art of coreference resolution. The key concepts related to coreference and anaphora are presented, the most relevant approaches to coreference resolution are discussed, ...
    • Del texto a la información 

      Atserias Batalla, Jordi; Castell Ariño, Núria; Catala Roig, Neus; Rodríguez Hontoria, Horacio; Turmo Borras, Jorge (Asociación de Técnicos de Informática, 1998-05)
      Article
      Open Access
      Las aplicaciones informáticas centradas en el Tratamiento de la Lengua (TL) han experimentado en los últimos años un notable auge sobre todo en el ámbito del acceso a la información textual no restriginda (ni codifica­da). ...
    • Del texto a la información 

      Atserias Batalla, Jordi; Castell Ariño, Núria; Catala Roig, Neus; Rodríguez Hontoria, Horacio; Turmo Borras, Jorge (1998-03)
      Research report
      Open Access
      In the last years, software applications centered on Natural Language Processing (NLP) have experimented a great significance, especially access to neither unrestricted, nor codified, textual information. In this ...
    • DOTT-HEALTH: Desarrollo de tecnología aplicada a textos para el soporte de diagnosis, prevención y gestión de instituciones de salud 

      Araujo Serna, Lourdes; Martinez-Romo, Juan; Turmo Borras, Jorge; Padró, Lluís; Casillas Rubio, Arantza; Gojenola Galletebeitia, Koldo (CEUR-WS.org, 2021)
      Conference report
      Open Access
      La combinación de datos y pautas dirigidas a pacientes individuales se engloba en los Sistemas de Apoyo a la Decisión Clínica. La adopción del Informe Clínico Electrónico de forma sistemática por parte de los sistemas de ...
    • Everything transformers: Recognition, classification and normalisation of professions and family relations 

      Medina Herrera, Salvador; Turmo Borras, Jorge (CEUR-WS.org, 2021)
      Conference report
      Open Access
      This document describes the system submitted by TALP team for IberLEF 2021’s MEDDOPROF Shared Task. The joint occupation mention identification and family relation classification model is composed of a pre-trained DistilBERT ...
    • FEMsum at DUC 2006: Semantic-based approach integrated in a Flexible Eclectic Multitask Summarizer Architecture 

      Fuentes Fort, Maria; Rodríguez Hontoria, Horacio; Turmo Borras, Jorge; Ferrés Domènech, Daniel (2006)
      Conference report
      Open Access
      In order to face different requirements at TALP Research Center we have built a highly parameterized environment allowing to instantiate specific summarizers for different summarization tasks in different languages. This ...
    • FEMsum: A flexible eclectic multitask summarizer architecture evaluated in multidocument tasks 

      Fuentes Fort, Maria; Rodríguez Hontoria, Horacio; Turmo Borras, Jorge (2007-01)
      Research report
      Open Access
      This article describes two types of summarization approaches integrated in a flexible architecture for multitask summarization. The first type is based on the use of lexical features, while the second one is grounded on ...