Recent Submissions

  • POS tagging using relaxation techniques 

    Padró, Lluís (1996-02)
    External research report
    Open Access
    Relaxation labelling is an optimization technique used in many fields to solve constraint satisfaction problems. The algorithm finds a combination of values for a set of variables such that satisfies - to the maximum ...
  • Using bidirectional chart parsing for corpus analysis 

    Ageno Pulido, Alicia; Rodríguez Hontoria, Horacio (1996-02)
    External research report
    Open Access
    Several experiments have been developed around a bidirectional island-driven chart parser. The system follows basically the approach of Stock, Satta and Corazza, and the experiments have been designed and performed with ...
  • Towards learning a constraint grammar from annotated corpora using decision trees 

    Màrquez Villodre, Lluís; Rodríguez Hontoria, Horacio (1996-02)
    External research report
    Open Access
    Inside the framework of robust parsers for the syntactic analysis of unrestricted text, the aim of this work is the construction of a system capable of automatically learning Constraint Grammar rules from a POS annotated ...
  • Fault diagnosis of chemical processes with incomplete observations: A comparative study 

    Askarian, Mahdieh; Escudero Bakx, Gerard; Graells Sobré, Moisès; Zarghami, Reza; Jalali Farahani, Farhang; Mostoufi, Navid (Pergamon Press, 2016-01-04)
    Article
    Restricted access - publisher's policy
    An important problem to be addressed by diagnostic systems in industrial applications is the estimation of faults with incomplete observations. This work discusses different approaches for handling missing data, and ...
  • TweetNorm: a benchmark for lexical normalization of spanish tweets 

    Alegria, Iñaki; Aranberri, Nora; Comas Umbert, Pere Ramon; Fresno, Víctor; Gamallo, Pablo; Padró, Lluís; San Vicente Roncal, Iñaki; Turmo Borras, Jorge; Zubiaga, Arkaitz (2015-12-01)
    Article
    Open Access
    The language used in social media is often characterized by the abundance of informal and non-standard writing. The normalization of this non-standard language can be crucial to facilitate the subsequent textual processing ...
  • An analysis of Twitter corpora and the differences between formal and colloquial tweets 

    González Bermúdez, Meritxell (CEUR-WS.org, 2015)
    Conference report
    Open Access
    This work reviews recent publications addressing the Twitter translation task, and highlights the lack of appropriate corpora that represents the colloquial language used in Twitter. It also discusses the most well-know ...
  • WikiParable -- Data Categorisation Platform (Version 1.0) 

    España Bonet, Cristina (2015-11-16)
    External research report
    Open Access
    This document describes WikiParable, an on-line platform designed for data categorisation. Its purpose is twofold and the tool can be used both to annotate data and to evaluate automatic categorisations. As a main use case ...
  • Unsupervised ensemble minority clustering 

    Gonzàlez Pellicer, Edgar; Turmo Borras, Jorge (2015-01)
    Article
    Restricted access - publisher's policy
    Cluster analysis lies at the core of most unsupervised learning tasks. However, the majority of clustering algorithms depend on the all-in assumption, in which all objects belong to some cluster, and perform poorly on ...
  • ¿Es sostenible la Estrella de la Muerte? 

    Sánchez Carracedo, Fermín; García Almiñana, Jordi; Vidal López, Eva María; López Álvarez, David; Cabré Garcia, José M.; García García, Helena; Alier Forment, Marc; Martín Escofet, Carme (2015-09)
    Article
    Open Access
    El futuro será sostenible o no será. Por eso, es fundamental que todos los Trabajos de Fin de Grado (TFG) de las ingenierías incorporen un estudio de sostenibilidad que analice su impacto ambiental, social y económico. ...
  • AETAS: A system for semanticizing temporal expressions from unstructured contents 

    Ardalan, Zagros; Martín Escofet, Carme; Padró, Lluís (Springer, 2015)
    Conference report
    Restricted access - publisher's policy
    AETAS is an online tool for converting text into RDF linked data with resolution of temporal expressions. AETAS follows fully SOA architecture and is accessible via web-service. It implements a novel approach for semantic ...

View more