Now showing items 1-20 of 45

    • A graph-based strategy to streamline translation quality assessments 

      Pighin, Daniele; Formiga Fanals, Lluís; Màrquez Villodre, Lluís (2012)
      Conference report
      Open Access
      We present a detailed analysis of a graph- based annotation strategy that we employed to annotate a corpus of 11,292 real-world En- glish to Spanish automatic translations with relative (ranking) and absolute ...
    • A graphical interface for MT evaluation and error analysis 

      González Bermúdez, Meritxell; Giménez, J.; Màrquez Villodre, Lluís (Association for Computational Linguistics, 2012)
      Conference lecture
      Open Access
      Error analysis in machine translation is a necessary step in order to investigate the strengths and weaknesses of the MT systems under development and allow fair comparisons among them. This work presents an application ...
    • A hybrid machine translation architecture guided by syntax 

      Labaka, Gorka; España Bonet, Cristina; Màrquez Villodre, Lluís; Sarasola, Kepa (2014-09-16)
      Article
      Restricted access - publisher's policy
      This article presents a hybrid architecture which combines rule-based machine translation (RBMT) with phrase-based statistical machine translation (SMT). The hybrid translation system is guided by the rule-based engine. ...
    • A hybrid system for patent translation 

      Enache, Ramona; España Bonet, Cristina; Ranta, Aarne; Màrquez Villodre, Lluís (2012)
      Conference lecture
      Open Access
      This work presents a HMT system for patent translation. The system exploits the high coverage of SMT and the high precision of an RBMT system based on GF to deal with specific issues of the language. The translator is ...
    • A joint model for parsing syntactic and semantic dependencies 

      Lluis Martorell, Xavier; Màrquez Villodre, Lluís (Coling 2008 Organizing Committee, 2008)
      Conference report
      Open Access
      This paper describes a system that jointly parses syntactic and semantic dependencies, presented at the CoNLL-2008 shared task (Surdeanu et al., 2008). It combines online Peceptron learning (Collins, 2002) with a parsing ...
    • A Machine learning approach to POS tagging 

      Màrquez Villodre, Lluís; Padró, Lluís; Rodríguez Hontoria, Horacio (1997-12)
      External research report
      Open Access
      We have applied inductive learning of statistical decision trees and relaxation labelling to the Natural Language Processing (NLP) task of morphosyntactic disambiguation (Part Of Speech Tagging). The learning process ...
    • A Proposal for wide-coverage Spanish named entity recognition 

      Arévalo, M.; Carreras Pérez, Xavier; Màrquez Villodre, Lluís; Martí Antonin, Maria Antònia; Padró, Lluís; Simon, Maria José (2002-04)
      External research report
      Open Access
      This paper presents a proposal for wide--coverage Named Entity Recognition for Spanish. First, a linguistic description of the typology of Named Entities is proposed. Following this definition an architecture of sequential ...
    • A second-order joint Eisner model for syntactic and semantic dependency parsing 

      Lluis Martorell, Xavier; Bott, Stefan Markus; Màrquez Villodre, Lluís (2009)
      Conference lecture
      Restricted access - publisher's policy
      We present a system developed for the CoNLL-2009 Shared Task (Hajic et al., 2009). We extend the Carreras (2007) parser to jointly annotate syntactic and semantic dependencies. This state-of-the-art parser factorizes the ...
    • Boosting applied to word sense disambiguation 

      Escudero Bakx, Gerard; Màrquez Villodre, Lluís; Rigau Claramunt, German (2000-01)
      External research report
      Open Access
      In this paper we apply Schapire and Singer's AdaBoost.MH boosting algorithm to the Word Sense Disambiguation (WSD) problem. Initial experiments on a set of 15 selected polysemous words show that the boosting approach ...
    • Boosting trees for anti-spam email filtering (Extended version) 

      Carreras Pérez, Xavier; Màrquez Villodre, Lluís (2001-10)
      External research report
      Open Access
      In this work, a set of comparative experiments for the problem of automatically filtering unwanted electronic mail messages are performed on two public corpora: PU1 and LingSpam. Several variants of the AdaBoost algorithm ...
    • Context-aware machine translation for software localization 

      Muntés Mulero, Víctor; Paladini Adell, Patricia; España Bonet, Cristina; Màrquez Villodre, Lluís (2012)
      Conference lecture
      Open Access
      Software localization requires translating short text strings appearing in user interfaces (UI) into several languages. These strings are usually unrelated to the other strings in the UI. Due to the lack of semantic ...
    • Deep evaluation of hybrid architectures: simple metrics correlated with human judgments 

      Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Sarasola Gabiola, Kepa; España Bonet, Cristina; Màrquez Villodre, Lluís (2011)
      Conference lecture
      Open Access
      The process of developing hybrid MT systems is guided by the evaluation method used to compare different combinations of basic subsystems. This work presents a deep evaluation experiment of a hybrid architecture ...
    • Deep evaluation of hybrid architectures: Use of different metrics in MERT weight optimization 

      España Bonet, Cristina; Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Màrquez Villodre, Lluís; Sarasola, Kepa (2013)
      Conference report
      Open Access
      The process of developing hybrid MT systems is usually guided by an evaluation method used to compare different combinations of basic subsystems. This work presents a deep evaluation experiment of a hybrid architecture, ...
    • Discriminative learning within Arabic statistical machine translation 

      España Bonet, Cristina; Giménez, Jesús; Màrquez Villodre, Lluís (2009-01)
      External research report
      Open Access
      Written Arabic is a especially ambiguous due to the lack of diacritisation of texts, and this makes the translation harder for automatic systems that do not take into account the context of phrases. Here, we use a standard ...
    • Document-level machine translation as a re-translation process 

      Martínez Garcia, Eva; España Bonet, Cristina; Màrquez Villodre, Lluís (2014-09-22)
      Article
      Open Access
      Most of the current Machine Translation systems are designed to translate a document sentence by sentence ignoring discourse information and producing incoherencies in the final translations. In this paper we present some ...
    • Exploiting diversity of margin-based classifiers 

      Romero Merino, Enrique; Carreras Pérez, Xavier; Màrquez Villodre, Lluís (2003-12)
      External research report
      Open Access
      An experimental comparison among Support Vector Machines, AdaBoost and a recently proposed model for maximizing the margin with Feed-forward Neural Networks has been made on a real-world classification problem, namely ...
    • Hybrid machine translation guided by a rule-based system 

      España Bonet, Cristina; Màrquez Villodre, Lluís; Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Sarasola Gabiola, Kepa (2011)
      Conference lecture
      Open Access
      This paper presents a machine translation architecture which hybridizes Matxin, a rulebased system, with regular phrase-based Statistical Machine Translation. In short, the hybrid translation process is guided by the ...
    • Identifying useful human correction feedback from an on-line machine translation service 

      Barrón-Cedeño, Alberto; Màrquez Villodre, Lluís; Henríquez Quintana, Carlos Alberto; Formiga Fanals, Lluís; Romero Merino, Enrique; May, Jonathan (2013)
      Conference report
      Open Access
      Post-editing feedback provided by users of on-line translation services offers an excellent opportunity for automatic improvement of statistical machine translation (SMT) systems. However, feedback provided by casual users ...
    • Identifying useful human feedback from an on-line translation service 

      Barrón-Cedeño, Alberto; Màrquez Villodre, Lluís; Henríquez Quintana, Carlos Alberto; Formiga Fanals, Lluís; Romero Merino, Enrique; May, Jonathan (2013)
      Conference lecture
      Open Access
      Post-editing feedback provided by users of on-line translation services offers an excellent opportunity for automatic improvement of statistical machine translation (SMT) systems. However, feedback provided by casual ...
    • IPA and STOUT: leveraging linguistic and source-based features for machine translation evaluation 

      González Bermúdez, Meritxell; Barrón-Cedeño, Alberto; Màrquez Villodre, Lluís (Association for Computational Linguistics, 2014)
      Conference lecture
      Restricted access - publisher's policy
      This paper describes the UPC submissions to the WMT14 Metrics Shared Task : UPC-IPA and UPC-STOUT. These metrics use a collection of evaluation measures integrated in ASIYA, a toolkit for machine translation evaluation. ...