Ara es mostren els items 1-20 de 45

  • A graph-based strategy to streamline translation quality assessments 

    Pighin, Daniele; Formiga Fanals, Lluís; Màrquez Villodre, Lluís (2012)
    Text en actes de congrés
    Accés obert
    We present a detailed analysis of a graph- based annotation strategy that we employed to annotate a corpus of 11,292 real-world En- glish to Spanish automatic translations with relative (ranking) and absolute ...
  • A graphical interface for MT evaluation and error analysis 

    González Bermúdez, Meritxell; Giménez, J.; Màrquez Villodre, Lluís (Association for Computational Linguistics, 2012)
    Comunicació de congrés
    Accés obert
    Error analysis in machine translation is a necessary step in order to investigate the strengths and weaknesses of the MT systems under development and allow fair comparisons among them. This work presents an application ...
  • A hybrid machine translation architecture guided by syntax 

    Labaka, Gorka; España Bonet, Cristina; Màrquez Villodre, Lluís; Sarasola, Kepa (2014-09-16)
    Article
    Accés restringit per política de l'editorial
    This article presents a hybrid architecture which combines rule-based machine translation (RBMT) with phrase-based statistical machine translation (SMT). The hybrid translation system is guided by the rule-based engine. ...
  • A hybrid system for patent translation 

    Enache, Ramona; España Bonet, Cristina; Ranta, Aarne; Màrquez Villodre, Lluís (2012)
    Comunicació de congrés
    Accés obert
    This work presents a HMT system for patent translation. The system exploits the high coverage of SMT and the high precision of an RBMT system based on GF to deal with specific issues of the language. The translator is ...
  • A joint model for parsing syntactic and semantic dependencies 

    Lluis Martorell, Xavier; Màrquez Villodre, Lluís (Coling 2008 Organizing Committee, 2008)
    Text en actes de congrés
    Accés obert
    This paper describes a system that jointly parses syntactic and semantic dependencies, presented at the CoNLL-2008 shared task (Surdeanu et al., 2008). It combines online Peceptron learning (Collins, 2002) with a parsing ...
  • A Machine learning approach to POS tagging 

    Màrquez Villodre, Lluís; Padró, Lluís; Rodríguez Hontoria, Horacio (1997-12)
    Report de recerca
    Accés obert
    We have applied inductive learning of statistical decision trees and relaxation labelling to the Natural Language Processing (NLP) task of morphosyntactic disambiguation (Part Of Speech Tagging). The learning process ...
  • A Proposal for wide-coverage Spanish named entity recognition 

    Arévalo, M.; Carreras Pérez, Xavier; Màrquez Villodre, Lluís; Martí Antonin, Maria Antònia; Padró, Lluís; Simon, Maria José (2002-04)
    Report de recerca
    Accés obert
    This paper presents a proposal for wide--coverage Named Entity Recognition for Spanish. First, a linguistic description of the typology of Named Entities is proposed. Following this definition an architecture of sequential ...
  • A second-order joint Eisner model for syntactic and semantic dependency parsing 

    Lluis Martorell, Xavier; Bott, Stefan Markus; Màrquez Villodre, Lluís (2009)
    Comunicació de congrés
    Accés restringit per política de l'editorial
    We present a system developed for the CoNLL-2009 Shared Task (Hajic et al., 2009). We extend the Carreras (2007) parser to jointly annotate syntactic and semantic dependencies. This state-of-the-art parser factorizes the ...
  • Boosting applied to word sense disambiguation 

    Escudero Bakx, Gerard; Màrquez Villodre, Lluís; Rigau Claramunt, German (2000-01)
    Report de recerca
    Accés obert
    In this paper we apply Schapire and Singer's AdaBoost.MH boosting algorithm to the Word Sense Disambiguation (WSD) problem. Initial experiments on a set of 15 selected polysemous words show that the boosting approach ...
  • Boosting trees for anti-spam email filtering (Extended version) 

    Carreras Pérez, Xavier; Màrquez Villodre, Lluís (2001-10)
    Report de recerca
    Accés obert
    In this work, a set of comparative experiments for the problem of automatically filtering unwanted electronic mail messages are performed on two public corpora: PU1 and LingSpam. Several variants of the AdaBoost algorithm ...
  • Context-aware machine translation for software localization 

    Muntés Mulero, Víctor; Paladini Adell, Patricia; España Bonet, Cristina; Màrquez Villodre, Lluís (2012)
    Comunicació de congrés
    Accés obert
    Software localization requires translating short text strings appearing in user interfaces (UI) into several languages. These strings are usually unrelated to the other strings in the UI. Due to the lack of semantic ...
  • Deep evaluation of hybrid architectures: simple metrics correlated with human judgments 

    Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Sarasola Gabiola, Kepa; España Bonet, Cristina; Màrquez Villodre, Lluís (2011)
    Comunicació de congrés
    Accés obert
    The process of developing hybrid MT systems is guided by the evaluation method used to compare different combinations of basic subsystems. This work presents a deep evaluation experiment of a hybrid architecture ...
  • Deep evaluation of hybrid architectures: Use of different metrics in MERT weight optimization 

    España Bonet, Cristina; Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Màrquez Villodre, Lluís; Sarasola, Kepa (2013)
    Text en actes de congrés
    Accés obert
    The process of developing hybrid MT systems is usually guided by an evaluation method used to compare different combinations of basic subsystems. This work presents a deep evaluation experiment of a hybrid architecture, ...
  • Discriminative learning within Arabic statistical machine translation 

    España Bonet, Cristina; Giménez, Jesús; Màrquez Villodre, Lluís (2009-01)
    Report de recerca
    Accés obert
    Written Arabic is a especially ambiguous due to the lack of diacritisation of texts, and this makes the translation harder for automatic systems that do not take into account the context of phrases. Here, we use a standard ...
  • Document-level machine translation as a re-translation process 

    Martínez Garcia, Eva; España Bonet, Cristina; Màrquez Villodre, Lluís (2014-09-22)
    Article
    Accés obert
    Most of the current Machine Translation systems are designed to translate a document sentence by sentence ignoring discourse information and producing incoherencies in the final translations. In this paper we present some ...
  • Exploiting diversity of margin-based classifiers 

    Romero Merino, Enrique; Carreras Pérez, Xavier; Màrquez Villodre, Lluís (2003-12)
    Report de recerca
    Accés obert
    An experimental comparison among Support Vector Machines, AdaBoost and a recently proposed model for maximizing the margin with Feed-forward Neural Networks has been made on a real-world classification problem, namely ...
  • Hybrid machine translation guided by a rule-based system 

    España Bonet, Cristina; Màrquez Villodre, Lluís; Labaka, Gorka; Díaz de Ilarraza Sánchez, Arantza; Sarasola Gabiola, Kepa (2011)
    Comunicació de congrés
    Accés obert
    This paper presents a machine translation architecture which hybridizes Matxin, a rulebased system, with regular phrase-based Statistical Machine Translation. In short, the hybrid translation process is guided by the ...
  • Identifying useful human correction feedback from an on-line machine translation service 

    Barrón-Cedeño, Alberto; Màrquez Villodre, Lluís; Henríquez Quintana, Carlos Alberto; Formiga Fanals, Lluís; Romero Merino, Enrique; May, Jonathan (2013)
    Text en actes de congrés
    Accés obert
    Post-editing feedback provided by users of on-line translation services offers an excellent opportunity for automatic improvement of statistical machine translation (SMT) systems. However, feedback provided by casual users ...
  • Identifying useful human feedback from an on-line translation service 

    Barrón-Cedeño, Alberto; Màrquez Villodre, Lluís; Henríquez Quintana, Carlos Alberto; Formiga Fanals, Lluís; Romero Merino, Enrique; May, Jonathan (2013)
    Comunicació de congrés
    Accés obert
    Post-editing feedback provided by users of on-line translation services offers an excellent opportunity for automatic improvement of statistical machine translation (SMT) systems. However, feedback provided by casual ...
  • IPA and STOUT: leveraging linguistic and source-based features for machine translation evaluation 

    González Bermúdez, Meritxell; Barrón-Cedeño, Alberto; Màrquez Villodre, Lluís (Association for Computational Linguistics, 2014)
    Comunicació de congrés
    Accés restringit per política de l'editorial
    This paper describes the UPC submissions to the WMT14 Metrics Shared Task : UPC-IPA and UPC-STOUT. These metrics use a collection of evaluation measures integrated in ASIYA, a toolkit for machine translation evaluation. ...