Now showing items 1-20 of 123

    • A Client mobile application for Chinese-Spanish statistical machine translation 

      Centelles, Jordi; Ruiz Costa-Jussà, Marta; Banchs Martínez, Rafael Enrique (2014)
      Conference report
      Open Access
      This show and tell paper describes a client mobile application for Chinese-Spanish machine translation. The system combines a standard server-based statistical machine translation (SMT) system, which requires online ...
    • A Deep source-context feature for lexical selection in statistical machine translation 

      Gupta, Parth; Ruiz Costa-Jussà, Marta; Rosso, Paolo; Banchs Martínez, Rafael Enrique (Elsevier, 2016-05-01)
      Article
      Open Access
      This paper presents a methodology to address lexical disambiguation in a standard phrase-based statistical machine translation system. Similarity among source contexts is used to select appropriate translation units. The ...
    • A graphical interface for MT evaluation and error analysis 

      González Bermúdez, Meritxell; Giménez, J.; Màrquez Villodre, Lluís (Association for Computational Linguistics, 2012)
      Conference lecture
      Open Access
      Error analysis in machine translation is a necessary step in order to investigate the strengths and weaknesses of the MT systems under development and allow fair comparisons among them. This work presents an application ...
    • A hybrid machine translation architecture guided by syntax 

      Labaka, Gorka; España Bonet, Cristina; Màrquez Villodre, Lluís; Sarasola, Kepa (2014-09-16)
      Article
      Restricted access - publisher's policy
      This article presents a hybrid architecture which combines rule-based machine translation (RBMT) with phrase-based statistical machine translation (SMT). The hybrid translation system is guided by the rule-based engine. ...
    • A hybrid system for patent translation 

      Enache, Ramona; España Bonet, Cristina; Ranta, Aarne; Màrquez Villodre, Lluís (2012)
      Conference lecture
      Open Access
      This work presents a HMT system for patent translation. The system exploits the high coverage of SMT and the high precision of an RBMT system based on GF to deal with specific issues of the language. The translator is ...
    • A MOOC on approaches to machine translation 

      Ruiz Costa-Jussà, Marta; Formiga, Lluís; Torrillas Tostado, Oriol; Petit Silvestre, Jordi; Rodríguez Fonollosa, José Adrián (2015-12-10)
      Article
      Open Access
      This paper describes the design, development, and analysis of a MOOC entitled “Approaches to Machine Translation: Rule-based, statistical and hybrid”, and provides lessons learned and conclusions to be taken into account ...
    • A new subtree-transfer approach to syntax-based reordering for statistical machine translation 

      Khalilov, Maxim; Rodríguez Fonollosa, José Adrián; Dras, Mark (2009-05)
      Conference lecture
      Open Access
      In this paper we address the problem of translating between languages with word order disparity. The idea of augmenting statistical machine translation (SMT) by using a syntax-based reordering step prior to translation, ...
    • A non-linear semantic mapping technique for cross-language sentence matching 

      Banchs Martínez, Rafael Enrique; Ruiz Costa-Jussà, Marta (2010-08-01)
      Article
      Restricted access - publisher's policy
    • A Richly annotated, multilingual parallel corpus for hybrid machine translation 

      Avramidis, Elefterios; Ruiz Costa-Jussà, Marta; Federmann, Christian; Melero, Maite; Pecina, Pavel; Van Genabith, Josef (European Language Resources Association (ELRA), 2012)
      Conference report
      Open Access
      In recent years, machine translation (MT) research has focused on investigating how hybrid machine translation as well as system combination approachescan bedesigned so that theresulting hybrid translationsshow an improvement ...
    • Abstractive text summarization with attention-based mechanism 

      Sanjabi, Nima (Universitat Politècnica de Catalunya, 2018-04)
      Master thesis
      Open Access
      In this work, we explore the evolution of Sequential Neural Models, and their use as a Summarizer System. Transformer is a recently proposed model with a high potential. We experiment and compare their result in abstractive ...
    • AMALEU: una representación universal del lenguaje basada en aprendizaje automático 

      Ruiz Costa-Jussà, Marta (2020-09)
      Article
      Open Access
      El objetivo del proyecto AMALEU es aprender una representación común para diferentes idiomas. Se pretende tener una representación común para la lengua oral y una para la lengua escrita. AMALEU, de dos años de duración, ...
    • An analysis of Twitter corpora and the differences between formal and colloquial tweets 

      González Bermúdez, Meritxell (CEUR-WS.org, 2015)
      Conference report
      Open Access
      This work reviews recent publications addressing the Twitter translation task, and highlights the lack of appropriate corpora that represents the colloquial language used in Twitter. It also discusses the most well-know ...
    • An IR-based strategy for supporting Chinese-Portuguese translation services in off-line model 

      Centelles, Jordi; Ruiz Costa-Jussà, Marta; Banchs Martínez, Rafael Enrique; Gelbukh, Alexander (2014-04-01)
      Article
      Restricted access - publisher's policy
      This paper describes an Information Retrieval engine that is used to support our Chinese-Portuguese machine translation services when no internet connection is available. Our mobile translation app, which is deployed on ...
    • An overview of the phrase-based statistical machine translation techniques 

      Ruiz Costa-Jussà, Marta (2012-12-01)
      Article
      Restricted access - publisher's policy
    • Automatic normalization of short texts by combining statistical and rule-based techniques 

      Ruiz Costa-Jussà, Marta; Banchs, Rafael E. (2013-03-01)
      Article
      Open Access
      Short texts are typically composed of small number of words, most of which are abbreviations, typos and other kinds of noise. This makes the noise to signal ratio relatively high for this specific category of text. A high ...
    • Automatic Spanish translation of SQuAD dataset for multi-lingual question answering 

      Carrino, Casimiro Pio; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (European Language Resources Association (ELRA), 2020)
      Conference lecture
      Open Access
      Recently, multilingual question answering became a crucial research topic, and it is receiving increased interest in the NLP community.However, the unavailability of large-scale datasets makes it challenging to train ...
    • Automatic translation between layman and HPO terms using machine learning algorithms 

      Manzini, Enrico (Universitat Politècnica de Catalunya, 2019-07-10)
      Master thesis
      Open Access
    • Block-based Speech-to-Speech Translation 

      Roca, Sandra (Universitat Politècnica de Catalunya, 2018-10)
      Bachelor thesis
      Open Access
      Esta tesis explora diferentes maneras de implementar un sistema de bloques de Traducción de Voz con el propósito de generar grandes cantidades de datos para generar un gran corpus paralelo de voz. La primera tarea consiste ...
    • Chinese-Catalan neural machine translation with OpenNMT 

      Wang, Chaofeng (Universitat Politècnica de Catalunya, 2018-07)
      Bachelor thesis
      Restricted access - author's decision
    • Chinese-Catalan: A neural machine translation approach based on pivoting and attention mechanisms 

      Ruiz Costa-Jussà, Marta; Casas Manzanares, Noé; Escolano Peinado, Carlos; Rodríguez Fonollosa, José Adrián (2019-01-01)
      Article
      Open Access
      This article innovatively addresses machine translation from Chinese to Catalan using neural pivot strategies trained without any direct parallel data. The Catalan language is very similar to Spanish from a linguistic point ...