Now showing items 1-20 of 59

  • Abstractive text summarization with attention-based mechanism 

    Sanjabi, Nima (Universitat Politècnica de Catalunya, 2018-04)
    Master thesis
    Open Access
    In this work, we explore the evolution of Sequential Neural Models, and their use as a Summarizer System. Transformer is a recently proposed model with a high potential. We experiment and compare their result in abstractive ...
  • A MOOC on approaches to machine translation 

    Ruiz Costa-Jussà, Marta; Formiga, Lluís; Torrillas Tostado, Oriol; Petit Silvestre, Jordi; Rodríguez Fonollosa, José Adrián (2015-12-10)
    Article
    Open Access
    This paper describes the design, development, and analysis of a MOOC entitled “Approaches to Machine Translation: Rule-based, statistical and hybrid”, and provides lessons learned and conclusions to be taken into account ...
  • An analysis of Twitter corpora and the differences between formal and colloquial tweets 

    González Bermúdez, Meritxell (CEUR-WS.org, 2015)
    Conference report
    Open Access
    This work reviews recent publications addressing the Twitter translation task, and highlights the lack of appropriate corpora that represents the colloquial language used in Twitter. It also discusses the most well-know ...
  • An IR-based strategy for supporting Chinese-Portuguese translation services in off-line model 

    Centelles, Jordi; Ruiz Costa-Jussà, Marta; Banchs Martínez, Rafael Enrique; Gelbukh, Alexander (2014-04-01)
    Article
    Restricted access - publisher's policy
    This paper describes an Information Retrieval engine that is used to support our Chinese-Portuguese machine translation services when no internet connection is available. Our mobile translation app, which is deployed on ...
  • A non-linear semantic mapping technique for cross-language sentence matching 

    Banchs Martínez, Rafael Enrique; Ruiz Costa-Jussà, Marta (2010-08-01)
    Article
    Restricted access - publisher's policy
  • An overview of the phrase-based statistical machine translation techniques 

    Ruiz Costa-Jussà, Marta (2012-12-01)
    Article
    Restricted access - publisher's policy
  • Automatic normalization of short texts by combining statistical and rule-based techniques 

    Ruiz Costa-Jussà, Marta; Banchs, Rafael E. (2013-03-01)
    Article
    Open Access
    Short texts are typically composed of small number of words, most of which are abbreviations, typos and other kinds of noise. This makes the noise to signal ratio relatively high for this specific category of text. A high ...
  • Block-based Speech-to-Speech Translation 

    Roca, Sandra (Universitat Politècnica de Catalunya, 2018-10)
    Bachelor thesis
    Open Access
    Esta tesis explora diferentes maneras de implementar un sistema de bloques de Traducción de Voz con el propósito de generar grandes cantidades de datos para generar un gran corpus paralelo de voz. La primera tarea consiste ...
  • Chinese-Catalan neural machine translation with OpenNMT 

    Wang, Chaofeng (Universitat Politècnica de Catalunya, 2018-07)
    Bachelor thesis
    Restricted access - author's decision
  • Chinese–Spanish neural machine translation enhanced with character and word bitmap fonts 

    Ruiz Costa-Jussà, Marta; Aldón Mínguez, David; Rodríguez Fonollosa, José Adrián (2017-04-06)
    Article
    Open Access
    Recently, machine translation systems based on neural networks have reached state-of-the-art results for some pairs of languages (e.g., German–English). In this paper, we are investigating the performance of neural machine ...
  • Context-aware machine translation for software localization 

    Muntés Mulero, Víctor; Paladini Adell, Patricia; España Bonet, Cristina; Màrquez Villodre, Lluís (2012)
    Conference lecture
    Open Access
    Software localization requires translating short text strings appearing in user interfaces (UI) into several languages. These strings are usually unrelated to the other strings in the UI. Due to the lack of semantic ...
  • Coupling hierarchical word reordering and decoding in phrase-based statistical machine translation 

    Dras, Mark; Khalilov, Maxim; Rodríguez Fonollosa, José Adrián (2009-06)
    Conference lecture
    Open Access
    In this paper, we start with the existing idea of taking reordering rules automatically derived from syntactic representations, and applying them in a preprocessing step before translation to make the source sentence ...
  • Coverage for character based neural machine translation 

    Kazimi, Bashir; Ruiz Costa-Jussà, Marta (2017-09-22)
    Article
    Open Access
    In recent years, Neural Machine Translation (NMT) has achieved state-of-the-art performance in translating from a language; source language, to another; target language. However, many of the proposed methods use word ...
  • Coverage model for character-based neural machine translation 

    Kazimi, Mohammad Bashir (Universitat Politècnica de Catalunya, 2017-05)
    Master thesis
    Open Access
    In recent years, Neural Machine Translation (NMT) has achieved state-of-the art performance in translating from a language; source language, to another; target language. However, many of the proposed methods use word ...
  • Description of the Chinese-to-Spanish rule-based machine translation system developed with a hybrid combination of human annotation and statistical techniques 

    Ruiz Costa-Jussà, Marta; Centelles, Jordi (2015-11-01)
    Article
    Open Access
    Two of the most popular Machine Translation (MT) paradigms are rule based (RBMT) and corpus based, which include the statistical systems (SMT). When scarce parallel corpus is available, RBMT becomes particularly attractive. ...
  • Domain adaptation strategies in statistical machine translation: a brief overview 

    Ruiz Costa-Jussà, Marta (2015-11-01)
    Article
    Open Access
    Statistical machine translation (SMT) is gaining interest given that it can easily be adapted to any pair of languages. One of the main challenges in SMT is domain adaptation because the performance in translation drops ...
  • End-to-end speech translation system with attention-based mechanisms 

    Cros Vila, Laura (Universitat Politècnica de Catalunya, 2018-06)
    Bachelor thesis
    Open Access
    Speech Recognition and Text-to-Text Translation systems have been improving significantly in recent decades thanks to the improvement of both hardware and software means. However, Speech Translation is usually done as a ...
  • End-to-end speech translation with the transformer 

    Cross Vila, Laura; Escolano Peinado, Carlos; Rodríguez Fonollosa, José Adrián; Ruiz Costa-Jussà, Marta (Antonio Bonafonte, Jordi Luque and Francesc Alías Pujol, 2018)
    Conference lecture
    Restricted access - publisher's policy
    Speech Translation has been traditionally addressed with the concatenation of two tasks: Speech Recognition and Machine Translation. This approach has the main drawback that errors are concatenated. Recently, neural ...
  • English-to-Hindi system description for WMT 2014: deep source-context features for Moses 

    Ruiz Costa-Jussà, Marta; Gupta, Parth; Banchs, Rafael E.; Rosso, P. (Association for Computational Linguistics, 2014)
    Conference lecture
    Open Access
    This paper describes the IPN-UPV participation on the English-to-Hindi translation task from WMT 2014 International Evaluation Campaign. The system presented is based on Moses and enhanced with deep learning by meansof ...
  • Experiments on document level machine translation 

    Martínez Garcia, Eva; España Bonet, Cristina; Márquez Villodre, Luís (2014-03-03)
    External research report
    Open Access
    Most of the current SMT systems work at sentence level. They translate a text assuming that sentences are independent, but, when one looks at a well formed document, it is clear that there exist many inter sentence relations. ...