Now showing items 1-20 of 87

    • A MOOC on approaches to machine translation 

      Ruiz Costa-Jussà, Marta; Formiga, Lluís; Torrillas Tostado, Oriol; Petit Silvestre, Jordi; Rodríguez Fonollosa, José Adrián (2015-12-10)
      Article
      Open Access
      This paper describes the design, development, and analysis of a MOOC entitled “Approaches to Machine Translation: Rule-based, statistical and hybrid”, and provides lessons learned and conclusions to be taken into account ...
    • A non-linear semantic mapping technique for cross-language sentence matching 

      Banchs Martínez, Rafael Enrique; Ruiz Costa-Jussà, Marta (2010-08-01)
      Article
      Restricted access - publisher's policy
    • Abstractive text summarization with attention-based mechanism 

      Sanjabi, Nima (Universitat Politècnica de Catalunya, 2018-04)
      Master thesis
      Open Access
      In this work, we explore the evolution of Sequential Neural Models, and their use as a Summarizer System. Transformer is a recently proposed model with a high potential. We experiment and compare their result in abstractive ...
    • AMALEU: una representación universal del lenguaje basada en aprendizaje automático 

      Ruiz Costa-Jussà, Marta (2020-09)
      Article
      Open Access
      El objetivo del proyecto AMALEU es aprender una representación común para diferentes idiomas. Se pretende tener una representación común para la lengua oral y una para la lengua escrita. AMALEU, de dos años de duración, ...
    • An analysis of Twitter corpora and the differences between formal and colloquial tweets 

      González Bermúdez, Meritxell (CEUR-WS.org, 2015)
      Conference report
      Open Access
      This work reviews recent publications addressing the Twitter translation task, and highlights the lack of appropriate corpora that represents the colloquial language used in Twitter. It also discusses the most well-know ...
    • An IR-based strategy for supporting Chinese-Portuguese translation services in off-line model 

      Centelles, Jordi; Ruiz Costa-Jussà, Marta; Banchs Martínez, Rafael Enrique; Gelbukh, Alexander (2014-04-01)
      Article
      Restricted access - publisher's policy
      This paper describes an Information Retrieval engine that is used to support our Chinese-Portuguese machine translation services when no internet connection is available. Our mobile translation app, which is deployed on ...
    • An overview of the phrase-based statistical machine translation techniques 

      Ruiz Costa-Jussà, Marta (2012-12-01)
      Article
      Restricted access - publisher's policy
    • Automatic normalization of short texts by combining statistical and rule-based techniques 

      Ruiz Costa-Jussà, Marta; Banchs, Rafael E. (2013-03-01)
      Article
      Open Access
      Short texts are typically composed of small number of words, most of which are abbreviations, typos and other kinds of noise. This makes the noise to signal ratio relatively high for this specific category of text. A high ...
    • Automatic Spanish translation of SQuAD dataset for multi-lingual question answering 

      Carrino, Casimiro Pio; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (European Language Resources Association (ELRA), 2020)
      Conference lecture
      Open Access
      Recently, multilingual question answering became a crucial research topic, and it is receiving increased interest in the NLP community.However, the unavailability of large-scale datasets makes it challenging to train ...
    • Block-based Speech-to-Speech Translation 

      Roca, Sandra (Universitat Politècnica de Catalunya, 2018-10)
      Bachelor thesis
      Open Access
      Esta tesis explora diferentes maneras de implementar un sistema de bloques de Traducción de Voz con el propósito de generar grandes cantidades de datos para generar un gran corpus paralelo de voz. La primera tarea consiste ...
    • Chinese-Catalan neural machine translation with OpenNMT 

      Wang, Chaofeng (Universitat Politècnica de Catalunya, 2018-07)
      Bachelor thesis
      Restricted access - author's decision
    • Chinese-Catalan: A neural machine translation approach based on pivoting and attention mechanisms 

      Ruiz Costa-Jussà, Marta; Casas Manzanares, Noé; Escolano Peinado, Carlos; Rodríguez Fonollosa, José Adrián (2019-01-01)
      Article
      Open Access
      This article innovatively addresses machine translation from Chinese to Catalan using neural pivot strategies trained without any direct parallel data. The Catalan language is very similar to Spanish from a linguistic point ...
    • Chinese–Spanish neural machine translation enhanced with character and word bitmap fonts 

      Ruiz Costa-Jussà, Marta; Aldón Mínguez, David; Rodríguez Fonollosa, José Adrián (2017-04-06)
      Article
      Open Access
      Recently, machine translation systems based on neural networks have reached state-of-the-art results for some pairs of languages (e.g., German–English). In this paper, we are investigating the performance of neural machine ...
    • Combining subword representations into word-level representations in the transformer architecture 

      Casas Manzanares, Noé; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (Association for Computational Linguistics, 2020)
      Conference lecture
      Open Access
      In Neural Machine Translation, using word-level tokens leads to degradation in translation quality. The dominant approaches use subword-level tokens, but this increases the length of the sequences and makes it difficult ...
    • Context-aware machine translation for software localization 

      Muntés Mulero, Víctor; Paladini Adell, Patricia; España Bonet, Cristina; Màrquez Villodre, Lluís (2012)
      Conference lecture
      Open Access
      Software localization requires translating short text strings appearing in user interfaces (UI) into several languages. These strings are usually unrelated to the other strings in the UI. Due to the lack of semantic ...
    • Convolutional neural networks for classifying studio/non-studio frames in TV news programs 

      Nevot Ezpeleta, Alexis Dionisio (Universitat Politècnica de Catalunya, 2020-07-15)
      Master thesis
      Open Access
      This project has studied an unusual topic on image recognition and classification that can have numerous utilities on multimedia content. The approach studied on this project has been ...
    • Coupling hierarchical word reordering and decoding in phrase-based statistical machine translation 

      Dras, Mark; Khalilov, Maxim; Rodríguez Fonollosa, José Adrián (2009-06)
      Conference lecture
      Open Access
      In this paper, we start with the existing idea of taking reordering rules automatically derived from syntactic representations, and applying them in a preprocessing step before translation to make the source sentence ...
    • Coverage for character based neural machine translation 

      Kazimi, Bashir; Ruiz Costa-Jussà, Marta (2017-09-22)
      Article
      Open Access
      In recent years, Neural Machine Translation (NMT) has achieved state-of-the-art performance in translating from a language; source language, to another; target language. However, many of the proposed methods use word ...
    • Coverage model for character-based neural machine translation 

      Kazimi, Mohammad Bashir (Universitat Politècnica de Catalunya, 2017-05)
      Master thesis
      Open Access
      In recent years, Neural Machine Translation (NMT) has achieved state-of-the art performance in translating from a language; source language, to another; target language. However, many of the proposed methods use word ...
    • Decision, Integration and Deployment of a Translation Management Software 

      Roselló Rosiñol, Aleix (Universitat Politècnica de Catalunya, 2020-06)
      Bachelor thesis
      Open Access
      Covenantee:   Andjoy
      En un context de globalització i internacionalització, les diferents aplicacions de software poden ser utilitzades a tot el món i en regions diferents. Per aquest motiu, les empreses estan obligades a traduir el contingut ...