Exploració per autor "Ruiz Costa-Jussà, Marta"
Ara es mostren els items 96-115 de 135
-
On the locality of attention in direct speech translation
Alastruey Lasheras, Belén; Ferrando Monsonís, Javier; Gallego Olsina, Gerard Ion; Ruiz Costa-Jussà, Marta (Association for Computational Linguistics, 2022)
Comunicació de congrés
Accés obertTransformers have achieved state-of-the-art results across multiple NLP tasks. However, the self-attention mechanism complexity scales quadratically with the sequence length, creating an obstacle for tasks involving long ... -
On-line and off-line Chinese-Portuguese translation service for mobile applications
Centelles, Jordi; Ruiz Costa-Jussà, Marta; Banchs Martínez, Rafael Enrique; Gelbukh, Alexander (2014-07-01)
Article
Accés obertWe describe a Chinese-Portuguese translation service, which is integrated in an Android application. The application is also enhanced with technologies such as Automatic Speech Recognition, Optical Character Recognition, ... -
Ongoing study for enhancing chinese-spanish translation with morphology strategies
Ruiz Costa-Jussà, Marta (2015)
Text en actes de congrés
Accés obertChinese and Spanish have different morphology structures, which poses a big challenge for translating between this pair of languages. In this paper, we analyze several strategies to better generalize from the Chinese ... -
Overcoming statistical machine translation limitations: error analysis and proposed solutions for the Catalan–Spanish language pair
Mariño Acebal, José Bernardo; Farrús Cabeceran, Mireia; Ruiz Costa-Jussà, Marta; Poch, Marc; Hernández Huerta, Adolfo; Herníquez, Carlos; Rodríguez Fonollosa, José Adrián (2011-02-20)
Article
Accés obertThis work aims to improve anN-gram-based statistical machine translation system between the Catalan and Spanish languages, trained with an aligned Spanish– Catalan parallel corpus consisting of 1.7 million sentences taken ... -
Plagiarism detection using information retrieval and similarity measures based on image processing techniques
Ruiz Costa-Jussà, Marta; Banchs, Rafael E.; Grivolla, Jens; Codina, Joan (2010)
Text en actes de congrés
Accés obertThis paper describes the Barcelona Media Innovation Center participation in the 2nd International Competition on Plagiarism Detection. Particularly, our system focused on the external plagiarism detection task, which assumes ... -
Refinement of unsupervised cross-lingual word embeddings
Biesialska, Magdalena Marta; Ruiz Costa-Jussà, Marta (Ios Press, 2020)
Comunicació de congrés
Accés obertCross-lingual word embeddings aim to bridge the gap between high-resource and low-resource languages by allowing to learn multilingual word representations even without using any direct bilingual signal. The lion's share ... -
Search engine for multilingual audiovisual contents
Pérez, José David; Bonafonte Cávez, Antonio; Ruiz Costa-Jussà, Marta; Cardenal, Antonio; Rodríguez Fonollosa, José Adrián; Moreno Bilbao, M. Asunción; Navas, Eva; Rodríguez Banga, Eduardo (2012)
Comunicació de congrés
Accés obertThis paper describes the BUCEADOR search engine, a web server that allows retrieving. multimedia documents (text, audio, video) in different languages. All the documents are translated into the user language and are ... -
Segmentation strategies to face morphology challenges in Brazilian-Portuguese/English statistical machine translation and its integration in cross-language information retrieval
Ruiz Costa-Jussà, Marta (2015-06-01)
Article
Accés obertThe use of morphology is particularly interesting in the context of statistical machine translation in order to reduce data sparseness and compensate any lack of training corpus. In this work, we propose several approaches ... -
Selection of correction candidates for the normalization of Spanish user generated content
Melero, Maite; Ruiz Costa-Jussà, Marta; Lambert, Patrik; Quixal, Martí (2016-01-01)
Article
Accés obertWe present research aiming to build tools for the normalization of User-Generated Content (UGC). We argue that processing this type of text requires the revisiting of the initial steps of Natural Language Processing, since ... -
Semantic and syntactic information for neural machine translation: Injecting features to the transformer
Armengol Estapé, Jordi; Ruiz Costa-Jussà, Marta (2021-05-18)
Article
Accés obertIntroducing factors such as linguistic features has long been proposed in machine translation to improve the quality of translations. More recently, factored machine translation has proven to still be useful in the case ... -
SHAS: approaching optimal segmentation for end-to-end speech translation
Tsiamas, Ioannis; Gallego Olsina, Gerard Ion; Fonollosa, José A. R.; Ruiz Costa-Jussà, Marta (2022-02)
Report de recerca
Accés obertSpeech translation models are unable to directly process long audios, like TED talks, which have to be split into shorter segments. Speech translation datasets provide manual segmentations of the audios, which are not ... -
SHAS: approaching optimal segmentation for end-to-end speech translation
Tsiamas, Ioannis; Gallego Olsina, Gerard Ion; Fonollosa, José A. R.; Ruiz Costa-Jussà, Marta (International Speech Communication Association (ISCA), 2022)
Text en actes de congrés
Accés obertSpeech translation models are unable to directly process long audios, like TED talks, which have to be split into shorter segments. Speech translation datasets provide manual segmentations of the audios, which are not ... -
State-of-the-Art word reordering approaches in statistical machine translation: a survey
Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (2009-11-01)
Article
Accés obertThis paper surveys several state-of-the-art reordering techniques employed in Statistical Machine Translation systems. Reordering is understood as the word-order redistribution of the translated words. In original SMT ... -
Statistical machine translation enhancements through linguistic levels: a survey
Ruiz Costa-Jussà, Marta; Farrus, Mireia (2014-01)
Article
Accés restringit per política de l'editorialMachine translation can be considered a highly interdisciplinary and multidisciplinary field because it is approached from the point of view of human translators, engineers, computer scientists, mathematicians, and linguists. ... -
Study and correlation analysis of linguistic, perceptual and automatic machine translation evaluations
Farrus, Mireia; Ruiz Costa-Jussà, Marta; Popovic, Maya; Henriquez, Carlos A (2012-01-01)
Article
Accés obertEvaluation of machine translation output is an important task. Various human evaluation techniques as well as automatic metrics have been proposed and investigated in the last decade. However, very few evaluation methods ... -
Syntax-driven iterative expansion language models for controllable text generation
Casas Manzanares, Noé; Rodríguez Fonollosa, José Adrián; Ruiz Costa-Jussà, Marta (Association for Computational Linguistics, 2020)
Comunicació de congrés
Accés obertThe dominant language modeling paradigm handles text as a sequence of discrete tokens. While that approach can capture the latent structure of the text, it is inherently constrained to sequential dynamics for text generation. ... -
Terminology-aware segmentation and domain feature for the WMT19 biomedical translation task
Carrino, Casimiro Pio; Rafieian, Bardia; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (Association for Computational Linguistics, 2019)
Text en actes de congrés
Accés restringit per política de l'editorialIn this work, we give a description of the TALP-UPC systems submitted for the WMT19 Biomedical Translation Task. Our proposed strategy is NMT model-independent and relies only on one ingredient, a biomedical terminology ... -
The IPN-CIC team system submission for the WMT 2020 similar language task
Menéndez-Salazar, Luis A.; Sidorov, Grigori; Ruiz Costa-Jussà, Marta (Association for Computational Linguistics, 2020)
Comunicació de congrés
Accés obertThis paper describes the participation of the NLP research team of the IPN Computer Research center in the WMT 2020 Similar Language Translation Task. We have submitted systems for the Spanish-Portuguese language pair (in ... -
The TALP & I2R SMT Systems for IWSLT 2008
Li, H.; Aw, A.; Zhang, Ming; Khalilov, Maxim; Ruiz Costa-Jussà, Marta; Henríquez Quintana, Carlos Alberto; Rodríguez Fonollosa, José Adrián; Hernández, A.; Mariño Acebal, José Bernardo; Banchs Martínez, Rafael Enrique; Chen, B. (NICT/ATR, 2008-10-31)
Comunicació de congrés
Accés obertThis paper gives a description of the statistical machine translation (SMT) systems developed at the TALP Research Center of the UPC (Universitat Polit`ecnica de Catalunya) for our participation in the IWSLT’08 evaluation ... -
The TALP on-line Spanish-Catalan machine-translation system
Poch, M; Farrús Cabeceran, Mireia; Ruiz Costa-Jussà, Marta; Mariño Acebal, José Bernardo; Hernández, Adolfo; Henríquez Quintana, Carlos Alberto; Rodríguez Fonollosa, José Adrián (2009-09)
Comunicació de congrés
Accés obertIn this paper the statistical machine translator (SMT) between Catalan and Spanish developed at the TALP research center (UPC) and its web demonstration are described.