Exploració per autor "Rodríguez Fonollosa, José Adrián"

Combining subword representations into word-level representations in the transformer architecture

Casas Manzanares, Noé; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (Association for Computational Linguistics, 2020)
Comunicació de congrés
Accés obert

In Neural Machine Translation, using word-level tokens leads to degradation in translation quality. The dominant approaches use subword-level tokens, but this increases the length of the sequences and makes it difficult ...

Conditional distribution variability measures for causality detection

Rodríguez Fonollosa, José Adrián (Springer, 2019)
Capítol de llibre
Accés restringit per política de l'editorial

In this paper we derive variability measures for the conditional probability distributions of a pair of random variables, and we study its application in the inference of causal-effect relationships. We also study the ...

Correcting input noise in SMT as a char-based translation problem

Formiga Fanals, Lluís; Rodríguez Fonollosa, José Adrián (2012-10-31)
Report de recerca
Accés obert

Misspelled words have a direct impact on the final quality obtained by Statistical Machine Translation (SMT) systems as the input becomes noisy and unpredictable. This paper presents some improvement strategies for translating ...

Coupling hierarchical word reordering and decoding in phrase-based statistical machine translation

Dras, Mark; Khalilov, Maxim; Rodríguez Fonollosa, José Adrián (2009-06)
Comunicació de congrés
Accés obert

In this paper, we start with the existing idea of taking reordering rules automatically derived from syntactic representations, and applying them in a preprocessing step before translation to make the source sentence ...

Cuantificación vectorial adaptativa de la voz

Rodríguez Fonollosa, José Adrián; Masgrau Gómez, Enrique José; Carbonell, Rafael M. (1989)
Text en actes de congrés
Accés obert

Vector quantization (VQ) is a simultaneous quantization of a sequence of samples or vector. This process allows to malee effective use of the interrelations among the different vector components and performance arbitrarily ...

Cuantificación vectorial en codificación de voz por excitación multipulso

Moreno Bilbao, M. Asunción; Rodríguez Fonollosa, José Adrián (1991)
Text en actes de congrés
Accés obert

In this paper is made a comparison between sorne quantizers applied in a Multipulse Speech coder. Vector quantizer is compared against scalar quantization in the LPC parameter in the short predictor. Adaptive Multistage ...

Dealing with input noise in statistical machine translation

Formiga Fanals, Lluís; Rodríguez Fonollosa, José Adrián (2012)
Comunicació de congrés
Accés obert

Misspelled words have a direct impact on the final quality obtained by Statistical Machine Translation (SMT) systems as the input becomes noisy and unpredictable. This paper presents some improvement strategies for ...

DeepVoice: tecnologías de aprendizaje profundo aplicadas al procesado de voz y audio

Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (2017-09-01)
Article
Accés obert

This project proposes the development of new deep learning methods for speech and audio processing, exploring new applications and continuing the initial work of the research team and the international community. Research ...

DeepVoice: tecnologías de aprendizaje profundo aplicadas al procesado de voz y audio

Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (2017-09-22)
Article
Accés obert

Este proyecto propone el desarrollo de nuevas arquitecturas para el procesado de la voz y el audio mediante métodos de aprendizaje profundo, explorando también nuevas aplicaciones y dando continuidad al trabajo inicial del ...

Determining CPU and memory requirements for real-time speech recognition systems using the TMS320C3x/C4x

Batlle Mont, Eloi; Rodríguez Fonollosa, José Adrián (Texas Instruments (TI), 1996)
Text en actes de congrés
Accés restringit per política de l'editorial

Developing a computer system using real-time speech recognition previously required a workstation using non-specialized CPUs. Limits to the system were imposed by the amount of memory and hardware required. The Texas ...

End-to-end speech translation with pre-trained models and adapters: UPC at IWSLT 2021

Gallego Olsina, Gerard Ion; Tsiamas, Ioannis; Escolano Peinado, Carlos; Rodríguez Fonollosa, José Adrián; Ruiz Costa-Jussà, Marta (Association for Computational Linguistics, 2021)
Text en actes de congrés
Accés obert

This paper describes the submission to the IWSLT 2021 offline speech translation task by the UPC Machine Translation group. The task consists of building a system capable of translating English audio recordings extracted ...

End-to-end speech translation with the transformer

Cross Vila, Laura; Escolano Peinado, Carlos; Rodríguez Fonollosa, José Adrián; Ruiz Costa-Jussà, Marta (Antonio Bonafonte, Jordi Luque and Francesc Alías Pujol, 2018)
Comunicació de congrés
Accés restringit per política de l'editorial

Speech Translation has been traditionally addressed with the concatenation of two tasks: Speech Recognition and Machine Translation. This approach has the main drawback that errors are concatenated. Recently, neural ...

English-Latvian SMT: the challenge of translating into a free word order language

Khalilov, Maxim; Rodríguez Fonollosa, José Adrián; Skadina, Inguna; Bralitis, Edgar; Pretkalnina, Lauma (2010)
Comunicació de congrés
Accés obert

This paper presents a comparative study of two approaches to statistical machine translation (SMT) and their application to a task of English-to-Latvian translation, which is still an open research line in the field of ...

Enhancing sequence-to-sequence modeling for RDF triples to natural text

Domingo Roig, Oriol; Bergés Lladó, David; Cantenys Sabà, Roser; Creus Castanyer, Roger; Rodríguez Fonollosa, José Adrián (Association for Computational Linguistics, 2020)
Text en actes de congrés
Accés obert

Establishes key guidelines on how, which and when Machine Translation (MT) techniques are worth applying to RDF-to-Text task. Not only do we apply and compare the most prominent MT architecture, the Transformer, but we ...

Estimation of the modulation index of cpm signals using hos

Rodríguez Fonollosa, Javier; Rodríguez Fonollosa, José Adrián (Institute of Electrical and Electronics Engineers (IEEE), 1993)
Text en actes de congrés
Accés obert

Three simple methods are proposed for the estimation of the modulation index of continuous phase modulated signals in noise. These methods employ the estimated autocorrelation and fourth-order cumulant sequences of the ...

Feature decorrelation methods in speech recognition. A comparative study

Batlle Mont, Eloi; Nadeu Camprubí, Climent; Rodríguez Fonollosa, José Adrián (International Speech Communication Association (ISCA), 1998)
Text en actes de congrés
Accés obert

In this paper we study various decorrelation methods for the features used in speech recognition and we compare the performance of each one by running several tests with a speech database. First of all we study the ...

Fir system identification using a linear combination of cumulants

Rodríguez Fonollosa, José Adrián; Vidal Manzano, José; Moreno Bilbao, M. Asunción (. IEEE INT. CONF. ON ACOUSTICS, SPEECH & SIGNAL PROC, 1992)
Text en actes de congrés
Accés obert

A general linear approach to identifying the parameters of a moving average (MA) model from the statistics of the output is developed. It is shown that, under some constraints, the impulse response of the system can be ...

First experiments on an HMM based double layer framework for automatic continuous speech recognition

Nogueiras Rodríguez, Albino; Casar López, Marta; Rodríguez Fonollosa, José Adrián; Caballero Galeote, Mónica (2006)
Comunicació de congrés
Accés restringit per política de l'editorial

The usual approach to automatic continuous speech recognition is what can be called the acoustic-phonetic modelling approach. In this approach, voice is considered to hold two different kinds of information acoustic and ...

From bilingual to multilingual neural machine translation by incremental training

Escolano Peinado, Carlos; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (Association for Computational Linguistics, 2019)
Comunicació de congrés
Accés obert

Multilingual Neural Machine Translation approaches are based on the use of task specific models and the addition of one more language can only be done by retraining the whole system. In this work, we propose a new training ...

From bilingual to multilingual neural-based machine translation by incremental training

Escolano Peinado, Carlos; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (2020-08-02)
Article
Accés obert

A common intermediate language representation in neural machine translation can be used to extend bilingual systems by incremental training. We propose a new architecture based on introducing an interlingual loss as an ...