Exploració per tema "Reconeixement automàtic de la parla"

A bilingual Spanish-Catalan database of units for concatenative synthesis

Esquerra Llucià, Ignasi; Bonafonte Cávez, Antonio; Vallverdú Bayés, Sisco; Febrer Godayol, Albert (1998)
Text en actes de congrés
Accés obert

Different databases of phonetic units are required in multilingual Text-to-Speech systems based on concatenative synthesis. We are currently developing a TTS system able to convert text either in Catalan and Spanish, with ...

A Catalan broadcast conversational speech database

Schulz, Henrik; Rodríguez Fonollosa, José Adrián (2009-09)
Comunicació de congrés
Accés restringit per política de l'editorial

Data driven methods in speech and linguistic research, and system develoment require appropriate speech databases. A new Catalan speech database has been developed with a particular emphasis on broadcast conversational ...

A conversation analysis framework using speech recognition and naïve bayes classification for construction process monitoring

Zhang, T.; Lee, Y. C.; Zhu, Y.; Hernando Pericás, Francisco Javier (American Society of Civil Engineers (ASCE), 2018)
Text en actes de congrés
Accés restringit per política de l'editorial

At a dynamic construction site, conversations convey vital information including construction activities, operation status, and task performance. Even though because of information security, recording the entire conversations ...

A low-power, high-performance speech recognition accelerator

Yazdani, Reza; Arnau Montañés, José María; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2019-12-01)
Article
Accés obert

Automatic Speech Recognition (ASR) is becoming increasingly ubiquitous, especially in the mobile segment. Fast and accurate ASR comes at high energy cost, not being affordable for the tiny power-budgeted mobile devices. ...

A multi-microphone approach to speech processing in a smart-room environment

Abad Gareta, Alberto (Universitat Politècnica de Catalunya, 2007-06-29)
Tesi
Accés obert

Els avenços recents en tecnologia informàtica i processament de la parla i del llenguatge, entre altres, han fet possible que noves maneres de comunicació entre les persones i les màquines comencin a semblar factibles. ...

A programmable accelerator for streaming automatic speech recognition on edge devices

Pinto Rivero, Dennis; Arnau Montañés, José María; González Colás, Antonio María (2022)
Text en actes de congrés
Accés obert

Automatic Speech Recognition (ASR) is quickly becoming a mainstream technology, mainly driven by the outstanding accuracy achieved by modern systems based on machine learning. However, these systems often require billions ...

A Speech-based Dialogue System for Household Robots

Pons Rueda, Susana (Universitat Politècnica de Catalunya / Technische Universiteit Delft, 2011)
Projecte/Treball Final de Carrera
Accés restringit per decisió de l'autor

This thesis studies mechanisms to improve human-robot-interaction through a spoken dialogue for household robots. Therefore, a full dialogue system, in which the semantics of the words play an important role, is ...

Action plan for dissemination

Cristea, Dan; Trandaba¿, Diana; Branco, Antonio; Mendes, Amalia; Pellegrini, Thomas; Thompson, Paul; Irimia, Elena; Tufis, Dan; Gilmenau, Georgiana; Rosner, Mike; Moreno Bilbao, M. Asunción; Bel, Nùria (2012-07-29)
Report de recerca
Accés obert

The central objective of the Metanet4u project is to contribute to the establishment of a pan-European digital platform that makes available language resources and services, encompassing both datasets and software tools, ...

Action plan for dissemination updated

Cristea, Dan; Trandaba¿, Diana; Branco, Antonio; Mendes, Amalia; Pellegrini, Thomas; Thompson, Paul; Tufis, Dan; Gilmenau, Georgiana; Rosner, Mike; Moreno Bilbao, M. Asunción; Bel, Nùria (2012-07-04)
Report de recerca
Accés obert

Deliverable D5.3 del projecte METANET4U (Project CIP #270893)

Adaptación del sistema texto a voz "Festival" al catalán

Jaén Gómez, Alejandro (Universitat Politècnica de Catalunya, 2007-01-08)
Projecte/Treball Final de Carrera
Accés obert

Age prediction by voice using deep learning

Linde Martínez, David (Universitat Politècnica de Catalunya, 2023-01-30)
Projecte Final de Màster Oficial
Accés obert

One of the main topics in artificial intelligence is the speech characterization. Moreover, it is a field of study with the minimal scope when the Catalan language is involved in. In this project, we try to perform an age ...

AI-Vocie: intel·ligència artificial aplicada al reconeixement de la veu

Gil Aguilar, Aleix (Universitat Politècnica de Catalunya, 2019-10)
Treball Final de Grau
Accés obert

L’objectiu d’aquest treball de final de grau és dissenyar i implementar un Altaveu Intel·ligent, Lima, senzill, però pràctic, i que respongui a ordres, simulant la manera de pensar de l’ésser humà. La finalitat és que ...

Alzheimer disease diagnosis based on automatic spontaneous speech analysis

Lopez de Ipiña Peña, Karmele; Alonso Hernandez, Jesus Bernardino; Sole Casals, Jordi; Barroso Moreno, Nora; Faúndez Zanuy, Marcos; Ecay Torres, Mirian; Travieso Gonzalez, Carlos Manuel; Ezeiza Ramos, Aitzol; Estanga Alustiza, Ainara (SciTePress, 2012)
Text en actes de congrés
Accés restringit per política de l'editorial

Alzheimer's disease (AD) is the most prevalent form of progressive degenerative dementia and it has a high socio-economic impact in Western countries, therefore is one of the most active research areas today. Its diagnosis ...

An ASR prototype for Spanish dictation

Cosano Serra, Marta (Universitat Politècnica de Catalunya, 2020-01)
Treball Final de Grau
Accés obert

Automatic Speech Recognition (ASR), or speech to text conversion, has been subject to many researchers for decades due to its various applications. In this project I propose to implement an ASR based on Hidden Markov Model ...

An information-theoretic string matching approach for spoken utterance verification and keyword spotting

Quer Romeo, Guillem (Universitat Politècnica de Catalunya, 2016)
Projecte/Treball Final de Carrera
Accés restringit per decisió de l'autor

The goal of this project is to develop an information-theoretic acoustic-phonetic approach to detect the presence of words or phrases in an utterance. Specifically, the project focuses on two types of detection tasks in ...

An ultra low-power hardware accelerator for acoustic scoring in speech recognition

Tabani, Hamid; Arnau Montañés, José María; Tubella Murgadas, Jordi; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2017)
Text en actes de congrés
Accés restringit per política de l'editorial

Accurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has often to be sacrificed in order to fit the strict power constraints of mobile systems. However, accuracy is extremely ...

Analisis estadistico de orden superior de la voz

Rodríguez Fonollosa, José Adrián; Moreno Bilbao, M. Asunción (1991)
Text en actes de congrés
Accés obert

Most of the speech analysis methods developed up to date have been based on the autocorrelation function or power spectrum, i. e., the second order statistics of the signa!. In this paper it is shown that higher order ...

Análisis de los servicios y sistemas de comunicaciones de voz y datos para la implantación de un sistema de telefonía IP, en un entorno editorial

Giné Figueras, Francesc (Universitat Politècnica de Catalunya, 2011-04-27)
Projecte/Treball Final de Carrera
Accés restringit per decisió de l'autor

Català: L'objectiu principal del present document és el d'analitzar el sistema de comunicacions de veu actual i, en especial, la qualitat del servei ofertat, estudiar els requeriments dels serveis de comunicacions de veu ...

Aplicació de la lectura de llavis automatitzada a l'accessibilitat: escriptura per imatge

Guevara Moran, Meritxell (Universitat Politècnica de Catalunya, 2023-06-28)
Treball Final de Grau
Accés obert

En els darrers anys, els avenços significatius en la intel·ligència artificial han obert noves vies per a promoure la diversitat i la integració a la societat. Aquests progressos han proporcionat eines potents que es poden ...

Automatic Spanish translation of SQuAD dataset for multi-lingual question answering

Carrino, Casimiro Pio; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián (European Language Resources Association (ELRA), 2020)
Comunicació de congrés
Accés obert

Recently, multilingual question answering became a crucial research topic, and it is receiving increased interest in the NLP community.However, the unavailability of large-scale datasets makes it challenging to train ...