Browsing by Subject "Automatic speech recognition"
Now showing items 1-20 of 189
-
A bilingual Spanish-Catalan database of units for concatenative synthesis
(1998)
Conference report
Open AccessDifferent databases of phonetic units are required in multilingual Text-to-Speech systems based on concatenative synthesis. We are currently developing a TTS system able to convert text either in Catalan and Spanish, with ... -
A conversation analysis framework using speech recognition and naïve bayes classification for construction process monitoring
(American Society of Civil Engineers (ASCE), 2018)
Conference report
Restricted access - publisher's policyAt a dynamic construction site, conversations convey vital information including construction activities, operation status, and task performance. Even though because of information security, recording the entire conversations ... -
A low-power, high-performance speech recognition accelerator
(Institute of Electrical and Electronics Engineers (IEEE), 2019-12-01)
Article
Open AccessAutomatic Speech Recognition (ASR) is becoming increasingly ubiquitous, especially in the mobile segment. Fast and accurate ASR comes at high energy cost, not being affordable for the tiny power-budgeted mobile devices. ... -
A programmable accelerator for streaming automatic speech recognition on edge devices
(2022)
Conference report
Open AccessAutomatic Speech Recognition (ASR) is quickly becoming a mainstream technology, mainly driven by the outstanding accuracy achieved by modern systems based on machine learning. However, these systems often require billions ... -
A Speech-based Dialogue System for Household Robots
(Universitat Politècnica de Catalunya / Technische Universiteit Delft, 2011)
Master thesis (pre-Bologna period)
Restricted access - author's decisionThis thesis studies mechanisms to improve human-robot-interaction through a spoken dialogue for household robots. Therefore, a full dialogue system, in which the semantics of the words play an important role, is ... -
Action plan for dissemination
(2012-07-29)
Research report
Open AccessThe central objective of the Metanet4u project is to contribute to the establishment of a pan-European digital platform that makes available language resources and services, encompassing both datasets and software tools, ... -
Action plan for dissemination updated
(2012-07-04)
Research report
Open AccessDeliverable D5.3 del projecte METANET4U (Project CIP #270893) -
Adaptación del sistema texto a voz "Festival" al catalán
(Universitat Politècnica de Catalunya, 2007-01-08)
Master thesis (pre-Bologna period)
Open Access -
Age prediction by voice using deep learning
(Universitat Politècnica de Catalunya, 2023-01-30)
Master thesis
Open AccessOne of the main topics in artificial intelligence is the speech characterization. Moreover, it is a field of study with the minimal scope when the Catalan language is involved in. In this project, we try to perform an age ... -
AI-Vocie: intel·ligència artificial aplicada al reconeixement de la veu
(Universitat Politècnica de Catalunya, 2019-10)
Bachelor thesis
Open AccessL’objectiu d’aquest treball de final de grau és dissenyar i implementar un Altaveu Intel·ligent, Lima, senzill, però pràctic, i que respongui a ordres, simulant la manera de pensar de l’ésser humà. La finalitat és que ... -
Alzheimer disease diagnosis based on automatic spontaneous speech analysis
(SciTePress, 2012)
Conference report
Restricted access - publisher's policyAlzheimer's disease (AD) is the most prevalent form of progressive degenerative dementia and it has a high socio-economic impact in Western countries, therefore is one of the most active research areas today. Its diagnosis ... -
An ASR prototype for Spanish dictation
(Universitat Politècnica de Catalunya, 2020-01)
Bachelor thesis
Open AccessAutomatic Speech Recognition (ASR), or speech to text conversion, has been subject to many researchers for decades due to its various applications. In this project I propose to implement an ASR based on Hidden Markov Model ... -
An information-theoretic string matching approach for spoken utterance verification and keyword spotting
(Universitat Politècnica de Catalunya, 2016)
Master thesis (pre-Bologna period)
Restricted access - author's decisionThe goal of this project is to develop an information-theoretic acoustic-phonetic approach to detect the presence of words or phrases in an utterance. Specifically, the project focuses on two types of detection tasks in ... -
An ultra low-power hardware accelerator for acoustic scoring in speech recognition
(Institute of Electrical and Electronics Engineers (IEEE), 2017)
Conference report
Restricted access - publisher's policyAccurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has often to be sacrificed in order to fit the strict power constraints of mobile systems. However, accuracy is extremely ... -
An ultra low-power hardware accelerator for automatic speech recognition
(IEEE Press, 2016)
Conference report
Open AccessAutomatic Speech Recognition (ASR) is becoming increasingly ubiquitous, especially in the mobile segment. Fast and accurate ASR comes at a high energy cost which is not affordable for the tiny power budget of mobile devices. ... -
Analisis estadistico de orden superior de la voz
(1991)
Conference report
Open AccessMost of the speech analysis methods developed up to date have been based on the autocorrelation function or power spectrum, i. e., the second order statistics of the signa!. In this paper it is shown that higher order ... -
Análisis de los servicios y sistemas de comunicaciones de voz y datos para la implantación de un sistema de telefonía IP, en un entorno editorial
(Universitat Politècnica de Catalunya, 2011-04-27)
Master thesis (pre-Bologna period)
Restricted access - author's decisionCatalà: L'objectiu principal del present document és el d'analitzar el sistema de comunicacions de veu actual i, en especial, la qualitat del servei ofertat, estudiar els requeriments dels serveis de comunicacions de veu ... -
Aplicació de la lectura de llavis automatitzada a l'accessibilitat: escriptura per imatge
(Universitat Politècnica de Catalunya, 2023-06-28)
Bachelor thesis
Open AccessEn els darrers anys, els avenços significatius en la intel·ligència artificial han obert noves vies per a promoure la diversitat i la integració a la societat. Aquests progressos han proporcionat eines potents que es poden ... -
Automatic Spanish translation of SQuAD dataset for multi-lingual question answering
(European Language Resources Association (ELRA), 2020)
Conference lecture
Open AccessRecently, multilingual question answering became a crucial research topic, and it is receiving increased interest in the NLP community.However, the unavailability of large-scale datasets makes it challenging to train ... -
Automatic speech recognition with deep neural networks for impaired speech
(Springer, 2016)
Conference report
Open AccessAutomatic Speech Recognition has reached almost human performance in some controlled scenarios. However, recognition of impaired speech is a difficult task for two main reasons: data is (i) scarce and (ii) heterogeneous. ...