Exploració per tema "Reconeixement automàtic de la parla"
Ara es mostren els items 1-20 de 206
-
A bilingual Spanish-Catalan database of units for concatenative synthesis
(1998)
Text en actes de congrés
Accés obertDifferent databases of phonetic units are required in multilingual Text-to-Speech systems based on concatenative synthesis. We are currently developing a TTS system able to convert text either in Catalan and Spanish, with ... -
A Catalan broadcast conversational speech database
(2009-09)
Comunicació de congrés
Accés restringit per política de l'editorialData driven methods in speech and linguistic research, and system develoment require appropriate speech databases. A new Catalan speech database has been developed with a particular emphasis on broadcast conversational ... -
A conversation analysis framework using speech recognition and naïve bayes classification for construction process monitoring
(American Society of Civil Engineers (ASCE), 2018)
Text en actes de congrés
Accés restringit per política de l'editorialAt a dynamic construction site, conversations convey vital information including construction activities, operation status, and task performance. Even though because of information security, recording the entire conversations ... -
A low-power, high-performance speech recognition accelerator
(Institute of Electrical and Electronics Engineers (IEEE), 2019-12-01)
Article
Accés obertAutomatic Speech Recognition (ASR) is becoming increasingly ubiquitous, especially in the mobile segment. Fast and accurate ASR comes at high energy cost, not being affordable for the tiny power-budgeted mobile devices. ... -
A multi-microphone approach to speech processing in a smart-room environment
(Universitat Politècnica de Catalunya, 2007-06-29)
Tesi
Accés obertEls avenços recents en tecnologia informàtica i processament de la parla i del llenguatge, entre altres, han fet possible que noves maneres de comunicació entre les persones i les màquines comencin a semblar factibles. ... -
A programmable accelerator for streaming automatic speech recognition on edge devices
(2022)
Text en actes de congrés
Accés obertAutomatic Speech Recognition (ASR) is quickly becoming a mainstream technology, mainly driven by the outstanding accuracy achieved by modern systems based on machine learning. However, these systems often require billions ... -
A Speech-based Dialogue System for Household Robots
(Universitat Politècnica de Catalunya / Technische Universiteit Delft, 2011)
Projecte/Treball Final de Carrera
Accés restringit per decisió de l'autorThis thesis studies mechanisms to improve human-robot-interaction through a spoken dialogue for household robots. Therefore, a full dialogue system, in which the semantics of the words play an important role, is ... -
Action plan for dissemination
(2012-07-29)
Report de recerca
Accés obertThe central objective of the Metanet4u project is to contribute to the establishment of a pan-European digital platform that makes available language resources and services, encompassing both datasets and software tools, ... -
Action plan for dissemination updated
(2012-07-04)
Report de recerca
Accés obertDeliverable D5.3 del projecte METANET4U (Project CIP #270893) -
Adaptación del sistema texto a voz "Festival" al catalán
(Universitat Politècnica de Catalunya, 2007-01-08)
Projecte/Treball Final de Carrera
Accés obert -
Age prediction by voice using deep learning
(Universitat Politècnica de Catalunya, 2023-01-30)
Projecte Final de Màster Oficial
Accés obertOne of the main topics in artificial intelligence is the speech characterization. Moreover, it is a field of study with the minimal scope when the Catalan language is involved in. In this project, we try to perform an age ... -
AI-Vocie: intel·ligència artificial aplicada al reconeixement de la veu
(Universitat Politècnica de Catalunya, 2019-10)
Treball Final de Grau
Accés obertL’objectiu d’aquest treball de final de grau és dissenyar i implementar un Altaveu Intel·ligent, Lima, senzill, però pràctic, i que respongui a ordres, simulant la manera de pensar de l’ésser humà. La finalitat és que ... -
Alzheimer disease diagnosis based on automatic spontaneous speech analysis
(SciTePress, 2012)
Text en actes de congrés
Accés restringit per política de l'editorialAlzheimer's disease (AD) is the most prevalent form of progressive degenerative dementia and it has a high socio-economic impact in Western countries, therefore is one of the most active research areas today. Its diagnosis ... -
An ASR prototype for Spanish dictation
(Universitat Politècnica de Catalunya, 2020-01)
Treball Final de Grau
Accés obertAutomatic Speech Recognition (ASR), or speech to text conversion, has been subject to many researchers for decades due to its various applications. In this project I propose to implement an ASR based on Hidden Markov Model ... -
An information-theoretic string matching approach for spoken utterance verification and keyword spotting
(Universitat Politècnica de Catalunya, 2016)
Projecte/Treball Final de Carrera
Accés restringit per decisió de l'autorThe goal of this project is to develop an information-theoretic acoustic-phonetic approach to detect the presence of words or phrases in an utterance. Specifically, the project focuses on two types of detection tasks in ... -
An ultra low-power hardware accelerator for acoustic scoring in speech recognition
(Institute of Electrical and Electronics Engineers (IEEE), 2017)
Text en actes de congrés
Accés restringit per política de l'editorialAccurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has often to be sacrificed in order to fit the strict power constraints of mobile systems. However, accuracy is extremely ... -
Analisis estadistico de orden superior de la voz
(1991)
Text en actes de congrés
Accés obertMost of the speech analysis methods developed up to date have been based on the autocorrelation function or power spectrum, i. e., the second order statistics of the signa!. In this paper it is shown that higher order ... -
Análisis de los servicios y sistemas de comunicaciones de voz y datos para la implantación de un sistema de telefonía IP, en un entorno editorial
(Universitat Politècnica de Catalunya, 2011-04-27)
Projecte/Treball Final de Carrera
Accés restringit per decisió de l'autorCatalà: L'objectiu principal del present document és el d'analitzar el sistema de comunicacions de veu actual i, en especial, la qualitat del servei ofertat, estudiar els requeriments dels serveis de comunicacions de veu ... -
Aplicació de la lectura de llavis automatitzada a l'accessibilitat: escriptura per imatge
(Universitat Politècnica de Catalunya, 2023-06-28)
Treball Final de Grau
Accés obertEn els darrers anys, els avenços significatius en la intel·ligència artificial han obert noves vies per a promoure la diversitat i la integració a la societat. Aquests progressos han proporcionat eines potents que es poden ... -
Automatic Spanish translation of SQuAD dataset for multi-lingual question answering
(European Language Resources Association (ELRA), 2020)
Comunicació de congrés
Accés obertRecently, multilingual question answering became a crucial research topic, and it is receiving increased interest in the NLP community.However, the unavailability of large-scale datasets makes it challenging to train ...