Exploració per tema "Reconeixement automàtic de la parla"
Ara es mostren els items 21-40 de 206
-
Automatic speech recognition with deep neural networks for impaired speech
(Springer, 2016)
Text en actes de congrés
Accés obertAutomatic Speech Recognition has reached almost human performance in some controlled scenarios. However, recognition of impaired speech is a difficult task for two main reasons: data is (i) scarce and (ii) heterogeneous. ... -
Automatic speech recognition with Kaldi toolkit
(Universitat Politècnica de Catalunya, 2016-02-08)
Treball Final de Grau
Accés obert
Realitzat a/amb: Akademia Górniczo-Hutnicza im. S. Staszica w KrakowieThe topic of this thesis is to built an accurate automatic speech recognition system to be able to recognize speech using Kaldi, an open-source toolkit for speech recognition written in C++ and with free data. First of ... -
Awareness, mobilisation and dissemination actions
(2012-01-31)
Report de recerca
Accés obertThe central objective of the Metanet4u project is to contribute to the establishment of a pan-European digital platform that makes available language resources and services, encompassing both datasets and software tools, ... -
BaNa: a noise resilient fundamental frequency detection algorithm for speech and music
(2014-08-27)
Article
Accés obertFundamental frequency (F0) is one of the essential features in many acoustic related applications. Although numerous F0 detection algorithms have been developed, the detection accuracy in noisy environments still needs ... -
Blind channel equalization using weighted subspace methods
(Institute of Electrical and Electronics Engineers (IEEE), 1999)
Text en actes de congrés
Accés obertThis paper addresses the problems of blind channel estimation and symbol detection with second order statistics methods from the received data. It can be shown that this problem is similar to direction of arrival (DOA) ... -
Block-based Speech-to-Speech Translation
(Universitat Politècnica de Catalunya, 2018-10)
Treball Final de Grau
Accés obertEsta tesis explora diferentes maneras de implementar un sistema de bloques de Traducción de Voz con el propósito de generar grandes cantidades de datos para generar un gran corpus paralelo de voz. La primera tarea consiste ... -
BUCEADOR hybrid TTS for blizzard challenge 2011
(2011)
Text en actes de congrés
Accés obertThis paper describes the Text-to-Speech (TTS) systems presented by the Buceador Consortium in the Blizzard Challenge 2011 evaluation campaign. The main system is a concatenative hybrid one that tries to combine the strong ... -
BUCEADOR, a multi-language search engine for digital libraries
(2012)
Comunicació de congrés
Accés obertThis paper presents a web-based multimedia search engine built within the Buceador (www.buceador.org) research project. A proof-of-concept tool has been implemented which is able to retrieve information from a digital ... -
Building synthetic voices in the METANET framework
(2012)
Comunicació de congrés
Accés obertMETANET4U is a European project aiming at supporting language technology for European languages and multilingualism. It is a project in the META-NET Network of Excellence, a cluster of projects aiming at fostering the ... -
Casc per a personal de serveis d'emergència controlat per veu
(Universitat Politècnica de Catalunya, 2017-07)
Treball Final de Grau
Accés obertAquest projecte ha pretès la creació d’un prototip d’equip de protecció individual que permeti la interacció amb diferents actuadors mitjançant un control per veu, facilitant i fent més segures les accions del portador en ... -
Catalan Accent Classification by Voice using Deep Learning
(Universitat Politècnica de Catalunya, 2023-05-25)
Projecte Final de Màster Oficial
Accés obertSpeech characterization is a vital field in artificial intelligence, yet accent classification is often overlooked, particularly for the Catalan language. This project is centered on the classification of Catalan accents ... -
Channel selection measures for multi-microphone speech recognition
(2014-02-01)
Article
Accés restringit per política de l'editorialAutomatic speech recognition in a room with distant microphones is strongly affected by noise and reverberation. In scenarios where the speech signal is captured by several arbitrarily located microphones the degree of ... -
Channel selection using N-best hypothesis for multi-microphone ASR
(2013)
Text en actes de congrés
Accés restringit per política de l'editorialIf speech is captured by several arbitrarily-located microphones in a room, the degree of distortion by noise and reverberation may vary strongly from one channel to another. Channel selection for automatic speech recognition ... -
Characterization of Speech Recognition Systems on GPU Architectures
(Universitat Politècnica de Catalunya, 2016-07-04)
Projecte Final de Màster Oficial
Accés obertThis master thesis characterizes the performance and energy bottlenecks of speech recognition systems when running on modern GPU, with the aim of providing useful information for designing future GPU architectures, as well ... -
Collaborative voting of 3D features for robust gesture estimation
(Institute of Electrical and Electronics Engineers (IEEE), 2017)
Comunicació de congrés
Accés obertHuman body analysis raises special interest because it enables a wide range of interactive applications. In this paper we present a gesture estimator that discriminates body poses in depth images. A novel collaborative ... -
Combining phrase and neural-based machine translation: what worked and did not
(2017)
Article
Accés restringit per política de l'editorialPhrase-based machine translation assumes that all words are at the same distance and translates them using feature functions that approximate the probability at different levels. On the other hand, neural machine translation ... -
Computation reuse in DNNs by exploiting input similarity
(Institute of Electrical and Electronics Engineers (IEEE), 2018)
Text en actes de congrés
Accés restringit per política de l'editorialIn recent years, Deep Neural Networks (DNNs) have achieved tremendous success for diverse problems such as classification and decision making. Efficient support for DNNs on CPUs, GPUs and accelerators has become a prolific ... -
Control remot per veu d'un robot Open Source
(Universitat Politècnica de Catalunya, 2019-06)
Treball Final de Grau
Accés obertEn l’actualitat la interacció amb les màquines mitjançant el reconeixement de la parla està molt de moda. Aquesta manera d’interactuar amb les màquines es duu desenvolupant des de fa molts anys. Aquest treball es centrarà ... -
Controlador de dispositivos por reconocimiento de voz (CDRV)
(Universitat Politècnica de Catalunya, 2014-12-11)
Projecte/Treball Final de Carrera
Accés obertCDRV (Controlador de dipositivos por reconocimiento de voz) es un dispositiu capaç de controlar altres dispositius mitjançant la veu. Concretament, per aquest projecte, s'ha adaptat per controlar una butaca reclinable. -
Controlling 3D holographic contents by personal devices
(Universitat Politècnica de Catalunya, 2014-12)
Projecte/Treball Final de Carrera
Accés obert
Realitzat a/amb: Politecnico di Torino[ANGLÈS] Foremost, this project explains the different sensors of a personal device (e.g. smartphone). After that, this study shows how to interact with holographic scenes. These scenes have been created by Blender. Blender ...