Mostra el registre d'ítem simple
Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition.
dc.contributor.author | Nadeu Camprubí, Climent |
dc.contributor.author | Pachès Leal, Pau |
dc.contributor.author | Juang, B H |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2017-03-09T15:10:54Z |
dc.date.available | 2017-03-09T15:10:54Z |
dc.date.issued | 1995 |
dc.identifier.citation | Nadeu, C., Paches, P., Juang, B. Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition.. A: 4th European Conference on Speech Communication and Technology. "Proceedings of the 4th EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY". Madrid: ESCA - J.M. PARDO, E. ENRIQUEZ, J. ORTEGA, J. FERREIROS GTM-UPM, 1995, p. 923-926. |
dc.identifier.isbn | 1018-4074 |
dc.identifier.uri | http://hdl.handle.net/2117/102224 |
dc.description.abstract | In this work, we show how speaker-independent CDHMM word recognition performance can be significantly improved for clean speech by filtering the time sequence of spectral parameters to enhance its time dynamics. Experimental results with the standard TI connected digits database show the filter can achieve more than 30% reduction of string recognition error. As shown in this paper, that improvement is partially due to the speaker variability reduction obtained by attenuating the very low modulation frequencies. The widely used cepstral mean subtraction technique also improves the recognition rate, but it can not achieve such a noticeable improvement as the parameter filter. In fact, the best results are obtained when the peak of the long-term spectrum of the filter output is at around 3 Hz, a frequency which corresponds to the average syllable rate of the employed database. |
dc.format.extent | 4 p. |
dc.language.iso | eng |
dc.publisher | ESCA - J.M. PARDO, E. ENRIQUEZ, J. ORTEGA, J. FERREIROS GTM-UPM |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació |
dc.subject.lcsh | Telecommunication |
dc.title | Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition. |
dc.type | Conference report |
dc.subject.lemac | Telecomunicació |
dc.contributor.group | Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
dc.description.peerreviewed | Peer Reviewed |
dc.rights.access | Open Access |
local.identifier.drac | 2415437 |
dc.description.version | Postprint (published version) |
local.citation.author | Nadeu, C.; Paches, P.; Juang, B. |
local.citation.contributor | 4th European Conference on Speech Communication and Technology |
local.citation.pubplace | Madrid |
local.citation.publicationName | Proceedings of the 4th EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY |
local.citation.startingPage | 923 |
local.citation.endingPage | 926 |