Mostra el registre d'ítem simple
Comparison of time & frequency filtering and cepstral-time matrix approaches in ASR
dc.contributor.author | Macho, D |
dc.contributor.author | Nadeu Camprubí, Climent |
dc.contributor.author | Jancovic, P |
dc.contributor.author | Rozinaj, G |
dc.contributor.author | Hernando Pericás, Francisco Javier |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2016-07-14T08:21:23Z |
dc.date.available | 2016-07-14T08:21:23Z |
dc.date.issued | 1999 |
dc.identifier.citation | Macho, D., Nadeu, C., Jancovic, P., Rozinaj, G., Hernando, J. Comparison of time & frequency filtering and cepstral-time matrix approaches in ASR. A: European Conference on Speech Communication and Technology. "EUROSPEECH '99: 6th European Conference on Speech Communication and Technology: September 5-9, 1999: Budapest, Hungary". Budapest: 1999, p. 77-80. |
dc.identifier.uri | http://hdl.handle.net/2117/88763 |
dc.description.abstract | In current speech recognition systems, speech is represented by a 2-D sequence of parameters that model the temporal evolution of the spectral envelope of speech. Linear transformation or filtering along both time and frequency axes of that 2-D sequence are used to enhance the discriminative ability and robustness of speech parameters in the HMM pattern-matching formalism. In this paper, we compared two recently reported approaches which operate on the sequence of logarithmically compressed mel-scaled filter-bank energies: the first approach - TIFFING (TIme and Frequency FilterING) - applies FIR filters to that 2-D sequence along both axes, while the second one - CTM (Cepstral Time Matrix) - uses the DCT to compute a set of parameters in the 2-D transformed domain. They are compared in several ways: (1) analytically, using Fourier transformation, (2) statistically and (3) performing recognition tests with clean and noisy speech. |
dc.format.extent | 4 p. |
dc.language.iso | eng |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic |
dc.subject.lcsh | Speech processing systems |
dc.subject.lcsh | Filters and filtration |
dc.title | Comparison of time & frequency filtering and cepstral-time matrix approaches in ASR |
dc.type | Conference report |
dc.subject.lemac | Processament de la parla |
dc.subject.lemac | Filtres i filtració |
dc.contributor.group | Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
dc.description.peerreviewed | Peer Reviewed |
dc.rights.access | Open Access |
local.identifier.drac | 2346627 |
dc.description.version | Postprint (published version) |
local.citation.author | Macho, D.; Nadeu, C.; Jancovic, P.; Rozinaj, G.; Hernando, J. |
local.citation.contributor | European Conference on Speech Communication and Technology |
local.citation.pubplace | Budapest |
local.citation.publicationName | EUROSPEECH '99: 6th European Conference on Speech Communication and Technology: September 5-9, 1999: Budapest, Hungary |
local.citation.startingPage | 77 |
local.citation.endingPage | 80 |