Explicit segmentation of speech using gaussian models

Bonafonte Cávez, Antonio; Nogueiras Rodríguez, Albino; Rodriguez-Garrido, A

doi:10.1109/ICSLP.1996.607841

Visualitza/Obre

Explicit segmentation of speech using Gaussian models.pdf (428,3Kb) (Accés restringit) Sol·licita una còpia a l'autor

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Bonafonte Cávez, Antonio

Nogueiras Rodríguez, Albino

Rodriguez-Garrido, A

Tipus de documentText en actes de congrés

Data publicació1996

EditorH. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE

Condicions d'accésAccés restringit per política de l'editorial

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

The authors investigate an automatic method to segment labeled speech. The method needs an initial estimation of the segmentation which is provided by an alignment based on HMM. Afterwards, the boundaries are refined moving the frontier frames to the segment which is more similar to the speech frame. Gaussian PDFs are used as a similarity measure. The performance of the method is evaluated using the TIMIT database. If boundary deviations (from the reference position) larger than 20 ms are counted as errors, then the replacement of the boundaries reduces the error by 30%. Additional experiments show how the proposed method makes the performance independent of the speaker dependent or speaker independent data used to estimate the HMM.

CitacióBonafonte, A.; Nogueiras, A.; Rodriguez-Garrido, A. Explicit segmentation of speech using gaussian models. A: ICSLP'96. INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING. "Proceedings Fourth International Conference on Spoken Language Processing". Philadelphia: H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996, p. 1269-1272.

URIhttp://hdl.handle.net/2117/15674

DOI10.1109/ICSLP.1996.607841

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
Explicit segmen ... using Gaussian models.pdf		428,3Kb	PDF	Accés restringit

UPCommons. Portal del coneixement obert de la UPC

Explicit segmentation of speech using gaussian models

Visualitza/Obre

Explora