Show simple item record

dc.contributor.authorBonafonte Cávez, Antonio
dc.contributor.authorNogueiras Rodríguez, Albino
dc.contributor.authorRodriguez-Garrido, A
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2012-03-27T16:44:00Z
dc.date.available2012-03-27T16:44:00Z
dc.date.created1996
dc.date.issued1996
dc.identifier.citationBonafonte, A.; Nogueiras, A.; Rodriguez-Garrido, A. Explicit segmentation of speech using gaussian models. A: ICSLP'96. INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING. "Proceedings Fourth International Conference on Spoken Language Processing". Philadelphia: H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996, p. 1269-1272.
dc.identifier.urihttp://hdl.handle.net/2117/15674
dc.description.abstractThe authors investigate an automatic method to segment labeled speech. The method needs an initial estimation of the segmentation which is provided by an alignment based on HMM. Afterwards, the boundaries are refined moving the frontier frames to the segment which is more similar to the speech frame. Gaussian PDFs are used as a similarity measure. The performance of the method is evaluated using the TIMIT database. If boundary deviations (from the reference position) larger than 20 ms are counted as errors, then the replacement of the boundaries reduces the error by 30%. Additional experiments show how the proposed method makes the performance independent of the speaker dependent or speaker independent data used to estimate the HMM.
dc.format.extent4 p.
dc.language.isoeng
dc.publisherH. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE
dc.subjectÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural
dc.subject.lcshNatural language processing (Computer science)
dc.titleExplicit segmentation of speech using gaussian models
dc.typeConference report
dc.subject.lemacProcessament en llenguatge natural (Informàtica)
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.identifier.doi10.1109/ICSLP.1996.607841
dc.description.peerreviewedPeer Reviewed
dc.rights.accessRestricted access - publisher's policy
drac.iddocument2414255
dc.description.versionPostprint (published version)
upcommons.citation.authorBonafonte, A.; Nogueiras, A.; Rodriguez-Garrido, A.
upcommons.citation.contributorICSLP'96. INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING
upcommons.citation.pubplacePhiladelphia
upcommons.citation.publishedtrue
upcommons.citation.publicationNameProceedings Fourth International Conference on Spoken Language Processing
upcommons.citation.startingPage1269
upcommons.citation.endingPage1272


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder