Speaker orientation estimation based on hybridation of GCC-PHAT and HLBR
Visualitza/Obre
Tipus de documentText en actes de congrés
Data publicació2008
Condicions d'accésAccés obert
Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i
industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva
reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets
Abstract
This paper presents a novel approach to speaker orientation
estimation in a SmartRoom environment equipped with
multiple microphones. The ratio between the high and low
band energies (HLBR) received at each microphone has been
shown in our previous work to be a potentially approach to estimate
the direction of the voice produced by a speaker. In this
work, for each microphone pair, a smoothed CPS phase is obtained
by a proper windowing of the main peak of the crosscorrelation
sequence estimated with the GCC-PHAT method,
and a HLBR is computed from the processed CPS. The proposed
method keeps the computational simplicity of the HLBR
algorithm while adding the robustness offered by the GCCPHAT
technique. Experimental preliminary results were conducted
over a database recorded purposely in the UPC Smart
room, and over the CLEAR head pose database. The proposed
method performs consistently better than other state-of-the-art
techniques with both databases.
CitacióSegura, C. [et al.]. Speaker orientation estimation based on hybridation of GCC-PHAT and HLBR. A: International Speech Communication Association. Conference. "9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION". Brisbane: 2008, p. 1325-1328.
ISBN978-1-61567-378-0
Versió de l'editorhttp://www.lsi.upc.edu/~nlp/papers/hernando_orient.pdf
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
87.pdf | Articel principal | 285,9Kb | Visualitza/Obre |