Mostra el registre d'ítem simple
Speaker orientation estimation based on hybridation of GCC-PHAT and HLBR
dc.contributor.author | Segura Perales, Carlos |
dc.contributor.author | Abad, Alberto |
dc.contributor.author | Hernando Pericás, Francisco Javier |
dc.contributor.author | Nadeu Camprubí, Climent |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2011-01-14T11:54:28Z |
dc.date.available | 2011-01-14T11:54:28Z |
dc.date.created | 2008 |
dc.date.issued | 2008 |
dc.identifier.citation | Segura, C. [et al.]. Speaker orientation estimation based on hybridation of GCC-PHAT and HLBR. A: International Speech Communication Association. Conference. "9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION". Brisbane: 2008, p. 1325-1328. |
dc.identifier.isbn | 978-1-61567-378-0 |
dc.identifier.uri | http://hdl.handle.net/2117/11030 |
dc.description.abstract | This paper presents a novel approach to speaker orientation estimation in a SmartRoom environment equipped with multiple microphones. The ratio between the high and low band energies (HLBR) received at each microphone has been shown in our previous work to be a potentially approach to estimate the direction of the voice produced by a speaker. In this work, for each microphone pair, a smoothed CPS phase is obtained by a proper windowing of the main peak of the crosscorrelation sequence estimated with the GCC-PHAT method, and a HLBR is computed from the processed CPS. The proposed method keeps the computational simplicity of the HLBR algorithm while adding the robustness offered by the GCCPHAT technique. Experimental preliminary results were conducted over a database recorded purposely in the UPC Smart room, and over the CLEAR head pose database. The proposed method performs consistently better than other state-of-the-art techniques with both databases. |
dc.format.extent | 4 p. |
dc.language.iso | eng |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació |
dc.subject.lcsh | High/Low Band Ratio |
dc.subject.lcsh | Speaker orientation |
dc.subject.lcsh | Natural language processing |
dc.subject.lcsh | Signal theory (Telecommunication) |
dc.title | Speaker orientation estimation based on hybridation of GCC-PHAT and HLBR |
dc.type | Conference report |
dc.subject.lemac | Senyal, Teoria del (Telecomunicació) |
dc.contributor.group | Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
dc.relation.publisherversion | http://www.lsi.upc.edu/~nlp/papers/hernando_orient.pdf |
dc.rights.access | Open Access |
local.identifier.drac | 2544141 |
dc.description.version | Postprint (published version) |
local.citation.author | Segura, C.; Abad, A.; Hernando, J.; Nadeu, C. |
local.citation.contributor | International Speech Communication Association. Conference |
local.citation.pubplace | Brisbane |
local.citation.publicationName | 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION |
local.citation.startingPage | 1325 |
local.citation.endingPage | 1328 |