Mostra el registre d'ítem simple
Speech-based emotion classification using multiclass SVM with hybrid kernel and thresholding fusion
dc.contributor.author | Yang, N. |
dc.contributor.author | Muraleedharan, Rajani |
dc.contributor.author | Khol, J. |
dc.contributor.author | Demirkol, Ilker Seyfettin |
dc.contributor.author | Heinzelman, Wendi |
dc.contributor.author | Sturge Apple, Melissa L. |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Enginyeria Telemàtica |
dc.date.accessioned | 2014-04-10T14:27:14Z |
dc.date.created | 2012 |
dc.date.issued | 2012 |
dc.identifier.citation | Yang, N. [et al.]. Speech-based emotion classification using multiclass SVM with hybrid kernel and thresholding fusion. A: IEEE Workshop on Spoken Language Technology. "2012 IEEE Workshop on Spoken Language Technology: SLT 2012: proceedings: December 2-5, 2012: Miami, Florida, USA". Miami, FL: 2012, p. 455-460. |
dc.identifier.isbn | 978-146735126-3 |
dc.identifier.uri | http://hdl.handle.net/2117/22610 |
dc.description.abstract | Emotion classification is essential for understanding human interactions and hence is a vital component of behavioral studies. Although numerous algorithms have been developed, the emotion classification accuracy is still short of what is desired for the algorithms to be used in real systems. In this paper, we evaluate an approach where basic acoustic features are extracted from speech samples, and the One-Against-All (OAA) Support Vector Machine (SVM) learning algorithm is used. We use a novel hybrid kernel, where we choose the optimal kernel functions for the individual OAA classifiers. Outputs from the OAA classifiers are normalized and combined using a thresholding fusion mechanism to finally classify the emotion. Samples with low ‘relative confidence’ are left as ‘unclassified’ to further improve the classification accuracy. Results show that the decision-level recall of our approach for six-class emotion classification is 80.5%, outperforming a state-of-the-art approach that uses the same dataset. |
dc.format.extent | 6 p. |
dc.language.iso | eng |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Spain |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic |
dc.subject | Àrees temàtiques de la UPC::Informàtica |
dc.subject.lcsh | Automatic speech recognition |
dc.subject.other | Emotion classification |
dc.subject.other | Hybrid kernel |
dc.subject.other | Speaker independent |
dc.subject.other | Support vector machine |
dc.subject.other | Thresholding fusion |
dc.title | Speech-based emotion classification using multiclass SVM with hybrid kernel and thresholding fusion |
dc.type | Conference report |
dc.subject.lemac | Reconeixement automàtic de la parla |
dc.identifier.doi | 10.1109/SLT.2012.6424267 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6424267&queryText%3DSpeech-based+emotion+classification+using+multiclass+SVM+with+hybrid+kernel+and+thresholding+fusion |
dc.rights.access | Restricted access - publisher's policy |
local.identifier.drac | 11888049 |
dc.description.version | Postprint (published version) |
dc.date.lift | 10000-01-01 |
local.citation.author | Yang, N.; Muraleedharan, R.; Khol, J.; Demirkol, I.; Heinzelman, W.; Sturge-Apple, M. |
local.citation.contributor | IEEE Workshop on Spoken Language Technology |
local.citation.pubplace | Miami, FL |
local.citation.publicationName | 2012 IEEE Workshop on Spoken Language Technology: SLT 2012: proceedings: December 2-5, 2012: Miami, Florida, USA |
local.citation.startingPage | 455 |
local.citation.endingPage | 460 |