Mostra el registre d'ítem simple

dc.contributor.authorYazdani, Reza
dc.contributor.authorArnau Montañés, José María
dc.contributor.authorGonzález Colás, Antonio María
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2020-01-20T16:18:47Z
dc.date.available2020-01-20T16:18:47Z
dc.date.issued2019-12-01
dc.identifier.citationYazdani, R.; Arnau, J.; Gonzalez, A. A low-power, high-performance speech recognition accelerator. "IEEE transactions on computers", 1 Desembre 2019, vol. 68, núm. 12, p. 1817-1831.
dc.identifier.issn0018-9340
dc.identifier.urihttp://hdl.handle.net/2117/175332
dc.description© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.description.abstractAutomatic Speech Recognition (ASR) is becoming increasingly ubiquitous, especially in the mobile segment. Fast and accurate ASR comes at high energy cost, not being affordable for the tiny power-budgeted mobile devices. Hardware acceleration reduces energy-consumption of ASR systems, while delivering high-performance. In this paper, we present an accelerator for largevocabulary, speaker-independent, continuous speech-recognition. It focuses on the Viterbi search algorithm representing the main bottleneck in an ASR system. The proposed design consists of innovative techniques to improve the memory subsystem, since memory is the main bottleneck for performance and power in these accelerators' design. It includes a prefetching scheme tailored to the needs of ASR systems that hides main memory latency for a large fraction of the memory accesses, negligibly impacting area. Additionally, we introduce a novel bandwidth-saving technique that removes off-chip memory accesses by 20 percent. Finally, we present a power saving technique that significantly reduces the leakage power of the accelerators scratchpad memories, providing between 8.5 and 29.2 percent reduction in entire power dissipation. Overall, the proposed design outperforms implementations running on the CPU by orders of magnitude, and achieves speedups between 1.7x and 5.9x for different speech decoders over a highly optimized CUDA implementation running on Geforce-GTX-980 GPU, while reducing the energy by 123-454x.
dc.format.extent15 p.
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject.lcshAutomatic speech recognition
dc.subject.otherViterbi algorithm
dc.subject.otherSpeech recognition
dc.subject.otherGraphics processing units
dc.subject.otherAcoustics
dc.subject.otherCentral Processing Unit
dc.subject.otherHardware
dc.subject.otherDecoding
dc.subject.otherAutomatic Speech Recognition (ASR)
dc.subject.otherViterbi search
dc.subject.otherhardware accelerator
dc.subject.otherWFST
dc.subject.otherlow-power architecture
dc.titleA low-power, high-performance speech recognition accelerator
dc.typeArticle
dc.subject.lemacReconeixement automàtic de la parla
dc.contributor.groupUniversitat Politècnica de Catalunya. ARCO - Microarquitectura i Compiladors
dc.identifier.doi10.1109/TC.2019.2937075
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttps://ieeexplore.ieee.org/document/8812893
dc.rights.accessOpen Access
local.identifier.drac26417166
dc.description.versionPostprint (author's final draft)
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/H2020/833057/EU/CoCoUnit: An Energy-Efficient Processing Unit for Cognitive Computing/CoCoUnit
dc.relation.projectidinfo:eu-repo/grantAgreement/MINECO/1PE/TIN2016-75344-R
local.citation.authorYazdani, R.; Arnau, J.; Gonzalez, A.
local.citation.publicationNameIEEE transactions on computers
local.citation.volume68
local.citation.number12
local.citation.startingPage1817
local.citation.endingPage1831


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple