A programmable accelerator for streaming automatic speech recognition on edge devices

Cita com:
hdl:2117/373474
Document typeConference report
Defense date2022
Rights accessOpen Access
All rights reserved. This work is protected by the corresponding intellectual and industrial
property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public
communication or transformation of this work are prohibited without permission of the copyright holder
ProjectCoCoUnit - CoCoUnit: An Energy-Efficient Processing Unit for Cognitive Computing (EC-H2020-833057)
ARQUITECTURAS DE DOMINIO ESPECIFICO PARA SISTEMAS DE COMPUTACION ENERGETICAMENTE EFICIENTES (AEI-PID2020-113172RB-I00)
ARQUITECTURAS DE DOMINIO ESPECIFICO PARA SISTEMAS DE COMPUTACION ENERGETICAMENTE EFICIENTES (AEI-PID2020-113172RB-I00)
Abstract
Automatic Speech Recognition (ASR) is quickly becoming a mainstream technology, mainly driven by the outstanding accuracy achieved by modern systems based on machine learning. However, these systems often require billions of arithmetic operations to decode a second of audio and relying on cloud services for ASR is usually inconvenient. Even though deployment of ASR systems directly on the edge is highly desirable, the requirements for high performance and low energy consumption, combined with the fast pace of evolution and heterogeneity of existing ASR systems, result in challenges for effective deployment of ASR on edge devices. In this work, we propose a programmable accelerator to efficiently support a variety of ASR implementations. We estimate the performance of our system by implementing a recently proposed streaming ASR system and show that it can perform real-time streaming decoding with a tight power budget and low area footprint while offering great flexibility to implement a variety of different models.
CitationPinto, D.; Arnau, J.; González, A. A programmable accelerator for streaming automatic speech recognition on edge devices. A: Workshop on Cognitive Architectures. "COGARCH 2022, Sixth Workshop on Cognitive Architectures: Data-secure AI and the rise of homomorphic encryption: April, 3rd 2022, Seoul, South Korea (virtual)". 2022.
Publisher versionhttps://cogarchworkshop.org/assets/pdfs/1.pdf
Files | Description | Size | Format | View |
---|---|---|---|---|
A Programmable Accelerator CogArch2022.pdf | 161,8Kb | View/Open |