Implement ONNX AI kernels for semidynamics' RISC-V atrevido core

View/Open
Cita com:
hdl:2117/418201
Document typeMaster thesis
Date2024-07-01
Rights accessOpen Access
All rights reserved. This work is protected by the corresponding intellectual and industrial
property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public
communication or transformation of this work are prohibited without permission of the copyright holder
Abstract
Artificial Intelligence (AI) has become a fundamental element of contemporary technological progress, with Convolutional Neural Networks (CNNs) being a pivotal breakthrough in the field of image and pattern recognition, thanks to their ability to automatically learn spatial hierarchies of features from input data. In the context of AI and CNNs, interoperability and model optimization are crit- ical. The Open Neural Network Exchange Runtime (ONNX Runtime) provides a robust, open-source format that enables models to be used across vari- ous platforms seamlessly. ONNX Runtime facilitates the inference process, which is the use of pre-trained models with new input data, and optimizes the execution across different hardware, making AI more accessible and efficient. The convolution kernel, fundamental function to CNNs, involves the processing of input data through filters to extract features, and it is essential for tasks such as image classification. The conv operator in ONNX represents this operation, while QLinearConv extends it by incorporating quantization, which reduces model size and computational requirements without significantly compromising accuracy. In the pursuit of enhancing the computational efficiency of Convolutional Neu- ral Networks (CNNs), this thesis presents the implementation of the conv and QLinearConv operators from the Open Neural Network Exchange (ONNX), specif- ically optimized for Semidynamics' Atrevido core. Leveraging RISC-V vector instructions, the code has been optimized to significantly improve the inference time of CNN models.
SubjectsInference, Neural networks (Computer science), RISC microprocessors, Inferència, Xarxes neuronals (Informàtica), RISC (Microprocessadors)
DegreeMÀSTER UNIVERSITARI EN INNOVACIÓ I RECERCA EN INFORMÀTICA (Pla 2012)
Files | Description | Size | Format | View |
---|---|---|---|---|
188853.pdf | 2,951Mb | View/Open |