Ir al contenido (pulsa Retorno)

Universitat Politècnica de Catalunya

    • Català
    • Castellano
    • English
    • LoginRegisterLog in (no UPC users)
  • mailContact Us
  • world English 
    • Català
    • Castellano
    • English
  • userLogin   
      LoginRegisterLog in (no UPC users)

UPCommons. Global access to UPC knowledge

57.066 UPC E-Prints
You are here:
View Item 
  •   DSpace Home
  • E-prints
  • Programes de doctorat
  • Doctorat en Arquitectura de Computadors
  • Ponències/Comunicacions de congressos
  • View Item
  •   DSpace Home
  • E-prints
  • Programes de doctorat
  • Doctorat en Arquitectura de Computadors
  • Ponències/Comunicacions de congressos
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

A generator of numerically-tailored and high-throughput accelerators for batched GEMMs

Thumbnail
View/Open
fccm_2022.pdf (2,730Mb)
Share:
 
 
10.1109/FCCM53951.2022.9786164
 
  View Usage Statistics
Cita com:
hdl:2117/368563

Show full item record
Ledoux Pardo, Luis EduardoMés informació
Casas Guix, Marc
Document typeConference report
Defense date2022
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Rights accessOpen Access
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder
ProjectDEEP-SEA - DEEP – SOFTWARE FOR EXASCALE ARCHITECTURES (EC-H2020-955606)
Abstract
We propose a hardware generator of GEMM accelerators. Our generator produces vendor-agnostic HDL describing highly customizable systolic arrays guided by accuracy and energy efficiency goals. The generated arrays have three main novel aspects. First, the accelerators handle a large variety of computer number formats using intermediate representations based on our Sign Scale Significand (S3) format. Second, the processing elements perform all intermediate dot-product arithmetic operations required by the GEMM kernel without any intermediate rounding, which makes it possible to deliver better energy efficiency than state-of-the-art approaches while offering more accuracy and reproducible results. Third, our accelerators feature the Half-Speed Sink Down (HSSD) mechanism, which maximizes the overlap of host-accelerator data transfers with GEMM computations.We evaluate our automatically generated designs in a cutting-edge setup composed of a POWER9 host, CAPI (Coherent Accelerator Processor Interface) link, and a Virtex Ultrascale Plus FPGA. Arrays can operate at the speed of the link and saturate it to reach a 13GB/s throughput. Our fine-grain customization approach allows to cover a wide range of accuracy versus efficiency scenarios and can reach 0.65GOps/s/W while producing 1024 accurate bits or 148.7GOps/s/W with 6 accurate bits. Our configurations achieve up to 1613GOps/s system performance and power efficiencies of up to 240GOps/s/W for the FPGA. This automatic generator is the first being able to produce such a variety of designs. We improve the single-precision energy efficiency of state-of-the-art FPGA GEMM accelerators by 1.86×.
CitationLedoux, L.; Casas, M. A generator of numerically-tailored and high-throughput accelerators for batched GEMMs. A: IEEE Symposium on Field Programmable Custom Computing Machines. "2022 IEEE 30th International Symposium on Field-Programmable Custom Computing Machines, FCCM 2022: 15-18 May, 2022, New York, NY, USA: proceedings". Institute of Electrical and Electronics Engineers (IEEE), 2022, ISBN 978-1-6654-8332-2. DOI 10.1109/FCCM53951.2022.9786164. 
URIhttp://hdl.handle.net/2117/368563
DOI10.1109/FCCM53951.2022.9786164
ISBN978-1-6654-8332-2
Publisher versionhttps://ieeexplore.ieee.org/document/9786164
Collections
  • Doctorat en Arquitectura de Computadors - Ponències/Comunicacions de congressos [196]
  • Computer Sciences - Ponències/Comunicacions de congressos [459]
Share:
 
  View Usage Statistics

Show full item record

FilesDescriptionSizeFormatView
fccm_2022.pdf2,730MbPDFView/Open

Browse

This CollectionBy Issue DateAuthorsOther contributionsTitlesSubjectsThis repositoryCommunities & CollectionsBy Issue DateAuthorsOther contributionsTitlesSubjects

© UPC Obrir en finestra nova . Servei de Biblioteques, Publicacions i Arxius

info.biblioteques@upc.edu

  • About This Repository
  • Contact Us
  • Send Feedback
  • Inici de la pàgina