Temporal vectorization: a compiler approach to automatic multi-pumping

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

The multi-pumping resource sharing technique can overcome the limitations commonly found in single-clocked FPGA designs by allowing hardware components to operate at a higher clock frequency than the surrounding system. However, this optimization cannot be expressed in high levels of abstraction, such as HLS, requiring the use of hand-optimized RTL. In this paper we show how to leverage multiple clock domains for computational subdomains on reconfigurable devices through data movement analysis on high-level programs.We offer a novel view on multi-pumping as a compiler optimization - a superclass of traditional vectorization. As multiple data elements are fed and consumed, the computations are packed temporally rather than spatially. The optimization is applied automatically using an intermediate representation that maps high-level code to HLS. Internally, the optimization injects modules into the generated designs, incorporating RTL for finegrained control over the clock domains. We obtain a reduction of resource consumption by up to 50% on critical components and 23% on average. For scalable designs, this can enable further parallelism, increasing overall performance.

OriginalsprogEngelsk
TitelProceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2022
ForlagInstitute of Electrical and Electronics Engineers Inc.
Publikationsdato2022
Sider1-9
Artikelnummer85
ISBN (Elektronisk)9781450392174
DOI
StatusUdgivet - 2022
Begivenhed41st IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2022 - San Diego, USA
Varighed: 30 okt. 20224 nov. 2022

Konference

Konference41st IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2022
LandUSA
BySan Diego
Periode30/10/202204/11/2022
SponsorACM Special Interest Group on Design Automation (SIGDA), et al., IEEE Circuits and Systems Society (CASS), IEEE Council on Electronic Design Automation (CEDA), IEEE Electron Devices Society (EDS), International Conference on Computer-Aided Design (ICCAD)

Bibliografisk note

Funding Information:
This project received funding from the European Research Council (ERC) grant PSAP, grant agreement No. 101002047, and the European Union s Horizon Europe programme DEEP-SEA, grant agreement No. 955606

Funding Information:
This project received funding from the European Research Council (ERC) grant PSAP, grant agreement No. 101002047, and the European Union’s Horizon Europe programme DEEP-SEA, grant agreement No. 955606. C.J. was funded by the Innovation Fund Denmark (IFD) under File No. 8057-00012B, the IFD Grand Solutions project “Adaptive X-ray InSpection”. T.B.N. is supported by the Swiss National Science Foundation (Am-bizione Project #185778). We acknowledge the Xilinx University Program for providing access to the AMD Xilinx Heterogeneous Accelerated Compute Cluster (HACC) at ETH Zurich.

Publisher Copyright:
© 2022 Association for Computing Machinery.

ID: 336299632