We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
AKGF: Automatic Kernel Generation for DNN on CPU-FPGA.
- Authors
Dong, Dong; Jiang, Hongxu; Diao, Boyu
- Abstract
While tensor accelerated compilers have proven effective in deploying deep neural networks (DNN) on general-purpose hardware, optimizing for FPGA remains challenging due to the complex DNN architectures and the heterogeneous, semi-open compute units. This paper introduces the Automatic Kernel Generation for DNN on CPU-FPGA (AKGF) framework for efficient deployment of DNN on heterogeneous CPU-FPGA platforms. AKGF generates an intermediate representation (IR) of the DNN using TVM's Halide IR, annotates the operators of model layers in the IR to compute them on the corresponding hardware cores, and further optimizes the operator code for CPU and FPGA using ARM's function library and the polyhedral model to enhance model inference speed and power consumption. The experimental tests conducted on a CPU-FPGA board validate the effectiveness of AKGF, demonstrating significant acceleration ratios (up to 6.7x) compared to state-of-the-art accelerators while achieving a 2x power optimization. AKGF effectively leverages the computational capabilities of both CPU and FPGA for high-performance deployment of DNN on CPU-FPGA platforms.
- Subjects
ARTIFICIAL neural networks; FIELD programmable gate arrays; CENTRAL processing units; ELECTRIC power consumption; HETEROGENEOUS computing
- Publication
Computer Journal, 2024, Vol 67, Issue 5, p1619
- ISSN
0010-4620
- Publication type
Article
- DOI
10.1093/comjnl/bxad086