TY - JOUR
T1 - Design of an energy-efficient binarized convolutional neural network accelerator using a nonvolatile field-programmable gate array with only-once-write shifting
AU - Suzuki, Daisuke
AU - Oka, Takahiro
AU - Hanyu, Takahiro
N1 - Publisher Copyright:
© 2021 The Japan Society of Applied Physics.
PY - 2021/5
Y1 - 2021/5
N2 - This paper presents an energy-efficient hardware accelerator for binarized convolutional neural networks (BCNNs). In this BCNN accelerator, a data-shift operation becomes dominant to effectively control input/weight-data streams under limited memory bandwidth. A magnetic-tunnel-junction (MTJ)-based nonvolatile field-programmable gate array (NV-FPGA), where the amount of stored-data updating is minimized in a configurable logic block, is a well-suited hardware platform for implementing such a BCNN accelerator. Owing to the nonvolatile storage capability of the NV-FPGA, not only power consumption in the data-shift operation but also standby power consumption in the idle function block is reduced without losing internal data. It is demonstrated under 45 nm complementary metal-oxide-semiconductor/MTJ process technologies that the energy consumption of the proposed BCNN accelerator is 50.7% lower than that of a BCNN accelerator using a conventional static-random-access-memory-based FPGA.
AB - This paper presents an energy-efficient hardware accelerator for binarized convolutional neural networks (BCNNs). In this BCNN accelerator, a data-shift operation becomes dominant to effectively control input/weight-data streams under limited memory bandwidth. A magnetic-tunnel-junction (MTJ)-based nonvolatile field-programmable gate array (NV-FPGA), where the amount of stored-data updating is minimized in a configurable logic block, is a well-suited hardware platform for implementing such a BCNN accelerator. Owing to the nonvolatile storage capability of the NV-FPGA, not only power consumption in the data-shift operation but also standby power consumption in the idle function block is reduced without losing internal data. It is demonstrated under 45 nm complementary metal-oxide-semiconductor/MTJ process technologies that the energy consumption of the proposed BCNN accelerator is 50.7% lower than that of a BCNN accelerator using a conventional static-random-access-memory-based FPGA.
UR - http://www.scopus.com/inward/record.url?scp=85103852188&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85103852188&partnerID=8YFLogxK
U2 - 10.35848/1347-4065/abe682
DO - 10.35848/1347-4065/abe682
M3 - Article
AN - SCOPUS:85103852188
SN - 0021-4922
VL - 60
JO - Japanese Journal of Applied Physics
JF - Japanese Journal of Applied Physics
IS - SB
M1 - SBBB07
ER -