A Light-Weight Vision Transformer Toward Near Memory Computation on an FPGA

Takeshi Senoo, Ryota Kayanoma, Akira Jinguji, Hiroki Nakahara

研究成果: 書籍の章/レポート/Proceedings会議への寄与査読

1 被引用数 (Scopus)

抄録

Computer Vision AI is making remarkable advances in image recognition, object detection, and segmentation tasks. However, the model size continuously expands, necessitating dedicated hardware acceleration for the real-time processing of these tasks on embedded systems. The Vision Transformer (ViT) is gaining attention as a new approach to replace Convolutional Neural Networks (CNN) in image recognition tasks. However, ViT, while achieving high recognition accuracy, requires a complex structure and many parameters, making it difficult to implement in real time. Near-memory computing allows faster processing by closely placing data processing and memory access together. We are optimizing ViT for near-memory computation. We design a distributed on-chip memory suitable for near-memory computing and a calculation flow that closely integrates with it on an FPGA. This allows us to achieve more real-time image AI processing with higher recognition accuracy. With ImageNet2012 test images, the recognition accuracy of LW-ViT was 78.38% in the Top-1 category and 94.12% in the Top-5 category. Our implementation was 1.6 times faster than an embedded GPU while maintaining the same recognition accuracy. Compared with other FPGA implementations, while achieving a real-time processing time of 29.97 fps for camera images, the recognition accuracy was 6.6–10.2 points higher. Therefore, our implementation is suitable for real-time image recognition with high recognition accuracy.

本文言語英語
ホスト出版物のタイトルApplied Reconfigurable Computing. Architectures, Tools, and Applications - 19th International Symposium, ARC 2023, Proceedings
編集者Francesca Palumbo, Georgios Keramidas, Nikolaos Voros, Pedro C. Diniz
出版社Springer Science and Business Media Deutschland GmbH
ページ338-353
ページ数16
ISBN(印刷版)9783031429200
DOI
出版ステータス出版済み - 2023
イベント19th International Symposium on Applied Reconfigurable Computing, ARC 2023 - Cottbus, ドイツ
継続期間: 2023 9月 272023 9月 29

出版物シリーズ

名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14251 LNCS
ISSN(印刷版)0302-9743
ISSN(電子版)1611-3349

会議

会議19th International Symposium on Applied Reconfigurable Computing, ARC 2023
国/地域ドイツ
CityCottbus
Period23/9/2723/9/29

フィンガープリント

「A Light-Weight Vision Transformer Toward Near Memory Computation on an FPGA」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル