TY - JOUR
T1 - Bandwidth compression of floating-point numerical data streams for FPGA-based high-performance computing
AU - Ueno, Tomohiro
AU - Sano, Kentaro
AU - Yamamoto, Satoru
N1 - Publisher Copyright:
©2017 ACM.
PY - 2017/5
Y1 - 2017/5
N2 - Although computational performance is often limited by insufficient bandwidth to/from an external memory, it is not easy to physically increase off-chip memory bandwidth. In this study, we propose a hardware-based bandwidth compression technique that can be applied to field-programmable gate array- (FPGA) based high-performance computation with a logically wider effective memory bandwidth. Our proposed hardware approach can boost the performance of FPGA-based stream computations by applying a data compression technique to effectively transfer more data streams. To apply this data compression technique to bandwidth compression via hardware, several requirements must first be satisfied, including an acceptable level of compression performance and a sufficiently small hardware footprint. Our proposed hardware-based bandwidth compressor utilizes an efficient prediction-based data compression algorithm. Moreover, we propose a multichannel serializer and deserializer that enable applications to use multiple channels of computational data with the bandwidth compression. The serializer encodes compressed data blocks of multiple channels into a data stream, which is efficiently written to an external memory. Based on preliminary evaluation, we define an encoding format considering both high compression ratio and small hardware area. As a result, we demonstrate that our area saving bandwidth compressor increases performance of an FPGA-based fluid dynamics simulation by deployingmore processing elements to exploit spatial parallelism with the enhanced memory bandwidth.
AB - Although computational performance is often limited by insufficient bandwidth to/from an external memory, it is not easy to physically increase off-chip memory bandwidth. In this study, we propose a hardware-based bandwidth compression technique that can be applied to field-programmable gate array- (FPGA) based high-performance computation with a logically wider effective memory bandwidth. Our proposed hardware approach can boost the performance of FPGA-based stream computations by applying a data compression technique to effectively transfer more data streams. To apply this data compression technique to bandwidth compression via hardware, several requirements must first be satisfied, including an acceptable level of compression performance and a sufficiently small hardware footprint. Our proposed hardware-based bandwidth compressor utilizes an efficient prediction-based data compression algorithm. Moreover, we propose a multichannel serializer and deserializer that enable applications to use multiple channels of computational data with the bandwidth compression. The serializer encodes compressed data blocks of multiple channels into a data stream, which is efficiently written to an external memory. Based on preliminary evaluation, we define an encoding format considering both high compression ratio and small hardware area. As a result, we demonstrate that our area saving bandwidth compressor increases performance of an FPGA-based fluid dynamics simulation by deployingmore processing elements to exploit spatial parallelism with the enhanced memory bandwidth.
KW - Data compression
KW - Dedicated circuit
KW - FPGA-based accelerator
KW - Memory bandwidth
KW - Numerical data stream
KW - Stream computing
UR - http://www.scopus.com/inward/record.url?scp=85020217765&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85020217765&partnerID=8YFLogxK
U2 - 10.1145/3053688
DO - 10.1145/3053688
M3 - Article
AN - SCOPUS:85020217765
SN - 1936-7406
VL - 10
JO - ACM Transactions on Reconfigurable Technology and Systems
JF - ACM Transactions on Reconfigurable Technology and Systems
IS - 3
M1 - 18
ER -