TY - JOUR
T1 - Design of high-performance asynchronous pipeline using synchronizing logic gates
AU - Xia, Zhengfan
AU - Ishihara, Shota
AU - Hariyama, Masanori
AU - Kameyama, Michitaka
PY - 2012/8
Y1 - 2012/8
N2 - This paper introduces a novel design method of an asynchronous pipeline based on dual-rail dynamic logic. The overhead of handshake control logic is greatly reduced by constructing a reliable critical datapath, which offers the pipeline high throughput as well as low power consumption. Synchronizing Logic Gates (SLGs), which have no data dependency problem, are used in the design to construct the reliable critical datapath. The design targets latch-free and extremely fine-grain or gatelevel pipeline, where the depth of every pipeline stage is only one dual-rail dynamic logic. HSPICE simulation results, in a 65 nm design technology, indicate that the proposed design increases the throughput by 120% and decreases the power consumption by 54% compared with PS0, a classic dual-rail asynchronous pipeline implementation style, in 4-bit wide FIFOs. Moreover, this method is applied to design an array style multiplier. It shows that the proposed design reduces power by 37.9% compared to classic synchronous design when the workloads are 55%. A chip has been fabricated with a 4×4 multiplier function, which works well at 2.16G dataset/ s (Post-layout simulation).
AB - This paper introduces a novel design method of an asynchronous pipeline based on dual-rail dynamic logic. The overhead of handshake control logic is greatly reduced by constructing a reliable critical datapath, which offers the pipeline high throughput as well as low power consumption. Synchronizing Logic Gates (SLGs), which have no data dependency problem, are used in the design to construct the reliable critical datapath. The design targets latch-free and extremely fine-grain or gatelevel pipeline, where the depth of every pipeline stage is only one dual-rail dynamic logic. HSPICE simulation results, in a 65 nm design technology, indicate that the proposed design increases the throughput by 120% and decreases the power consumption by 54% compared with PS0, a classic dual-rail asynchronous pipeline implementation style, in 4-bit wide FIFOs. Moreover, this method is applied to design an array style multiplier. It shows that the proposed design reduces power by 37.9% compared to classic synchronous design when the workloads are 55%. A chip has been fabricated with a 4×4 multiplier function, which works well at 2.16G dataset/ s (Post-layout simulation).
KW - Asynchronous pipeline
KW - Critical datapath
KW - Dual-rail
UR - http://www.scopus.com/inward/record.url?scp=84864536164&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84864536164&partnerID=8YFLogxK
U2 - 10.1587/transele.E95.C.1434
DO - 10.1587/transele.E95.C.1434
M3 - Article
AN - SCOPUS:84864536164
SN - 0916-8524
VL - E95-C
SP - 1434
EP - 1443
JO - IEICE Transactions on Electronics
JF - IEICE Transactions on Electronics
IS - 8
ER -