TY - JOUR
T1 - A free-extendible and ultralow-power nonvolatile multi-core associative coprocessor based on MRAM with inter-core pipeline scheme for large-scale full-adaptive nearest pattern searching
AU - Ma, Yitao
AU - Miura, Sadahiko
AU - Honjo, Hiroaki
AU - Ikeda, Shoji
AU - Endoh, Tetsuo
N1 - Publisher Copyright:
© 2020 The Japan Society of Applied Physics.
PY - 2020/4/1
Y1 - 2020/4/1
N2 - A nonvolatile multi-core associative coprocessor based on the MRAM is developed with open-end design, which achieves the higher power efficiency maintaining the high circuit density, full adaptivity and high-speed at the same time. This proposed coprocessor is proposed applicable for searching nearest pattern from the large-scale normal/pre-clustered data sets. An inter-core pipeline operation scheme is implemented, which absorbs the delay of in-core time-domain minimum searching to ensure the high-speed and flexibly extends the coprocessor by increasing the core count. A self-optimized power gating scheme is also employed, which minimizes the operation power and further reduces it when the cluster number (K) becomes larger. The prototype chip with 12-core is designed and fabricated under 90 nm CMOS/70 nm perpendicular-MTJ hybrid technology, and the chip operation at 100 MHz is demonstrated by measurement results. The average operation power is only 68 μW @K = 24, and more than 40-time higher power efficiency is achieved comparing to the latest conventional researches.
AB - A nonvolatile multi-core associative coprocessor based on the MRAM is developed with open-end design, which achieves the higher power efficiency maintaining the high circuit density, full adaptivity and high-speed at the same time. This proposed coprocessor is proposed applicable for searching nearest pattern from the large-scale normal/pre-clustered data sets. An inter-core pipeline operation scheme is implemented, which absorbs the delay of in-core time-domain minimum searching to ensure the high-speed and flexibly extends the coprocessor by increasing the core count. A self-optimized power gating scheme is also employed, which minimizes the operation power and further reduces it when the cluster number (K) becomes larger. The prototype chip with 12-core is designed and fabricated under 90 nm CMOS/70 nm perpendicular-MTJ hybrid technology, and the chip operation at 100 MHz is demonstrated by measurement results. The average operation power is only 68 μW @K = 24, and more than 40-time higher power efficiency is achieved comparing to the latest conventional researches.
UR - http://www.scopus.com/inward/record.url?scp=85083334065&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85083334065&partnerID=8YFLogxK
U2 - 10.35848/1347-4065/ab72d0
DO - 10.35848/1347-4065/ab72d0
M3 - Article
AN - SCOPUS:85083334065
SN - 0021-4922
VL - 59
JO - Japanese Journal of Applied Physics, Part 1: Regular Papers & Short Notes
JF - Japanese Journal of Applied Physics, Part 1: Regular Papers & Short Notes
IS - SG
M1 - SGGB18
ER -