TY - GEN
T1 - Glyph-Based Data Augmentation for Accurate Kanji Character Recognition
AU - Ofusa, Kenichiro
AU - Miyazaki, Tomo
AU - Sugaya, Yoshihiro
AU - Omachi, Shinichiro
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/7/2
Y1 - 2017/7/2
N2 - In this paper, we address a problem of data augmentation for character recognition. Particularly, we focus on incorporating variation in glyph into data augmentation of character images, which is a simple approach for data augmentation. Generally, existing methods increase data size by distorting images, whereas the proposed method applies noise injection into glyphs, resulting in data with radical variation in glyph. The proposed method exploits public database of glyphs for kanji and augments glyphs by injecting noise into glyphs. Then, we generate images of kanji automatically by deploying stroke images on the augmented glyphs. We carried out experiments for kanji character recognition using augmented data. The results show the effectiveness of the proposed method.
AB - In this paper, we address a problem of data augmentation for character recognition. Particularly, we focus on incorporating variation in glyph into data augmentation of character images, which is a simple approach for data augmentation. Generally, existing methods increase data size by distorting images, whereas the proposed method applies noise injection into glyphs, resulting in data with radical variation in glyph. The proposed method exploits public database of glyphs for kanji and augments glyphs by injecting noise into glyphs. Then, we generate images of kanji automatically by deploying stroke images on the augmented glyphs. We carried out experiments for kanji character recognition using augmented data. The results show the effectiveness of the proposed method.
KW - Character recognition
KW - Data augmentation
KW - Glyph
UR - http://www.scopus.com/inward/record.url?scp=85045215454&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85045215454&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2017.103
DO - 10.1109/ICDAR.2017.103
M3 - Conference contribution
AN - SCOPUS:85045215454
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 597
EP - 602
BT - Proceedings - 14th IAPR International Conference on Document Analysis and Recognition, ICDAR 2017
PB - IEEE Computer Society
T2 - 14th IAPR International Conference on Document Analysis and Recognition, ICDAR 2017
Y2 - 9 November 2017 through 15 November 2017
ER -