TY - GEN
T1 - Fast and accurate candidate reduction using the multiclass LDA for Japanese/Chinese character recognition
AU - Odate, Ryosuke
AU - Goto, Hideaki
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/12/9
Y1 - 2015/12/9
N2 - Acceleration of Optical Character Recognition (OCR) algorithms is quite important for developing real-time applications on mobile devices with limited computational performances. Multilingual scene text recognition is becoming more important for mobile and wearable devices. Since Japanese and Chinese have thousands of characters, a fast and accurate character recognition algorithm is required. We developed and proposed a tree-based clustering technique combined with Linear Discriminant Analysis (LDA), and it worked fine with ETL9B dataset consisting of Japanese handwritten characters. However, a significant performance degradation with HCL2000 Chinese handwritten character dataset was found. In this paper, we formalize the candidate reduction technique for the Nearest Neighbor (NN) problems, and propose an improved method that works fine with both Japanese and Chinese character sets. Experimental results show that our method is faster and more accurate than the existing acceleration techniques such as Approximate Nearest Neighbor (ANN) search and Locality Sensitive Hashing (LSH).
AB - Acceleration of Optical Character Recognition (OCR) algorithms is quite important for developing real-time applications on mobile devices with limited computational performances. Multilingual scene text recognition is becoming more important for mobile and wearable devices. Since Japanese and Chinese have thousands of characters, a fast and accurate character recognition algorithm is required. We developed and proposed a tree-based clustering technique combined with Linear Discriminant Analysis (LDA), and it worked fine with ETL9B dataset consisting of Japanese handwritten characters. However, a significant performance degradation with HCL2000 Chinese handwritten character dataset was found. In this paper, we formalize the candidate reduction technique for the Nearest Neighbor (NN) problems, and propose an improved method that works fine with both Japanese and Chinese character sets. Experimental results show that our method is faster and more accurate than the existing acceleration techniques such as Approximate Nearest Neighbor (ANN) search and Locality Sensitive Hashing (LSH).
KW - Approximate Nearest Neighbor (ANN) search
KW - Fast Nearest Neighbor search
KW - Linear Discriminant Analysis (LDA)
KW - multilingual OCR
KW - real-time character recognition
UR - http://www.scopus.com/inward/record.url?scp=84956698507&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84956698507&partnerID=8YFLogxK
U2 - 10.1109/ICIP.2015.7350940
DO - 10.1109/ICIP.2015.7350940
M3 - Conference contribution
AN - SCOPUS:84956698507
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 951
EP - 955
BT - 2015 IEEE International Conference on Image Processing, ICIP 2015 - Proceedings
PB - IEEE Computer Society
T2 - IEEE International Conference on Image Processing, ICIP 2015
Y2 - 27 September 2015 through 30 September 2015
ER -