Fast and accurate tree-based clustering for Japanese/Chinese character recognition

Yuichi Abe, Takahiro Sasaki, Hideaki Goto

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)


Recognizing text in natural scene images is very important to develop various systems such as an assistant device for visually-impaired people. Multilingual scene text recognition is also becoming important for wearable camera devices with language translation feature. Since computational resources are limited on such mobile devices, fast and accurate Optical Character Recognition (OCR) algorithm is needed. Nearest Neighbor (NN) search is quite popular in feature vector-based OCR systems, and its speed improvement is required. In this paper, we develop an OCR scheme with tree-based clustering technique with LDA (Linear Discriminant Analysis) aiming at real-time Japanese/Chinese character recognition. The experimental results using ETL9B dataset show that our proposed method is 94.6% faster than our previous method, also beating other techniques, at mere 0.24% accuracy drop from the full linear search.

Original languageEnglish
Title of host publicationImage Analysis and Processing, ICIAP 2013 - 17th International Conference, Proceedings
Number of pages10
EditionPART 2
Publication statusPublished - 2013
Event17th International Conference on Image Analysis and Processing, ICIAP 2013 - Naples, Italy
Duration: 2013 Sept 92013 Sept 13

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume8157 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Other17th International Conference on Image Analysis and Processing, ICIAP 2013


  • Approximate Nearest Neighbor (ANN) search
  • Fast Nearest Neighbor search
  • Linear Discriminant Analysis (LDA)
  • multilingual OCR
  • real-time character recognition

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)


Dive into the research topics of 'Fast and accurate tree-based clustering for Japanese/Chinese character recognition'. Together they form a unique fingerprint.

Cite this