Recent remarkable progress in computer systems and printing devices makes it easier to produce printed documents with various designs. Text characters are often printed on colored backgrounds, and sometimes on complex backgrounds. Some methods have been developed for character extraction from document images and scene images with complex backgrounds. However, those methods are designed to extract rather large characters, and often fails to extract small characters. This paper proposes a new method by which character patterns can be extracted from document images with complex background. The method is based on the local multilevel thresholding and pixel labeling, and the region growing. This framework is very useful for extracting character patterns from badly illuminated document images. The performance of extracting small character patterns has also been improved by suppressing the influence of mixed-color pixels around character edges.
|Number of pages||4|
|Journal||Proceedings - International Conference on Pattern Recognition|
|Publication status||Published - 2000|