TY - JOUR
T1 - Robust and fast text‐line extraction using local linearity of the text‐line
AU - Goto, Hideaki
AU - Aso, Hirotomo
PY - 1995
Y1 - 1995
N2 - Text region extraction is a necessary process before character recognition is done for document images. This paper describes a new algorithm, Linear Segment Linking (LSL), for text‐line extraction from document images. The algorithm groups together the piecewise linear elements in the document images, which may be assumed to be text lines, and then extracts them from the images. The algorithm requires less knowledge about document structure and is robust for distortion of the image. The primitive rectangles are introduced for the intermediate representation of image. It is easier and faster to create them than the usual circumscribing rectangles. A method of splitting the bridges between neighboring text lines is proposed. Combining the bridge splitting process with the text line extraction, the locally touching text lines will be extracted as individual ones.
AB - Text region extraction is a necessary process before character recognition is done for document images. This paper describes a new algorithm, Linear Segment Linking (LSL), for text‐line extraction from document images. The algorithm groups together the piecewise linear elements in the document images, which may be assumed to be text lines, and then extracts them from the images. The algorithm requires less knowledge about document structure and is robust for distortion of the image. The primitive rectangles are introduced for the intermediate representation of image. It is easier and faster to create them than the usual circumscribing rectangles. A method of splitting the bridges between neighboring text lines is proposed. Combining the bridge splitting process with the text line extraction, the locally touching text lines will be extracted as individual ones.
KW - Linear segment linking
KW - bridge splitting
KW - document image analysis
KW - primitive rectangle
KW - text line extraction
UR - http://www.scopus.com/inward/record.url?scp=0029405235&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0029405235&partnerID=8YFLogxK
U2 - 10.1002/scj.4690261303
DO - 10.1002/scj.4690261303
M3 - Article
AN - SCOPUS:0029405235
SN - 0882-1666
VL - 26
SP - 21
EP - 31
JO - Systems and Computers in Japan
JF - Systems and Computers in Japan
IS - 13
ER -