Paper
7 January 1999 Text segmentation for automatic document processing
Dinesh P. Mital, Wee Leng Goh
Author Affiliations +
Proceedings Volume 3651, Document Recognition and Retrieval VI; (1999) https://doi.org/10.1117/12.335819
Event: Electronic Imaging '99, 1999, San Jose, CA, United States
Abstract
There is a considerable interest in designing automatic systems that can scan a given paper document and store it on electronic media for easier storage, manipulation and access. Most documents contain graphics and images, in addition to text. Thus, the document image has to be segmented to identify text and image regions, so that appropriate techniques may be applied to those regions. In this paper, we have presented a new technique for image segmentation in which text and image regions, in a given document image, are automatically identified. The technique is based on the differential processing text extraction concept. The proposed technique is capable of analyzing complex document image layouts. The document image is processed by using textural feature analysis. Results of the proposed method are presented with test images which demonstrate the robustness of the technique.
© (1999) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Dinesh P. Mital and Wee Leng Goh "Text segmentation for automatic document processing", Proc. SPIE 3651, Document Recognition and Retrieval VI, (7 January 1999); https://doi.org/10.1117/12.335819
Lens.org Logo
CITATIONS
Cited by 1 patent.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Image processing

Image processing algorithms and systems

Algorithm development

Detection and tracking algorithms

Optical character recognition

Visualization

RELATED CONTENT

Non-Manhattan layout extraction algorithm
Proceedings of SPIE (March 21 2013)
Archiving of line-drawing images
Proceedings of SPIE (November 21 1995)
Machine-printed Arabic OCR
Proceedings of SPIE (February 25 1994)
Benchmarking of document page segmentation
Proceedings of SPIE (December 22 1999)

Back to Top