Document Imaging and Stenography

Low-complexity comprehensive labeling and enhancement algorithm for compound documents

[+] Author Affiliations
Onur G. Guleryuz

DoCoMo Communications Laboratories USA, Inc., 181 Metro Drive Ste 300, San Jose, California 95110 E-mail: guleryuz@docomolabs-usa.com

J. Electron. Imaging. 13(4), 832-859 (Oct 01, 2004). doi:10.1117/1.1790509
History: Received Mar. 12, 2003; Revised Feb. 24, 2004; Accepted Mar. 3, 2004; Online September 30, 2004
Text Size: A A A

We present a multiresolutional algorithm that segments a compound document and uses the results of the segmentation for document enhancement in copier applications. The document is initially segmented into halftone and nonhalftone areas. Based on this segmentation the location of the edges due to text, graphics, and images (and not due to halftone dots) are detected on halftone as well as on nonhalftone portions. We further detect constant-tone regions within nonhalftone areas for subsequent bleed-through removal applications. Edge enhancement on detected edges and descreening on detected halftones are carried out. The algorithm can detect general halftones over regions of arbitrary sizes and shapes, and it can be straightforwardly adjusted for operation at various dpi resolutions. We obtain high detection probabilities on compound multilingual documents containing halftones and fine text. The proposed enhancement stage is tolerant of segmentation errors providing robust performance for the remaining problem cases. Our main contribution is the accomplishment of these tasks with a single pass algorithm that is computationally very simple and that requires less than 1% of full page memory, with active memory requirements less than 0.02% of full page memory. The operation of the algorithm can be imagined as a very thin line (of thickness the size of a “full-stop” in 11 pt text) that rapidly scans an input page while simultaneously producing an output page. © 2004 SPIE and IS&T.

© 2004 SPIE and IS&T

Citation

Onur G. Guleryuz
"Low-complexity comprehensive labeling and enhancement algorithm for compound documents", J. Electron. Imaging. 13(4), 832-859 (Oct 01, 2004). ; http://dx.doi.org/10.1117/1.1790509


Access This Article
Sign in or Create a personal account to Buy this article ($20 for members, $25 for non-members).

Some tools below are only available to our subscribers or users with an online account.

Related Content

Customize your page view by dragging & repositioning the boxes below.

Related Book Chapters

Topic Collections

Advertisement
  • Don't have an account?
  • Subscribe to the SPIE Digital Library
  • Create a FREE account to sign up for Digital Library content alerts and gain access to institutional subscriptions remotely.
Access This Article
Sign in or Create a personal account to Buy this article ($20 for members, $25 for non-members).
Access This Proceeding
Sign in or Create a personal account to Buy this article ($15 for members, $18 for non-members).
Access This Chapter

Access to SPIE eBooks is limited to subscribing institutions and is not available as part of a personal subscription. Print or electronic versions of individual SPIE books may be purchased via SPIE.org.