Paper
24 March 2014 Video text localization using wavelet and shearlet transforms
Purnendu Banerjee, B. B. Chaudhuri
Author Affiliations +
Proceedings Volume 9021, Document Recognition and Retrieval XXI; 90210B (2014) https://doi.org/10.1117/12.2036077
Event: IS&T/SPIE Electronic Imaging, 2014, San Francisco, California, United States
Abstract
Text in video is useful and important in indexing and retrieving the video documents efficiently and accurately. In this paper, we present a new method of text detection using a combined dictionary consisting of wavelets and a recently introduced transform called shearlets. Wavelets provide optimally sparse expansion for point-like structures and shearlets provide optimally sparse expansions for curve-like structures. By combining these two features we have computed a high frequency sub-band to brighten the text part. Then K-means clustering is used for obtaining text pixels from the Standard Deviation (SD) of combined coefficient of wavelets and shearlets as well as the union of wavelets and shearlets features. Text parts are obtained by grouping neighboring regions based on geometric properties of the classified output frame of unsupervised K-means classification. The proposed method tested on a standard as well as newly collected database shows to be superior to some of the existing methods.
© (2014) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Purnendu Banerjee and B. B. Chaudhuri "Video text localization using wavelet and shearlet transforms", Proc. SPIE 9021, Document Recognition and Retrieval XXI, 90210B (24 March 2014); https://doi.org/10.1117/12.2036077
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Wavelets

Transform theory

Video surveillance

Databases

Associative arrays

Visualization

RELATED CONTENT

Image and video indexing in the compressed domain
Proceedings of SPIE (October 06 1997)
Text-based search of TV news stories
Proceedings of SPIE (November 01 1996)
Mosaic-based video compression
Proceedings of SPIE (April 17 1995)

Back to Top