Regular Articles

Video genre categorization and representation using audio-visual information

[+] Author Affiliations
Bogdan Ionescu

University Politehnica of Bucharest, The Image Processing and Analysis Laboratory, Bucharest, 061071, Romania

University of Savoie, LISTIC, Polytech Annecy-Chambery, Annecy, 74944, France

Klaus Seyerlehner

Johannes Kepler University, Department of Computational Perception, Linz, A-4040, Austria

Christoph Rasche

University Politehnica of Bucharest, The Image Processing and Analysis Laboratory, Bucharest, 061071, Romania

Constantin Vertan

University Politehnica of Bucharest, The Image Processing and Analysis Laboratory, Bucharest, 061071, Romania

Patrick Lambert

University of Savoie, LISTIC, Polytech Annecy-Chambery, Annecy, 74944, France

J. Electron. Imaging. 21(2), 023017 (Jun 22, 2012). doi:10.1117/1.JEI.21.2.023017
History: Received October 16, 2011; Revised February 26, 2012; Accepted April 10, 2012
Text Size: A A A

Abstract.  We propose an audio-visual approach to video genre classification using content descriptors that exploit audio, color, temporal, and contour information. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At the temporal structure level, we consider action content in relation to human perception. Color perception is quantified using statistics of color distribution, elementary hues, color properties, and relationships between colors. Further, we compute statistics of contour geometry and relationships. The main contribution of our work lies in harnessing the descriptive power of the combination of these descriptors in genre classification. Validation was carried out on over 91 h of video footage encompassing 7 common video genres, yielding average precision and recall ratios of 87% to 100% and 77% to 100%, respectively, and an overall average correct classification of up to 97%. Also, experimental comparison as part of the MediaEval 2011 benchmarking campaign demonstrated the efficiency of the proposed audio-visual descriptors over other existing approaches. Finally, we discuss a 3-D video browsing platform that displays movies using feature-based coordinates and thus regroups them according to genre.

Figures in this Article
© 2012 SPIE and IS&T

Citation

Bogdan Ionescu ; Klaus Seyerlehner ; Christoph Rasche ; Constantin Vertan and Patrick Lambert
"Video genre categorization and representation using audio-visual information", J. Electron. Imaging. 21(2), 023017 (Jun 22, 2012). ; http://dx.doi.org/10.1117/1.JEI.21.2.023017


Access This Article
Sign in or Create a personal account to Buy this article ($20 for members, $25 for non-members).

Some tools below are only available to our subscribers or users with an online account.

Related Content

Customize your page view by dragging & repositioning the boxes below.

Advertisement
  • Don't have an account?
  • Subscribe to the SPIE Digital Library
  • Create a FREE account to sign up for Digital Library content alerts and gain access to institutional subscriptions remotely.
Access This Article
Sign in or Create a personal account to Buy this article ($20 for members, $25 for non-members).
Access This Proceeding
Sign in or Create a personal account to Buy this article ($15 for members, $18 for non-members).
Access This Chapter

Access to SPIE eBooks is limited to subscribing institutions and is not available as part of a personal subscription. Print or electronic versions of individual SPIE books may be purchased via SPIE.org.