Special Section on Image Processing for Cultural Heritage

New public dataset for spotting patterns in medieval document images

[+] Author Affiliations
Sovann En, Stéphane Nicolas, Caroline Petitjean, Laurent Heutte

Normandie University, UNIROUEN, UNIHAVRE, INSA Rouen, LITIS, Rouen 76000, France

Frédéric Jurie

Normandie University, UNICAEN, CNRS, GREYC, Caen 14000, France

J. Electron. Imaging. 26(1), 011010 (Nov 23, 2016). doi:10.1117/1.JEI.26.1.011010
History: Received July 1, 2016; Accepted October 18, 2016
Text Size: A A A

Abstract.  With advances in technology, a large part of our cultural heritage is becoming digitally available. In particular, in the field of historical document image analysis, there is now a growing need for indexing and data mining tools, thus allowing us to spot and retrieve the occurrences of an object of interest, called a pattern, in a large database of document images. Patterns may present some variability in terms of color, shape, or context, making the spotting of patterns a challenging task. Pattern spotting is a relatively new field of research, still hampered by the lack of available annotated resources. We present a new publicly available dataset named DocExplore dedicated to spotting patterns in historical document images. The dataset contains 1500 images and 1464 queries, and allows the evaluation of two tasks: image retrieval and pattern localization. A standardized benchmark protocol along with ad hoc metrics is provided for a fair comparison of the submitted approaches. We also provide some first results obtained with our baseline system on this new dataset, which show that there is room for improvement and that should encourage researchers of the document image analysis community to design new systems and submit improved results.

© 2016 SPIE and IS&T

Citation

Sovann En ; Stéphane Nicolas ; Caroline Petitjean ; Frédéric Jurie and Laurent Heutte
"New public dataset for spotting patterns in medieval document images", J. Electron. Imaging. 26(1), 011010 (Nov 23, 2016). ; http://dx.doi.org/10.1117/1.JEI.26.1.011010


Access This Article
Sign in or Create a personal account to Buy this article ($20 for members, $25 for non-members).

Some tools below are only available to our subscribers or users with an online account.

Related Content

Customize your page view by dragging & repositioning the boxes below.

Related Book Chapters

Topic Collections

Advertisement
  • Don't have an account?
  • Subscribe to the SPIE Digital Library
  • Create a FREE account to sign up for Digital Library content alerts and gain access to institutional subscriptions remotely.
Access This Article
Sign in or Create a personal account to Buy this article ($20 for members, $25 for non-members).
Access This Proceeding
Sign in or Create a personal account to Buy this article ($15 for members, $18 for non-members).
Access This Chapter

Access to SPIE eBooks is limited to subscribing institutions and is not available as part of a personal subscription. Print or electronic versions of individual SPIE books may be purchased via SPIE.org.