Robust frame and text extraction from comic books

Christophe Rigaud, Norbert Tsopze, Jean-Christophe Burie, Jean-Marc Ogier

Graphics Recognition. New Trends and Challenges (LNCS), vol. 7423, pp. 129-138, 2013

Image not found


Comic books constitute an important heritage in many countries. Nowadays, digitization allows to search directly from content instead of metadata only (e.g. album title or author name). Few studies have been done in this direction. Only frame and speech balloon extraction have been experimented in the case of simple page structure. In fact, the page structure depends on the author which is why many different structures and drawings exist. Despite the differences, drawings have a common characteristic because of design process: they are all surrounded by a black line. In this paper, we propose to rely on this particularity of comic books to automatically extract frame and text using a connected-component labeling analysis. The approach is compared with some existing methods found in the literature and results are presented.

Citation BibTeX

booktitle={Graphics Recognition. New Trends and Challenges},
series={Lecture Notes in Computer Science},
editor={Kwon, Young-Bin and Ogier, Jean-Marc},
title={Robust Frame and Text Extraction from Comic Books},
publisher={Springer Berlin Heidelberg},
keywords={comic books; comics frame extraction; comics text extraction; segmentation; connected-component labeling; k-means},
author={Rigaud, Christophe and Tsopze, Norbert and Burie, Jean-Christophe and Ogier, Jean-Marc},


See online

Download PDF version