Automatic comic page segmentation based on polygon detection

Luyuan Li¹,
Yongtao Wang¹,
Zhi Tang^1,2 &
…
Liangcai Gao¹

705 Accesses
6 Altmetric
Explore all metrics

Abstract

Comic page segmentation aims to automatically decompose scanned comic images into storyboards (frames), which is the key technique to produce digital comic documents that are suitable for reading on mobile devices. In this paper, we propose a novel method for comic page segmentation by finding the quadrilateral enclosing box of each storyboard. We first acquire the edge image of the input comic image, and then extract line segments with a heuristic line segment detection algorithm. We perform line clustering to further merge the overlapped line segments and remove the redundancy line segments. Finally, we perform another round of line clustering and post-processing to compose the obtained line segments into complete quadrilateral enclosing boxes of the storyboards. The proposed method is tested on 2,237 comic images from 12 different printed comic series, and the experimental results demonstrate that our method is effective for comic image segmentation and outperforms the existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

References

Arai K, Tolle H (2010) Automatic e-comic content adaptation. Int J Ubiquit Comput 1(1):1–11
Google Scholar
Ballard DH (1981) Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognit 13(2):111–122
Article MATH Google Scholar
Burns JB, Hanson AR, Riseman EM (1986) Extracting straight lines. IEEE Trans Pattern Anal Mach Intell 8(4):425–455
Article Google Scholar
Canny J (1986) A computational approach to edge detection. IEEE Trans Pattern Anal Mach Intell 8(6):679–698
Article Google Scholar
Chung KL, Lin ZW, Huang ST, Huang YH, Liao HYM (2010) New orientation-based elimination approach for accurate line-detection. Pattern Recognit 31:11–19
Article Google Scholar
Forsyth DA, Ponce J (2002) Computer vision: a modern approach, 1st edn. Prentice Hall 467–490
Gioi RG, Jakubowicz J, Morel JM, Randall G (2010) LSD: a fast line segment detector with a false detection control. IEEE Trans Pattern Anal Mach Intell 32(4):722–732
Article Google Scholar
Grana C, Borghesani D, Cucchiara R (2010) Optimized block-based connected components labeling with decision trees. IEEE Trans Image Process 19(6):1596–1609
Article MathSciNet Google Scholar
Ho CT, Chen LH (1996) A high-speed algorithm for line detection. Pattern Recognit Lett 17:467–473
Article Google Scholar
In Y, Oie T, Higuchi M, Kawasaki S et al (2010) Fast frame decomposition and sorting by contour tracing for mobile phone comic images. Proc. International Conference on Visualization, imaging and simulation (VIS), Wisconsin, 2010:23–28
Ishii D, Watanabe H (2010) A study on frame position detection of digitized comic images. Workshop on Picture Coding and Image Processing (PCSJ), Nagoya, 2010:124–125
Jain AK, Yu B (1998) Document representation and its application to page decomposition. IEEE Trans Pattern Anal Mach Intell 20(12):294–308
Article Google Scholar
Lo RC, Tsai WH (1995) Gray-scale Hough transform for thick line detection in gray-scale images. Pattern Recognit 28(5):647–661
Article Google Scholar
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Article Google Scholar
Tanaka T, Shoji K, Toyama F, Miyamichi J (2007) Layout analysis of tree-structured scene frames in comic images. Proc. International Joint Conferences on Artificial Intelligence (IJCAI), Hyderabad, January 2007: 2885–2890
Theodoridis S, Koutroumbas K (2008) Pattern recognition, 4th edn. Academic Press 20–50
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Article Google Scholar
Xi J, Hu J, Wu L (2002) Page segmentation of Chinese newspapers. Pattern Recognit 35(12):2695–2704
Article MATH Google Scholar
Yamada M, Budiarto R, Endoo M, Miyazaki S (2004) Comic image decomposition for reading comics on cellular phones. IEICE Trans Inf Syst E87-D(6):1370–1376
Google Scholar

Download references

Acknowledgments

This work is supported by National Basic Research Program of China, also Named “973 Program” (No. 2010CB735908).

Author information

Authors and Affiliations

Institute of Computer Science & Technology, Peking University, Beijing, China
Luyuan Li, Yongtao Wang, Zhi Tang & Liangcai Gao
State Key Laboratory of Digital Publishing Technology, Beijing, China
Zhi Tang

Authors

Luyuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Yongtao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhi Tang
View author publications
You can also search for this author in PubMed Google Scholar
Liangcai Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Yongtao Wang or Zhi Tang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, L., Wang, Y., Tang, Z. et al. Automatic comic page segmentation based on polygon detection. Multimed Tools Appl 69, 171–197 (2014). https://doi.org/10.1007/s11042-012-1241-7

Download citation

Published: 27 September 2012
Issue Date: March 2014
DOI: https://doi.org/10.1007/s11042-012-1241-7

Automatic comic page segmentation based on polygon detection

Abstract

Access this article

Subscribe and save

Buy Now

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Automatic comic page segmentation based on polygon detection

Abstract

Access this article

Subscribe and save

Buy Now

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation