[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/827140.827204acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

Correcting broken characters in the recognition of historical printed documents

Published: 27 May 2003 Publication History

Abstract

This paper presents a new technique for dealing with broken characters, one of the major challenges in the optical character recognition (OCR) of degraded historical printed documents. A technique based on graph combinatorics is used to rejoin the appropriate connected components. It has been applied to real data with successful results.

References

[1]
T. Cover, and P. Hart. 1967. Nearest neighbour pattern classification. IEEE Trans. on Inform. Theory. 13(1): 21--7.
[2]
M. Droettboom, K. MacMillan, I. Fujinaga, G. S. Choudhury, T. DiLauro, M. Patton, and T. Anderson. 2002. Using the Gamera framework for the recognition of cultural heritage materials. JCDL. 11--7.
[3]
I. Fujinaga, B. Alphonce, B. Pennycook, and K. Hogan. 1991. Optical music recognition: Progress report. Int. Comp. Music Conf. 66--73.
[4]
S. M. Harding, W. B. Croft, and C. Weir. 1997. Probabilistic retrieval of OCR degraded text using N-grams. Europ. Conf. on Dig. Libraries. 345--59.
[5]
M. Kass, A. Witkin and D. Terzopolous. 1987. Snakes: Active contour models. Int. Conf. on Comp. Vision. 259--68.
[6]
Statistical Accounts of Scotland. 1799. http://edina.ac.uk/statacc/
[7]
Ø. D. Trier, and A. K. Jain. 1995. Goal-directed evaluation of binarization methods. IEEE Trans. on Pattern Analysis & Machine Intelligence. 17(12): 1191--201.

Cited By

View all
  • (2015)Recognition of Machine Printed Broken Oriya Characters Using Sift FeaturesProceedings of the Sixth International Conference on Computer and Communication Technology 201510.1145/2818567.2818587(106-109)Online publication date: 25-Sep-2015
  • (2013)Query representation for cross-temporal information retrievalProceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval10.1145/2484028.2484054(383-392)Online publication date: 28-Jul-2013
  • (2009)Adaptive shape prior for recognition and variational segmentation of degraded historical charactersPattern Recognition10.1016/j.patcog.2008.10.00542:12(3348-3354)Online publication date: 1-Dec-2009
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
JCDL '03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries
May 2003
393 pages
ISBN:0769519393

Sponsors

Publisher

IEEE Computer Society

United States

Publication History

Published: 27 May 2003

Check for updates

Qualifiers

  • Article

Conference

JCDL03
Sponsor:

Acceptance Rates

JCDL '03 Paper Acceptance Rate 54 of 216 submissions, 25%;
Overall Acceptance Rate 415 of 1,482 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2015)Recognition of Machine Printed Broken Oriya Characters Using Sift FeaturesProceedings of the Sixth International Conference on Computer and Communication Technology 201510.1145/2818567.2818587(106-109)Online publication date: 25-Sep-2015
  • (2013)Query representation for cross-temporal information retrievalProceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval10.1145/2484028.2484054(383-392)Online publication date: 28-Jul-2013
  • (2009)Adaptive shape prior for recognition and variational segmentation of degraded historical charactersPattern Recognition10.1016/j.patcog.2008.10.00542:12(3348-3354)Online publication date: 1-Dec-2009
  • (2008)Recognition of degraded characters using dynamic Bayesian networksPattern Recognition10.1016/j.patcog.2008.03.02241:10(3092-3103)Online publication date: 1-Oct-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media