[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Document analysis system

Published: 01 November 1982 Publication History

Abstract

This paper outlines the requirements and components for a proposed Document Analysis System, which assists a user in encoding printed documents for computer processing. Several critical functions have been investigated and the technical approaches are discussed. The first is the segmentation and classification of digitized printed documents into regions of text and images. A nonlinear, run-length smoothing algorithm has been used for this purpose. By using the regular features of text lines, a linear adaptive classification scheme discriminates text regions from others. The second technique studied is an adaptive approach to the recognition of the hundreds of font styles and sizes that can occur on printed documents. A preclassifier is constructed during the input process and used to speed up a well-known pattern-matching method for clustering characters from an arbitrary print source into a small sample of prototypes. Experimental results are included.

References

[1]
R. N. Ascher, G. M. Koppelman, M. J. Miller, G. Nagy, and G. L. Shelton, Jr., "An Interactive System for Reading Unformatted Printed Text," IEEE Trans. Computers C-20, 1527- 1543 (December 1971).
[2]
A. Steinbach and K. Y. Wong, "An Understanding of Moiré Patterns in the Reproduction of Halftone Images," Proceedings of the Pattern Recognition and Image Processing Conference, Chicago, Aug. 6-8, 1979, pp. 545-552.
[3]
G. Nagy, "Preliminary Investigation of Techniques for Automated Reading of Unformatted Text," Commun. ACM 11, 480-487 (July 1968).
[4]
E. Johnstone, "Printed Text Discrimination," Computer Graph. & Image Process. 3, 83-89 (1974).
[5]
F. M. Wahl, K. Y. Wong, and R. G. Casey, "Block Segmentation and Text Extraction in Mixed Text/Image Documents," Research Report RJ 3356, IBM Research Laboratory, San Jose, CA, 1981.
[6]
F. Wahl, L. Abele, and W. Scherl, "Merkmale fuer die Segmentation von Dokumenten zur Automatischen Textverarbeitung," Proceedings of the 4th DAGM-Symposium, Hamburg, Federal Republic of Germany, Springer-Verlag, Berlin, 1981.
[7]
L. Abele, F. Wahl, and W. Scherl, "Procedures for an Automatic Segmentation of Text Graphic and Halftone Regions in Documents," Proceedings of the 2nd Scandinavian Conference on Image Analysis, Helsinki, 1981.
[8]
F. M. Wahl and K. Y. Wong: "An Efficient Method of Running a Constrained Run Length Algorithm (CRLA) in Vertical and Horizontal Directions on Binary Image Data," Research Report RJ3438, IBM Research Laboratory, San Jose, CA, 1982.
[9]
A. Rosenfeld and A. C. Kak, "Digital Picture Processing," Academic Press, Inc., New York, 1976, pp. 347-348.
[10]
F. M. Wahl, "A New Distance Mapping and its Use for Shape Measurement on Binary Patterns," Research Report RJ 3361, IBM Research Laboratory, San Jose, CA, 1982.
[11]
R. G. Casey and G. Nagy, "Decision Tree Design Using a Probabilistic Model," Research Report RJ 3358, IBM Research Laboratory, San Jose, CA, 1981.
[12]
W. H. Chen, J. L. Douglas, W. K. Pratt, and R. H. Wallis, "Dual-mode Hybrid Compressor for Facsimile Images," SPIE J. 207, 226-232 (1979).
[13]
R. G. Casey and K. Y. Wong, "Unsupervised Construction of Decision Networks for Pattern Classification," IBM Research Report, to appear.
[14]
R. G. Casey and G. Nagy, "Recursive Segmentation and Classification of Composite Character Patterns," presented at the 6th International Conference on Pattern Recognition, Munich, October 1982.

Cited By

View all
  • (2024)Continuous document layout analysisInformation Fusion10.1016/j.inffus.2024.102398108:COnline publication date: 1-Aug-2024
  • (2024)Doc-DINO: A Transformer Model for Complex Logical Document Layout AnalysisDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70546-5_5(76-89)Online publication date: 30-Aug-2024
  • (2024)Text Line Segmentation on Ancient Egyptian Papyri: Layout Analysis with Object Detection Networks and Connected ComponentsDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70543-4_13(215-232)Online publication date: 30-Aug-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IBM Journal of Research and Development
IBM Journal of Research and Development  Volume 26, Issue 6
November 1982
125 pages

Publisher

IBM Corp.

United States

Publication History

Published: 01 November 1982
Revised: 07 July 1982
Received: 03 May 1982

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 24 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Continuous document layout analysisInformation Fusion10.1016/j.inffus.2024.102398108:COnline publication date: 1-Aug-2024
  • (2024)Doc-DINO: A Transformer Model for Complex Logical Document Layout AnalysisDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70546-5_5(76-89)Online publication date: 30-Aug-2024
  • (2024)Text Line Segmentation on Ancient Egyptian Papyri: Layout Analysis with Object Detection Networks and Connected ComponentsDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70543-4_13(215-232)Online publication date: 30-Aug-2024
  • (2023)Semantic Navigation of PowerPoint-Based Lecture Video for AutoNote GenerationIEEE Transactions on Learning Technologies10.1109/TLT.2022.321653516:1(1-17)Online publication date: 1-Feb-2023
  • (2023)Document analysis by crosscount approachJournal of Computer Science and Technology10.1007/BF0294661213:1(32-40)Online publication date: 22-Mar-2023
  • (2023)Visual Information Extraction in the Wild: Practical Dataset and End-to-End SolutionDocument Analysis and Recognition - ICDAR 202310.1007/978-3-031-41731-3_3(36-53)Online publication date: 21-Aug-2023
  • (2023)Document Layout Annotation: Database and Benchmark in the Domain of Public AffairsDocument Analysis and Recognition – ICDAR 2023 Workshops10.1007/978-3-031-41501-2_9(123-138)Online publication date: 21-Aug-2023
  • (2022)Text and metadata extraction from scanned Arabic documents using support vector machinesJournal of Information Science10.1177/016555152096125648:2(268-279)Online publication date: 1-Apr-2022
  • (2022)Robust Detection of Tables in Documents Using Scores from Table Cell CoresSN Computer Science10.1007/s42979-022-01041-z3:2Online publication date: 12-Feb-2022
  • (2022)Extracting Variable-Depth Logical Document Hierarchy from Long Documents: Method, Evaluation, and ApplicationJournal of Computer Science and Technology10.1007/s11390-021-1076-737:3(699-718)Online publication date: 1-Jun-2022
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media