[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/645890.671265guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Training Set Expansion in Handwritten Character Recognition

Published: 06 August 2002 Publication History

Abstract

In this paper, a process of expansion of the training set by synthetic generation of handwritten uppercase letters via deformations of natural images is tested in combination with an approximate k-Nearest Neighbor (k-NN) classifier. It has been previously shown [11] [10] that approximate nearest neighbors search in large databases can be successfully used in an OCR task, and that significant performance improvements can be consistently obtained by simply increasing the size of the training set. In this work, extensive experiments adding distorted characters to the training set are performed, and the results are compared to directly adding new natural samples to the set of prototypes.

References

[1]
S. Arya, D. M. Mount, N. S. Netanyahu, R. Silverman, and A. Wu. An optimal algorithm for approximate nearest neighbor searching. Journal of the ACM, 45:891-923, 1998.
[2]
J. L. Bentley, B. W. Weide, and A. C. Yao. Optimal expected time algorithms for closest point problems. ACM Trans. on Math. Software, 6:563-580, 1980.
[3]
P. A. Devijver and J. Kittler. On the edited nearest neighbour rule. In Proceedings of the 5th International Conference on Pattern Recognition, pages 72-80. IEEE Computer Society Press, Los Alamitos, CA, 1980.
[4]
J. H. Friedman, J. L. Bentley, and R. A. Finkel. An algorithm finding best matches in logarithmic expected time. ACM Trans. Math. Software, 3:209-226, 1977.
[5]
T. M. Ha and H. Bunke. Off-line, handwritten numeral recognition by perturbation method. IEEE Trans. on PAMI, 19(5):535-539, May 1997.
[6]
P. E. Hart. The condensed nearest neighbor rule. IEEE Trans. on Information Theory, 125:515-516, 1968.
[7]
A. Jain. Object matching using deformable templates. IEEE Transactions on Pattern Analysis and Machine Intelligence, 18:268-273, 1996.
[8]
B. S. Kim and S. B. Park. A fast k nearest neighbor finding algorithm based on the ordered partition. IEEE Trans. on PAMI, 8:761-766, 1986.
[9]
Jianchang Mao. Improving ocr performance using character degradation models and boosting algorithm. Pattern Recognition Letters, 18:1415-1419, 1997.
[10]
J. C. Perez-Cortes, J. Arlandis Arlandis, and R. Llobet. Fast and accurate handwritten character recognition using approximate nearest neighbours search on large databases. In Workshop on Statistical Pattern Recognition SPR-2000, Alicante (Spain), 2000.
[11]
S. J. Smith. Handwritten character classification using nearest neighbor in large databases. IEEE Trans. on PAMI, 16(9):915-919, September 1994.
[12]
D. L. Wilson. Asymptotic properties of nearest neighbor rules using edited data. IEEE Trans. on Systems, Man and Cybernetics, 2:408-420, 1972.

Cited By

View all
  • (2018)A new iterative synthetic data generation method for CNN based stroke gesture recognitionMultimedia Tools and Applications10.1007/s11042-017-5285-677:13(17181-17205)Online publication date: 1-Jul-2018
  • (2016)A Rapid Prototyping Approach to Synthetic Data Generation for Improved 2D Gesture RecognitionProceedings of the 29th Annual Symposium on User Interface Software and Technology10.1145/2984511.2984525(873-885)Online publication date: 16-Oct-2016
  • (2015)An Arabic handwriting synthesis systemPattern Recognition10.1016/j.patcog.2014.09.01348:3(849-861)Online publication date: 1-Mar-2015
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
Proceedings of the Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
August 2002
865 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 06 August 2002

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2018)A new iterative synthetic data generation method for CNN based stroke gesture recognitionMultimedia Tools and Applications10.1007/s11042-017-5285-677:13(17181-17205)Online publication date: 1-Jul-2018
  • (2016)A Rapid Prototyping Approach to Synthetic Data Generation for Improved 2D Gesture RecognitionProceedings of the 29th Annual Symposium on User Interface Software and Technology10.1145/2984511.2984525(873-885)Online publication date: 16-Oct-2016
  • (2015)An Arabic handwriting synthesis systemPattern Recognition10.1016/j.patcog.2014.09.01348:3(849-861)Online publication date: 1-Mar-2015
  • (2015)Toward automatic development of handwritten personal Farsi/Arabic OpenType$$^{\textregistered }$$® fontsInternational Journal on Document Analysis and Recognition10.1007/s10032-015-0241-318:3(249-262)Online publication date: 1-Sep-2015
  • (2015)Semi---supervised Learning for Image ModalityźClassificationRevised Selected Papers from the First International Workshop on Multimodal Retrieval in the Medical Domain - Volume 905910.1007/978-3-319-24471-6_8(85-98)Online publication date: 29-Mar-2015
  • (2008)Analogical DissimilarityJournal of Artificial Intelligence Research10.5555/1622673.162269332:1(793-824)Online publication date: 1-Aug-2008
  • (2007)Learning a Classifier with Very Few ExamplesProceedings of the 18th European conference on Machine Learning10.1007/978-3-540-74958-5_49(527-534)Online publication date: 17-Sep-2007
  • (2005)An Editor Labeling Model for Training Set Expansion in Web CategorizationProceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence10.1109/WI.2005.27(165-171)Online publication date: 19-Sep-2005
  • (2003)Using tree-grammars for training set expansion in page classificationProceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 210.5555/938980.939542Online publication date: 3-Aug-2003
  • (2003)Generation of Synthetic Training Data for an HMM-based Handwriting Recognition SystemProceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 110.5555/938979.939265Online publication date: 3-Aug-2003
  • Show More Cited By

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media