[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3038884.3038905acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmedpraiConference Proceedingsconference-collections
research-article

A New Database for Writer Demographics Attributes Detection Based on Off-Line Persian and English Handwriting

Published: 22 November 2016 Publication History

Abstract

This paper describes a database of multi-script (Persian and English) for typical and new aspects and challenges of offline handwriting automatic analysis field. This database can be used for typical aspects such as different levels of segmentation and recognition and writer identification in text-dependent and text-independent modes. Also, new aspects can be used such as writer identification and gender, age and handedness detection based on script-dependent and script-independent. Two pages of forms in three regions were designed and collected from 200 native Persian writers of different age, handednesses, genders and education level. To the best of our knowledge, so far no attempt has been conducted on providing Persian database for writer identification and gender, age and handedness detection based on script-dependent and script-independent in multi-script environment.

References

[1]
Basilis G. Gatos, Nikolaos Stamatopoulos, and Georgios Louloudis. 2009. Handwriting Segmentation Contest. In: Proceedings of 10th ICDAR. 1393--1397.
[2]
D.H. Kim, Y.S. Hwang, S.T. Park, E.J. Kim, S.H. Paek, and S.Y. Bang. 1993. Handwritten Korean Character Image Database PE92. In: Proceedings of 2th ICDAR. 470--473.
[3]
T. Saito, H. Yamada, and K. Yamamoto. 1985. On the Database ETL 9 of Handprinted Characters in JIS Chinese Characters and its Analysis. IEICE Transactions. J68-D, 4, 757--764.
[4]
Tonghua Su, Tianwen Zhang, and Dejun Guan. 2009. Corpus-Based HIT-MW Database for Off-line Recognition of GeneralPurpose Chinese Handwritten Text. Int. J. Document Analysis and Recognition. 10, 1, 27--38.
[5]
Yousef Al-Ohali,MohamedCheriet, and Ching Suen. 2003. Databases for Recognition of Handwritten Arabic Cheques. Pattern Recognition. 36, 111--121.
[6]
Ram Sarkar, Nibaran Das, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri, and Dipak Kumar Basu. A Database of Unconstrained Handwritten Bangla and Bangla-English Mixed Script Document Image. Int. J. Document Analysis and Recognition. 15, 71--83.
[7]
Alireza Alaei, P. Nagabhushan, Umapada Pal. 2011. A Benchmark Kannada Handwritten Document Dataset and its Segmentation. In: Proceedings of 11th ICDAR. 141--145.
[8]
Hossein Khosravi, Ehsanollah Kabir. 2007. Introducing a Very Large Dataset of Handwritten Farsi Digits and a Study on their Varieties. Pattern Recognition Letter, 28, 10, 1133--1141.
[9]
Saeed Mozaffari, Karim Faez, Farhad Faradji, Majid Ziaratban and S.Mohamad Golzan. 2006. A Comprehensive Isolated Farsi/Arabic Character Database for Handwritten OCR Research. In: Proceedings of 10th IWFHR, 23--26.
[10]
Farshid Solimanpour, Javad Sadri, and Ching Y. Suen. 2006. Standard Databases for Recognition of Handwritten Digits, Numerical Strings, Legal Amounts, Letters and Dates in Farsi Language. In: Proceedings of 10th IWFHR. 3--7.
[11]
Nikolaos Stamatopoulos, Basilis Gatos, Georgios Louloudis, Umapada Pal, Alireza Alaei. 2013. ICDAR 2013 Handwriting Segmentation Contest. In: Proceedings of 12th ICDAR. 1402--1406.
[12]
Amir M Bidgoli, and Mehdi Sarhadi. 2008. IAUT/PHCN: Azad University of Tehran/Persian Handwritten City Names, a Very Large Database of Handwritten Persian Word. In: Proceedings of 11th ICFHR. 192--197.
[13]
Alireza Alaei, P. Nagabhushan, Umapada Pal. 2011. A New Dataset of Persian Handwritten Documents and its Segmentation. In: Proceedings of 7th Iranian Conference on Machine Vision and Image Processing.
[14]
Alireza Alaei, Umapada Pal, P. Nagabhushan. 2012. Dataset and Ground Truth for Handwritten Text in Four Different Script. Int. J. Pattern Recognition and Artificial Intelligence. 26, 4, 1--25.
[15]
Raashid Hussain, Ahsen Raza, Imran Siddiqi, Khurram Khurshid, and Chawki Djeddi. 2015. A Comprehensive Survey of Handwritten Document Benchmarks: Structure, Usage and Evaluation. EURASIP Journal on Image and Video Processing. DOI 10.1186/s13640-015-0102-5.
[16]
Imran Siddiqi, Chawki Djeddi, Ahsen Raza, and Labiba Souicimeslati. 2014. Automatic Analysis of Handwriting for Gender Classification. Pattern Analysis and Applications. 18, 887899.
[17]
Chawki Djeddi, Imran Siddiqi, Labiba Souici-Meslati, and Abdellatif Ennaji. 2013. Text-independent Writer Recognition using Multi-script Handwritten Texts. Pattern Recognition Letters. 34, 11961202.
[18]
Chawki Djeddi, Imran Siddiqi, Abdeljalil Gattal, Youcef Chibani, Labiba Souici-Meslati, and Haikal El Abed. 2014. LAMIS-MSHD: A Multi-script Offline Handwriting Database. In: Proceedings of 14th ICFHR. 93--97.
[19]
Somaya Al Maadeed, Wael Ayouby, Abdelaali Hassaine, and Jihad Mohamad Aljaam. QUWI: An Arabic and English Handwriting Dataset for Offline Writer Identification. In: Proceedings of 13th ICFHR. 746--751.
[20]
Urs-Viktor Marti, and H. Bunke. 1999. A full English Sentence Database for Off-line Handwriting Recognition. In: Proceedings of 5th ICDAR. 705--708.
[21]
Emmanule Grosicki, Matthieu Carr, Jean-Marie Brodin, and Edouard Geoffrois. 2009. Results of the RIMES Evaluation Campaign for Handwritten Mail Processing. In: Proceedings of 10th ICDAR. 941--945.
[22]
Alicia Fornes, Anjan Dutta, Albert Gordo and Josep Llados. 2011. The ICDAR 2011 Music Scores Competition: Staff Removal and Writer Identification. In: Proceedings of 11th ICDAR. 1511--1515.
[23]
Haikal El Abed, Volker Margner. 2007. The IFN/ENIT-Database - a Tool to Develop Arabic Handwriting Recognition Systems. In: Proceeding of 9th International Symposium on Signal Processing and Its Applications. 1--4.
[24]
Puntis Jifroodian Haghighi, Nicola Nobile, Chun Lei He, and Ching Y. Suen. 2009. A New Large-Scale Multi-purpose Handwritten Farsi Database. In: Proceedings of 6th Int. Conference on Image Analysis and Recognition. 278--286.
[25]
F. Shahabi nejad, and Mohammad Rahmati. 2007. A New Method for Writer Identification and Verification Based on Farsi/Arabic Handwritten Texts. In: Proceedings of 9th ICDAR. 829--833.
[26]
Majid Ziaratban, Karim Faez, and Fatemeh Bagheri. 2009. FHT: An Unconstraint Farsi Handwritten Text Database. In: Proceedings of 10th ICDAR. 281--285.
[27]
Javad Sadri, Mohammad Reza Yeganehzad, and Javad Saghi. 2016. A Novel Comprehensive Database for Offline Persian Handwriting Recognition. Pattern Recognition. 60, 378393.

Cited By

View all
  • (2022)Feature learning and encoding for multi-script writer identificationInternational Journal on Document Analysis and Recognition (IJDAR)10.1007/s10032-022-00394-825:2(79-93)Online publication date: 14-Feb-2022
  • (2021)Texture feature column scheme for single‐ and multi‐script writer identificationIET Biometrics10.1049/bme2.1201010:2(179-193)Online publication date: 14-Feb-2021
  • (2018)ICFHR 2018 Competition on Multi-Script Writer Identification2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)10.1109/ICFHR-2018.2018.00094(506-510)Online publication date: Aug-2018

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
MedPRAI-2016: Proceedings of the Mediterranean Conference on Pattern Recognition and Artificial Intelligence
November 2016
163 pages
ISBN:9781450348768
DOI:10.1145/3038884
  • General Chairs:
  • Chawki Djeddi,
  • Imran Siddiqi,
  • Akram Bennour,
  • Program Chairs:
  • Youcef Chibani,
  • Haikal El Abed
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 November 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Age and Handedness Detection
  2. Gender
  3. Persian and English Handwriting Database
  4. Script-Dependent and Script-Independent
  5. Text-dependent and Text-Independent
  6. Writer Identification

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

MedPRAI-2016

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)2
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Feature learning and encoding for multi-script writer identificationInternational Journal on Document Analysis and Recognition (IJDAR)10.1007/s10032-022-00394-825:2(79-93)Online publication date: 14-Feb-2022
  • (2021)Texture feature column scheme for single‐ and multi‐script writer identificationIET Biometrics10.1049/bme2.1201010:2(179-193)Online publication date: 14-Feb-2021
  • (2018)ICFHR 2018 Competition on Multi-Script Writer Identification2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)10.1109/ICFHR-2018.2018.00094(506-510)Online publication date: Aug-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media