Abstract
Identifying the writer of a handwritten document has remained an interesting pattern classification problem for document examiners, forensic experts, and paleographers. While mature identification systems have been developed for handwriting in contemporary documents, the problem remains challenging from the viewpoint of historical manuscripts. Design and development of expert systems that can identify the writer of a questioned manuscript or retrieve samples belonging to a given writer can greatly help the paleographers in their practices. In this context, the current study exploits the textural information in handwriting to characterize writer from historical documents. More specifically, we employ oBIF(oriented Basic Image Features) and hinge features and introduce a novel moment-based matching method to compare the feature vectors extracted from writing samples. Classification is based on minimization of a similarity criterion using the proposed moment distance. A comprehensive series of experiments using the International Conference on Document Analysis and Recognition 2017 historical writer identification dataset reported promising results and validated the ideas put forward in this study.
1 Introduction
Identifying the writer of a handwritten document is an established task in forensic analysis, document examination, paleography, and information retrieval problems. Along with physical biometric identifiers such as fingerprints, face, and deoxyribonucleic acid, handwriting in human identification is considered a special case of behavioral biometrics [1]. With the recent advancements in pattern classification and machine learning, automatic writer identification systems have matured substantially in recent years [2–8]. These systems aim to capture the visual differences in the handwriting of different individuals. These differences typically include the variations in allographs, the slope of lines, the slant of characters, line spacing, inter and intra-word spacings, legibility, cursiveness, and so on. These writer-specific attributes are extracted through computational structural or statistical features at paragraph, line, word, or sub-word levels. In recent years, data-driven feature learning using convolutional neural networks has also emerged as a popular choice of researchers in characterizing the writer from handwriting [9–11].
Among various features capturing the writing style, textural measures have remained an attractive as well as an effective choice for many researchers. Commonly employed textural features include different variants of local binary patterns [12,13], the hinge features [14], curvature-free cloud of line distribution (COLD) features [15,16], histogram of oriented gradients [17], gray-level co-occurrence matrices [18], and the run-length features [16]. Writer identification using a codebook of graphemes [19,20], small writing strokes [21], and small windows around keypoints in handwriting [22] has also been investigated. Learning features from writing samples using ConvNets has also gained popularity in recent years [23,24].
Contrary to the contemporary documents, the complexity of identifying the hand that produced a manuscript becomes much more challenging in the case of historical documents [25]. Historical manuscripts often degrade over time and commonly suffer from noise, holes, ripping, tearing, and stains. In most cases, such documents are photographed using high-resolution cameras and are made available for research and development of computational methods. Identifying the scribe of historical documents can also be exploited to estimate the date and/or geographical information of the origin of the manuscript [26]. Furthermore, information on scribes can also be employed to match different fragments of the same manuscript and combine them together into a complete document [27].
A number of writer identification competitions have been organized in conjunction with the International Conference on Frontiers in Handwriting Recognition [28–30] and the International Conference on Document Analysis and Recognition (ICDAR) [31–33]. In addition to these competitions targeting writer identification on modern datasets, dedicated competitions on the identification of scribes from historical documents have also been organized in conjunction with ICDAR 2017 [34] and ICDAR 2019 [35]. The winning system [25] of the ICDAR 2017 competition employed oriented Basic Image Features (oBIFs) [36] to characterize the writer. For the 2109 competition, the system based on pathlet and scale-invariant feature transform (SIFT) features outperformed the other submitted systems.
This article targets the problem of writer identification from historical manuscripts extending our previous findings on this problem [13,25]. While most of the studies on such problems focus on the feature extraction part and employ standard matching techniques, we introduce a new moment-based distance to compare two writing samples. A combination of oBIF column histograms [25] and hinge features [14] is employed to map writing samples to feature vectors, which are subsequently compared using the proposed distance measure. An experimental study on the publicly available ICDAR 2017 dataset and a comparison with the existing techniques validate the effectiveness of the proposed method. The key highlights of this study are outlined as follows:
Writer characterizing from challenging historical manuscripts using oBIFs and hinge features.
Introduction of a novel moment-based measure to compute the distance between two feature vectors.
A comprehensive experimental study on publicly available historical documents.
Promising performance in terms of writer identification rates outperforming the existing methods.
It is pertinent to mention that data-driven feature extraction using deep learning methods has emerged as a popular method for this problem in recent years [9–11]. These methods jointly train the feature extractor and the classifier typically using different variants of ConvNets. We, on the other hand, employ the standard pattern classification pipeline of feature extraction followed by matching the query document with those in the reference base. A major motivation for this choice is the fact that although for experimental datasets, the amount of handwriting available per writer (class) is sufficient to learn writer-specific features using deep learning methods, for most practical applications, the amount of text per writer is fairly limited. In some cases, it could be a single line or a few words only. Standard visual features with conventional processing pipelines are more effective in such situation and also drive the current research.
The content of this article is organized as follows. In Section 2, we discuss the recent advancements in writer identification with a prime focus on historical manuscripts. Section 3 introduces the textural features employed to characterize the writer along with the proposed moment distance. Experimental study, quantitative performance, and a detailed analysis of the reported results are presented in Section 4. Section 5 concludes the article with a discussion on key findings and insights into open research problems on this subject.
2 Related works
The problem of writer identification has been thoroughly investigated by the handwriting recognition community. A major contributing factor to this research’s attention was the public availability of large handwriting datasets like IAM [37], RIMES [38], CVL [39], KHATT [40], and QUWI [41]. Despite these advancements, identifying the scribe from historical manuscripts remains an open problem as discussed in Section 1. In recent years, however, several joint projects [42–44] between paleographers and researchers in pattern classification have resulted in increased acceptability of computerized solutions by domain experts [45].
A major challenge in the automatic characterizing of writers is to identify the set of computational features that are able to capture writer-specific information from the samples under study. These can be a set of pre-defined hand-crafted features or can be learned through data using convolutional neural networks. With respect to historical documents, a number of studies investigate the features primarily employed for modern documents on historical manuscripts [46]. Gattal et al. [25], for instance, captured the textural information in handwriting by combining the oBIFs computed at multiple scales from binarized historical documents. Identification is carried out in the nearest-neighbor framework using a number of distance metrics, and an experimental study on the ICDAR 2017 Historical WI dataset reported an accuracy of 77.39%. Likewise, Lai et al. [47] proposed pathlet and SIFT features for writer identification in historical documents. Pathlet and unidirectional SIFT features are extracted to capture rich shape (slant and curvature) and structural information (corners and junctions) in the handwriting. The extracted features are then encoded using a newly proposed bagged-vector of locally aggregated descriptors (VLAD) scheme. The methods reported state-of-the-art performance on ICDAR 2017 Historical-WI dataset and achieved the top performance in the ICDAR 2019 historical document reading challenges-image retrieval (HDRC-IR) competition.
Among other methods, Chammas et al. [48] used a deep convolutional neural network using small patches of handwriting extracted through SIFT keypoints. Features learned by the convolutional neural network (CNN) are encoded through multi-VLAD and are normalized using the L2 norm. Classification with an exemplar support vector machine reported an accuracy of 97% on the ICDAR2019 HDRC-IR dataset. In another study, Christlein et al. [49] used a deep residual CNN to learn effective feature representations using surrogate classes. These classes are obtained by applying clustering on the samples in the training set. Features learned by the CNN are subsequently employed for classification. In an extension of this study, Jordan et al. [50] employed the same features and introduced a re-ranking method to improve the retrieval performance. The re-ranking relies on k-reciprocal nearest neighbor relationships and was shown to significantly improve the performance on the ICDAR 2107 dataset.
In other notable works on historical documents, transfer learning on pretrained CNNs is employed in ref. [51] to identify writer from the images of the twelfth century Bible. An extension of this study was the evaluation of pretrained CNNs on medieval documents [52]. Similarly, a number of pretrained CNNs are evaluated for multiple tasks, including dating and identification of writing styles from historical manuscripts. In another series of related studies, a detailed analysis of writer identification in historical documents was carried out in ref. [53], and the study was extended to handwriting on papyrus in ref. [54]. To handle the problem of data scarcity, Nasir et al. [55] proposed a two-step fine-tuning by first tuning the weights of pretrained CNN on modern handwriting images in the IAM dataset and subsequently tuning it on the limited samples of handwriting on papyrus. While the work in ref. [55] extracts features from rectangular windows of handwriting obtained with dense sampling, patches around keypoints in handwriting are considered in ref. [56], resulting in enhanced performance. An overall writer identification rate of 64% is reported in ref. [56], as opposed to 54% in [55] on a challenging set of 50 writing samples from ten different scribes.
A summary of notable studies on writer identification from historical manuscripts, primarily targeting the ICDAR datasets, is presented in Table 1. It can be observed that among hand-crafted features, textural features represent an attractive choice to capture the information on writing style and, hence, identify the writers [25,34]. Among machine-learned features, a common recent trend is to identify keypoints in handwriting (e.g., using SIFT), extract small patches using these keypoints, and employ these patches for feature learning through a CNN. It is also common to encode the extracted features where different variants of VLAD encoding have been investigated [47–49]. In our study, since the primary focus is on proposing an effective distance measure, we employ textural measures to extract features and employ a number of distance metrics to validate the superiority of the proposed metric. These details are presented in the next section.
Study | Features | Dataset | Top-1 | mAP |
---|---|---|---|---|
Winning system of ICDAR 2017 competition [34] | oBIFs columns | ICDAR 2017 | 76.40 | 55.60 |
Christlein et al. (2017) [49] | VLAD-encoded CNN features | ICDAR 2017 | 88.9 | 76.2 |
CLaMM16 | 84.1 | — | ||
Gattal et al. (2018) [25] | oBIFs columns | ICDAR 2017 | 77.39 | 56.82 |
Jordan et al. (2020) [50] | VLAD-encoded CNN features with re-ranking | ICDAR 2017 | 89.43 | 78.20 |
CzByChron | 98.04 | 80.10 | ||
MusicDocs | 98.62 | 78.64 | ||
Chammas et al. (2020) [48] | CNN features with multi-VLAD encoding | ICDAR 2019 | 97.0 | 91.2 |
Lai et al. (2020) [47] | SIFT and pathlet features with bagged-VLAD | ICDAR 2017 | 90.1 | 77.2 |
ICDAR 2019 | 97.4 | 92.5 |
3 Methods
This section introduces the proposed writer identification technique that relies upon extracting a set of features from the writing samples in the reference base and comparing those with the questioned document. An overview of the system is presented in Figure 1 while the feature extraction and the proposed moment-based matching method are discussed further.
3.1 Feature extraction
Feature extraction is an important step in any image classification task. It allows for the mapping of given images to points in the feature space, so that images of the same class (writer in our case) cluster together in the feature space. In our study, we have chosen to capture the curvature, contour, and texture information in the handwriting to characterize the writer. The corresponding computational features include oBIFs column histograms, and hinge features, detailed in the following.
3.1.1 oBIFs column histograms
In one of our recent studies [25], we investigated the effectiveness of oBIFs in identifying writers from historical documents and obtained promising results. We, therefore, employ oBIFs as one of the features to evaluate the performance of the newly proposed matching scheme. oBIFs are an extension of the Basic Image Features (BIFs) introduced in ref. [57]. oBIFs include the application of a bank of derivative-of-Gaussian filters at multiples scales (controlled by scale parameter
3.1.2 Hinge features
Among the various contour-based features reported in the literature, hinge [20] and delta-n hinge [14] features, designed to capture the ink-trace curvature, are known to be highly discriminative for different writers. The hinge feature [20] computes the joint probability distribution of orientations of the two arms of a hinge, considering each pixel on the writing contour as the reference point. Parameters involved in the calculation of hinge feature include the length of the arm
where
Decision on the identity of the writer of a query document is made separately using oBIFs column histogram and hinge features and the individual decisions are subsequently combined to arrive at the final output. Details of classification using the newly proposed moment matching are presented in the following sections.
3.2 Moment-based distance
Distance metric plays an important role in comparing the feature vectors and eventually performing classification. A number of distance measures have been proposed in the literature to compute the dis-similarity between feature vectors [59]. Commonly employed metrics include Euclidean distance, city block distance, correlation, cosine distance, and spearman distance. In the current study, in addition to the investigation of standard metrics, we also propose a novel matching method and the moment distance, which is elaborated further below.
Moments have been widely employed in image analysis, pattern classification, object recognition, and image coding [60]. If a feature vector is considered as a discrete function
The first moment is the expected value of a random variable, and the second central moment is its variance. Likewise, the third moment is the skewness, and the fourth moment is the kurtosis. The moments about mean are the mean of deviations from the mean after raising them to integer powers. The
If
If
In a similar fashion,
For matching, we extract the feature vector
where
Using equations (8) and (9), the
In our study, we have (empirically) chosen the values of
The distance between the query feature vector and all those in the reference base is computed using equation (11), and the writer of the query document is identified as the writer of the document in the reference collection, which reports the minimum distance (nearest neighbor framework). Moment distances are computed separately for the two sets of features (oBIF column histograms and hinge features) and the final decision is made by combining the individual decisions. Decisions can be combined using the product (Prod), sum (Sum), average (Avr), or minimum (Min) rules, and based on the findings of our previous study [25], in the present work, we combine the individual decisions using the Min rule.
4 Experiments and results
The experimental study of our system is carried out on the ICDAR 2017 Historical Writer Identification dataset [34]. The test set of this competition contains a total of 3,600 historical manuscripts with 720 unique writers, i.e., each writer contributed five pages. For consistency, we employ the same experimental protocol as that of the competition and quantify the performance using Top-1, Top-5, and Top-10 identification rates along with the mean average precision (mAP). Top-k refers to retrieving the best
In the first experiment, we aim to evaluate the performance of well-known handcrafted features on the challenging set of historical manuscripts. Using the competition experimental protocol, we implemented a number of well-known features applied to the writer identification problem on modern manuscripts to select the best subset of features. The investigated features include the oBIFs column histogram, Delta Hinge features, local binary patterns (LBP), LBP column histogram, complete local binary patterns (CLBP), local binary pattern variance (LBPV), run length features, edge direction and edge hinge features, and COLD features. The results of these experiments are summarized in Table 2.
Features | Parameters | Dim. | Average identification rates (%) | |||
---|---|---|---|---|---|---|
Top 1 | Top 5 | Top 10 | Map | |||
|
oBIF at
|
484 | 74.56 | 83.33 | 86.36 | 53.26 |
|
oBIF at
|
484 | 72.69 | 80.92 | 84.14 | 52.34 |
|
oBIF at
|
968 | 76.17 | 84.25 | 86.94 | 55.27 |
oBIF at
|
||||||
|
|
5184 | 71.14 | 79.69 | 83.33 | 50.50 |
LBP [13] |
|
529 | 49.31 | 55.72 | 58.14 | 29.40 |
LBP column histogram [13] | LBP at
|
14,337 | 37.81 | 43.22 | 45.83 | 21.67 |
LBP at
|
||||||
CLBP [12] |
|
270 | 54.00 | 60.61 | 63.42 | 33.43 |
LBPV [12] |
|
529 | 56.06 | 63.75 | 65.89 | 34.00 |
Run-length distribution on white pixels [61] | — | 200 | 39.92 | 47.56 | 51.17 | 25.18 |
Run-length distribution on black pixels [61] | — | 400 | 27.14 | 32.67 | 35.03 | 15.96 |
Run-length distribution on white and black pixels [61] | — | 600 | 42.11 | 49.36 | 52.39 | 26.08 |
Edge-direction distribution with 16 angles [61] | — | 16 | 29.03 | 35.53 | 39.08 | 17.72 |
Edge-hinge with fragment of length of 7 pixels [61] | — | 2,304 | 61.11 | 68.64 | 72.03 | 40.40 |
COLD features [15] |
|
84
|
63.44 | 72.36 | 75.42 | 41.67 |
It can be observed in Table 2 that among the investigated features, the oBIF column histograms at
Once the top-performing features are identified, we evaluate the performance of these features using different distance metrics, including the proposed moment-based distance. The corresponding results are summarized in Table 3. It can be seen that by using the proposed moment-based distance, Top-1 identification rates of as high as 77.36% and 75% are reported with the oBIF column histogram (
Features | Distance metrics | Average identification rates (%) | ||||
---|---|---|---|---|---|---|
Top1 | Top5 | Top10 | mAP | |||
|
NMD | New moment distance | 74.61 | 82.83 | 86.08 | 53.44 |
Eu | Euclidean distance | 73.53 | 81.81 | 85.14 | 52.33 | |
CB | City-Block distance | 74.33 | 82.33 | 85.00 | 52.46 | |
Corr | Correlation distance | 74.56 | 83.33 | 86.36 | 53.26 | |
Cos | Cosine distance | 74.17 | 82.78 | 85.42 | 53.18 | |
Sp | Spearman distance | 73.89 | 82.72 | 86.19 | 52.87 | |
|
NMD | New moment distance | 74.17 | 83.56 | 86.58 | 53.20 |
Eu | Euclidean distance | 72.83 | 81.42 | 84.36 | 51.30 | |
CB | City-Block distance | 72.64 | 82.03 | 84.72 | 51.08 | |
Corr | Correlation distance | 72.69 | 80.92 | 84.14 | 52.34 | |
Cos | Cosine distance | 73.22 | 82.67 | 86.03 | 52.67 | |
Sp | Spearman distance | 72.19 | 80.25 | 83.17 | 51.68 | |
|
NMD | New moment distance | 77.36 | 86.42 | 89.39 | 55.88 |
Eu | Euclidean distance | 76.03 | 84.17 | 87.11 | 54.39 | |
CB | City-Block distance | 75.72 | 84.53 | 86.97 | 54.32 | |
Corr | Correlation distance | 76.17 | 84.25 | 86.94 | 55.27 | |
Cos | Cosine distance | 75.97 | 84.39 | 87.28 | 55.14 | |
Sp | Spearman distance | 75.42 | 83.33 | 86.08 | 54.72 | |
|
NMD | New moment distance | 75.00 | 84.00 | 86.47 | 54.06 |
Eu | Euclidean distance | 72.11 | 80.72 | 83.69 | 50.51 | |
CB | City-Block distance | 72.94 | 82.19 | 85.11 | 51.30 | |
Corr | Correlation distance | 71.14 | 79.69 | 83.33 | 50.50 | |
Cos | Cosine distance | 72.25 | 80.33 | 83.47 | 51.08 | |
Sp | Spearman distance | 69.94 | 78.19 | 81.22 | 50.49 |
Bold text indicates a significant performance compared to other performances.
In addition to studying the performance of the features with respect to different distance metrics, we also investigated the combination of decisions of individual features to study how the proposed metric behaves when decisions are combined. Decisions are combined using a minimum of the sum distances from features (Sum-Min), minimum of the product (Prod-Min), minimum of the average (Avg-Min), and minimum of the minimum (Min-Min) distances. Performance with the combination of decisions is summarized in Table 4, where it can be observed that, in general, classification performance of the combination scheme based on the minimum of the product (Prod-Min) is relatively better than other combination methods. A Top-1 identification rate of 78.75% and mAp of 58.62% is reported. Among the investigated combinations, combination of the oBIF column histogram (
Combination decision | Average identification rates (%) | |||
---|---|---|---|---|
Top1 | Top5 | Top10 | mAP | |
Sum-Min(f1,f2) | 77.25 | 86.08 | 88.39 | 56.07 |
Sum-Min(f3,f4) | 78.72 | 88.33 | 91.00 | 58.45 |
Sum-Min(f1,f2, f4) | 78.69 | 87.06 | 89.89 | 58.37 |
Sum-Min(f1, f2, f3,f4) | 78.42 | 86.75 | 89.67 | 57.99 |
Prod-Min(f1,f2) | 77.22 | 86.58 | 89.28 | 56.11 |
Prod-Min(f3,f4) | 78.75 | 88.31 | 91.08 | 58.62 |
Prod-Min(f1,f2, f4) | 78.64 | 86.86 | 89.92 | 58.48 |
Prod-Min(f1, f2, f3,f4) | 78.53 | 86.97 | 90.08 | 58.03 |
Bold text indicates a significant performance compared to other performances.
In the end, we also provide a performance comparison with methods (hand-crafted features) evaluated on the same dataset and using the same experimental protocol as that of our study. Although higher identification rates are reported in studies employing machine-learned features (using ConvNets), it is important to recall that the objective of this study is not to introduce novel features but to enhance the matching step. Consequently, to show the effectiveness of the proposed moment-based distance, we investigated two well-known hand-crafted features and studied the performance evolution as a function of the distance metric. Consequently, the comparison is also made with techniques employing hand-crafted features. Table 5 shows that using the proposed moment-based matching, we achieve better performance as compared to those reported in ref. [25] as well as the winning system of the ICDAR 2017 competition [34]. Given the complexity of the problem, the reported identification rates are indeed very promising validating the effectiveness of the employed features and the proposed moment-matching method.
5 Conclusion
We presented an effective technique for characterizing writer from historical manuscripts. The technique relies on extracting oBIF column histograms and delta-hinge features from writing samples, and these features are matched using the newly proposed moment-based distance. A number of existing hand-crafted features are evaluated on the ICDAR 2017 Historical Document Writer Identification (Historical-WI) dataset, and the best-performing features are selected. A comprehensive study with different distance metrics and different decision combination schemes is also carried out. The reported results validated the effectiveness of the moment-based matching method in identifying writers from historical manuscripts. In our further study on this subject, we plan to investigate other categories of features and employ a formal feature selection strategy to identify the most discriminate subset of features for this problem. Furthermore, we also plan to extend this study to an unsupervised framework where manuscripts do not have writer labels and need to be grouped into clusters as a function of similarity in the writing style.
-
Funding information: The authors state no funding involved.
-
Conflict of interest: The authors declare no conflict of interest.
References
[1] Bensefia A, Tamimi H. Validity of handwriting in biometric systems. PRAI 2018. New York, NY, USA: Association for Computing Machinery; 2018. p. 5–10. 10.1145/3243250.3243261. Search in Google Scholar
[2] Srihari SN, Cha SH, Arora H, Lee S. Individuality of handwriting. J Forensic Sci. 2002;47(4):856–72. 10.1037/e515312006-001Search in Google Scholar
[3] Saunders CP, Davis LJ, Buscaglia J. Using automated comparisons to quantify handwriting individuality. J Forensic Sci. 2011;56(3):683–9. 10.1111/j.1556-4029.2011.01713.xSearch in Google Scholar PubMed
[4] Kumar P, Sharma A. DCWI: distribution descriptive curve and cellular automata based writer identification. Expert Syst Appl. 2019;128:187–200. 10.1016/j.eswa.2019.03.037Search in Google Scholar
[5] Chahi A, Ruichek Y, Touahni R. An effective and conceptually simple feature representation for off-line text-independent writer identification. Expert Syst Appl. 2019;123:357–76. 10.1016/j.eswa.2019.01.045Search in Google Scholar
[6] Tan GJ, Sulong G, Rahim MSM. Writer identification: a comparative study across three world major languages. Forensic Sci Int. 2017;279:41–52. 10.1016/j.forsciint.2017.07.034Search in Google Scholar PubMed
[7] Rehman A, Naz S, Razzak MI. Writer identification using machine learning approaches: a comprehensive review. Multimedia Tools Appl. 2019;78(8):10889–931. 10.1007/s11042-018-6577-1Search in Google Scholar
[8] Rasoulzadeh S, Baba Ali B. Writer identification and writer retrieval based on NetVLAD with re-ranking. IET Biometrics. 2022;11(1):10–22. 10.1049/bme2.12039Search in Google Scholar
[9] Chammas M, Makhoul A, Demerjian J, Dannaoui E. A deep learning based system for writer identification in handwritten Arabic historical manuscripts. Multimedia Tools Appl. 2022;81:30769–84. 10.1007/s11042-022-12673-xSearch in Google Scholar
[10] He S, Schomaker L. GR-RNN: Global-context residual recurrent neural networks for writer identification. Pattern Recognition 2021;117:107975. 10.1016/j.patcog.2021.107975Search in Google Scholar
[11] Semma A, Hannad Y, Siddiqi I, Djeddi C, El Kettani MEY. Writer identification using deep learning with FAST keypoints and Harris corner detector. Expert Syst Appl. 2021;184:115473. 10.1016/j.eswa.2021.115473Search in Google Scholar
[12] Abbas F, Gattal A, Djeddi C, Bensefia A, Jamil A, Saoudi K. Offline writer identification based on CLBP and VLBP. In: Mediterranean Conference on Pattern Recognition and Artificial Intelligence. Switzerland: Springer; 2020. p. 188–99. 10.1007/978-3-030-71804-6_14Search in Google Scholar
[13] Abbas F, Gattal A, Djeddi C, Siddiqi I, Bensefia A, Saoudi K. Texture feature column scheme for single-and multi-script writer identification. IET Biometrics. 2021;10(2):179–93. 10.1049/bme2.12010Search in Google Scholar
[14] He S, Schomaker L. Delta-n hinge: rotation-invariant features for writer identification. In: 2014 22nd International Conference on Pattern Recognition. USA: IEEE; 2014. p. 2023–8. 10.1109/ICPR.2014.353Search in Google Scholar
[15] He S, Schomaker L. Writer identification using curvature-free features. Pattern Recognition. 2017;63:451–64. 10.1016/j.patcog.2016.09.044Search in Google Scholar
[16] Gattal A, Djeddi C, Bensefia A, Ennaji A. Handwriting based gender classification using COLD and hinge features. In: International Conference on Image and Signal Processing. Switzerland: Springer; 2020. p. 233–42. 10.1007/978-3-030-51935-3_25Search in Google Scholar
[17] Hannad Y, Siddiqi I, Djeddi C, El-Kettani MEY. Improving Arabic writer identification using score-level fusion of textural descriptors. IET Biometrics. 2019;8(3):221–9. 10.1049/iet-bmt.2018.5009Search in Google Scholar
[18] Chawki D, Labiba SM. A texture based approach for Arabic writer identification and verification. In: 2010 International Conference on Machine and Web Intelligence. USA: IEEE; 2010. p. 115–20. 10.1109/ICMWI.2010.5648130Search in Google Scholar
[19] Bensefia A, Paquet T, Heutte L. A writer identification and verification system. Pattern Recognition Lett. 2005;26(13):2080–92. 10.1016/j.patrec.2005.03.024Search in Google Scholar
[20] Bulacu M, Schomaker L. Text-independent writer identification and verification using textural and allographic features. IEEE Trans Pattern Anal Machine Intell. 2007;29(4):701–17. 10.1109/TPAMI.2007.1009Search in Google Scholar PubMed
[21] Siddiqi I, Vincent N. Text independent writer recognition using redundant writing patterns with contour-based orientation and curvature features. Pattern Recognition. 2010;43(11):3853–65. 10.1016/j.patcog.2010.05.019Search in Google Scholar
[22] Bennour A, Djeddi C, Gattal A, Siddiqi I, Mekhaznia T. Handwriting based writer recognition using implicit shape codebook. Forensic Sci Int. 2019;301:91–100. 10.1016/j.forsciint.2019.05.014Search in Google Scholar PubMed
[23] Nguyen HT, Nguyen CT, Ino T, Indurkhya B, Nakagawa M. Text-independent writer identification using convolutional neural network. Pattern Recognition Lett. 2019;121:104–12. 10.1016/j.patrec.2018.07.022Search in Google Scholar
[24] He S, Schomaker L. FragNet: Writer identification using deep fragment networks. IEEE Trans Inform Forensics Security. 2020;15:3013–22. 10.1109/TIFS.2020.2981236Search in Google Scholar
[25] Gattal A, Djeddi C, Siddiqi I, Al-Maadeed S. Writer identification on historical documents using oriented basic image features. In: 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR). USA: IEEE; 2018. p. 369–73. 10.1109/ICFHR-2018.2018.00071Search in Google Scholar
[26] He S, Samara P, Burgers J, Schomaker L. Image-based historical manuscript dating using contour and stroke fragments. Pattern Recognition. 2016;58:159–71. 10.1016/j.patcog.2016.03.032Search in Google Scholar
[27] Pirrone A, Aimar MB, Journet N. Papy-S-Net: a Siamese network to match papyrus fragments. In: Proceedings of the 5th International Workshop on Historical Document Imaging and Processing; 2019. p. 78–83. 10.1145/3352631.3352646Search in Google Scholar
[28] Slimane F, Awaida S, Mezghani A, Parvez MT, Kanoun S, Mahmoud SA, et al. ICFHR2014 competition on Arabic writer identification using AHTID/MW and KHATT databases. In: 2014 14th International Conference on Frontiers in Handwriting Recognition. IEEE; 2014. p. 797–802. 10.1109/ICFHR.2014.139Search in Google Scholar
[29] Djeddi C, Al-Maadeed S, Gattal A, Siddiqi I, Ennaji A, El Abed H. ICFHR2016 Competition on multi-script writer demographics classification using QUWI database. In: 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE; 2016. p. 602–6. 10.1109/ICFHR.2016.0115Search in Google Scholar
[30] Djeddi C, Al-Maadeed S, Siddiqi I, Abdeljalil G, He S, Akbari Y. ICFHR 2018 competition on multi-script writer identification. In: 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE; 2018. p. 506–10. 10.1109/ICFHR-2018.2018.00094Search in Google Scholar
[31] Louloudis G, Gatos B, Stamatopoulos N, Papandreou A. Icdar 2013 competition on writer identification. In: 2013 12th International Conference on Document Analysis and Recognition. USA: IEEE; 2013. p. 1397–401. 10.1109/ICDAR.2013.282Search in Google Scholar
[32] Djeddi C, Al-Maadeed S, Gattal A, Siddiqi I, Souici-Meslati L, ElAbed H. ICDAR 2015 competition on multi-script writer identification and gender classification using QUWI database. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR). USA: IEEE; 2015. p. 1191–5. 10.1109/ICDAR.2015.7333949Search in Google Scholar
[33] Malik MI, Ahmed S, Marcelli A, Pal U, Blumenstein M, Alewijns L, et al. ICDAR 2015 competition on signature verification and writer identification for on-and off-line skilled forgeries (SigWIcomp2015). In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR). USA: IEEE; 2015. p. 1186–90. 10.1109/ICDAR.2015.7333948Search in Google Scholar
[34] Fiel S, Kleber F, Diem M, Christlein V, Louloudis G, Nikos S, et al. ICDAR2017 Competition on Historical Document Writer Identification (Historical-WI). In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). vol. 1. USA: IEEE; 2017. p. 1377–82. 10.1109/ICDAR.2017.225Search in Google Scholar
[35] Christlein V, Nicolaou A, Seuret M, Stutzmann D, Maier A. ICDAR 2019 Competition on Image Retrieval for Historical Handwritten Documents. In: 2019 International Conference on Document Analysis and Recognition (ICDAR). USA: IEEE; 2019. p. 1505–9. 10.1109/ICDAR.2019.00242Search in Google Scholar
[36] Gattal A, Djeddi C, Siddiqi I, Chibani Y. Gender classification from offline multi-script handwriting images using oriented Basic Image Features (oBIFs). Expert Syst Appl. 2018;99:155–67. 10.1016/j.eswa.2018.01.038Search in Google Scholar
[37] Marti UV, Bunke H. The IAM-database: an English sentence database for offline handwriting recognition. Int J Document Anal Recognition. 2002;5(1):39–46. 10.1007/s100320200071Search in Google Scholar
[38] Grosicki E, Carré M, Brodin JM, Geoffrois E. Results of the RIMES evaluation campaign for handwritten mail processing. In: 2009 10th International Conference on Document Analysis and Recognition. USA: IEEE; 2019. p. 941–5. 10.1109/ICDAR.2009.224Search in Google Scholar
[39] Kleber F, Fiel S, Diem M, Sablatnig R. Cvl-database: An off-line database for writer retrieval, writer identification and word spotting. In: 2013 12th International Conference on Document Analysis and Recognition. USA: IEEE; 2013. p. 560–4. 10.1109/ICDAR.2013.117Search in Google Scholar
[40] Mahmoud SA, Ahmad I, Al-Khatib WG, Alshayeb M, Parvez MT, Märgner V, et al. KHATT: An open Arabic offline handwritten text database. Pattern Recognition. 2014;47(3):1096–112. 10.1016/j.patcog.2013.08.009Search in Google Scholar
[41] AlMaadeed S, Ayouby W, Hassaine A, Aljaam JM. QUWI: an Arabic and English handwriting dataset for offline writer identification. In: 2012 International Conference on Frontiers in Handwriting Recognition. USA: IEEE; 2012. p. 746–51. 10.1109/ICFHR.2012.256Search in Google Scholar
[42] Sankar KP, Ambati V, Pratha L, Jawahar C. Digitizing a million books: Challenges for document analysis. In: International Workshop on Document Analysis Systems. Berlin, Heidelberg, Germany: Springer; 2006. p. 425–36. 10.1007/11669487_38Search in Google Scholar
[43] Klemme A. International Dunhuang Project: The Silk Road Online. Reference Reviews. 2014. Search in Google Scholar
[44] Jouili S, Coustaty M, Tabbone S, Ogier JM. NAVIDOMASS: structural-based approaches towards handling historical documents. In: 2010 20th International Conference on Pattern Recognition. USA: IEEE; 2010. p. 946–9. 10.1109/ICPR.2010.237Search in Google Scholar
[45] Hamid A, Bibi M, Siddiqi I, Moetesum M. Historical manuscript dating using textural measures. In: 2018 International Conference on Frontiers of Information Technology (FIT). USA: IEEE; 2018. p. 235–40. 10.1109/FIT.2018.00048Search in Google Scholar
[46] Schomaker L, Franke K, Bulacu M. Using codebooks of fragmented connected-component contours in forensic and historic writer identification. Pattern Recognition Lett. 2007;28(6):719–27. 10.1016/j.patrec.2006.08.005Search in Google Scholar
[47] Lai S, Zhu Y, Jin L. Encoding pathlet and SIFT features with bagged VLAD for historical writer identification. IEEE Trans Inform Forensics Security. 2020;15:3553–66. 10.1109/TIFS.2020.2991880Search in Google Scholar
[48] Chammas M, Makhoul A, Demerjian J. Writer identification for historical handwritten documents using a single feature extraction method. In: 19th International Conference on Machine Learning and Applications (ICMLA 2020). USA: IEEE; 2020. 10.1109/ICMLA51294.2020.00010Search in Google Scholar
[49] Christlein V, Gropp M, Fiel S, Maier A. Unsupervised feature learning for writer identification and writer retrieval. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). vol. 1. USA: IEEE; 2017. p. 991–7. 10.1109/ICDAR.2017.165Search in Google Scholar
[50] Jordan S, Seuret M, Král P, Lenc L, Martínek J, Wiermann B, et al. Re-ranking for writer identification and writer retrieval. In: International Workshop on Document Analysis Systems. Switzerland: Springer; 2020. p. 572–86. 10.1007/978-3-030-57058-3_40Search in Google Scholar
[51] Cilia ND, De Stefano C, Fontanella F, Marrocco C, Molinara M, DiFreca AS. A two-step system based on deep transfer learning for writer identification in medieval books. In: International Conference on Computer Analysis of Images and Patterns. Springer; 2019. p. 305–16. 10.1007/978-3-030-29891-3_27Search in Google Scholar
[52] Cilia N, De Stefano C, Fontanella F, Marrocco C, Molinara M, DiFreca AS. An end-to-end deep learning system for medieval writer identification. Pattern Recognition Lett. 2020;129:137–43. 10.1016/j.patrec.2019.11.025Search in Google Scholar
[53] Mohammed H, Märgner V, Stiehl HS. Writer identification for historical manuscripts: analysis and optimisation of a classifier as an easy-to-use tool for scholars from the humanities. In: 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE; 2018. p. 534–9. 10.1109/ICFHR-2018.2018.00099Search in Google Scholar
[54] Mohammed H, Marthot-Santaniello I, Märgner V. GRK-Papyri: a dataset of Greek handwriting on Papyri for the task of writer identification. In: 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE; 2019. p. 726–31. 10.1109/ICDAR.2019.00121Search in Google Scholar
[55] Nasir S, Siddiqi I. Learning features for writer identification from handwriting on Papyri. In: 4th Mediterranean Conference on Pattern Recognition and Artificial Intelligence. Switzerland: Springer; 2020. 10.1007/978-3-030-71804-6_17Search in Google Scholar
[56] Nasir S, Siddiqi I, Moetesum M. Writer characterization from handwriting on Papyri using multi-step feature learning. In: International Conference on Document Analysis and Recognition. Switzerland: Springer; 2021. p. 451–65. 10.1007/978-3-030-86198-8_32Search in Google Scholar
[57] Griffin LD, Lillholm M. Symmetry sensitivities of derivative-of-gaussian filters. IEEE Trans Pattern Anal Machine Intell. 2009;32(6):1072–83. 10.1109/TPAMI.2009.91Search in Google Scholar PubMed
[58] Brink AA, Smit J, Bulacu M, Schomaker L. Writer identification using directional ink-trace width measurements. Pattern Recognition. 2012;45(1):162–71. 10.1016/j.patcog.2011.07.005Search in Google Scholar
[59] Hafner J, Sawhney HS, Equitz W, Flickner M, Niblack W. Efficient color histogram indexing for quadratic form distance functions. IEEE Trans Pattern Anal Machine Intell. 1995;17(7):729–36. 10.1109/34.391417Search in Google Scholar
[60] Mukundan R, Ramakrishnan K. Moment functions in image analysis: theory and applications. Singapore: World Scientific; 1998. 10.1142/3838Search in Google Scholar
[61] Djeddi C, Siddiqi I, Souici-Meslati L, Ennaji A. Multi-script writer identification optimized with retrieval mechanism. In: 2012 International Conference on Frontiers in Handwriting Recognition. USA: IEEE; 2012. p. 509–14. 10.1109/ICFHR.2012.239Search in Google Scholar
© 2023 the author(s), published by De Gruyter
This work is licensed under the Creative Commons Attribution 4.0 International License.