[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Next Article in Journal
Machine Learning for Modeling the Singular Multi-Pantograph Equations
Next Article in Special Issue
DSP-Assisted Nonlinear Impairments Tolerant 100 Gbps Optical Backhaul Network for Long-Haul Transmission
Previous Article in Journal
A Novel Hybrid Approach for Partial Discharge Signal Detection Based on Complete Ensemble Empirical Mode Decomposition with Adaptive Noise and Approximate Entropy
Previous Article in Special Issue
Partitioned Stator Switched Flux Machine: Novel Winding Configurations
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Reliable Auto-Robust Analysis of Blood Smear Images for Classification of Microcytic Hypochromic Anemia Using Gray Level Matrices and Gabor Feature Bank

1
Department of Computer Science and IT, University of Malakand, Chakdara 18801, Pakistan
2
College of Engineering, Electrical Engineering Department, Najran University, Najran 61441, Saudi Arabia
3
School of Computing and Communications, Lancaster University, Bailrigg, Lancaster LA1 4YW, UK
4
Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Najran University, Najran 61441, Saudi Arabia
5
Department of Medicinal Chemistry, Pharmacy School, Najran University, Najran 61441, Saudi Arabia
*
Author to whom correspondence should be addressed.
Entropy 2020, 22(9), 1040; https://doi.org/10.3390/e22091040
Submission received: 15 July 2020 / Revised: 15 September 2020 / Accepted: 15 September 2020 / Published: 17 September 2020
(This article belongs to the Special Issue Reliability of Modern Electro-Mechanical Systems)
Figure 1
<p>(<b>a</b>) A normal blood smear image and (<b>b</b>) microcytic hypochromic, (<b>c</b>) normocytic hypochromic, (<b>d</b>) macrocytic hypochromic, and (<b>e</b>) microcytic hyperchromic anemia.</p> ">
Figure 2
<p>A step-by-step proposed plan of work.</p> ">
Figure 3
<p>Output images after the preprocessing step: (<b>a</b>) original RGB image, (<b>b</b>) red channel, (<b>c</b>) green channel, (<b>d</b>) blue channel, (<b>e</b>) enhanced green channel, and (<b>f</b>) quantized image leaving behind RBCs.</p> ">
Figure 4
<p>(<b>a</b>) Binarized original image of a sample blood smear, (<b>b</b>) binarized image of a sample blood smear quantized, and (<b>c</b>) exclusive OR of (<b>a</b>,<b>b</b>).</p> ">
Figure 5
<p>(<b>a</b>) Orientation of angles overlayed on a sample red blood cell (RBC) and (<b>b</b>) scanning through 0<math display="inline"><semantics> <msup> <mrow/> <mo>∘</mo> </msup> </semantics></math>, 45<math display="inline"><semantics> <msup> <mrow/> <mo>∘</mo> </msup> </semantics></math>, 90<math display="inline"><semantics> <msup> <mrow/> <mo>∘</mo> </msup> </semantics></math>, and 135<math display="inline"><semantics> <msup> <mrow/> <mo>∘</mo> </msup> </semantics></math>.</p> ">
Figure 6
<p>(<b>a</b>) Original image of hyperchromic macrocytic RBC, (<b>b</b>) gray level image of hyperchromic macrocytic RBC, (<b>c</b>) Gabor filter bank of hyperchromic macrocytic RBC, (<b>d</b>) original image of hypochromic microcytic RBC, (<b>e</b>) gray level image of hypochromic microcytic RBC, (<b>f</b>) Gabor filter bank features of hypochromic microcytic RBC, (<b>g</b>) original image of hyperchromic microcytic RBC, (<b>h</b>) gray level image of hyperchromic microcytic RBC, and (<b>i</b>) a hyperchromic microcytic RBC with its Gabor filter bank features.</p> ">
Figure 7
<p>Feature reduction using Locality Sensitive Discriminant Analysis (LSDA).</p> ">
Figure 8
<p>(<b>a</b>,<b>d</b>,<b>g</b>) Original RGB images of a sample blood smear, (<b>b</b>,<b>e</b>,<b>h</b>) removal of white blood cells (WBCs) from the blood smear images, and (<b>c</b>,<b>f</b>,<b>i</b>) enhanced gray-scale images of RBCs in blood smears.</p> ">
Figure 9
<p>Classes of cells among RBCs (<b>a</b>,<b>b</b>) microcytic hypochromic cells, (<b>c</b>) microcytic narmochromic cells, (<b>d</b>) macrocytic hyperchromic cells (<b>e</b>,<b>f</b>) microcytic hyperchromic cells (<b>g</b>) macrocytic Hyperchromic cells, and (<b>h</b>,<b>j</b>) codocytes or taget cells.</p> ">
Figure 10
<p>Graphs showing the results of four types of texture features, (<b>a</b>) GLRLM (<b>b</b>) GLCM (<b>c</b>) GMSE (<b>d</b>) GMA.</p> ">
Versions Notes

Abstract

:
Accurate blood smear quantification with various blood cell samples is of great clinical importance. The conventional manual process of blood smear quantification is quite time consuming and is prone to errors. Therefore, this paper presents automatic detection of the most frequently occurring condition in human blood—microcytic hyperchromic anemia—which is the cause of various life-threatening diseases. This task has been done with segmentation of blood contents, i.e., Red Blood Cells (RBCs), White Blood Cells (WBCs), and platelets, in the first step. Then, the most influential features like geometric shape descriptors, Gray Level Co-occurrence Matrix (GLCM), Gray Level Run Length Matrix (GLRLM), and Gabor features (mean squared energy and mean amplitude) are extracted from each of the RBCs. To discriminate the cells as hypochromic microcytes among other RBC classes, scanning is done at angles (0 , 45 , 90 , and 135 ). To achieve high-level accuracy, Adaptive Synthetic (AdaSyn) sampling for imbalance learning is used to balance the datasets and locality sensitive discriminant analysis (LSDA) technique is used for feature reduction. Finally, upon using these features, classification of blood cells is done using the multilayer perceptual model and random forest learning algorithms. Performance in terms of accuracy was 96%, which is better than the performance of existing techniques. The final outcome of this work may be useful in the efforts to produce a cost-effective screening scheme that could make inexpensive screening for blood smear analysis available globally, thus providing early detection of these diseases.

1. Introduction

Anemia is an abnormal condition in human blood in which the amount of red blood cells (and therefore their oxygen-carrying capacity) is inadequate to fulfill the physiologic needs of the body. The World Health Organization (WHO) estimates that 42% of children less than 5 years of age and 40% of pregnant women worldwide are anaemic [1,2]. According to the translated prevalence by [3], 496 million are nonpregnant women, 32 million are pregnant women, and 273 million are children with anemia in 2011. When a peripheral blood smear slide is viewed under a microscope, it is found to be composed of three main cell types, i.e., Red Blood Cells (RBCs), White Blood Cells (WBCs), and platelets. Normal RBCs are round and reddish in color, having a central pallor area in the center indicating hemoglobin level; see Figure 1. The diameter of normal RBCs ranges from about 6.2 to 8.2 micrometer ( μ m ), and its thickness ranges from about 2 to 2.5 micrometers ( μ m ) [4]. magenta The amount of hemoglobin indicates whether magenta a human is healthy or has a potential disease that affects RBC production or function. This indication is through several parameters of whole RBC (size, shape, and number) and the amount of hemoglobin it contains. The size of RBCs indicates three main defects in human blood: narmocytosis, microcytosis, and macrocytosis. Narmocytosis is an anemia of normocytes, where the size of RBCs is normal. Microcytosis occurs when the size of the RBC is less than that of the normal reference range and is expressed by a blood index called Mean Corpuscular Volume (MCV) of less than 80 femtoliters, whereas macrocytosis is when the size of the RBC is greater than the normal reference range and is expressed by an MCV greater than the normal range [5].
Patients of anemia may be categorized into mild, moderate, and severe according to WHO classification, i.e., mild anemia (hemoglobin 9.0–10.9 g/dL) moderate anemia (hemoglobin 7.0–8.9 g/dL), and severe anemia (hemoglobin <7.0 g/dL) [1]. The amount of hemoglobin on the other hand specifies two other abnormalities: hypochromic, if the mean concentration of hemoglobin in a single cell is lower than the normal reference range, i.e., 27 to 32 picogram ( p g ) (approximately) and is stated by Mean Corpuscular Hemoglobin Concentration (MCHC), and hyperchromic, if the range is above the said reference range [6].
The above discussed terms are parameters used in distinguishing the primary types of anemic conditions. When both cell size and hemoglobin defects are combined, e.g., if microcytic is taken from the first category and combined with hypochromic from the second category, then it will define the features of microcytic hypochromic anemia, whereas if macrocytic and hyperchromic are combined, it will give macrocytic hyperchromic anemia, etc. [7,8]. The diseases which are most frequently diagnosed in South Asian countries especially in Pakistan are iron deficiency anemia, sideroblastic anemia, and thalassemia, which are categorized and defined under microcytic hypochromic anemia [9]. These parameters may be obtained from peripheral blood smear slides prepared by hematologists in clinical labs of hospitals proceeded by grabbing digital images, which is an area of interest.
This work is organized as follows: related work is explained in Section 2, the preparation and visualization of the blood smear is explained in Section 3. To better understand the nature of improvement, we carried out a thorough study that builds up a methodology in which blood smear after following the necessary procedures is processed with image processing techniques for segmentation. The primary steps of image processing are image acquisition, preprocessing (conversion into color spaces, splitting channels, and enhancement techniques), binarization, segmentation, etc. [10]. Then, the classification step explained in Section 4 is conducted with Adaptive Synthetic (AdaSyn) sampling to balance the datasets and with Locality Sensitive Discriminant Analysis (LSDA) for the reduction of unnecessary features of all the contents visible in a blood smear. The rest of paper presents the results (Section 5) and finally the conclusion and discussion (Section 6).

2. Related Work

Various state-of-the-art image processing-based techniques for computer-aided disease detection are present excessively, some of which are the most relevant to our work presented here. Moallem et al. [11] used a three-step algorithm for segmentation of overlapped cells in blood smear images by extracting a binary mask in the first step, then an adaptive mean shift algorithm was used to centrally localize a cell, and finally the gradient vector flow algorithm was used to draw boundaries for separation of cells. In the given article, there is no shape analysis done, which fails to classify the significant classes of RBCs. Tomari et al. [12] used some geometric features for the classification of RBCs through Artificial Neural Network (ANN). Alomari et al. [13] worked on the automatic quantification of WBCs and RBCs by using an iterative structured circle detection algorithm, but in the given technique, it was found that it is restricted to circular cells. The technique suggested by Aggarwal et al. [14] is intensity-based Otsu thresholding where segmentation of RBCs infected with a parasite was done, but the segmentation by intensity may suffer extremely due to luminance variations and other photographic conditions. Tek et al. [15] suggested a technique for recognition and classification of malarial RBCs’ parasites and species in peripheral blood smears. Shape, color, and local granulometry features were extracted from the area of interest, and k Nearest Neighbor (kNN) classifier was applied to classify them from extracted features. Chen et al. [16] evaluated blood smear slides having hemolytic anemia by determining the chain codes for finding the edges of cells, separated the cells with the help of concavity measurement, and classified cells with a bank of classifiers. A work done by Xu et al. [17] for the detection of sickle cells is segmentation of red cells in the first step, then separation of overlapped cells using random walk algorithm in the next step, and classification through Deep Convolutional Network in the last step. Sharma et al. [18] used a median filter for image smoothing and watershed for overlapped cells separation. Then, they implemented a three-feature vector, circularity matric, aspect ratio, and radial signature, trained with KNN classifier for the recognition of three types of RBCs called sickle cells, elliptocytes, and dacrocytes. A work for the detection and classification of parasites in RBCs was done by Ahirwar et al. [19]. They generated geometric, color attributes and gray level texture feature sets and used artificial neural network for their classification. A fuzzy logic technique was used by Bhagavathi et al. [20] for segmentation of RBCs and WBCs. Morphological operations and hough transform method for circle detection were used by Chandrasiri et al. [21] for the detection and analysis of red blood cells, but the proposed system was unable to determine and analyze the extensive number of clumped or overlapped regions. The above presented approaches can do better in a situation where the population of clumped cells is low. In highly populated clumped cells, accuracy suffers. In our proposed approach, the issue is fixed to much extent by leveraging the concavities and texture-based features in each cell as explained in the following Materials and Methods section.

3. Materials and Methods

Our proposed plan of work for the detection of hypochromic microcytic cells in sample blood smear images consisted of the following series of steps shown in Figure 2.

3.1. Blood Smear Slide Preparation

A consistent blood distribution and proper lucidity are required for reliable blood smear analysis. It can be done by starting with a drop of sample blood at one end of glass slide, which is smeared quickly and gently with a wedge technique to form a thin edge, where all cells are able to be analyzed separately, especially RBCs [22]. This whole process was done by an expert laboratory technician in the local hospital.

3.2. Image Acquisition

After the staining process was completed, the image acquisition step started by using a 400× field of a microscope with oil immersion, keeping the horizontal and vertical resolution at 180 dpi and the image dimension at 2592 × 1944 pixels. The images were captured and labelled properly.

3.3. Preprocessing

A green channel of RGB blood smear was selected, enhanced, and smoothed using Balance Contrast Equalization Technique (BCET) and median filter. After getting a fine gray-scale image shown in Figure 3, it was quantized with a scale factor F g , calculated from mean intensity value of g ( x , y ) calculated in Equations (1) and (2), where Q ( x , y ) is the resultant quantized image.
F g = 1 N 1 N g ( x , y )
Q ( x , y ) = F g r o u n d ( g ( x , y ) F g )

3.4. Segmentation

A global or automatic thresholding technique was used in this work for the segmentation of only RBCs (leaving behind WBCs and platelets). A two-step binarization technique was followed for getting noise-free binary images. In the first step, the quantized image was binarized containing white blood cells; in the second step, the whole original image was binarized. Finally, we used the XOR gate (Exclusive OR) operation to remove white blood cells. The output along with other processed images are shown in Figure 4.

3.5. Feature Extraction

A three-way feature extraction plan was implemented. The first was geometric morphological feature extraction, the second was intensity features and texture features using (GLCM and GLRLM), and the third was texture features using the Gabor Filter Bank.

3.5.1. Geometric Morphology of Microcytic Hypochromic RBCs

Keeping in view the visual size and shape of hypochromic microcytes, the useful features were calculated as follows:
  • Area: Area is an important geometrical feature for the detection of microcytes, being small in size compared to other blood cells.
  • Circularity: A size-invariant shape descriptor given in Equation (3) which describes a shape to be circular, if the value is closer to 1 and noncircular if the value is closer to 0, where A is Area and P is parameter of a cell.
    C i r c u l a r i t y = 4 π A P 2
  • Rectangularity: It determines the degree of elongation with respect to a rectangle. Equation (4) shows its calculation, where A s is area of a shape and A r is the area of minimum bounding rectangle.
    R e c t a n g u l a r i t y = A s A r
  • Concavity: This property is used to determine how much an object is concave; we applied it on the shapes for identification of the amount of central pallor area occupied in an RBC, given by Equation (5)
    C o n c a v i t y = A c A H
  • Convexity: A cell convexity can be determined by Equation (6), which identifies a shape through its boundary convexity.
    C o n v e x i t y = P C H P C

3.5.2. RBC Texture Feature Calculation

Features other than geometry include texture information on RBCs. In this research work, we extracted RBG intensity-based features (mean and variances), Gray Level Co-occurrence Matrix (GLCM), Gray Level Run Length Matrix (GLRLM), and Gabor feature bank. This set of features associated with the dispersal of chromatin matter in the RBCs is helpful in the classification of hypochromic, hyperchromic, and normochromic cells given in Figure 1 above. The measure will indicate that either the RBC has a deep red color compared to the others (cells having more haemoglobin) or are less red, with more central pallor area and having a smaller haemoglobin ratio. The texture features calculated are presented below:
  • RGB mean and variance of hypochromic microcytic RBCs: The mean values μ r , μ g , and μ b of pixels of each RBC in R, G, and B, respectively, were calculated in Equation (7).
    μ r = 1 N 1 N r ( x , y ) μ g = 1 N 1 N g ( x , y ) μ b = 1 N 1 N b ( x , y )
    The variances ( σ r ) 2 , ( σ g ) 2 , and ( σ b ) 2 in the channels R, G, and B, respectively, are calculated in Equation (8)).
    ( σ r ) 2 = ( X μ r ) 2 N ( σ g ) 2 = ( X μ g ) 2 N ( σ b ) 2 = ( X μ b ) 2 N
  • GLCM features of Hypochromic Microcytic RBCs: GLCM is the distribution of cooccurring pixel values defined over an N × N image P at a specific offset, or every P’s element determines the occurrences of a pixel with value of gray level, i, lifted by a certain distance to a pixel with value j. Our next six textural features are GLCM features. The mean of 6 GLCM features were determined for offset values conforming to 0 , 45 , 90 , and 135 consuming 8 gray levels (see Figure 5).
    Maximum Probability: It measures the strongest response of the cooccurrence matrix. The range of values is [0, 1] as given in Equation (9), where P i j is the pixels of gray image.
    P = m a x i j ( P i j )
    Correlation: The degree of correlation of a pixel to its neighbor is determined by the correlation factor of the cooccurrence matrix, ranging from 1 to −1 given by Equation (10). This measure cannot be defined if any of the standard deviation σ is 0 for the two existing correlations, perfect positive and perfect negative correlation.
    c o r r e l a t i o n = i , j = 1 K ( i m r ) ( j m c ) p ( i j ) σ r σ c
    Pixels intensity contrast: It is a measure of intensity contrast between a pixel and its neighbor over the entire image (calculated in Equation (11)).
    c o n t r a s t = i , j = 1 k ( i j ) 2 p i j
    Energy: It is the measurement of uniformity in the intensities of an image (as given in Equation (12)). Its value is 1, if an image is constant and 0 if the intensities are variable.
    E n e r g y = i , j = 1 k ( P ( i , j ) ) 2
    Homogeneity: It measures the spatial closeness of the distribution of elements in the cooccurrence matrix to the diagonal given by (13). The values range is [0, 1], and the maximum value is attained when the matrix is a diagonal.
    h o m o g e n e i t y = i , j = 1 k P i j 1 + | i j |
    Entropy: It measure the degree of variability of the elements of the cooccurrence matrix. Its value is 0 if all intensities of P i j are 0 and is maximum when all P i j are equal. It may be calculated by (14).
    e n t r o p y = i , j = 1 k P i j l o g 2 P i j
  • Run length matrix features of each RBC: The other textural features are created on the gray-level run length matrix (calculated in Equations (15)–(24). The l K matrix p, where l is the number of gray levels and k is the maximum run length, is defined for a certain image as the total runs with pixels of gray level i and run length j. Likewise, as in the GLCM, the run length matrices were calculated using 8 gray-levels for 30 , 60 , 90 , and 135 .
    Short Run Emphasis (SRE):
    S R E = 1 R i = 1 k j = 1 k P i j j 2
    Long Run Emphasis (LRE):
    L R E = 1 R i = 1 k j = 1 k ( p i j ) j 2
    Gray-Level Nonuniformity (GLN):
    G L N = 1 R i = 1 k ( j = 1 l ( p i j ) ) 2
    Run Length Nonuniformity (RLN):
    R L N = 1 R j = 1 l ( i = 1 k p i j ) 2
    Low Gray-level Run Emphasis (LGRE):
    L G R E = 1 R i = 1 k j = 1 l p i j i 2
    High Gray-level Run Emphasis (HGRE):
    H G R E = 1 R i = 1 k j = 1 l ( p i j ) i 2
    Short Run Low Gray-level Emphasis (SRLGE):
    S R L G E = 1 R i = 1 k j = 1 l p i j i 2 j 2
    Short Run High Gray-level Emphasis (SRHGE):
    S R H G E = 1 R i = 1 k j = 1 l p ( i j ) i 2 j 2
    Long Run Low Gray-level Emphasis (LRLGE):
    L R L G E = 1 R i = 1 k j = 1 l p ( i j ) j 2 i 2
    Long Run High Gray-level Emphasis (LRHGE):
    L R H G L E = 1 R i = 1 k j = 1 l ( p i j ) i 2 j 2
  • Gabor Feature Extraction: like a human visual processing system, the Gabor filter extracts features at different amplitudes and orientation.
    ψ ( x , y ) = f 2 π γ η e ( f 2 γ 2 x 2 ) + ( f 2 η 2 y 2 ) e j 2 π f x x = x cos θ + y sin θ y = x sin θ + y cos θ
    The Gabor filter is the product of a 2D Fourier basis function and origin-centred Gaussian given in Equation (25), where f is the central frequency of the filter, γ and η are the sharpness or bandwidth measure along the minor and major axes of Gaussian respectively, θ is the angle of rotation, and ( η / γ ) is the aspect ratio. The analytical form of this function in frequency domain is given in Equation (26) as follow:
    ψ ( u , v ) = e π 2 f 2 ( γ 2 ( u f ) 2 + η 2 v 2 ) u = u cos θ + v sin θ v = u sin θ + v cos θ
    In the frequency domain given by Equation (27), the function is a single real-valued Gaussian centered at f. A simplified version of a general 2D Gabor filter function in Equations (25) and (26) was formulated by [23], which implements a set of self-similar filters, i.e., Gabor wavelets (rotated and scaled forms of each other, irrespective of the frequency f and orientation θ .
    Gabor bank or Gabor features were created from responses of Gabor filters in Equations (25) and (26) by using multiple filters on several frequencies f m and orientations θ n . Frequency in this case corresponds to scale information and is thus drawn from [23]
    f m = k m f m a x , m = { 0 , , M 1 }
    where fm is the mth frequency, f θ = f m a x is the highest frequency desired, and k > 1 is the frequency scaling factor. The filter orientations are drawn from [24]. Gabor features were calculated at 4 wavelengths (3, 6, 9, and 12) and 3 orientations θ (30 , 60 , and 90 ); see Figure 6a–c. Then, each filter was convolved with the real image, and the response image of the same image was produced; here, each image gave us a feature vector. Each feature vector consisted of mean amplitude and mean squared energy. Finally, two matrices were obtained, that were of [1 × 12] each. The matrices were appended to each other, and a [1 × 24] matrix was produced for one image having a [n × 24] vector for n images for supplementary training purpose in the preceding step of classification (as shown in Figure 6).

3.6. ADASYN Sampling

A significant aspect in classification and learning is to show a reasonable dataset to guarantee that no inclination is presented by an imbalanced information distribution. A method that has been used in previous works is Adaptive Synthetic Sampling (ADASYN) to enhance the classification accuracy by balancing the datasets, thus decreasing bias factors [25]. Table 1 shows that the original dataset is partially imbalance; therefore, we applied ADASYN to overcome this problem and to balance the dataset. After applying ADASYN sampling, the database then consisted of 354 microcytic, 327 normocytic, 312 macrocytic, 340 hypochromic, and 380 normochromic images of blood smears.

3.7. Features Reduction

To maintain variation among interclass data samples, it is necessary to reduce the dimensionality of an original dataset. We used Gray Level Cooccurrence Matrix (GLCM), Gray Level Run Length Matrix (GLRLM), and Gabor filter bank, which collectively produced 52 features for a single cell image in a blood smear image. Therefore, to reduce the dimensionality, a Locality Sensitive Discriminant Analysis (LSDA) [26] approach was applied separately to the features extracted from each cell (shown in Figure 7). We use LSDA because it is significant in the case where there are no sufficient training samples. LSDA uses local structures, and it is generally more important than global structure for discriminant analysis. LSDA determines a projection using the local manifold structure, which results in the maximization of the margin among data points from different classes at every local area. Various experiments on the existing datasets showed an improvement over the Linear Discriminant Analysis (LDA).

4. Classification

In different situations, varied instances show a tendency to a specific classification tool. Therefore, iterative experiments have been performed for the selection and determination of an optimal tool. The tools determined during the process were random forest and multilayer perceptual modal. These are ideal classification tools in this situation. For classification, the training data set was prepared with the extracted features mentioned above. The images were labelled with ground truth from the existing dataset images, containing Iron Deficiency Anaemia (IDA) images, mostly. The instances of dataset comprised of geometric morphological feature, GLCM, GLRL, and Gabor texture features are given in the features below. We used the classical machine learning methods as the overfitting and underfitting anomalies of pretrained deep learning algorithms suffer in accuracy of results due to less sufficient data in datasets. The performance of our classification technique is given in the Results Section 5.

5. Results

The results of the proposed work are as follows.

5.1. Dataset

The images we used in our work were collected from a local hospital. At the selected hospital, a high frequency of microcytic hypochromic patients were observed. The available datasets were also searched for such images, and a set of 150 images (comprising of about 80 hypochromic microcytes per image) were created based on ground truth, labeled by expert hematologists.

5.2. Qualitative Results

Hypochromic microcytes are the cells we are interested in for segmentation, the cell with a more pallor area and smaller than normal mean sizes of RBCs (Figure 8). These types of cells have less pixel area with more hole area proportionality. A hole containing cells was assumed to be a pallor cell and was less chromatic than the cell having no hole. The chromaticity factor is inversely proportional to the proportionality of central pallor: the bigger the central pallor, the less chromatic the cell will be and the cell will have a big hole in it after carrying out binarization operations, and hence, the concavity of the cell will also be more. The qualitative results of the GLCM, GLRL, Gabor mean amplitude, and Gabor mean square energy are given in Table 2, Table 3, Table 4 and Table 5. The qualitative results are also demonstrated in Figure 8 and Figure 9. The sample cells for which the features are extracted and demonstrated in Table 2, Table 3, Table 4 and Table 5 are shown in Figure 10.

5.3. Quantitative Results

This work has two main purposes: one is to segment the red blood cells accurately and then to detect the microcytic hypochromic cell. To evaluate the detection algorithm, the precision, recall, F1-score and accuracy metrices were calculated (using Equations (28)–(31)), where TP, TN, FP, FN are true positive, true negative, false positive, and false negative, respectively. To evaluate the detection algorithm, the precision, accuracy, recall, and F1-score matrices were calculated. The performance-wise results of different algorithms are given in Table 6.
P r e c i s i o n = F P T P + F P
R e c a l l = T P T P + F N
F 1 M e a s u r e = 2 p r e c i s i o n R e c a l l p r e c i s i o n + R e c a l l
A c c u r a c y = T P + T N T P + T N + F P + F N

6. Conclusions

The goal of this study is to develop and improve a robust algorithm for the analysis of blood smear images for the classification of microcytic hypochromic anaemia. Many studies are present in the literature that focus on the classification of blood cells, but there are very rare studies found on the classification of normal and abnormal blood slides as a specific anemic disease. Moreover, the existing state-of-the-art techniques are very expensive and their operation is very difficult. Our proposed system has a reasonable accuracy rate and processing time. The proposed system is capable of detecting various chromatic status of blood and accurately estimates the boundary pixels of RBCs at diverse photographic conditions. The results showed that our algorithm is a better automatic segmentation methods for blood smear images. The geometric and three potential texture features of RBCs, i.e., GLCM, GLRL, and Gabor texture features at 4 different degrees of scan (3, 6, 9, and 12) and 3 orientations θ (30 , 60 , and 90 ) were extracted. After feature extraction and the feature reduction technique, the features were then put in two powerful machine learning algorithms (random forest and multilayer perceptron) using ensemble learning technique. The overall classification accuracy was 96%. Which was compared to the existing techniques and were found better for classification. The present work may be extended to a 3D volume estimation of blood cells, which is necessary in finding the accurate blood indices like Mean Corpuscular Volume (MCV), Mean Corpuscular Haemoglobin (MCH), and Mean Corpuscular Haemoglobin Concentration (MCHC). The proposed system may facilitate pathologists by getting quicker results with high True Positive (TP) and True Negative (TN) rates in the initial stage of diagnosis. It is user-friendly and easily operable with less expenses in terms of cost. Furthermore, the system may be made available on the web without any association of special equipment and user knowledge. Therefore, it can be considered an added value to the existing automated analysis blood smear tools.

Author Contributions

The research conceptualization and methodology were done by B.A., S.U.R., and M.I. The technical and theoretical framework was prepared by M.A., O.M.A., and A.S. The technical review and improvement were performed by M.H.N. and M.A. The overall technical support, guidance, and project administration were done by M.H.M. and O.M.A. Finally, responses to the queries of the reviewers were done by B.A. and M.A. All authors have read and agreed to the published version of the manuscript.

Funding

The authors acknowledge the Deanship of Scientific Research, Najran University, Kingdom of Saudi Arabia for their support.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. World Health Organization. Haemoglobin Concentrations for the Diagnosis of Anaemia and Assessment of Severity; World Health Organization: Geneva, Switzerland, 2011. [Google Scholar]
  2. Farooqi, N.; Naqvi, M.; Jaffery, S.; Anemia, H.F. Anemia; moderate to severe during pregnancy. Prof. Med. J. 2014, 21, 247–252. [Google Scholar]
  3. Stevens, G.A.; Finucane, M.M.; De-Regil, L.M.; Paciorek, C.J.; Flaxman, S.R.; Branca, F.; Peña-Rosas, J.P.; Bhutta, Z.A.; Ezzati, M. Global, regional, and national trends in haemoglobin concentration and prevalence of total and severe anaemia in children and pregnant and non-pregnant women for 1995–2011: A systematic analysis of population-representative data. Lancet Glob. Health 2013, 1, e16–e25. [Google Scholar] [CrossRef] [Green Version]
  4. Jandl, J.H. Blood: Textbook of hematology. Blood 1996, 4, 3. [Google Scholar]
  5. Brecher, G.; Jakobek, E.F.; Schneiderman, M.A.; Williams, G.Z.; Schmidt, P.J. Size distribution of erythrocytes. Ann. N. Y. Acad. Sci. 1962, 99, 242–261. [Google Scholar] [CrossRef] [PubMed]
  6. Vreugdenhil, G.; Baltus, C.; Eijk, H.V.; Swakk, A. Anaemia of chronic disease: Diagnostic significance of erythrocyte and serological parameters in iron deficient rhuematiod arthritis pateints. Rheumatology 1990, 29, 105–110. [Google Scholar] [CrossRef] [PubMed]
  7. Aulakh, R.; Sohi, I.; Singh, T.; Kakkar, N. Red cell distribution width (rdw) in the diagnosis of iron deficiency with microcytic hypochromic anemia. Indian J. Pediatr. 2009, 76, 265–268. [Google Scholar] [CrossRef] [PubMed]
  8. Bachman, A.L. Macrocytic hyperchromic anemia in early infancy: Report of a case and review of the literature. Am. J. Dis. Child. 1936, 52, 633–647. [Google Scholar] [CrossRef]
  9. Habib, M.A.; Black, K.; Soofi, S.B.; Hussain, I.; Bhatti, Z.; Bhutta, Z.A.; Raynes-Greenow, C. Prevalence and predictors of iron deficiency anemia in children under five years of age in pakistan, a secondary analysis of national nutrition survey data 2011–2012. PLoS ONE 2016, 11, e0155051. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  10. Gonzalez, R.C.; Woods, R.E.; Eddins, S.L. Digital Image Processing Using MATLAB; Pearson-Prentice-Hall: Upper Saddle River, NJ, USA, 2004; Volume 624. [Google Scholar]
  11. Moallem, G.; Sari-Sarraf, H.; Poostchi, M.; Maude, R.J.; Silamut, K.; Hossain, M.A.; Antani, S.; Jaeger, S.; Thoma, G. Detecting and segmenting overlapping red blood cells in microscopic images of thin blood smears. In Proceedings of the Medical Imaging 2018: Digital Pathology, Houston, TX, USA, 10–15 February 2018. [Google Scholar]
  12. Tomari, R.; Zakaria, W.N.W.; Jamil, M.M.A.; Nor, F.M.; Fuad, N.F.N. Computer aided system for red blood cell classification in blood smear image. Procedia Comput. Sci. 2014, 42, 206–213. [Google Scholar] [CrossRef] [Green Version]
  13. Alomari, Y.M.; Abdullah, S.; Huda, S.N.; Azma, R.Z.; Omar, K. Automatic detection and quantification of wbcs and rbcs using iterative structured circle detection algorithm. Comput. Math. Methods Med. 2014, 2014, 979302. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Aggarwal, P.; Khatter, A.; Vyas, G. An intensity threshold based image segmentation of malaria infected cells. In Proceedings of the 2018 Second International Conference on Computing Methodologies and Communication (ICCMC), Erode, India, 15–16 February 2018; pp. 549–553. [Google Scholar]
  15. Tek, F.B.; Dempster, A.G.; Kale, Z. Parasite detection and identification for automated thin blood film malaria diagnosis. Comput. Vis. Image Underst. 2010, 114, 21–32. [Google Scholar] [CrossRef]
  16. Chen, H.-M.; Tsao, Y.-T.; Tsai, S.-N. Automatic image segmentation and classification based on direction texton technique for hemolytic anemia in thin blood smears. Mach. Vis. Appl. 2014, 25, 501–510. [Google Scholar] [CrossRef]
  17. Xu, M.; Papageorgiou, D.P.; Abidi, S.Z.; Dao, M.; Zhao, H.; Karniadakis, G.E. A deep convolutional neural network for classification of red blood cells in sickle cell anemia. PLoS Comput. Biol. 2017, 13, e1005746. [Google Scholar] [CrossRef] [PubMed]
  18. Sharma, V.; Rathore, A.; Vyas, G. Detection of sickle cell anaemia and thalassaemia causing abnormalities in thin smear of human blood sample using image processing. In Proceedings of the 2016 International Conference on Inventive Computation Technologies (ICICT), Coimbatore, India, 26–27 August 2016; pp. 1–5. [Google Scholar]
  19. Ahirwar, N.; Pattnaik, S.; Acharya, B. Advanced image analysis based system for automatic detection and classification of malarial parasite in blood images. Int. J. Inf. Technol. Knowl. Manag. 2012, 5, 59–64. [Google Scholar]
  20. Bhagavathi, S.; Niba, S.T. An automatic system for detecting and counting rbc and wbc using fuzzy logic. ARPN J. Eng. Appl. Sci. 2016, 11, 6891–6894. [Google Scholar]
  21. Chandrasiri, S.; Samarasinghe, P. Morphology based automatic disease analysis through evaluation of red blood cells. In Proceedings of the 2014 5th International Conference on Intelligent Systems, Modelling and Simulation, Langkawi, Malaysia, 27–29 January 2014; pp. 318–323. [Google Scholar]
  22. Adewoyin, A. Peripheral blood film-a review. Ann. Ib. Postgrad. Med. 2014, 12, 71–79. [Google Scholar] [PubMed]
  23. Daugman, J.G. Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. JOSA A 1985, 2, 1160–1169. [Google Scholar] [CrossRef] [PubMed]
  24. Ilonen, J.; Kamarainen, J.-K.; Paalanen, P.; Hamouz, M.; Kittler, J.; Kalviainen, H. Image feature localization by multiple hypothesis testing of gabor features. IEEE Trans. Image Process. 2008, 17, 311–325. [Google Scholar] [CrossRef] [PubMed]
  25. Yung, M.; Brown, E.T.; Rasin, A.; Furst, J.D.; Raicu, D.S. Synthetic sampling for multi-class malignancy prediction. arXiv 2018, arXiv:1807.02608. [Google Scholar]
  26. Gao, Y.-L.; Luo, S.-Z.; Wang, Z.-H.; Chen, C.-C.; Pan, J.-Y. Locality sensitive discriminative unsupervised dimensionality reduction. Symmetry 2019, 11, 1036. [Google Scholar] [CrossRef] [Green Version]
Figure 1. (a) A normal blood smear image and (b) microcytic hypochromic, (c) normocytic hypochromic, (d) macrocytic hypochromic, and (e) microcytic hyperchromic anemia.
Figure 1. (a) A normal blood smear image and (b) microcytic hypochromic, (c) normocytic hypochromic, (d) macrocytic hypochromic, and (e) microcytic hyperchromic anemia.
Entropy 22 01040 g001
Figure 2. A step-by-step proposed plan of work.
Figure 2. A step-by-step proposed plan of work.
Entropy 22 01040 g002
Figure 3. Output images after the preprocessing step: (a) original RGB image, (b) red channel, (c) green channel, (d) blue channel, (e) enhanced green channel, and (f) quantized image leaving behind RBCs.
Figure 3. Output images after the preprocessing step: (a) original RGB image, (b) red channel, (c) green channel, (d) blue channel, (e) enhanced green channel, and (f) quantized image leaving behind RBCs.
Entropy 22 01040 g003
Figure 4. (a) Binarized original image of a sample blood smear, (b) binarized image of a sample blood smear quantized, and (c) exclusive OR of (a,b).
Figure 4. (a) Binarized original image of a sample blood smear, (b) binarized image of a sample blood smear quantized, and (c) exclusive OR of (a,b).
Entropy 22 01040 g004
Figure 5. (a) Orientation of angles overlayed on a sample red blood cell (RBC) and (b) scanning through 0 , 45 , 90 , and 135 .
Figure 5. (a) Orientation of angles overlayed on a sample red blood cell (RBC) and (b) scanning through 0 , 45 , 90 , and 135 .
Entropy 22 01040 g005
Figure 6. (a) Original image of hyperchromic macrocytic RBC, (b) gray level image of hyperchromic macrocytic RBC, (c) Gabor filter bank of hyperchromic macrocytic RBC, (d) original image of hypochromic microcytic RBC, (e) gray level image of hypochromic microcytic RBC, (f) Gabor filter bank features of hypochromic microcytic RBC, (g) original image of hyperchromic microcytic RBC, (h) gray level image of hyperchromic microcytic RBC, and (i) a hyperchromic microcytic RBC with its Gabor filter bank features.
Figure 6. (a) Original image of hyperchromic macrocytic RBC, (b) gray level image of hyperchromic macrocytic RBC, (c) Gabor filter bank of hyperchromic macrocytic RBC, (d) original image of hypochromic microcytic RBC, (e) gray level image of hypochromic microcytic RBC, (f) Gabor filter bank features of hypochromic microcytic RBC, (g) original image of hyperchromic microcytic RBC, (h) gray level image of hyperchromic microcytic RBC, and (i) a hyperchromic microcytic RBC with its Gabor filter bank features.
Entropy 22 01040 g006
Figure 7. Feature reduction using Locality Sensitive Discriminant Analysis (LSDA).
Figure 7. Feature reduction using Locality Sensitive Discriminant Analysis (LSDA).
Entropy 22 01040 g007
Figure 8. (a,d,g) Original RGB images of a sample blood smear, (b,e,h) removal of white blood cells (WBCs) from the blood smear images, and (c,f,i) enhanced gray-scale images of RBCs in blood smears.
Figure 8. (a,d,g) Original RGB images of a sample blood smear, (b,e,h) removal of white blood cells (WBCs) from the blood smear images, and (c,f,i) enhanced gray-scale images of RBCs in blood smears.
Entropy 22 01040 g008
Figure 9. Classes of cells among RBCs (a,b) microcytic hypochromic cells, (c) microcytic narmochromic cells, (d) macrocytic hyperchromic cells (e,f) microcytic hyperchromic cells (g) macrocytic Hyperchromic cells, and (h,j) codocytes or taget cells.
Figure 9. Classes of cells among RBCs (a,b) microcytic hypochromic cells, (c) microcytic narmochromic cells, (d) macrocytic hyperchromic cells (e,f) microcytic hyperchromic cells (g) macrocytic Hyperchromic cells, and (h,j) codocytes or taget cells.
Entropy 22 01040 g009
Figure 10. Graphs showing the results of four types of texture features, (a) GLRLM (b) GLCM (c) GMSE (d) GMA.
Figure 10. Graphs showing the results of four types of texture features, (a) GLRLM (b) GLCM (c) GMSE (d) GMA.
Entropy 22 01040 g010
Table 1. Balanced datasets after applying Adaptive Synthetic Sampling (ADASYN).
Table 1. Balanced datasets after applying Adaptive Synthetic Sampling (ADASYN).
Cell TypeOriginalSyntheticTotal
Microcyte157197354
Normocytes27057327
Macrocytes101211312
Hypochromic157150340
Narmochromic3800380
Table 2. Gray Level Cooccurrence Matrix (GLCM) features extracted from each cell in a sample image.
Table 2. Gray Level Cooccurrence Matrix (GLCM) features extracted from each cell in a sample image.
Cell No.ContrastCorrelationEnergyHomogeneity
Cell185.40406.62123.52105.8
Cell280.02758−112.132136.3636108.52
Cell3116.9394.5910093.68
Cell496.1894.716117.85100.0
Cell5121.1−79.498092.77
Cell6106.2−148.78098.03
Cell786.6−201.2120104.07
Cell888.1288.12120104
Cell9106.197.4710098
Cell10100100100100
Table 3. Gray Level Run Length (GLRL) features.
Table 3. Gray Level Run Length (GLRL) features.
Cell No.SRELREGLNRLNRPLGREHGRESRLGESRHGELRLGELRHGE
Cell10.5644.24116.35113.278.742.0950.110.2212.4624.79209.57
Cell20.740.25126.78108.018.692.2639.370.2914.1225.91127.59
Cell30.4344.88135.02139.618.472.0251.450.1810.5924.89268.0
Cell40.565.18137.38115.3512.932.6248.20.237.8345.05134.07
Cell588.24106.6294.77106.14102.6893.53110.2980.2386.0796.51118.61
Cell6115.4100.8594.5198.77103.2692.7399.82120.7773.0288.5120.18
Cell70.3961.57132.04111.7311.222.643.640.197.942.7130.44
Cell81.2589.76103.12103.3629.927.0181.330.5919.370.39130.52
Cell90.3347.64309.2216.511.12.4428.860.156.0232.81254.73
Cell100.3745.41355.65206.2910.462.5628.170.185.9931.69325.68
Table 4. Gabor’s Mean Squared Energy (MSE).
Table 4. Gabor’s Mean Squared Energy (MSE).
Cell No.MSE1MSE2MSE3MSE4MSE5MSE6MSE7MSE8MSE9MSE10MSE11MSE12
Cell1376263165.7155.2414.9301.7216.3204.5424.4313.3226.3197
Cell2258350274.6316.8436.3380.9264.9306.7439.7382.9273.3299
Cell3269403419.8554.9328.8457.5386.6493.4314.5440.3377.1462
Cell4234205207.9153.3177.8157.8186.5174.7173.5153200.3172
Cell56296121.3164.972.1110.4157.8173.673.3109.7155.8167
Cell6457091.5113.540.870.593.3104.441.572.4102105
Cell7120115149.813679.1140201.2149.770.6102.6147.2156
Cell850101138.612051.8101149.714654.2113.4163.6131
Cell9701682737.9862.766959269278470653.5685.5820
Cell107306957008506705907017707151373.91396.11397
Table 5. Gabor’s Mean Amplitude (MA).
Table 5. Gabor’s Mean Amplitude (MA).
Cell No.MA1MA2MA3MA4MA5MA6MA7MA8MA9MA10MA11MA12
Cell1251120261321261321
Cell2151223151221151222
Cell3141123141121141021
Cell4251624251624151624
Cell5141325151526151526
Cell6141121141221141221
Cell7141322151623141323
Cell8141422151625151624
Cell9141218141217141117
Cell10141216141216141216
Table 6. Performance evaluation of each classification algorithm with statistical measurement tools.
Table 6. Performance evaluation of each classification algorithm with statistical measurement tools.
StatisticalK-means Clustering (%)Logistic Regression (%)Naive Bayes (%)Proposed Classifier (%)
Precision81.186.487.192.3
Accuracy80.786.288.393.2
Recall80.383.284.395.4
F1-Score79.882.184.194.1

Share and Cite

MDPI and ACS Style

Azam, B.; Ur Rahman, S.; Irfan, M.; Awais, M.; Alshehri, O.M.; Saif, A.; Nahari, M.H.; Mahnashi, M.H. A Reliable Auto-Robust Analysis of Blood Smear Images for Classification of Microcytic Hypochromic Anemia Using Gray Level Matrices and Gabor Feature Bank. Entropy 2020, 22, 1040. https://doi.org/10.3390/e22091040

AMA Style

Azam B, Ur Rahman S, Irfan M, Awais M, Alshehri OM, Saif A, Nahari MH, Mahnashi MH. A Reliable Auto-Robust Analysis of Blood Smear Images for Classification of Microcytic Hypochromic Anemia Using Gray Level Matrices and Gabor Feature Bank. Entropy. 2020; 22(9):1040. https://doi.org/10.3390/e22091040

Chicago/Turabian Style

Azam, Bakht, Sami Ur Rahman, Muhammad Irfan, Muhammad Awais, Osama Mohammed Alshehri, Ahmed Saif, Mohammed Hassan Nahari, and Mater H. Mahnashi. 2020. "A Reliable Auto-Robust Analysis of Blood Smear Images for Classification of Microcytic Hypochromic Anemia Using Gray Level Matrices and Gabor Feature Bank" Entropy 22, no. 9: 1040. https://doi.org/10.3390/e22091040

APA Style

Azam, B., Ur Rahman, S., Irfan, M., Awais, M., Alshehri, O. M., Saif, A., Nahari, M. H., & Mahnashi, M. H. (2020). A Reliable Auto-Robust Analysis of Blood Smear Images for Classification of Microcytic Hypochromic Anemia Using Gray Level Matrices and Gabor Feature Bank. Entropy, 22(9), 1040. https://doi.org/10.3390/e22091040

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop