AI Dermatochroma Analytica (AIDA): Smart Technology for Robust Skin Color Classification and Segmentation
<p>Image transformation to the LAB color space.</p> "> Figure 2
<p>Systematic evaluation and visualization of cluster configurations.</p> "> Figure 3
<p>Illustrative scheme of the color matching methodology.</p> "> Figure 4
<p>Color alignment visualization between segmented skin colors and Fitzpatrick Skin Type palette.</p> "> Figure 5
<p>Sample of top-foot skin and FST scale palette imagery used in the comparative study of clustering models.</p> "> Figure 6
<p>Comparative visualization of clustering model performances in AIDA system.</p> "> Figure 7
<p>Bar chart for the evaluation of K-means AIDA vs. CNN performances in skin color classification.</p> "> Figure 8
<p>Confusion matrices for the comparative analysis of (<b>a</b>) AIDA and (<b>b</b>) CNN predicted outcomes vs. ground-truth FST classes.</p> "> Figure 9
<p>Skin regions using AIDA algorithm with two, three, and four cluster segments, matched to FST categories.</p> "> Figure A1
<p>Confusion matrix comparison of three ground-truth methods that could be used for testing the unsupervised convolutional network model from 48 images. Red coloration signifies a mismatch, while blue is a match for FST class. (<b>A</b>) Visual classification by a clinician versus visual classification by researcher, both using an FST sticker within image. (<b>B</b>) Commercial device FST classification using individual topological angle (ITA) measurements versus visual classification by researcher using an FST sticker within image. (<b>C</b>) Commercial device FST classification using individual topological angle (ITA) measurements versus visual classification by clinician using an FST sticker within image.</p> "> Figure A2
<p>Architecture of convolutional neural network (CNN) model used in the comparative analysis.</p> ">
Abstract
:1. Introduction
Background and Related Work
2. Materials and Methods
2.1. Overview of AIDA System
- Start: Initiate the AIDA system process.
- Import libraries: In the development of the AIDA system, a crucial step involved the importation of various libraries essential for machine learning, image processing, and data visualization. The specific libraries imported and their primary uses in the context of this project are outlined in Appendix A (Table A1).
- Load and preprocess data: The initial phase of loading and preprocessing skin color and FST palette data [23] was essential for the success of subsequent machine learning tasks. This process involved importing the image data and converting these into a more analytically suitable format. The images were transformed from their original color space (R-G-B) to the LAB color space (Figure 1), which is particularly beneficial for skin color analysis due to its ability to provide a nuanced representation of color variations.
- Configure, train, and evaluate clustering model: A methodical approach was adopted for configuring, training, and evaluating the clustering model for image segmentation in the LAB color space. Initially, the parameters of the clustering model, including the number of clusters and the initialization method, were accurately configured. Subsequently, the clustering algorithm was applied to the prepared data. This involved resizing the LAB color space images, reshaping them for the clustering process, and iteratively applying the clustering algorithm until the clusters were optimally formed. The resulting labels and cluster centers were then calculated to provide a detailed segmentation of the image (Figure 2). The quality of clustering was rigorously evaluated using established metrics such as the silhouette score, Calinski–Harabasz Index, and Davies–Bouldin Index. These metrics provided quantitative assessments of the clustering quality, evaluating aspects such as cluster cohesion, separation, and compactness.
- Match cluster centers with FST palette: An essential phase involved the alignment of cluster centers from segmented skin images with the cluster centers of the FST color palette. This key process aimed to determine the closest correspondences between the identified cluster centers of skin colors and those of the FST palette (Figure 3). The first step involved quantifying the perceptual differences between each color in the skin palette (represented by cluster centers) and the colors in the FST palette. This was achieved by calculating the color distance using a standard metric in colorimetry (CIE76 Delta-E color distance), which effectively measures the differences between two colors, i.e., cluster centers. Subsequently, each cluster center from the skin palette was matched with the nearest cluster center in the FST palette based on the calculated color distances. This matching process was fundamental in identifying the most similar FST color for each identified skin color.
- Visualize results: The visualization of results, specifically the alignment of cluster centers with the FST palette, was executed with a specific approach. This process entailed creating visual representations that illustrated the relationship between the segmented skin colors and the FST color palette. The visualization (Figure 4) was designed to display each color from the skin palette alongside its closest match in the FST palette. To enhance the interpretability of these results, the visualizations included the paired colors and annotations indicating the percentage of each skin color within the image and the distance metrics, which quantified the similarity between the skin and FST colors.
- Validation: FST ground-truth classification was determined by corelating ITA measurements from a colorimetry-based tool (Delfin Skin ColorCatch) to the FST skin color scale [24]. This tool was utilized for the validation of the clustering results against real-world skin color measurements.
- End: Conclude the process with validated and calibrated clustering results ready for practical application or further analysis.
2.2. Unsupervised Clustering Models
3. Results
3.1. Data Collection and Preparation
3.2. Performance Metrics
3.3. Comparative Analysis of Clustering Models
- K-means-type models: K-means demonstrated superior performance, with a higher silhouette score (0.47) and C-H Index (465,790) compared to K-means mini-batch and K-means-PCA, indicating better cluster quality and separation. However, K-means mini-batch had a shorter training time (0.27 s), suggesting greater computational efficiency, albeit at the cost of clustering quality. K-means-PCA, an extension of K-means with dimensionality reduction, showed a moderate silhouette score and C-H Index (0.28 and 15,307, respectively), suggesting decent clustering but not as effective as standard K-means. The incorporation of PCA appeared to slightly increase the training time (0.7 s) compared to basic K-means.
- DBSCAN-type models: DBSCAN and HDBSCAN, both density-based models, exhibited lower scores across all performance metrics compared to K-means. Their lower silhouette scores (0.16) indicate less distinct clustering, which might be due to the complex nature of skin color data not conforming well to density-based clustering. OPTICS-DBSCAN performed poorly in comparison to other models, with the lowest silhouette score and the highest D-B Index (0.08 and 4.25, respectively), indicating poor clustering quality and separation. Its significantly longer training time (26.3 s) also makes it less desirable for real-time application.
- Agglomerative hierarchical clustering (AHC): AHC showed moderate performance (silhouette score of 0.28) but required significantly more time for training (16.91 s), making it less suitable for scenarios where time is crucial.
- Gaussian mixture models (GMMs): GMMs presented a balance between cluster quality (with a silhouette score of 0.23) and training time (0.31 s) but did not excel in any metric.
- Fuzzy C-means: Fuzzy C-means, allowing for overlapping clusters, showed reasonable performance (with a silhouette score of 0.3), suggesting its potential applicability in situations where skin colors do not distinctly belong to separate categories.
- Affinity propagation and mean shift: Both of these models demonstrated moderate to high silhouette scores (with a silhouette score of 0.37 for affinity propagation and 0.29 for mean shift) but were not as effective as K-means in overall clustering performance.
- Spectral clustering: Spectral clustering was found to be the least suitable for this application, evidenced by its negative silhouette score (−0.45) and the longest training time (35.27 s), indicating poor clustering effectiveness and computational inefficiency.
3.4. Comparison with Supervised Learning Model
Evaluation of K-Means AIDA Versus CNN Performance in Skin Color Classification
3.5. Performance Analysis of AIDA Versus CNN with Tolerance
3.6. Spatial Mapping of Skin Regions Using AIDA
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
Task | Tool Description |
---|---|
Image processing (OpenCV) | OpenCV (Open-Source Computer Vision Library) [48] was integral for image processing and computer vision tasks. It facilitated crucial operations such as reading, resizing, and transforming images, as well as converting them between color spaces. |
Data processing (NumPy/Pandas) | The incorporation of the NumPy and Pandas libraries [49] was critical in managing and processing data for machine learning applications. NumPy, renowned for its capabilities in numerical computing, was primarily utilized for its efficient array (vector, matrix) operations. |
Machine learning (Scikit-Learn) | The integration of Scikit-Learn and Scikit-Image libraries played a crucial role in both machine learning and image processing aspects. Scikit-Learn [50], a prominent machine learning library, was utilized to implement the clustering algorithm. |
Plotting and visualization (Seaborn/Matplotlib) | Seaborn and Matplotlib libraries were utilized to facilitate advanced data visualization. Seaborn [51] offered a high-level interface for creating aesthetically pleasing and informative statistical graphics. |
Appendix A.1. Details About Performance Metrics
- Silhouette score is calculated using the following formula [35]:
- Calinski–Harabasz (C-H) Index is defined by the following formula [38]:
- Davies–Bouldin (D-B) Index is determined using [37]:
- Training time: This metric is measured as the elapsed time required for the system to train the model. It is recorded in seconds. The computational efficiency of AIDA was evaluated through the measurement of the time elapsed, ensuring its suitability for real-time applications in clinical environments.
Appendix B
Appendix B.1. Data Preprocessing and Augmentation for CNN
- Data preprocessing techniques: Two key image preprocessing steps were employed to enhance skin color analysis. First, a function cropped out white borders by converting images to grayscale, thresholding for white regions, and then cropping to the largest contour’s bounding box. Second, white frames were removed using advanced morphological operations and contour detection, ensuring only relevant skin color information was retained. These steps were pivotal in focusing on essential skin areas and eliminating irrelevant content.
- Data augmentation techniques: To increase the dataset’s variability and simulate different real-world conditions, several augmentation techniques were applied. These included rotating images at random angles and random horizontal and vertical shifts for variability in color positioning. Additionally, random zooming simulated varying camera-subject distances, while horizontal and vertical flipping diversified the dataset by mirroring skin presentations. A batch augmentation process was developed to systematically apply these techniques, generating multiple augmented versions of each image. This expanded the dataset significantly, with the augmented images saved for use in model training and validation.
- Data oversampling technique: To tackle the issue of uneven FST distribution in the dataset, we used the oversampling technique to ensure a fair representation for each FST. This expanded the dataset to include 1000 subject images for every FST. This approach addressed the class imbalance in the training data. The combination of these preprocessing and augmentation techniques resulted in a richly varied and high-quality dataset, crucial for the effective training of the CNN model.
Appendix B.2. Architecture and Training of CNN Model
- Model architecture and hyperparameter tuning (Figure A2):
- Training and evaluation:
- Performance evaluation:
Fitzpatrick Scale (FST) | Index of Subject Samples Utilized for CNN Training/Testing | Index of Subject Samples Utilized for Comparison Analysis of CNN vs. AIDA |
---|---|---|
FST—1 | 17, 29 | 32, 35 |
FST—2 | 4, 5, 6, 34 | 36, 38, 39 |
FST—3 | 7, 8, 9, 10, 15 | 21, 22, 26, 30, 33, 37 |
FST—4 | 1, 2, 3, 12, 16, 18 | 19, 23, 27, 28, 41 |
FST—5 | 13, 14, 25, 31, 40 | 42, 43, 44, 45, 48 |
FST—6 | 11, 24 | 46, 47 |
References
- Adelekun, A.; Onyekaba, G.; Lipoff, J.B. Skin Color in Dermatology Textbooks: An Updated Evaluation and Analysis. J. Am. Acad. Dermatol. 2021, 84, 194–196. [Google Scholar] [CrossRef] [PubMed]
- Eilers, S.; Bach, D.Q.; Gaber, R.; Blatt, H.; Guevara, Y.; Nitsche, K.; Kundu, R.V.; Robinson, J.K. Accuracy of Self-Report in Assessing Fitzpatrick Skin Phototypes I Through VI. JAMA Dermatol. 2013, 149, 1289–1294. [Google Scholar] [CrossRef] [PubMed]
- Kinyanjui, N.M.; Odonga, T.; Cintas, C.; Codella, N.C.F.; Panda, R.; Sattigeri, P.; Varshney, K.R. Fairness of Classifiers Across Skin Tones in Dermatology. In International Conference on Medical Image Computing and Computer-Assisted Intervention; Springer International Publishing: Cham, Switzerland, 2020; pp. 320–329. [Google Scholar] [CrossRef]
- Marguier, J.; Bhatti, N.; Baker, H.; Harville, M.; Süsstrunk, S. Assessing Human Skin Color from Uncalibrated Images. Int. J. Imaging Syst. Technol. 2007, 17, 143–151. [Google Scholar] [CrossRef]
- Bratchenko, I.A.; Bratchenko, L.A.; Khristoforova, Y.A.; Moryatov, A.A.; Kozlov, S.V.; Zakharov, V.P. Classification of Skin Cancer Using Convolutional Neural Networks Analysis of Raman Spectra. Comput. Methods Programs Biomed. 2022, 219, 106755. [Google Scholar] [CrossRef] [PubMed]
- Qian, S.; Ren, K.; Zhang, W.; Ning, H. Skin Lesion Classification Using CNNs with Grouping of Multi-Scale Attention and Class-Specific Loss Weighting. Comput. Methods Programs Biomed. 2022, 226, 107166. [Google Scholar] [CrossRef]
- Leizaola, D.; Sobhan, M.; Kaile, K.; Mondal, A.M.; Godavarty, A. Deep Learning Algorithms to Classify Fitzpatrick Skin Types for Smartphone-Based NIRS Imaging Device. In Next-Generation Spectroscopic Technologies XV; SPIE: Bellingham, WA, USA, 2023; Volume 12516, pp. 12–17. [Google Scholar] [CrossRef]
- Kaile, K.; Sobhan, M.; Mondal, A.; Godavarty, A. Machine Learning Algorithms to Classify Fitzpatrick Skin Types during Tissue Oxygenation Mapping. In Optical Tomography and Spectroscopy; Optica Publishing Group: Washington, DC, USA, 2022; p. JM3A.4. [Google Scholar] [CrossRef]
- Sobhan, M.; Leizaola, D.; Godavarty, A.; Mondal, A.M. Subject Skin Tone Classification with Implications in Wound Imaging Using Deep Learning. In Proceedings of the 2022 International Conference on Computational Science and Computational Intelligence (CSCI), Las Vegas, NV, USA, 14–16 December 2022; pp. 1640–1645. [Google Scholar] [CrossRef]
- Marchetti, M.A.; Liopyris, K.; Dusza, S.W.; Codella, N.C.F.; Gutman, D.A.; Helba, B.; Kalloo, A.; Halpern, A.C.; Soyer, H.P.; Curiel-Lewandrowski, C.; et al. Computer Algorithms Show Potential for Improving Dermatologists’ Accuracy to Diagnose Cutaneous Melanoma: Results of the International Skin Imaging Collaboration 2017. J. Am. Acad. Dermatol. 2020, 82, 622–627. [Google Scholar] [CrossRef]
- Rezk, E.; Eltorki, M.; El-Dakhakhni, W. Leveraging Artificial Intelligence to Improve the Diversity of Dermatological Skin Color Pathology: Protocol for an Algorithm Development and Validation Study. JMIR Res. Protoc. 2022, 11, e34896. [Google Scholar] [CrossRef]
- Rezk, E.; Eltorki, M.; El-Dakhakhni, W. Improving Skin Color Diversity in Cancer Detection: Deep Learning Approach. JMIR Dermatol. 2022, 5, e39143. [Google Scholar] [CrossRef]
- Fitzpatrick, T.B. The Validity and Practicality of Sun-Reactive Skin Types I Through VI. Arch. Dermatol. 1988, 124, 869–871. [Google Scholar] [CrossRef]
- Alam, M. Fitzpatrick’s Dermatology in General Medicine,6th Ed. Arch. Dermatol. 2004, 140, 372. [Google Scholar] [CrossRef]
- Gupta, V.; Sharma, V.K. Skin Typing: Fitzpatrick Grading and Others. Clin. Dermatol. 2019, 37, 430–436. [Google Scholar] [CrossRef] [PubMed]
- Roberts, W.E. Skin Type Classification Systems Old and New. Dermatol. Clin. 2009, 27, 529–533. [Google Scholar] [CrossRef] [PubMed]
- Sachdeva, S. Fitzpatrick Skin Typing: Applications in Dermatology. Indian J. Dermatol. Venereol. Leprol. 2009, 75, 93. [Google Scholar] [CrossRef] [PubMed]
- Zarit, B.D.; Super, B.J.; Quek, F.K.H. Comparison of Five Color Models in Skin Pixel Classification. In Proceedings of the Conjunction with ICCV’99 (Cat. No. PR00378), Corfu, Greece, 26–27 September 1999; pp. 58–63. [Google Scholar] [CrossRef]
- Kakumanu, P.; Makrogiannis, S.; Bourbakis, N. A Survey of Skin-Color Modeling and Detection Methods. Pattern Recognit. 2007, 40, 1106–1122. [Google Scholar] [CrossRef]
- Ly, B.C.K.; Dyer, E.B.; Feig, J.L.; Chien, A.L.; Del Bino, S. Research Techniques Made Simple: Cutaneous Colorimetry: A Reliable Technique for Objective Skin Color Measurement. J. Investig. Dermatol. 2020, 140, 3–12.e1. [Google Scholar] [CrossRef]
- Odeh, S.M.; Baareh, A.K.M. A Comparison of Classification Methods as Diagnostic System: A Case Study on Skin Lesions. Comput. Methods Programs Biomed. 2016, 137, 311–319. [Google Scholar] [CrossRef]
- Chan, S.; Reddy, V.; Myers, B.; Thibodeaux, Q.; Brownstone, N.; Liao, W. Machine Learning in Dermatology: Current Applications, Opportunities, and Limitations. Dermatol. Ther. 2020, 10, 365–386. [Google Scholar] [CrossRef]
- Ho, B.K.; Robinson, J.K. Color Bar Tool for Skin Type Self-Identification: A Cross-Sectional Study. J. Am. Acad. Dermatol. 2015, 73, 312–313.e1. [Google Scholar] [CrossRef]
- Del Bino, S.; Bernerd, F. Variations in Skin Colour and the Biological Consequences of Ultraviolet Radiation Exposure. Br. J. Dermatol. 2013, 169 (Suppl. S3), 33–40. [Google Scholar] [CrossRef]
- Jain, A.K. Data Clustering: 50 Years beyond K-Means. Pattern Recognit. Lett. 2010, 31, 651–666. [Google Scholar] [CrossRef]
- Sinaga, K.P.; Yang, M.S. Unsupervised K-Means Clustering Algorithm. IEEE Access 2020, 8, 80716–80727. [Google Scholar] [CrossRef]
- Kriegel, H.P.; Kröger, P.; Sander, J.; Zimek, A. Density-Based Clustering. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2011, 1, 231–240. [Google Scholar] [CrossRef]
- Murtagh, F.; Contreras, P. Algorithms for Hierarchical Clustering: An Overview. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2012, 2, 86–97. [Google Scholar] [CrossRef]
- Bouveyron, C.; Brunet-Saumard, C. Model-Based Clustering of High-Dimensional Data: A Review. Comput. Stat. Data Anal. 2014, 71, 52–78. [Google Scholar] [CrossRef]
- Nayak, J.; Naik, B.; Behera, H.S. Fuzzy C-Means (FCM) Clustering Algorithm: A Decade Review from 2000 to 2014. Smart Innov. Syst. Technol. 2015, 32, 133–149. [Google Scholar] [CrossRef]
- Bodenhofer, U.; Kothmeier, A.; Hochreiter, S. APCluster: An R Package for Affinity Propagation Clustering. Bioinformatics 2011, 27, 2463–2464. [Google Scholar] [CrossRef]
- Cheng, Y. Mean Shift, Mode Seeking, and Clustering. IEEE Trans. Pattern Anal. Mach. Intell. 1995, 17, 790–799. [Google Scholar] [CrossRef]
- Von Luxburg, U. A Tutorial on Spectral Clustering. Stat. Comput. 2007, 17, 395–416. [Google Scholar] [CrossRef]
- Kaile, K.; Fernandez, C.; Godavarty, A. Development of a smartphone-based optical device to measure hemoglobin concentration changes in remote monitoring of wounds. Biosensors 2021, 11, 165. [Google Scholar] [CrossRef]
- Rousseeuw, P.J. Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis. J. Comput. Appl. Math. 1987, 20, 53–65. [Google Scholar] [CrossRef]
- Wang, X.; Xu, Y. An Improved Index for Clustering Validation Based on Silhouette Index and Calinski-Harabasz Index. IOP Conf. Ser. Mater. Sci. Eng. 2019, 569, 052024. [Google Scholar] [CrossRef]
- Tomašev, N.; Radovanović, M. Clustering Evaluation in High-Dimensional Data. In Unsupervised Learning Algorithms; Springer International Publishing: Cham, Switzerland, 2016; pp. 71–107. [Google Scholar] [CrossRef]
- Konovalenko, I.A.; Smagina, A.A.; Nikolaev, D.P.; Nikolaev, P.P. Prolab: A Perceptually Uniform Projective Color Coordinate System. IEEE Access 2021, 9, 133023–133042. [Google Scholar] [CrossRef]
- De Craen, S.; Commandeur, J.J.F.; Frank, L.E.; Heiser, W.J. Effects of Group Size and Lack of Sphericity on the Recovery of Clusters in K-Means Cluster Analysis. Multivariate Behav. Res. 2006, 41, 127–145. [Google Scholar] [CrossRef]
- DeGol, J.; Bretl, T.; Hoiem, D. ChromaTag: A Colored Marker and Fast Detection Algorithm. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 1472–1481. [Google Scholar]
- Zhao, W.L.; Deng, C.H.; Ngo, C.W. K-Means: A Revisit. Neurocomputing 2018, 291, 195–206. [Google Scholar] [CrossRef]
- Ding, C.; He, X. K-Means Clustering via Principal Component Analysis. In Proceedings of the Twenty-First International Conference on Machine Learning, Banff, AB, Canada, 4–8 July 2004; Association for Computing Machinery: New York, NY, USA, 2004; pp. 225–232. [Google Scholar] [CrossRef]
- Tremblay, N.; Gilles Puy, I.; Gribonval, R.; Vandergheynst, P. Compressive Spectral Clustering. In Proceedings of the 33rd International Conference on Machine Learning, PMLR 48; PMLR: New York, NY, USA, 2016; pp. 1002–1011. [Google Scholar]
- Jia, H.; Ding, S.; Xu, X.; Nie, R. The Latest Research Progress on Spectral Clustering. Neural Comput. Appl. 2014, 24, 1477–1486. [Google Scholar] [CrossRef]
- Chuang, K.S.; Tzeng, H.L.; Chen, S.; Wu, J.; Chen, T.J. Fuzzy C-Means Clustering with Spatial Information for Image Segmentation. Comput. Med. Imaging Graph. 2006, 30, 9–15. [Google Scholar] [CrossRef]
- Goon, P.; Banfield, C.; Bello, O.; Levell, N.J. Skin Cancers in Skin Types IV–VI: Does the Fitzpatrick Scale Give a False Sense of Security? Skin Health Dis. 2021, 1, e40. [Google Scholar] [CrossRef] [PubMed]
- Wilkes, M.; Wright, C.Y.; Du Plessis, J.L.; Reeder, A. Fitzpatrick Skin Type, Individual Typology Angle, and Melanin Index in an African Population: Steps Toward Universally Applicable Skin Photosensitivity Assessments. JAMA Dermatol. 2015, 151, 902–903. [Google Scholar] [CrossRef]
- Bradski, G.; Kaehler, A. Learning OpenCV: Computer Vision with the OpenCV Library; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2008; ISBN 978-0-596-51613-0. [Google Scholar]
- McKinney, W. Python for Data Analysis Data Wrangling with Pandas, NumPy, and IPython; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2017; ISBN 9781491957660. [Google Scholar]
- Pedregosa, F.; Michel, V.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Vanderplas, J.; Cournapeau, D.; Pedregosa, F.; Varoquaux, G.; et al. Scikit-Learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Waskom, M.L. Seaborn: Statistical Data Visualization. J. Open Source Softw. 2021, 6, 3021. [Google Scholar] [CrossRef]
Clustering Model | Description | Refs. |
---|---|---|
K-means models | K-means and its variants divide data into clusters by iteratively minimizing the sum of squared distances between data points and their assigned cluster centroids. Variants like K-means-PCA reduce dimensionality, while K-means mini-batch optimizes for efficiency. | [25,26] |
Density-based models | Models like DBSCAN, HDBSCAN, and OPTICS identify clusters by grouping data points with sufficient density, effectively detecting arbitrarily shaped clusters and outliers in sparse data. | [27] |
Hierarchical methods | Agglomerative hierarchical clustering (AHC) builds a tree-like structure by iteratively merging or splitting clusters based on their similarity, enabling the exploration of data at multiple granularity levels. | [28] |
Probabilistic techniques | Gaussian mixture models (GMM) use a probabilistic approach to model data as a mixture of multiple Gaussian distributions, assigning probabilities for data point membership in overlapping clusters. | [29] |
Fuzzy logic approaches | Fuzzy C-means assigns data points to multiple clusters with varying degrees of membership, reflecting the inherent ambiguity in boundaries between certain skin color categories. | [30] |
Other clustering methods | Models such as affinity propagation identify exemplars for clusters by passing messages between data points, mean shift locates cluster centers by maximizing density, and spectral clustering partitions data using eigenvalues of a similarity matrix. | [31,32,33] |
Aspect | Details |
---|---|
Study location and approval | Conducted at Florida International University (FIU) under IRB-13-0092, focusing on capturing white light data using the smartphone oxygenation tool (SPOT) device [7,34] for skin color classification [23]. |
Subjects | A total of 48 control subjects across FST I to VI were recruited. Subjects were seated or supine with feet exposed for imaging. |
Imaging setup | A reference sticker with six FST colors was placed within the imaging field of view. A black curtain provided a consistent background, isolating the foot. |
Imaging process | Images were captured at 7 foot locations under three lighting conditions. The top-foot location under a controlled lighting condition (4100 K) was used for the proof-of-concept. |
Ground-truth comparisons | Skin color classifications by a researcher, a clinician, and a commercial colorimetry-based tool (Delfin Skin ColorCatch) were compared for consistency and variability. Researcher and clinician classifications were completed by visual comparison on the FST scale and showed significant subjectivity, with variability between researcher and clinician classifications. The commercial device provided predominantly consistent results, unaffected by external lighting, and was chosen as the ground truth (Appendix A Figure A1). |
Metric | Definition | Dermatological Relevance |
---|---|---|
Silhouette score [35] | Measures the degree of similarity of an object within its own cluster compared to others. Values range from −1 to +1, with higher values indicating better cohesion and separation. | Ensures each skin tone cluster aligns distinctly with an FST category, aiding in the evaluation of cohesion and separation. |
Calinski–Harabasz (C-H) Index [36] | Known as the variance ratio criterion, it measures dispersion between and within clusters. Higher scores indicate more distinct clustering. | Quantifies the distinctiveness of skin tone clusters, ensuring well-defined boundaries between FST categories. |
Davies–Bouldin (D-B) Index [37] | A ratio of within-cluster to between-cluster distances, indicating compactness and separation. Lower values suggest better clustering. | Helps assess partitioning effectiveness, reducing overlap between FST categories for precise classifications. |
Training Time | Elapsed time required for the system to train the model, recorded in seconds to evaluate computational efficiency. | Evaluates computational efficiency, ensuring suitability for real-time clinical applications. |
Details of the performance metrics utilized in this study are outlined in Appendix A. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hamrani, A.; Leizaola, D.; Reddy Vedere, N.K.; Kirsner, R.S.; Kaile, K.; Trinidad, A.L.; Godavarty, A. AI Dermatochroma Analytica (AIDA): Smart Technology for Robust Skin Color Classification and Segmentation. Cosmetics 2024, 11, 218. https://doi.org/10.3390/cosmetics11060218
Hamrani A, Leizaola D, Reddy Vedere NK, Kirsner RS, Kaile K, Trinidad AL, Godavarty A. AI Dermatochroma Analytica (AIDA): Smart Technology for Robust Skin Color Classification and Segmentation. Cosmetics. 2024; 11(6):218. https://doi.org/10.3390/cosmetics11060218
Chicago/Turabian StyleHamrani, Abderrachid, Daniela Leizaola, Nikhil Kumar Reddy Vedere, Robert S. Kirsner, Kacie Kaile, Alexander Lee Trinidad, and Anuradha Godavarty. 2024. "AI Dermatochroma Analytica (AIDA): Smart Technology for Robust Skin Color Classification and Segmentation" Cosmetics 11, no. 6: 218. https://doi.org/10.3390/cosmetics11060218
APA StyleHamrani, A., Leizaola, D., Reddy Vedere, N. K., Kirsner, R. S., Kaile, K., Trinidad, A. L., & Godavarty, A. (2024). AI Dermatochroma Analytica (AIDA): Smart Technology for Robust Skin Color Classification and Segmentation. Cosmetics, 11(6), 218. https://doi.org/10.3390/cosmetics11060218