Abstract
Skin disease is one of the global burdens of disease, and affects around 30% to 70% individuals worldwide. Effective automatic diagnosis is indispensable for doctors and patients. Compared with dermoscopic imaging, using clinical images captured by a portable electronic device (e.g. a mobile phone) is more available and low-cost. However, the existing large clinical skin-disease image datasets do not have the spatial annotation information, thus posing challenges for localizing the skin-disease regions and learning detailed features. To address the problem, we propose the Interactive Attention Sampling Network (IASN) which automatically localizes the target skin-disease regions and highlight the regions into high resolution. Specifically, the top-K local peaks of the class activation maps are collected, which indicate the key clues of skin-disease images. Then the features of the local peaks are interacted with the features of the surrounding context. With the guidance of the interactive attention maps, the non-uniform sampled images are generated, which facilitate the model to learn more discriminative features. Experimental results demonstrate that the proposed IASN outperforms the state-of-the-art methods on the SD-198 benchmark dataset.
This work was supported by the Guangdong Basic and Applied Basic Research Foundation No. 2021A1515011867, the Taishan Young Scholars Program of Shandong Province, the Key Development Program for Basic Research of Shandong Province No. ZR2020ZD44, and the National Natural Science Foundation of China No. 61503084 and No. 61976123.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Lim, H.W., et al.: The burden of skin disease in the united states. J. Am. Acad. Dermatol. 76(5), 958–972 (2017)
Sun, X., Yang, J., Sun, M., Wang, K.: A benchmark for automatic visual classification of clinical skin disease images. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9910, pp. 206–222. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46466-4_13
Yang, J., Sun, X., Liang, J., Rosin, P.L.: Clinical skin lesion diagnosis using representations inspired by dermatologist criteria. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 2018, pp. 1258–1266 (2018)
Yang, J., Sun, X., Lai, Y., Zheng, L., Cheng, M.: Recognition from web data: a progressive filtering approach. IEEE Trans. Image Process. 27(11), 5303–5315 (2018)
Yang, J., et al.: Self-paced balance learning for clinical skin disease recognition. IEEE Trans. Neural Netw. Learn. Syst. 31, 2832–2846 (2019)
Rensink, R.A.: The dynamic representation of scenes. Vis. Cogn. 7(1–3), 17–42 (2000)
Mnih, V., Heess, N., Graves, A., Kavukcuoglu, K.: Recurrent models of visual attention. In: Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2, ser, NIPS 2014, pp. 2204–2212. MIT Press, Cambridge (2014). http://dl.acm.org/citation.cfm?id=2969033.2969073
Yang, Z., Luo, T., Wang, D., Hu, Z., Gao, J., Wang, L.: Learning to navigate for fine-grained classification. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) Computer Vision – ECCV 2018. LNCS, vol. 11218, pp. 438–454. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01264-9_26
Recasens, A., Kellnhofer, P., Stent, S., Matusik, W., Torralba, A.: Learning to zoom: a saliency-based sampling layer for neural networks. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11213, pp. 52–67. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01240-3_4
Ding, Y., Zhou, Y., Zhu, Y., Ye, Q., Jiao, J.: Selective sparse sampling for fine-grained image recognition. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 6598–6607 (2019)
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., Torralba, A.: Learning deep features for discriminative localization. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2921–2929 (2016)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016, pp. 770–778 (2016)
Sermanet, P., Frome, A., Real, E.: Attention for fine-grained categorization. In: International Conference on Learning Representations (2015)
Jaderberg, M., Simonyan, K., Zisserman, A., Kavukcuoglu, K.: Spatial transformer networks. CoRR, vol. abs/1506.02025 (2015). http://arxiv.org/abs/1506.02025
Lin, J., Guo, Z., Li, D., Hu, X., Zhang, Y.: Automatic classification of clinical skin disease images with additional high-level position information. In: 2019 Chinese Control Conference (CCC), pp. 8606–8610 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, X., Li, D., Zhang, Y., Jian, M. (2021). Interactive Attention Sampling Network for Clinical Skin Disease Image Classification. In: Ma, H., et al. Pattern Recognition and Computer Vision. PRCV 2021. Lecture Notes in Computer Science(), vol 13021. Springer, Cham. https://doi.org/10.1007/978-3-030-88010-1_33
Download citation
DOI: https://doi.org/10.1007/978-3-030-88010-1_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88009-5
Online ISBN: 978-3-030-88010-1
eBook Packages: Computer ScienceComputer Science (R0)