Abstract
X-ray baggage inspection is an essential task to detect threat objects at important controlled access places, which can guard personal safety and prevent crime. Generally, it is carried out by screeners to visually determine whether or not a bag contains threat objects. Whereas, manual detection exhibits distinct shortcomings, from high detection errors to different detection results produced by screeners. These limitations can be addressed by introducing automated detection model of threat objects for X-ray baggage inspection. However, existing automated detection methods cannot realize end-to-end detection and the detection results include only classification without location. In this paper, we propose an automated detection model of threat objects based on depthwise separable convolution. Our model is able to not only categorize the threat object but also locate it simultaneously. The network model has the advantage of high detection accuracy, fast computational speed, and a few parameters. Meanwhile, the precision of threat object regions is enhanced with the help of multi-scale prediction. A deformation layer is added in our model, which can provide invariance to affine warping. The experiments on the GDXray database (Mery et al. in J Nondestr Eval 34(4):42, 2015) demonstrate that the overall performance of our proposed model is superior to YOLOv3 (Redmon J and Farhadi A in YOLOv3: an incremental improvement, 2018) model, SSD (Liu et al. in SSD: single shot multibox detector. In: European Conference on Computer Vision (ECCV), pp. 21–37, 2016) model, and Tiny_YOLO (Redmon et al. in You only look once: unified, real-time object detection. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779–788, 2015) model.
Similar content being viewed by others
References
Bastan, M., Yousefi, M.R., Breuel, T.M.: Visual Words on Baggage X-ray Images. Springer, New York (2011)
Bodla, N., Singh, B., Chellappa, R., Davis, L.S.: Soft-NMS-improving object detection with one line of code. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 5561–5569 (2017)
Bolfing, A., Halbherr, T., Schwaninger, A.: How Image Based Factors and Human Factors Contribute to Threat Detection Performance in X-ray Aviation Security Screening. Springer, New York (2008)
Chen, Y., Kang, X., Shi, Y.Q., Wang, Z.J.: A multi-purpose image forensic method using densely connected convolutional neural networks. J. Real Time Image Process. 16(3), 725–740 (2019)
Chollet, F.: Xception: Deep learning with depthwise separable convolutions. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1800–1807 (2016)
Christ, P.F., Ettlinger, F., Kaissis, G., Schlecht, S., Ahmaddy, F., Grun, F., Valentinitsch, A., Ahmadi, S.A., Braren, R., Menze, B.: SurvivalNet: Predicting patient survival from diffusion weighted magnetic resonance images using cascaded fully convolutional and 3D convolutional neural networks. In: IEEE International Symposium on Biomedical Imaging, pp. 839–843 (2017)
Flitton, G., Breckon, T.P., Megherbi, N.: A comparison of 3D interest point descriptors with application to airport baggage object detection in complex CT imagery. Pattern Recogn. 46(9), 2420–2436 (2013)
Franzel, T., Schmidt, U., Roth, S.: Object Detection in Multi-view X-ray Images. Springer, New York (2012)
Gao, X., Lin, S., Wong, T.Y.: Automatic feature learning to grade nuclear cataracts based on deep learning. IEEE Trans. Biomed. Eng. 62(11), 2693–2701 (2015)
Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., Adam, H.: MobileNets: efficient convolutional neural networks for mobile vision applications (2017)
Jaderberg, M., Simonyan, K., Zisserman, A., Kavukcuoglu, K.: Spatial transformer networks. In: Proceedings of the 28th International Conference on Neural Information Processing Systems. pp. 2017–2025 (2015)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2014)
Krishnaraj, N., Elhoseny, M., Thenmozhi, M., Selim, M.M., Shankar, K.: Deep learning model for real-time image compression in internet of underwater things (IoUT). J. Real Time Image Process. 17, 2097–2111 (2020)
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 936–944 (2017)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., Berg, A.C.: SSD: single shot multibox detector. In: European Conference on Computer Vision (ECCV), pp. 21–37 (2016)
Megherbi, N., Han, J., Breckon, T.P., Flitton, G.T.: A comparison of classification approaches for threat detection in CT based baggage screening. In: IEEE International Conference on Image Processing, pp. 3109–3112 (2013)
Mery, D., Riffo, V., Zscherpel, U., Mondragón, G., Lillo, I., Zuccar, I., Lobel, H., Carrasco, M.: GDXray: The database of X-ray images for nondestructive testing. J. Nondestr. Eval. 34(4), 42 (2015)
Mery, D., Svec, E., Arias, M.: Object recognition in baggage inspection using adaptive sparse representations of X-ray images. In: Pacific-rim Symposium on Image and Video Technology, pp. 709–720 (2015)
Mery, D., Svec, E., Arias, M., Riffo, V., Saavedra, J.M., Banerjee, S.: Modern computer vision techniques for X-ray testing in baggage inspection. IEEE Trans. Syst. Man Cyberne. Syst. 47(4), 682–692 (2017)
Michel, S., Ruiter, J.C.D., Hogervorst, M., Koller, S.M., Schwaninger, A.: Computer-based training increases efficiency in X-ray image interpretation by aviation security screeners. In: IEEE International Carnahan Conference on Security Technology, pp. 201–206 (2007)
Payan, A., Montana, G.: Predicting Alzheimer’s disease: a neuroimaging study with 3D convolutional neural networks. In: 2015 International Conference on Pattern Recognition Applications and Methods (ICPRAM), pp. 355–362 (2015)
Pozzo, F.R.D.: Aviation Security. Springer, New York (2015)
Redmon, J., Divvala, S.K., Girshick, R.B., Farhadi, A.: You only look once: unified, real-time object detection. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779–788 (2015)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6517–6525 (2016)
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement (2018)
Riffo, V., Mery, D.: Automated detection of threat objects using adapted implicit shape model. IEEE Trans. Syst. Man Cybern. Syst. 46(4), 472–482 (2017)
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M.: Imagenet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2014)
Sajjad, M., Khan, S., Hussain, T., Muhammad, K., Sangaiah, A.K., Castiglione, A., Esposito, C., Baik, S.W.: CNN-based anti-spoofing two-tier multi-factor authentication system. Pattern Recogn. Lett. 126(1), 123–131 (2019)
Sandler, M., Howard, A.G., Zhu, M., Zhmoginov, A., Chen, L.C.: MobileNetV2: inverted residuals and linear bottlenecks. In: 2018 IEEE Conference on Computer Vision and Pattern Recognition, pp. 4510–4520 (2018)
Turcsany, D., Mouton, A., Breckon, T.P.: Improving feature-based object recognition for X-ray baggage security screening using primed visual words. In: IEEE International Conference on Industrial Technology, pp. 1140–1145 (2013)
Uroukov, I., Speller, R.: A preliminary approach to intelligent X-ray imaging for baggage inspection at airports. Signal Process. Res. 4(5), 1–11 (2015)
Zentai, G.: X-ray imaging for homeland security. In: IEEE International Workshop on Imaging Systems & Techniques, pp. 1–6 (2008)
Zheng, H.T., Chen, J.Y., Yao, X., Sangaiah, A.K., Zhao, C.Z.: Clickbait convolutional neural network. Symmetry 10(5), 138 (2018)
Acknowledgements
This research is supported by the National Natural Science Foundation of China (Grant Nos. 61374178, 61402092, 61603182), the Fundamental Research Funds for the Central Universities (N171704004), the online education research fund of MOE research center for online education, China (Qtone education, Grant No.2016ZD306), and the Ph.D. Start-Up Foundation of Liaoning Province, China (Grant No. 201501141).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wei, Y., Zhu, Z., Yu, H. et al. An automated detection model of threat objects for X-ray baggage inspection based on depthwise separable convolution. J Real-Time Image Proc 18, 923–935 (2021). https://doi.org/10.1007/s11554-020-01051-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11554-020-01051-1