Abstract
This paper presents an unique approach for extracting the features of an image using a ResNet50 Convolutional Neural Network architecture and to predict the caption of the image using a novel ensemble of GRU (Gated Recurrent Units) and Recurrent Neural Network algorithms namely Long Short-Term Memory (LSTM). This approach aims to combine the faster training and performance from GRU and greater volatility throughout its gradient descent from LSTM. This technique has been implemented assuring the major factors being capacity, robustness, and visual quality. The proposed method for image captioning has been evaluated using metrics such as BLEU, CIDEr, METEOR and it outperforms when compared to conventional systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Hoxha, G., Melgani, F., Slaghenauffi, J.: A new CNN-RNN framework for remote sensing image captioning. In: 2020 Mediterranean and Middle-East Geoscience and Remote Sensing Symposium (M2GARSS), Tunis, Tunisia, pp. 1––4 (2020)
Nithya, K.C., Kumar, V.V.: A review on automatic image captioning techniques. In: 2020 International Conference on Communication and Signal Processing (ICCSP), Chennai, India, pp. 0432––0437 (2020)
Bhatia, Y., Bajpayee, A., Raghuvanshi, D., Mittal, H.: Image captioning using google’’s inception-resnet-v2 and recurrent neural network. In: 2019 Twelfth International Conference on Contemporary Computing (IC3), Noida, India, pp. 1–6 (2019)
Karpathy, A., Fei-Fei, L.: Deep Visual-Semantic Alignments for Generating Image Descriptions. IEEE Trans. Patt. Anal. Mach. Intell. 39(4), 664––676 (2017). https://doi.org/10.1109/TPAMI.2016.2598339
Rennie, S.J., Marcheret, E., Mroueh, Y., Ross, J., Goel, V.: Self-critical sequence training for image captioning. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, pp. 1179––1195 (2017)
Image Captioning using Deep Learning Bhamidi Haripriya Srushti G M, Syed Haseeb, Mrs. Madhura Prakash
Kaushik, R., Kumar, S.: Image segmentation using convolutional neural network. Int. J. Sci. Technol. Res. 8(11), 667–675 (2019)
Jain, C., Sharma, N.: Image Captioning Using CNN Long Short-Term Memory Network. SSRN 3576373 (2020). https://doi.org/10.2139/ssrn.3576373
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Agilandeeswari, L., Sharma, K., Srivastava, S. (2022). Intelligent Image Captioning Approach with Novel Ensembled Recurrent Neural Network Model. In: Abraham, A., Gandhi, N., Hanne, T., Hong, TP., Nogueira Rios, T., Ding, W. (eds) Intelligent Systems Design and Applications. ISDA 2021. Lecture Notes in Networks and Systems, vol 418. Springer, Cham. https://doi.org/10.1007/978-3-030-96308-8_114
Download citation
DOI: https://doi.org/10.1007/978-3-030-96308-8_114
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-96307-1
Online ISBN: 978-3-030-96308-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)