Cited By
View all- Hossen MYe ZAbdussalam AHossain M(2024)GVA: guided visual attention approach for automatic image caption generationMultimedia Systems10.1007/s00530-023-01249-w30:1Online publication date: 29-Jan-2024
Understanding different semantic concepts, such as objects and their relationships in an image, and integrating them to produce a natural language description is the goal of the image captioning task. Thus, it needs an algorithm to understand the ...
Self-attention based Transformer has been successfully introduced in the encoder-decoder framework of image captioning, which is superior in modeling the inner relations of inputs, i.e., image regions or semantic words. However, ...
The attention mechanism has been established as an effective method for generating caption words in image captioning; it explores one noticed subregion in an image to predict a related caption word. However, even though the attention mechanism could ...
Kluwer Academic Publishers
United States
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in