Cited By
View all- Shi YXu HYuan CLi BHu WZha Z(2023)Learning Video-Text Aligned Representations for Video CaptioningACM Transactions on Multimedia Computing, Communications, and Applications10.1145/354682819:2(1-21)Online publication date: 6-Feb-2023
- Li YFan JPan YYao TLin WMei T(2022)Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-trainingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/347314018:2(1-16)Online publication date: 16-Feb-2022
- Sharma HSrivastava S(2022)A Framework for Image Captioning Based on Relation Network and Multilevel Attention MechanismNeural Processing Letters10.1007/s11063-022-11106-y55:5(5693-5715)Online publication date: 17-Dec-2022
- Show More Cited By