Cited By
View all- Sanders KVan Durme B(2024)A Survey of Video Datasets for Grounded Event Understanding2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)10.1109/CVPRW63382.2024.00727(7314-7327)Online publication date: 17-Jun-2024
- Liu RFang YYu FTian RRen TWu GEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Deep Video Understanding with Video-Language ModelProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612863(9551-9555)Online publication date: 26-Oct-2023
- Li RGuo JLi MWu ZLiang CEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)A Hierarchical Deep Video Understanding Method with Shot-Based Instance Search and Large Language ModelProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612838(9425-9429)Online publication date: 26-Oct-2023
- Show More Cited By