Cited By
View all- Pittaras NGiannakopoulos GStamatopoulos PKarkaletsis V(2023)Content-based and Knowledge-enriched Representations for Classification Across Modalities: A SurveyACM Computing Surveys10.1145/358368255:14s(1-40)Online publication date: 13-Feb-2023
- Xu XZhang ZZhou ZZhang PXie ZWu MZhu KEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic DataProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3613820(2756-2764)Online publication date: 26-Oct-2023
- Zhou ZZhang ZXu XXie ZWu MZhu K(2022)Can Audio Captions Be Evaluated With Image Caption Metrics?ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP43922.2022.9746427(981-985)Online publication date: 23-May-2022