default search action
MMM 2024, Amsterdam, The Netherlands - Part III
- Stevan Rudinac, Alan Hanjalic, Cynthia C. S. Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata:
MultiMedia Modeling - 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 - February 2, 2024, Proceedings, Part III. Lecture Notes in Computer Science 14556, Springer 2024, ISBN 978-3-031-53310-5 - Qiang Chen, Fuxiao He, Guoqiang Xiao:
Global-to-Local Feature Mining Network for RGB-Infrared Person Re-Identification. 1-13 - Lu Chen, Jiawei Tan, Pingan Yang, Hongxing Wang:
Semantic Transition Detection for Self-supervised Video Scene Segmentation. 14-27 - Xueyang Qin, Lishuang Li, Jing Hao, Meiling Ge, Jiayi Huang, Guangyao Pang:
Multi-task Collaborative Network for Image-Text Retrieval. 28-42 - Hao-Yuan Ma, Li Zhang, Xiang-Yi Wei:
FGENet: Fine-Grained Extraction Network for Congested Crowd Counting. 43-56 - Jingjing Xie, Jixuan Hong, Manjin Sheng, Chenhui Yang:
MSMV-UNet: A 2.5D Stroke Lesion Segmentation Method Based on Multi-slice Feature Fusion. 57-69 - Xiang Gao, Sining Wu, Fan Wang, Xiaopeng Hu:
Non-Local Spatial-Wise and Global Channel-Wise Transformer for Efficient Image Super-Resolution. 70-85 - Ting Peng, Yihang Zhou, Rong Sun, Yizhi Luo, Yuqi Li:
MobileViT-FocR: MobileViT with Fixed-One-Centre Loss and Gradient Reversal for Generalised Fake Face Detection. 86-100 - Xiran Zhang, Haiyan Liu, Caixia Liu, Haiyang Zhang, Zhiwei Huo:
ASF-Conformer: Audio Scoring Conformer with FFC for Speaker Verification in Noisy Environments. 101-111 - Yuanjian He, Weile Zhang, Junyuan Deng, Yulai Cong:
Prior-Knowledge-Free Video Frame Interpolation with Bidirectional Regularized Implicit Neural Representations. 112-126 - Shengrong Ling, Sisi You, Bing-Kun Bao:
Two-Stage Reasoning Network with Modality Decomposition for Text VQA. 127-140 - Honglei Zheng, Wenkang Fan, Yinran Chen, Xiongbiao Luo:
Localization and Local Motion Magnification of Pulsatile Regions in Endoscopic Surgery Videos. 141-154 - Shinichi Ka, Koichi Shinoda:
Co-speech Gesture Generation with Variational Auto Encoder. 155-168 - Chunyin Sheng, Xiang Gao, Xiaopeng Hu, Fan Wang:
Differentiable Neural Architecture Search Based on Efficient Architecture for Lightweight Image Super-Resolution. 169-183 - Zhengwei Yang, Yange Wang, Lei Ma, Xiangzheng Li:
Learning Collaborative Reinforcement Attention for 3D Face Reconstruction and Dense Alignment. 184-197 - Konstantinos Triaridis, Vasileios Mezaris:
Exploring Multi-modal Fusion for Image Manipulation Detection and Localization. 198-211 - Feifei Xu, Zheng Zhong, Yitao Zhu, Yingchen Zhou, Guangzhen Li:
Appearance-Motion Dual-Stream Heterogeneous Network for VideoQA. 212-227 - Xiang Li, Ming Lu, Ziming Guo, Xiaoming Zhang:
Adaptive Token Selection and Fusion Network for Multimodal Sentiment Analysis. 228-241 - Pei Chen, Zhiyong Feng, Meng Xing, Yiming Zhang, Jinqing Zheng:
Exploring Imperceptible Adversarial Examples in YCbCr Color Space. 242-256 - Liyun Xu, Min Zhang:
Fractional-Order Image Moments and Applications. 257-269 - Maria Pegia, Ferran Agullo Lopez, Anastasia Moumtzidou, Alberto Gutierrez-Torre, Björn Þór Jónsson, Josep Lluis Berral-Garcia, Ilias Gialampoukidis, Stefanos Vrochidis, Ioannis Kompatsiaris:
Time-Quality Tradeoff of MuseHash Query Processing Performance. 270-283 - Zhanjie Jin, Anming Dong, Jiguo Yu, Shuxiang Dong, You Zhou:
Dual-Fisheye Image Stitching via Unsupervised Deep Learning. 284-298 - Junpeng Liu, Hengkang Bao:
CA-GAN: Conditional Adaptive Generative Adversarial Network for Text-to-Image Synthesis. 299-312 - Dexu Yao, Aimin Li, Deqi Liu, Mengfan Cheng:
RDC-YOLOv5: Improved Safety Helmet Detection in Adverse Weather. 313-326 - Aril Bernhard Ovesen, Tor-Arne Schmidt Nordmo, Michael Alexander Riegler, Pål Halvorsen, Dag Johansen:
Sustainable Commercial Fishery Control Using Multimedia Forensics Data from Non-trusted, Mobile Edge Nodes. 327-340 - Shan Cao, Qingfeng Wu:
MC-TCMNER: A Multi-modal Fusion Model Combining Contrast Learning Method for Traditional Chinese Medicine NER. 341-354 - Xiangyu Chen, Md Ayshik Rahman Khan, Md. Rakibul Hasan, Tom Gedeon, Md. Zakir Hossain:
C3-PO: A Convolutional Neural Network for COVID Onset Prediction from Cough Sounds. 355-368 - Mingyuan Ge, Jianan Shui, Junyu Chen, Mingyong Li:
Pseudo-label Based Unsupervised Momentum Representation Learning for Multi-domain Image Retrieval. 369-380 - Jianbo Xiong, Shinan Zou, Jin Tang:
DFGait: Decomposition Fusion Representation Learning for Multimodal Gait Recognition. 381-395 - Jiangfeng Li, Bowen Wang, Yongrui Qin, Chenxi Zhang, Gang Yu, Qinpei Zhao:
MoPE: Mixture of Pooling Experts Framework for Image-Text Retrieval. 396-409 - Linzi Xing, Quan Hung Tran, Fabian Caba, Franck Dernoncourt, Seunghyun Yoon, Zhaowen Wang, Trung Bui, Giuseppe Carenini:
Multi-modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation. 410-424 - Wenlong Lu, Suping Wu, Xitie Zhang, Shengjia Zhang:
Unsupervised Multi-collaborative Learning Network for 3D Face Reconstruction. 425-436 - Yiru Zhang, Zeke Li, Bijing Liu, Haiwei Fan, Yong Yang, Qun Yang:
A Region Based Non-overlapping Reference Speech Estimation Method for Speaker Extraction. 437-447 - Pan Li, Suping Wu, Xitie Zhang, Yuxin Peng, Boyang Zhang, Bin Wang:
Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel Optimization. 448-461 - Shuai Wang, Jiayi Shen, Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring:
Prototype-Enhanced Hypergraph Learning for Heterogeneous Information Networks. 462-476 - Ali Abdari, Alex Falcon, Giuseppe Serra:
A Language-Based Solution to Enable Metaverse Retrieval. 477-488 - Chenlin Zhao, Jiabo Ye, Yaguang Song, Ming Yan, Xiaoshan Yang, Changsheng Xu:
Part-Aware Prompt Tuning for Weakly Supervised Referring Expression Grounding. 489-502 - Sarwar Khan, Jun-Cheng Chen, Wen-Hung Liao, Chu-Song Chen:
Adversarially Robust Deepfake Detection via Adversarial Feature Similarity Learning. 503-516 - Adriano Baratè, Luca Andrea Ludovico:
A Multidimensional Taxonomy Model for Music Tangible User Interfaces. 517-531
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.