default search action
18th ECCV 2024: Milan, Italy - Part LXVIII
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXVIII. Lecture Notes in Computer Science 15126, Springer 2025, ISBN 978-3-031-73112-9 - Haobin Jiang, Junpeng Yue, Hao Luo, Ziluo Ding, Zongqing Lu:
Reinforcement Learning Friendly Vision-Language Model for Minecraft. 1-17 - Seonghoon Yu, Paul Hongsuck Seo, Jeany Son:
Pseudo-RIS: Distinctive Pseudo-Supervision Generation for Referring Image Segmentation. 18-36 - Jiaqi Liu, Tao Huang, Chang Xu:
Training-Free Composite Scene Generation for Layout-to-Image Synthesis. 37-53 - Guangrui Li, Rahul Duggal, Aaditya Singh, Kaustav Kundu, Bing Shuai, Jonathan Wu:
Robustness Preserving Fine-Tuning Using Neuron Importance. 54-69 - Mengcheng Lan, Chaofeng Chen, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang:
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation. 70-88 - Jian Ma, Chen Chen, Qingsong Xie, Haonan Lu:
PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in Non-english Text-to-Image Generation. 89-105 - Jaehui Hwang, Dongyoon Han, Byeongho Heo, Song Park, Sanghyuk Chun, Jong-Seok Lee:
Similarity of Neural Architectures Using Adversarial Attack Transferability. 106-126 - Tingting Chen, Beibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Robby T. Tan:
Dual-Rain: Video Rain Removal Using Assertive and Gentle Teachers. 127-143 - Ning Gao, Sanping Zhou, Le Wang, Nanning Zheng:
PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation. 144-160 - Raghav Kapoor, Yash Parag Butala, Melisa Russak, Jing Yu Koh, Kiran Kamble, Waseem AlShikh, Ruslan Salakhutdinov:
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web. 161-178 - Xiuyuan Chen, Yuan Lin, Yuchen Zhang, Weiran Huang:
AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering. 179-195 - Jinrui Zhang, Teng Wang, Haigang Zhang, Ping Lu, Feng Zheng:
Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models. 196-213 - Jiawei Wu, Zhi Jin:
Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks. 214-231 - Duy-Tho Le, Hengcan Shi, Jianfei Cai, Hamid Rezatofighi:
Diffusion Model for Robust Multi-sensor Fusion in 3D Object Detection and BEV Segmentation. 232-249 - Yushuo Chen, Zerong Zheng, Zhe Li, Chao Xu, Yebin Liu:
MeshAvatar: Learning High-Quality Triangular Human Avatars from Multi-view Videos. 250-269 - Hao Xu, Xi Zhang, Xiaolin Wu:
Fast Point Cloud Geometry Compression with Context-Based Residual Coding and INR-Based Refinement. 270-288 - Jinghao Zhou, Tomas Jakab, Philip Torr, Christian Rupprecht:
Scene-Conditional 3D Object Stylization and Composition. 289-305 - Xiaojie Li, Yibo Yang, Xiangtai Li, Jianlong Wu, Yue Yu, Bernard Ghanem, Min Zhang:
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning. 306-325 - Kartik Garg, Sai Shubodh Puligilla, Shishir Kolathaya, K. Madhava Krishna, Sourav Garg:
Revisit Anything: Visual Place Recognition via Image Segment Retrieval. 326-343 - Peiqi Chen, Lei Yu, Yi Wan, Yongjun Zhang, Jian Wang, Liheng Zhong, Jingdong Chen, Ming Yang:
EcoMatcher: Efficient Clustering Oriented Matcher for Detector-Free Image Matching. 344-360 - Isaac Labe, Noam Issachar, Itai Lang, Sagie Benaim:
DGD: Dynamic 3D Gaussians Distillation. 361-378 - Jaehyeong Jeon, Kibum Kim, Kanghoon Yoon, Chanyoung Park:
Semantic Diversity-Aware Prototype-Based Learning for Unbiased Scene Graph Generation. 379-395 - Xiaobin Hu, Xu Peng, Donghao Luo, Xiaozhong Ji, Jinlong Peng, Zhengkai Jiang, Jiangning Zhang, Taisong Jin, Chengjie Wang, Rongrong Ji:
DiffuMatting: Synthesizing Arbitrary Objects with Matting-Level Annotation. 396-413 - Soobin Um, Jong Chul Ye:
Self-Guided Generation of Minority Samples Using Diffusion Models. 414-430 - Kyungho Bae, Geo Ahn, Youngrae Kim, Jinwoo Choi:
DEVIAS: Learning Disentangled Video Representations of Action and Scene. 431-448 - Jan Lehr, Jan Philipps, Alik Sargsyan, Martin Pape, Jörg Krüger:
AD3: Introducing a Score for Anomaly Detection Dataset Difficulty Assessment Using VIADUCT Dataset. 449-464 - Qi Wang, Ruijie Lu, Xudong Xu, Jingbo Wang, Michael Yu Wang, Bo Dai, Gang Zeng, Dan Xu:
RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting. 465-482
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.