default search action
Renrui Zhang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j3]Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. Int. J. Comput. Vis. 132(2): 581-595 (2024) - [j2]Peng Gao, Ziyi Lin, Renrui Zhang, Rongyao Fang, Hongyang Li, Hongsheng Li, Yu Qiao:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. Int. J. Comput. Vis. 132(5): 1546-1556 (2024) - [c50]Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Hao Dong, Zhongjiang He, Peng Gao:
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. AAAI 2024: 6449-6457 - [c49]Fuhao Li, Ziyang Gong, Yupeng Deng, Xianzheng Ma, Renrui Zhang, Zhenming Ji, Xiangwei Zhu, Hong Zhang:
Parsing All Adverse Scenes: Severity-Aware Semantic Segmentation with Mask-Enhanced Cross-Domain Consistency. AAAI 2024: 13483-13491 - [c48]Dongmei Zhang, Chang Li, Renrui Zhang, Shenghao Xie, Wei Xue, Xiaodong Xie, Shanghang Zhang:
FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection. AAAI 2024: 16723-16731 - [c47]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao:
No Time to Train: Empowering Non-Parametric Networks for Few-Shot 3D Scene Segmentation. CVPR 2024: 3838-3847 - [c46]Guanqun Wang, Jiaming Liu, Chenxuan Li, Yuan Zhang, Junpeng Ma, Xinyu Wei, Kevin Zhang, Maurice Chong, Renrui Zhang, Yijiang Liu, Shanghang Zhang:
Cloud-Device Collaborative Learning for Multimodal Large Language Models. CVPR 2024: 12646-12655 - [c45]Xiaoqi Li, Mingxu Zhang, Yiran Geng, Haoran Geng, Yuxing Long, Yan Shen, Renrui Zhang, Jiaming Liu, Hao Dong:
ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation. CVPR 2024: 18061-18070 - [c44]Lingyi Hong, Shilin Yan, Renrui Zhang, Wanyun Li, Xinyu Zhou, Pinxue Guo, Kaixun Jiang, Yiting Chen, Jinglun Li, Zhaoyu Chen, Wenqiang Zhang:
OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning. CVPR 2024: 19079-19091 - [c43]Xiaobao Wei, Renrui Zhang, Jiarui Wu, Jiaming Liu, Ming Lu, Yandong Guo, Shanghang Zhang:
NTO3D: Neural Target Object 3D Reconstruction with Segment Anything. CVPR 2024: 20352-20362 - [c42]Zhi Zhang, Qizhe Zhang, Zijun Gao, Renrui Zhang, Ekaterina Shutova, Shiji Zhou, Shanghang Zhang:
Gradient-based Parameter Selection for Efficient Fine-Tuning. CVPR 2024: 28566-28577 - [c41]Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui Chen, Yandong Guo, Shanghang Zhang:
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation. CVPR 2024: 28653-28663 - [c40]Ziyi Lin, Dongyang Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Yu Qiao, Hongsheng Li:
SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models. ECCV (62) 2024: 36-55 - [c39]Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Yu Qiao, Peng Gao, Hongsheng Li:
MATHVERSE: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? ECCV (8) 2024: 169-186 - [c38]Shilin Yan, Xiaohao Xu, Renrui Zhang, Lingyi Hong, Wenchao Chen, Wenqiang Zhang, Wei Zhang:
PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation. ECCV (9) 2024: 346-365 - [c37]Shitian Zhao, Renrui Zhang, Xu Luo, Yan Wang, Shanghang Zhang, Peng Gao:
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models. EMNLP (Findings) 2024: 10152-10163 - [c36]Jiaming Liu, Senqiao Yang, Peidong Jia, Renrui Zhang, Ming Lu, Yandong Guo, Wei Xue, Shanghang Zhang:
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation. ICLR 2024 - [c35]Ke Wang, Houxing Ren, Aojun Zhou, Zimu Lu, Sichun Luo, Weikang Shi, Renrui Zhang, Linqi Song, Mingjie Zhan, Hongsheng Li:
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning. ICLR 2024 - [c34]Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li:
Personalize Segment Anything Model with One Shot. ICLR 2024 - [c33]Renrui Zhang, Jiaming Han, Chris Liu, Aojun Zhou, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
LLaMA-Adapter: Efficient Fine-tuning of Large Language Models with Zero-initialized Attention. ICLR 2024 - [c32]Dongyang Liu, Renrui Zhang, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. ICML 2024 - [c31]Xudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao, Hongsheng Li:
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models. ICML 2024 - [c30]Kaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao, Yali Wang, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI. ICML 2024 - [c29]Mingjie Pan, Jiaming Liu, Renrui Zhang, Peixiang Huang, Xiaoqi Li, Hongwei Xie, Bing Wang, Li Liu, Shanghang Zhang:
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision. ICRA 2024: 12404-12411 - [i73]Peng Gao, Renrui Zhang, Chris Liu, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. CoRR abs/2402.05935 (2024) - [i72]Lingyi Hong, Shilin Yan, Renrui Zhang, Wanyun Li, Xinyu Zhou, Pinxue Guo, Kaixun Jiang, Yiting Chen, Jinglun Li, Zhaoyu Chen, Wenqiang Zhang:
OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning. CoRR abs/2403.09634 (2024) - [i71]Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li:
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? CoRR abs/2403.14624 (2024) - [i70]Dongzhi Jiang, Guanglu Song, Xiaoshi Wu, Renrui Zhang, Dazhong Shen, Zhuofan Zong, Yu Liu, Hongsheng Li:
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching. CoRR abs/2404.03653 (2024) - [i69]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao:
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation. CoRR abs/2404.04050 (2024) - [i68]Kaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao, Yali Wang, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI. CoRR abs/2404.16006 (2024) - [i67]Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li:
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers. CoRR abs/2405.05945 (2024) - [i66]Xudong Lu, Aojun Zhou, Ziyi Lin, Qi Liu, Yuhui Xu, Renrui Zhang, Yafei Wen, Shuai Ren, Peng Gao, Junchi Yan, Hongsheng Li:
TerDiT: Ternary Diffusion Models with Transformers. CoRR abs/2405.14854 (2024) - [i65]Xudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao, Hongsheng Li:
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models. CoRR abs/2405.16057 (2024) - [i64]Jiaming Liu, Chenxuan Li, Guanqun Wang, Lily Lee, Kaichen Zhou, Sixiang Chen, Chuyan Xiong, Jiaxin Ge, Renrui Zhang, Shanghang Zhang:
Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation. CoRR abs/2405.17418 (2024) - [i63]Jiaze Wang, Yi Wang, Ziyu Guo, Renrui Zhang, Donghao Zhou, Guangyong Chen, Anfeng Liu, Pheng-Ann Heng:
TripletMix: Triplet Data Augmentation for 3D Understanding. CoRR abs/2405.18523 (2024) - [i62]Chaoyou Fu, Yuhan Dai, Yondong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun:
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis. CoRR abs/2405.21075 (2024) - [i61]Jiaming Liu, Mengzhen Liu, Zhenyu Wang, Lily Lee, Kaichen Zhou, Pengju An, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang:
RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation. CoRR abs/2406.04339 (2024) - [i60]Feng Li, Renrui Zhang, Hao Zhang, Yuanhan Zhang, Bo Li, Wei Li, Zejun Ma, Chunyuan Li:
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models. CoRR abs/2407.07895 (2024) - [i59]Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Yichi Zhang, Ziyu Guo, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Bin Wei, Shanghang Zhang, Peng Gao, Hongsheng Li:
MAVIS: Mathematical Visual Instruction Tuning. CoRR abs/2407.08739 (2024) - [i58]Bo Li, Yuanhan Zhang, Dong Guo, Renrui Zhang, Feng Li, Hao Zhang, Kaichen Zhang, Yanwei Li, Ziwei Liu, Chunyuan Li:
LLaVA-OneVision: Easy Visual Task Transfer. CoRR abs/2408.03326 (2024) - [i57]Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Chengzhuo Tong, Peng Gao, Chunyuan Li, Pheng-Ann Heng:
SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners. CoRR abs/2408.16768 (2024) - [i56]Dongzhi Jiang, Renrui Zhang, Ziyu Guo, Yanmin Wu, Jiayi Lei, Pengshuo Qiu, Pan Lu, Zehui Chen, Guanglu Song, Peng Gao, Yu Liu, Chunyuan Li, Hongsheng Li:
MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines. CoRR abs/2409.12959 (2024) - [i55]Weifeng Lin, Xinyu Wei, Renrui Zhang, Le Zhuo, Shitian Zhao, Siyuan Huang, Junlin Xi, Yu Qiao, Peng Gao, Hongsheng Li:
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions. CoRR abs/2409.15278 (2024) - [i54]Shitian Zhao, Renrui Zhang, Xu Luo, Yan Wang, Shanghang Zhang, Peng Gao:
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models. CoRR abs/2410.00363 (2024) - [i53]Guankun Wang, Han Xiao, Huxin Gao, Renrui Zhang, Long Bai, Xiaoxiao Yang, Zhen Li, Hongsheng Li, Hongliang Ren:
CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection. CoRR abs/2410.07540 (2024) - 2023
- [c28]Ziyu Guo, Renrui Zhang, Longtian Qiu, Xianzheng Ma, Xupeng Miao, Xuming He, Bin Cui:
CALIP: Zero-Shot Enhancement of CLIP with Parameter-Free Attention. AAAI 2023: 746-754 - [c27]Yulu Gan, Yan Bai, Yihang Lou, Xianzheng Ma, Renrui Zhang, Nian Shi, Lin Luo:
Decorate the Newcomers: Visual Domain Prompt for Continual Test Time Adaptation. AAAI 2023: 7595-7603 - [c26]Bowei He, Xu He, Renrui Zhang, Yingxue Zhang, Ruiming Tang, Chen Ma:
Dynamic Embedding Size Search with Minimum Regret for Streaming Recommender System. CIKM 2023: 741-750 - [c25]Anthony Chen, Kevin Zhang, Renrui Zhang, Zihan Wang, Yuheng Lu, Yandong Guo, Shanghang Zhang:
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection. CVPR 2023: 5291-5301 - [c24]Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li, Jianbo Shi:
Starting from Non-Parametric Networks for 3D Point Cloud Analysis. CVPR 2023: 5344-5353 - [c23]Jiaben Chen, Renrui Zhang, Dongze Lian, Jiaqi Yang, Ziyao Zeng, Jianbo Shi:
iQuery: Instruments as Queries for Audio-Visual Sound Separation. CVPR 2023: 14675-14686 - [c22]Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Yu Qiao, Peng Gao, Hongsheng Li:
Prompt, Generate, Then Cache: Cascade of Foundation Models Makes Strong Few-Shot Learners. CVPR 2023: 15211-15222 - [c21]Yanmin Wu, Xinhua Cheng, Renrui Zhang, Zesen Cheng, Jian Zhang:
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding. CVPR 2023: 19231-19242 - [c20]Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li:
Learning 3D Representations from 2D Pre-Trained Models via Image-to-Point Masked Autoencoders. CVPR 2023: 21769-21780 - [c19]Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao:
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement. ICCV 2023: 2605-2615 - [c18]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Ziyao Zeng, Zipeng Qin, Shanghang Zhang, Peng Gao:
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning. ICCV 2023: 2639-2650 - [c17]Renrui Zhang, Han Qiu, Tai Wang, Ziyu Guo, Ziteng Cui, Yu Qiao, Hongsheng Li, Peng Gao:
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection. ICCV 2023: 9121-9132 - [c16]Aojun Zhou, Yang Li, Zipeng Qin, Jianbo Liu, Junting Pan, Renrui Zhang, Rui Zhao, Peng Gao, Hongsheng Li:
SparseMAE: Sparse Training Meets Masked Autoencoders. ICCV 2023: 16130-16140 - [c15]Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li:
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models. ICCV (Workshops) 2023: 272-283 - [c14]Ziyu Guo, Renrui Zhang, Longtian Qiu, Xianzhi Li, Pheng-Ann Heng:
Joint-MAE: 2D-3D Joint Masked Autoencoders for 3D Point Cloud Pre-training. IJCAI 2023: 791-799 - [c13]Jiaben Chen, Yichen Zhu, Dongze Lian, Jiaqi Yang, Yifu Wang, Renrui Zhang, Xinhang Liu, Shenhan Qian, Laurent Kneip, Shenghua Gao:
Revisiting Event-Based Video Frame Interpolation. IROS 2023: 1292-1299 - [c12]Keqiang Sun, Junting Pan, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Limin Wang, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. NeurIPS 2023 - [c11]Renrui Zhang, Ziyao Zeng, Ziyu Guo, Borui Chen, Guangnan Zhang, Xilan Liu:
DS-Point: A Dual-Scale 3D Framework for Point Cloud Understanding. SMC 2023: 5046-5051 - [c10]Renrui Zhang, Liuhui Wang, Ziyu Guo, Jianbo Shi:
Nearest Neighbors Meet Deep Neural Networks for Point Cloud Analysis. WACV 2023: 1246-1255 - [i52]Renrui Zhang, Liuhui Wang, Ziyu Guo, Jianbo Shi:
Nearest Neighbors Meet Deep Neural Networks for Point Cloud Analysis. CoRR abs/2303.00703 (2023) - [i51]Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao:
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners. CoRR abs/2303.02151 (2023) - [i50]Peng Gao, Renrui Zhang, Rongyao Fang, Ziyi Lin, Hongyang Li, Hongsheng Li, Qiao Yu:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. CoRR abs/2303.05475 (2023) - [i49]Anthony Chen, Kevin Zhang, Renrui Zhang, Zihan Wang, Yuheng Lu, Yandong Guo, Shanghang Zhang:
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection. CoRR abs/2303.08129 (2023) - [i48]Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li, Jianbo Shi:
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis. CoRR abs/2303.08134 (2023) - [i47]Renrui Zhang, Jiaming Han, Aojun Zhou, Xiangfei Hu, Shilin Yan, Pan Lu, Hongsheng Li, Peng Gao, Yu Qiao:
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention. CoRR abs/2303.16199 (2023) - [i46]Ziyu Guo, Yiwen Tang, Renrui Zhang, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li:
ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance. CoRR abs/2303.16894 (2023) - [i45]Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao:
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement. CoRR abs/2304.01195 (2023) - [i44]Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao:
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model. CoRR abs/2304.15010 (2023) - [i43]Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Peng Gao, Hongsheng Li:
Personalize Segment Anything Model with One Shot. CoRR abs/2305.03048 (2023) - [i42]Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Zhongjiang He, Peng Gao:
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. CoRR abs/2305.16318 (2023) - [i41]Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li:
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models. CoRR abs/2306.11732 (2023) - [i40]Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. CoRR abs/2307.00716 (2023) - [i39]Jiaben Chen, Yichen Zhu, Dongze Lian, Jiaqi Yang, Yifu Wang, Renrui Zhang, Xinhang Liu, Shenhan Qian, Laurent Kneip, Shenghua Gao:
Revisiting Event-based Video Frame Interpolation. CoRR abs/2307.12558 (2023) - [i38]Bowei He, Xu He, Renrui Zhang, Yingxue Zhang, Ruiming Tang, Chen Ma:
Dynamic Embedding Size Search with Minimum Regret for Streaming Recommender System. CoRR abs/2308.07760 (2023) - [i37]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Hao Dong, Peng Gao:
Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks. CoRR abs/2308.12961 (2023) - [i36]Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng:
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following. CoRR abs/2309.00615 (2023) - [i35]Jiaming Han, Renrui Zhang, Wenqi Shao, Peng Gao, Peng Xu, Han Xiao, Kaipeng Zhang, Chris Liu, Song Wen, Ziyu Guo, Xudong Lu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Xiangyu Yue, Hongsheng Li, Yu Qiao:
ImageBind-LLM: Multi-modality Instruction Tuning. CoRR abs/2309.03905 (2023) - [i34]Mingjie Pan, Jiaming Liu, Renrui Zhang, Peixiang Huang, Xiaoqi Li, Li Liu, Shanghang Zhang:
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision. CoRR abs/2309.09502 (2023) - [i33]Xiaobao Wei, Renrui Zhang, Jiarui Wu, Jiaming Liu, Ming Lu, Yandong Guo, Shanghang Zhang:
NOC: High-Quality Neural Object Cloning with 3D Lifting of Segment Anything. CoRR abs/2309.12790 (2023) - [i32]Ke Wang, Houxing Ren, Aojun Zhou, Zimu Lu, Sichun Luo, Weikang Shi, Renrui Zhang, Linqi Song, Mingjie Zhan, Hongsheng Li:
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning. CoRR abs/2310.03731 (2023) - [i31]Song Wen, Guian Fang, Renrui Zhang, Peng Gao, Hao Dong, Dimitris N. Metaxas:
Improving Compositional Text-to-image Generation with Large Vision-Language Models. CoRR abs/2310.06311 (2023) - [i30]Ziyi Lin, Chris Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao:
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models. CoRR abs/2311.07575 (2023) - [i29]Xiaowei Chi, Yijiang Liu, Zhengkai Jiang, Rongyu Zhang, Ziyi Lin, Renrui Zhang, Peng Gao, Chaoyou Fu, Shanghang Zhang, Qifeng Liu, Yike Guo:
ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model. CoRR abs/2311.17963 (2023) - [i28]Dingning Liu, Xiaomeng Dong, Renrui Zhang, Xu Luo, Peng Gao, Xiaoshui Huang, Yongshun Gong, Zhihui Wang:
3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V. CoRR abs/2312.09738 (2023) - [i27]Zhi Zhang, Qizhe Zhang, Zijun Gao, Renrui Zhang, Ekaterina Shutova, Shiji Zhou, Shanghang Zhang:
Gradient-based Parameter Selection for Efficient Fine-Tuning. CoRR abs/2312.10136 (2023) - [i26]Yanmin Wu, Qiankun Gao, Renrui Zhang, Jian Zhang:
Language-Assisted 3D Scene Understanding. CoRR abs/2312.11451 (2023) - [i25]Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun:
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise. CoRR abs/2312.12436 (2023) - [i24]Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui Chen, Yandong Guo, Shanghang Zhang:
Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation. CoRR abs/2312.12480 (2023) - [i23]Xiaoqi Li, Mingxu Zhang, Yiran Geng, Haoran Geng, Yuxing Long, Yan Shen, Renrui Zhang, Jiaming Liu, Hao Dong:
ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation. CoRR abs/2312.16217 (2023) - 2022
- [c9]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CVPR 2022: 8542-8552 - [c8]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. ECCV (35) 2022: 388-404 - [c7]Ziteng Cui, Yingying Zhu, Lin Gu, Guo-Jun Qi, Xiaoxiao Li, Renrui Zhang, Zenghui Zhang, Tatsuya Harada:
Exploring Resolution and Degradation Clues as Self-supervised Signal for Low Quality Object Detection. ECCV (9) 2022: 473-491 - [c6]Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification. ECCV (35) 2022: 493-510 - [c5]Renrui Zhang, Ziyao Zeng, Ziyu Guo, Yafeng Li:
Can Language Understand Depth? ACM Multimedia 2022: 6868-6874 - [c4]Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. NeurIPS 2022 - [i22]Kexue Fu, Peng Gao, Renrui Zhang, Hongsheng Li, Yu Qiao, Manning Wang:
Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning. CoRR abs/2202.04241 (2022) - [i21]Renrui Zhang, Han Qiu, Tai Wang, Xuanzhuo Xu, Ziyu Guo, Yu Qiao, Peng Gao, Hongsheng Li:
MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection. CoRR abs/2203.13310 (2022) - [i20]Kexue Fu, Peng Gao, Shaolei Liu, Renrui Zhang, Yu Qiao, Manning Wang:
POS-BERT: Point Cloud One-Stage BERT Pre-Training. CoRR abs/2204.00989 (2022) - [i19]Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. CoRR abs/2205.14401 (2022) - [i18]Renrui Zhang, Ziyao Zeng, Ziyu Guo:
Can Language Understand Depth? CoRR abs/2207.01077 (2022) - [i17]Renrui Zhang, Zhang Wei, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification. CoRR abs/2207.09519 (2022) - [i16]Ziteng Cui, Yingying Zhu, Lin Gu, Guo-Jun Qi, Xiaoxiao Li, Renrui Zhang, Zenghui Zhang, Tatsuya Harada:
Exploring Resolution and Degradation Clues as Self-supervised Signal for Low Quality Object Detection. CoRR abs/2208.03062 (2022) - [i15]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. CoRR abs/2208.03550 (2022) - [i14]Renrui Zhang, Hanqiu Deng, Bohao Li, Wei Zhang, Hao Dong, Hongsheng Li, Peng Gao, Yu Qiao:
Collaboration of Pre-trained Models Makes Better Few-shot Learner. CoRR abs/2209.12255 (2022) - [i13]Ziyu Guo, Renrui Zhang, Longtian Qiu, Xianzheng Ma, Xupeng Miao, Xuming He, Bin Cui:
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention. CoRR abs/2209.14169 (2022) - [i12]Yanmin Wu, Xinhua Cheng, Renrui Zhang, Zesen Cheng, Jian Zhang:
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual and Language Learning. CoRR abs/2209.14941 (2022) - [i11]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyao Zeng, Shanghang Zhang, Peng Gao:
PointCLIP V2: Adapting CLIP for Powerful 3D Open-world Learning. CoRR abs/2211.11682 (2022) - [i10]Jiaben Chen, Renrui Zhang, Dongze Lian, Jiaqi Yang, Ziyao Zeng, Jianbo Shi:
iQuery: Instruments as Queries for Audio-Visual Sound Separation. CoRR abs/2212.03814 (2022) - [i9]Yulu Gan, Xianzheng Ma, Yihang Lou, Yan Bai, Renrui Zhang, Nian Shi, Lin Luo:
Decorate the Newcomers: Visual Domain Prompt for Continual Test Time Adaptation. CoRR abs/2212.04145 (2022) - [i8]Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li:
Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders. CoRR abs/2212.06785 (2022) - [i7]Peixiang Huang, Li Liu, Renrui Zhang, Song Zhang, Xinli Xu, Baichao Wang, Guoyi Liu:
TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning. CoRR abs/2212.13979 (2022) - 2021
- [c3]Minghang Zheng, Peng Gao, Renrui Zhang, Kunchang Li, Hongsheng Li, Hao Dong:
End-to-End Object Detection with Adaptive Clustering Transformer. BMVC 2021: 226 - [c2]Mingyuan Mao, Peng Gao, Renrui Zhang, Honghui Zheng, Teli Ma, Yan Peng, Errui Ding, Baochang Zhang, Shumin Han:
Dual-stream Network for Visual Recognition. NeurIPS 2021: 25346-25358 - [c1]Junming Zhang, Jingru Wang, Shigong Long, Lun Wang, Xuejiao Wang, Renrui Zhang:
Differential Privacy Protection and Game Analysis of Intelligent Transportation Data. PAAP 2021: 102-109 - [i6]Mingyuan Mao, Renrui Zhang, Honghui Zheng, Peng Gao, Teli Ma, Yan Peng, Errui Ding, Shumin Han:
Dual-stream Network for Visual Recognition. CoRR abs/2105.14734 (2021) - [i5]Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. CoRR abs/2110.04544 (2021) - [i4]Renrui Zhang, Rongyao Fang, Wei Zhang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling. CoRR abs/2111.03930 (2021) - [i3]Renrui Zhang, Ziyao Zeng, Ziyu Guo, Xinben Gao, Kexue Fu, Jianbo Shi:
DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion. CoRR abs/2111.10332 (2021) - [i2]Renrui Zhang, Longtian Qiu, Wei Zhang, Ziyao Zeng:
VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts. CoRR abs/2112.02399 (2021) - [i1]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CoRR abs/2112.02413 (2021)
2010 – 2019
- 2019
- [j1]Yafeng Li, Qijun Zhao, Xiangchu Feng, Weiwei Wang, Renrui Zhang, An Yan:
A variational image segmentation method exploring both intensity means and texture patterns. Signal Process. Image Commun. 76: 214-230 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-04 20:15 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint