default search action
Yixiao Ge
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j3]Yixiao Ge, Pieter van Goor, Robert E. Mahony:
A Geometric Perspective on Fusing Gaussian Distributions on Lie Groups. IEEE Control. Syst. Lett. 8: 844-849 (2024) - [j2]Chen Li, Yixiao Ge, Dian Li, Ying Shan:
Vision-Language Instruction Tuning: A Review and Analysis. Trans. Mach. Learn. Res. 2024 (2024) - [j1]Yixiao Ge, Feng Zhu, Dapeng Chen, Rui Zhao, Xiaogang Wang, Hongsheng Li:
Structured Domain Adaptation With Online Relation Regularization for Unsupervised Person Re-ID. IEEE Trans. Neural Networks Learn. Syst. 35(1): 258-271 (2024) - [c51]Zhaoyang Zhang, Wenqi Shao, Yixiao Ge, Xiaogang Wang, Jinwei Gu, Ping Luo:
Cached Transformers: Improving Transformers with Differentiable Memory Cachde. AAAI 2024: 16935-16943 - [c50]Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ying Shan, Ping Luo:
LLaMA Pro: Progressive LLaMA with Block Expansion. ACL (1) 2024: 6518-6537 - [c49]Xiaohan Ding, Yiyuan Zhang, Yixiao Ge, Sijie Zhao, Lin Song, Xiangyu Yue, Ying Shan:
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition. CVPR 2024: 5513-5524 - [c48]Yiyuan Zhang, Xiaohan Ding, Kaixiong Gong, Yixiao Ge, Ying Shan, Xiangyu Yue:
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities. CVPR 2024: 6108-6117 - [c47]Yuchao Gu, Xintao Wang, Yixiao Ge, Ying Shan, Mike Zheng Shou:
Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis. CVPR 2024: 7631-7640 - [c46]Yuzhou Huang, Liangbin Xie, Xintao Wang, Ziyang Yuan, Xiaodong Cun, Yixiao Ge, Jiantao Zhou, Chao Dong, Rui Huang, Ruimao Zhang, Ying Shan:
SmartEdit: Exploring Complex Instruction-Based Image Editing with Multimodal Large Language Models. CVPR 2024: 8362-8371 - [c45]Bohao Li, Yuying Ge, Yixiao Ge, Guangzhi Wang, Rui Wang, Ruimao Zhang, Ying Shan:
SEED-Bench: Benchmarking Multimodal Large Language Models. CVPR 2024: 13299-13308 - [c44]Ruyang Liu, Chen Li, Yixiao Ge, Thomas H. Li, Ying Shan, Ge Li:
BT-Adapter: Video Conversation is Feasible Without Video Instruction Tuning. CVPR 2024: 13658-13667 - [c43]Lin Song, Yukang Chen, Shuai Yang, Xiaohan Ding, Yixiao Ge, Ying-Cong Chen, Ying Shan:
Low-Rank Approximation for Sparse Attention in Multi-Modal LLMs. CVPR 2024: 13763-13773 - [c42]Tianheng Cheng, Lin Song, Yixiao Ge, Wenyu Liu, Xinggang Wang, Ying Shan:
YOLO-World: Real-Time Open-Vocabulary Object Detection. CVPR 2024: 16901-16911 - [c41]Weixian Lei, Yixiao Ge, Kun Yi, Jianfeng Zhang, Difei Gao, Dylan Sun, Yuying Ge, Ying Shan, Mike Zheng Shou:
VIT-LENS: Towards Omni-modal Representations. CVPR 2024: 26637-26647 - [c40]Ruyang Liu, Chen Li, Haoran Tang, Yixiao Ge, Ying Shan, Ge Li:
ST-LLM: Large Language Models Are Effective Temporal Learners. ECCV (57) 2024: 1-18 - [c39]Yunpeng Bai, Xintao Wang, Yan-Pei Cao, Yixiao Ge, Chun Yuan, Ying Shan:
DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment. ECCV (31) 2024: 472-488 - [c38]Yuying Ge, Sijie Zhao, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan:
Making LLaMA SEE and Draw with SEED Tokenizer. ICLR 2024 - [c37]Alessandro Fornasier, Yixiao Ge, Pieter van Goor, Martin Scheiber, Andrew Tridgell, Robert E. Mahony, Stephan Weiss:
An Equivariant Approach to Robust State Estimation for the ArduPilot Autopilot System. ICRA 2024: 11956-11962 - [i79]Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ping Luo, Ying Shan:
LLaMA Pro: Progressive LLaMA with Block Expansion. CoRR abs/2401.02415 (2024) - [i78]Jay Zhangjie Wu, Guian Fang, Haoning Wu, Xintao Wang, Yixiao Ge, Xiaodong Cun, David Junhao Zhang, Jia-Wei Liu, Yuchao Gu, Rui Zhao, Weisi Lin, Wynne Hsu, Ying Shan, Mike Zheng Shou:
Towards A Better Metric for Text-to-Video Generation. CoRR abs/2401.07781 (2024) - [i77]Xiaohu Jiang, Yixiao Ge, Yuying Ge, Chun Yuan, Ying Shan:
Supervised Fine-tuning in turn Improves Visual Foundation Models. CoRR abs/2401.10222 (2024) - [i76]Yiyuan Zhang, Xiaohan Ding, Kaixiong Gong, Yixiao Ge, Ying Shan, Xiangyu Yue:
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities. CoRR abs/2401.14405 (2024) - [i75]Tianheng Cheng, Lin Song, Yixiao Ge, Wenyu Liu, Xinggang Wang, Ying Shan:
YOLO-World: Real-Time Open-Vocabulary Object Detection. CoRR abs/2401.17270 (2024) - [i74]Yixiao Ge, Pieter van Goor, Robert E. Mahony:
A Geometric Perspective on Fusing Gaussian Distributions on Lie Groups. CoRR abs/2403.16411 (2024) - [i73]Ruyang Liu, Chen Li, Haoran Tang, Yixiao Ge, Ying Shan, Ge Li:
ST-LLM: Large Language Models Are Effective Temporal Learners. CoRR abs/2404.00308 (2024) - [i72]Yuying Ge, Sijie Zhao, Jinguo Zhu, Yixiao Ge, Kun Yi, Lin Song, Chen Li, Xiaohan Ding, Ying Shan:
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation. CoRR abs/2404.14396 (2024) - [i71]Bohao Li, Yuying Ge, Yi Chen, Yixiao Ge, Ruimao Zhang, Ying Shan:
SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension. CoRR abs/2404.16790 (2024) - [i70]Yuying Ge, Sijie Zhao, Chen Li, Yixiao Ge, Ying Shan:
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing. CoRR abs/2405.04007 (2024) - [i69]Chengyue Wu, Yixiao Ge, Qiushan Guo, Jiahao Wang, Zhixuan Liang, Zeyu Lu, Ying Shan, Ping Luo:
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots. CoRR abs/2405.07990 (2024) - [i68]Yicheng Xiao, Lin Song, Shaoli Huang, Jiangshan Wang, Siyu Song, Yixiao Ge, Xiu Li, Ying Shan:
GrootVL: Tree Topology is All You Need in State Space Model. CoRR abs/2406.02395 (2024) - [i67]Xubing Ye, Yukang Gan, Xiaoke Huang, Yixiao Ge, Ying Shan, Yansong Tang:
VoCo-LLaMA: Towards Vision Compression with Large Language Models. CoRR abs/2406.12275 (2024) - [i66]Shuai Yang, Yuying Ge, Yang Li, Yukang Chen, Yixiao Ge, Ying Shan, Yingcong Chen:
SEED-Story: Multimodal Long Story Generation with Large Language Model. CoRR abs/2407.08683 (2024) - [i65]Yixiao Ge, Behzad Zamani, Pieter van Goor, Jochen Trumpf, Robert E. Mahony:
Geometric Data Fusion for Collaborative Attitude Estimation. CoRR abs/2407.13176 (2024) - [i64]Zhuoyan Luo, Fengyuan Shi, Yixiao Ge, Yujiu Yang, Limin Wang, Ying Shan:
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation. CoRR abs/2409.04410 (2024) - 2023
- [c36]Rui Yan, Mike Zheng Shou, Yixiao Ge, Jinpeng Wang, Xudong Lin, Guanyu Cai, Jinhui Tang:
Video-Text Pre-training with Learned Regions for Retrieval. AAAI 2023: 3100-3108 - [c35]Binjie Zhang, Shupeng Su, Yixiao Ge, Xuyuan Xu, Yexin Wang, Chun Yuan, Mike Zheng Shou, Ying Shan:
Darwinian Model Upgrades: Model Evolving with Selective Compatibility. AAAI 2023: 3393-3400 - [c34]Yixiao Ge, Pieter van Goor, Robert E. Mahony:
A Note on the Extended Kalman Filter on a Manifold. CDC 2023: 7687-7694 - [c33]Jinpeng Wang, Yixiao Ge, Rui Yan, Yuying Ge, Kevin Qinghong Lin, Satoshi Tsutsui, Xudong Lin, Guanyu Cai, Jianping Wu, Ying Shan, Xiaohu Qie, Mike Zheng Shou:
All in One: Exploring Unified Video-Language Pre-Training. CVPR 2023: 6598-6608 - [c32]Ziyun Zeng, Yuying Ge, Xihui Liu, Bin Chen, Ping Luo, Shu-Tao Xia, Yixiao Ge:
Learning Transferable Spatiotemporal Representations from Natural Script Knowledge. CVPR 2023: 23079-23089 - [c31]Teng Wang, Yixiao Ge, Feng Zheng, Ran Cheng, Ying Shan, Xiaohu Qie, Ping Luo:
Accelerating Vision-Language Pretraining with Free Language Modeling. CVPR 2023: 23161-23170 - [c30]Shusheng Yang, Yixiao Ge, Kun Yi, Dian Li, Ying Shan, Xiaohu Qie, Xinggang Wang:
RILS: Masked Visual Reconstruction in Language Semantic Space. CVPR 2023: 23304-23314 - [c29]Rui Yang, Lin Song, Yixiao Ge, Xiu Li:
BoxSnake: Polygonal Instance Segmentation with Box Supervision. ICCV 2023: 766-776 - [c28]Xiaotong Li, Zixuan Hu, Yixiao Ge, Ying Shan, Ling-Yu Duan:
Exploring Model Transferability through the Lens of Potential Energy. ICCV 2023: 5406-5415 - [c27]Yuxin Fang, Shusheng Yang, Shijie Wang, Yixiao Ge, Ying Shan, Xinggang Wang:
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection. ICCV 2023: 6221-6230 - [c26]Jay Zhangjie Wu, Yixiao Ge, Xintao Wang, Stan Weixian Lei, Yuchao Gu, Yufei Shi, Wynne Hsu, Ying Shan, Xiaohu Qie, Mike Zheng Shou:
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation. ICCV 2023: 7589-7599 - [c25]Kun Yi, Yixiao Ge, Xiaotong Li, Shusheng Yang, Dian Li, Jianping Wu, Ying Shan, Xiaohu Qie:
Masked Image Modeling with Denoising Contrast. ICLR 2023 - [c24]Chengyue Wu, Teng Wang, Yixiao Ge, Zeyu Lu, Ruisong Zhou, Ying Shan, Ping Luo:
π-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation. ICML 2023: 37713-37727 - [c23]Yukang Gan, Yixiao Ge, Chang Zhou, Shupeng Su, Zhouchuan Xu, Xuyuan Xu, Quanchao Hui, Xiang Chen, Yexin Wang, Ying Shan:
Binary Embedding-based Retrieval at Tencent. KDD 2023: 4056-4067 - [c22]Cheng Cheng, Lin Song, Ruoyi Xue, Hang Wang, Hongbin Sun, Yixiao Ge, Ying Shan:
Meta-Adapter: An Online Few-shot Learner for Vision-Language Model. NeurIPS 2023 - [c21]Yuchao Gu, Xintao Wang, Jay Zhangjie Wu, Yujun Shi, Yunpeng Chen, Zihan Fan, Wuyou Xiao, Rui Zhao, Shuning Chang, Weijia Wu, Yixiao Ge, Ying Shan, Mike Zheng Shou:
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models. NeurIPS 2023 - [c20]Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan:
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction. NeurIPS 2023 - [i63]Xiaotong Li, Zixuan Hu, Jun Liu, Yixiao Ge, Yongxing Dai, Ling-Yu Duan:
Modeling Uncertain Feature Representation for Domain Generalization. CoRR abs/2301.06442 (2023) - [i62]Shusheng Yang, Yixiao Ge, Kun Yi, Dian Li, Ying Shan, Xiaohu Qie, Xinggang Wang:
Masked Visual Reconstruction in Language Semantic Space. CoRR abs/2301.06958 (2023) - [i61]Yukang Gan, Yixiao Ge, Chang Zhou, Shupeng Su, Zhouchuan Xu, Xuyuan Xu, Quanchao Hui, Xiang Chen, Yexin Wang, Ying Shan:
Binary Embedding-based Retrieval at Tencent. CoRR abs/2302.08714 (2023) - [i60]Rui Yang, Lin Song, Yixiao Ge, Xiu Li:
BoxSnake: Polygonal Instance Segmentation with Box Supervision. CoRR abs/2303.11630 (2023) - [i59]Teng Wang, Yixiao Ge, Feng Zheng, Ran Cheng, Ying Shan, Xiaohu Qie, Ping Luo:
Accelerating Vision-Language Pretraining with Free Language Modeling. CoRR abs/2303.14038 (2023) - [i58]Chen Li, Yixiao Ge, Jiayong Mao, Dian Li, Ying Shan:
TagGPT: Large Language Models are Zero-shot Multimodal Taggers. CoRR abs/2304.03022 (2023) - [i57]Binqian Xu, Xiangbo Shu, Rui Yan, Guo-Sen Xie, Yixiao Ge, Mike Zheng Shou:
Attack is Good Augmentation: Towards Skeleton-Contrastive Representation Learning. CoRR abs/2304.04023 (2023) - [i56]Chengyue Wu, Teng Wang, Yixiao Ge, Zeyu Lu, Ruisong Zhou, Ying Shan, Ping Luo:
π-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation. CoRR abs/2304.14381 (2023) - [i55]Guangzhi Wang, Yixiao Ge, Xiaohan Ding, Mohan S. Kankanhalli, Ying Shan:
What Makes for Good Visual Tokenizers for Large Language Models? CoRR abs/2305.12223 (2023) - [i54]Ziyun Zeng, Yixiao Ge, Zhan Tong, Xihui Liu, Shu-Tao Xia, Ying Shan:
TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale. CoRR abs/2305.14173 (2023) - [i53]Yuchao Gu, Xintao Wang, Jay Zhangjie Wu, Yujun Shi, Yunpeng Chen, Zihan Fan, Wuyou Xiao, Rui Zhao, Shuning Chang, Weijia Wu, Yixiao Ge, Ying Shan, Mike Zheng Shou:
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models. CoRR abs/2305.18292 (2023) - [i52]Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan:
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction. CoRR abs/2305.18752 (2023) - [i51]Sijie Zhao, Yixiao Ge, Zhongang Qi, Lin Song, Xiaohan Ding, Zehua Xie, Ying Shan:
Sticker820K: Empowering Interactive Retrieval with Stickers. CoRR abs/2306.06870 (2023) - [i50]Binjie Zhang, Yixiao Ge, Xuyuan Xu, Ying Shan, Mike Zheng Shou:
TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter. CoRR abs/2306.12642 (2023) - [i49]Chen Li, Xutan Peng, Teng Wang, Yixiao Ge, Mengyang Liu, Xuyuan Xu, Yexin Wang, Ying Shan:
PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas. CoRR abs/2306.14644 (2023) - [i48]Yunpeng Bai, Xintao Wang, Yan-Pei Cao, Yixiao Ge, Chun Yuan, Ying Shan:
DreamDiffusion: Generating High-Quality Images from Brain EEG Signals. CoRR abs/2306.16934 (2023) - [i47]Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan:
Planting a SEED of Vision in Large Language Model. CoRR abs/2307.08041 (2023) - [i46]Bohao Li, Rui Wang, Guangzhi Wang, Yuying Ge, Yixiao Ge, Ying Shan:
SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension. CoRR abs/2307.16125 (2023) - [i45]Weixian Lei, Yixiao Ge, Jianfeng Zhang, Dylan Sun, Kun Yi, Ying Shan, Mike Zheng Shou:
ViT-Lens: Towards Omni-modal Representations. CoRR abs/2308.10185 (2023) - [i44]Xiaotong Li, Zixuan Hu, Yixiao Ge, Ying Shan, Ling-Yu Duan:
Exploring Model Transferability through the Lens of Potential Energy. CoRR abs/2308.15074 (2023) - [i43]Alessandro Fornasier, Yixiao Ge, Pieter van Goor, Robert E. Mahony, Stephan Weiss:
Equivariant Symmetries for Inertial Navigation Systems. CoRR abs/2309.03765 (2023) - [i42]Yixiao Ge, Pieter van Goor, Robert E. Mahony:
A Note on the Extended Kalman Filter on a Manifold. CoRR abs/2309.06008 (2023) - [i41]Ruyang Liu, Chen Li, Yixiao Ge, Ying Shan, Thomas H. Li, Ge Li:
One For All: Video Conversation is Feasible Without Video Instruction Tuning. CoRR abs/2309.15785 (2023) - [i40]Yuying Ge, Sijie Zhao, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan:
Making LLaMA SEE and Draw with SEED Tokenizer. CoRR abs/2310.01218 (2023) - [i39]Cheng Cheng, Lin Song, Ruoyi Xue, Hang Wang, Hongbin Sun, Yixiao Ge, Ying Shan:
Meta-Adapter: An Online Few-shot Learner for Vision-Language Model. CoRR abs/2311.03774 (2023) - [i38]Chen Li, Yixiao Ge, Dian Li, Ying Shan:
Vision-Language Instruction Tuning: A Review and Analysis. CoRR abs/2311.08172 (2023) - [i37]Xiaohan Ding, Yiyuan Zhang, Yixiao Ge, Sijie Zhao, Lin Song, Xiangyu Yue, Ying Shan:
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition. CoRR abs/2311.15599 (2023) - [i36]Weixian Lei, Yixiao Ge, Kun Yi, Jianfeng Zhang, Difei Gao, Dylan Sun, Yuying Ge, Ying Shan, Mike Zheng Shou:
ViT-Lens-2: Gateway to Omni-modal Intelligence. CoRR abs/2311.16081 (2023) - [i35]Bohao Li, Yuying Ge, Yixiao Ge, Guangzhi Wang, Rui Wang, Ruimao Zhang, Ying Shan:
SEED-Bench-2: Benchmarking Multimodal Large Language Models. CoRR abs/2311.17092 (2023) - [i34]Yi Chen, Yuying Ge, Yixiao Ge, Mingyu Ding, Bohao Li, Rui Wang, Ruifeng Xu, Ying Shan, Xihui Liu:
EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models. CoRR abs/2312.06722 (2023) - [i33]Yuzhou Huang, Liangbin Xie, Xintao Wang, Ziyang Yuan, Xiaodong Cun, Yixiao Ge, Jiantao Zhou, Chao Dong, Rui Huang, Ruimao Zhang, Ying Shan:
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models. CoRR abs/2312.06739 (2023) - [i32]Jinguo Zhu, Xiaohan Ding, Yixiao Ge, Yuying Ge, Sijie Zhao, Hengshuang Zhao, Xiaohua Wang, Ying Shan:
VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation. CoRR abs/2312.09251 (2023) - [i31]Zhaoyang Zhang, Wenqi Shao, Yixiao Ge, Xiaogang Wang, Jinwei Gu, Ping Luo:
Cached Transformers: Improving Transformers with Differentiable Memory Cache. CoRR abs/2312.12742 (2023) - 2022
- [c19]Yixiao Ge, Pieter van Goor, Robert E. Mahony:
Equivariant Filter Design for Discrete-time Systems. CDC 2022: 1243-1250 - [c18]Alex Jinpeng Wang, Yixiao Ge, Guanyu Cai, Rui Yan, Xudong Lin, Ying Shan, Xiaohu Qie, Mike Zheng Shou:
Object-aware Video-language Pre-training for Retrieval. CVPR 2022: 3303-3312 - [c17]Yuying Ge, Yixiao Ge, Xihui Liu, Dian Li, Ying Shan, Xiaohu Qie, Ping Luo:
Bridging Video-text Retrieval with Multiple Choice Questions. CVPR 2022: 16146-16155 - [c16]Xiaotong Li, Yixiao Ge, Kun Yi, Zixuan Hu, Ying Shan, Ling-Yu Duan:
mc-BEiT: Multi-choice Discretization for Image BERT Pre-training. ECCV (30) 2022: 231-246 - [c15]Wenqi Shao, Xun Zhao, Yixiao Ge, Zhaoyang Zhang, Lei Yang, Xiaogang Wang, Ying Shan, Ping Luo:
Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space. ECCV (34) 2022: 286-302 - [c14]Yuying Ge, Yixiao Ge, Xihui Liu, Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Ping Luo:
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval. ECCV (35) 2022: 691-708 - [c13]Xiaotong Li, Yongxing Dai, Yixiao Ge, Jun Liu, Ying Shan, Lingyu Duan:
Uncertainty Modeling for Out-of-Distribution Generalization. ICLR 2022 - [c12]Wenqi Shao, Yixiao Ge, Zhaoyang Zhang, Xuyuan Xu, Xiaogang Wang, Ying Shan, Ping Luo:
Dynamic Token Normalization improves Vision Transformers. ICLR 2022 - [c11]Binjie Zhang, Yixiao Ge, Yantao Shen, Yu Li, Chun Yuan, Xuyuan Xu, Yexin Wang, Ying Shan:
Hot-Refresh Model Upgrades with Regression-Free Compatible Training in Image Retrieval. ICLR 2022 - [c10]Binjie Zhang, Yixiao Ge, Yantao Shen, Shupeng Su, Fanzi Wu, Chun Yuan, Xuyuan Xu, Yexin Wang, Ying Shan:
Towards Universal Backward-Compatible Representation Learning. IJCAI 2022: 1615-1621 - [i30]Yuying Ge, Yixiao Ge, Xihui Liu, Dian Li, Ying Shan, Xiaohu Qie, Ping Luo:
BridgeFormer: Bridging Video-text Retrieval with Multiple Choice Questions. CoRR abs/2201.04850 (2022) - [i29]Binjie Zhang, Yixiao Ge, Yantao Shen, Yu Li, Chun Yuan, Xuyuan Xu, Yexin Wang, Ying Shan:
Hot-Refresh Model Upgrades with Regression-Alleviating Compatible Training in Image Retrieval. CoRR abs/2201.09724 (2022) - [i28]Xiaotong Li, Yongxing Dai, Yixiao Ge, Jun Liu, Ying Shan, Ling-Yu Duan:
Uncertainty Modeling for Out-of-Distribution Generalization. CoRR abs/2202.03958 (2022) - [i27]Binjie Zhang, Yixiao Ge, Yantao Shen, Shupeng Su, Fanzi Wu, Chun Yuan, Xuyuan Xu, Yexin Wang, Ying Shan:
Towards Universal Backward-Compatible Representation Learning. CoRR abs/2203.01583 (2022) - [i26]Alex Jinpeng Wang, Yixiao Ge, Rui Yan, Yuying Ge, Xudong Lin, Guanyu Cai, Jianping Wu, Ying Shan, Xiaohu Qie, Mike Zheng Shou:
All in One: Exploring Unified Video-Language Pre-training. CoRR abs/2203.07303 (2022) - [i25]Guanyu Cai, Yixiao Ge, Alex Jinpeng Wang, Rui Yan, Xudong Lin, Ying Shan, Lianghua He, Xiaohu Qie, Jianping Wu, Mike Zheng Shou:
Revitalize Region Feature for Democratizing Video-Language Pre-training. CoRR abs/2203.07720 (2022) - [i24]Xiaotong Li, Yixiao Ge, Kun Yi, Zixuan Hu, Ying Shan, Ling-Yu Duan:
mc-BEiT: Multi-choice Discretization for Image BERT Pre-training. CoRR abs/2203.15371 (2022) - [i23]Yuxin Fang, Shusheng Yang, Shijie Wang, Yixiao Ge, Ying Shan, Xinggang Wang:
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection. CoRR abs/2204.02964 (2022) - [i22]Yuying Ge, Yixiao Ge, Xihui Liu, Alex Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Ping Luo:
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval. CoRR abs/2204.12408 (2022) - [i21]Shupeng Su, Binjie Zhang, Yixiao Ge, Xuyuan Xu, Yexin Wang, Chun Yuan, Ying Shan:
Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval. CoRR abs/2204.13919 (2022) - [i20]Kun Yi, Yixiao Ge, Xiaotong Li, Shusheng Yang, Dian Li, Jianping Wu, Ying Shan, Xiaohu Qie:
Masked Image Modeling with Denoising Contrast. CoRR abs/2205.09616 (2022) - [i19]Wenqi Shao, Xun Zhao, Yixiao Ge, Zhaoyang Zhang, Lei Yang, Xiaogang Wang, Ying Shan, Ping Luo:
Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space. CoRR abs/2207.03036 (2022) - [i18]Yixiao Ge, Pieter van Goor, Robert E. Mahony:
Equivariant Filter Design for Discrete-time systems. CoRR abs/2209.04965 (2022) - [i17]Ziyun Zeng, Yuying Ge, Xihui Liu, Bin Chen, Ping Luo, Shu-Tao Xia, Yixiao Ge:
Learning Transferable Spatiotemporal Representations from Natural Script Knowledge. CoRR abs/2209.15280 (2022) - [i16]Binjie Zhang, Shupeng Su, Yixiao Ge, Xuyuan Xu, Yexin Wang, Chun Yuan, Mike Zheng Shou, Ying Shan:
Darwinian Model Upgrades: Model Evolving with Selective Compatibility. CoRR abs/2210.06954 (2022) - [i15]Yuchao Gu, Xintao Wang, Yixiao Ge, Ying Shan, Xiaohu Qie, Mike Zheng Shou:
Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis. CoRR abs/2212.03185 (2022) - [i14]Jay Zhangjie Wu, Yixiao Ge, Xintao Wang, Weixian Lei, Yuchao Gu, Wynne Hsu, Ying Shan, Xiaohu Qie, Mike Zheng Shou:
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation. CoRR abs/2212.11565 (2022) - 2021
- [c9]Shixiang Tang, Dapeng Chen, Lei Bai, Kaijian Liu, Yixiao Ge, Wanli Ouyang:
Mutual CRF-GNN for Few-Shot Learning. CVPR 2021: 2329-2339 - [c8]Xiao Zhang, Yixiao Ge, Yu Qiao, Hongsheng Li:
Refining Pseudo Labels With Clustering Consensus Over Generations for Unsupervised Object Re-Identification. CVPR 2021: 3436-3445 - [c7]Rui Liu, Yixiao Ge, Ching Lam Choi, Xiaogang Wang, Hongsheng Li:
DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network. CVPR 2021: 16377-16386 - [c6]Chen Zhao, Yixiao Ge, Feng Zhu, Rui Zhao, Hongsheng Li, Mathieu Salzmann:
Progressive Correspondence Pruning by Consensus Learning. ICCV 2021: 6444-6453 - [c5]Yi Zheng, Shixiang Tang, Guolong Teng, Yixiao Ge, Kaijian Liu, Jing Qin, Donglian Qi, Dapeng Chen:
Online Pseudo Label Generation by Hierarchical Cluster Dynamics for Adaptive Person Re-identification. ICCV 2021: 8351-8361 - [i13]Chen Zhao, Yixiao Ge, Jiaqi Yang, Feng Zhu, Rui Zhao, Hongsheng Li:
Consensus-Guided Correspondence Denoising. CoRR abs/2101.00591 (2021) - [i12]Rui Liu, Yixiao Ge, Ching Lam Choi, Xiaogang Wang, Hongsheng Li:
DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network. CoRR abs/2103.07893 (2021) - [i11]Yixiao Ge, Ching Lam Choi, Xiao Zhang, Peipei Zhao, Feng Zhu, Rui Zhao, Hongsheng Li:
Self-distillation with Batch Knowledge Ensembling Improves ImageNet Classification. CoRR abs/2104.13298 (2021) - [i10]Xiao Zhang, Yixiao Ge, Yu Qiao, Hongsheng Li:
Refining Pseudo Labels with Clustering Consensus over Generations for Unsupervised Object Re-identification. CoRR abs/2106.06133 (2021) - [i9]Alex Jinpeng Wang, Yixiao Ge, Guanyu Cai, Rui Yan, Xudong Lin, Ying Shan, Xiaohu Qie, Mike Zheng Shou:
Object-aware Video-language Pre-training for Retrieval. CoRR abs/2112.00656 (2021) - [i8]Rui Yan, Mike Zheng Shou, Yixiao Ge, Alex Jinpeng Wang, Xudong Lin, Guanyu Cai, Jinhui Tang:
Video-Text Pre-training with Learned Regions. CoRR abs/2112.01194 (2021) - [i7]Wenqi Shao, Yixiao Ge, Zhaoyang Zhang, Xuyuan Xu, Xiaogang Wang, Ying Shan, Ping Luo:
Dynamic Token Normalization Improves Vision Transformer. CoRR abs/2112.02624 (2021) - 2020
- [c4]Yixiao Ge, Haibo Wang, Feng Zhu, Rui Zhao, Hongsheng Li:
Self-supervising Fine-Grained Region Similarities for Large-Scale Image Localization. ECCV (4) 2020: 369-386 - [c3]Yixiao Ge, Dapeng Chen, Hongsheng Li:
Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification. ICLR 2020 - [c2]Yixiao Ge, Feng Zhu, Dapeng Chen, Rui Zhao, Hongsheng Li:
Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID. NeurIPS 2020 - [i6]Yixiao Ge, Dapeng Chen, Hongsheng Li:
Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification. CoRR abs/2001.01526 (2020) - [i5]Yixiao Ge, Feng Zhu, Rui Zhao, Hongsheng Li:
Structured Domain Adaptation for Unsupervised Person Re-identification. CoRR abs/2003.06650 (2020) - [i4]Yixiao Ge, Dapeng Chen, Feng Zhu, Rui Zhao, Hongsheng Li:
Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID. CoRR abs/2006.02713 (2020) - [i3]Yixiao Ge, Haibo Wang, Feng Zhu, Rui Zhao, Hongsheng Li:
Self-supervising Fine-grained Region Similarities for Large-scale Image Localization. CoRR abs/2006.03926 (2020) - [i2]Yixiao Ge, Shijie Yu, Dapeng Chen:
Improved Mutual Mean-Teaching for Unsupervised Domain Adaptive Re-ID. CoRR abs/2008.10313 (2020)
2010 – 2019
- 2018
- [c1]Yixiao Ge, Zhuowan Li, Haiyu Zhao, Guojun Yin, Shuai Yi, Xiaogang Wang, Hongsheng Li:
FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification. NeurIPS 2018: 1230-1241 - [i1]Yixiao Ge, Zhuowan Li, Haiyu Zhao, Guojun Yin, Shuai Yi, Xiaogang Wang, Hongsheng Li:
FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification. CoRR abs/1810.02936 (2018)
Coauthor Index
aka: Mike Zheng Shou
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-02 21:30 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint