Vladimir2506

Zhuofan Xia Vladimir2506

PhD candidate at Tsinghua University.

88 followers · 126 following

Beijing
www.zhuofanxia.xyz

Highlights

Stars

microsoft / art-msra

[CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation

Jupyter Notebook 299 34 Updated Apr 29, 2025

LeapLabTHU / Segment3D

Python 85 7 Updated Dec 29, 2024

LeapLabTHU / Attention-Mediators

[ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Python 44 2 Updated Sep 11, 2024

LeapLabTHU / GSVA

[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models

Python 133 Updated Sep 12, 2024

roboflow / supervision

We write your reusable computer vision tools. 💜

Python 26,688 2,016 Updated May 31, 2025

LTH14 / rcg

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 915 40 Updated Sep 27, 2024

Yangyi-Chen / Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

682 43 Updated May 30, 2025

LeapLabTHU / ExpeL

Python 135 16 Updated Dec 20, 2024

SHI-Labs / Smooth-Diffusion

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024

Python 340 7 Updated Sep 24, 2024

LeapLabTHU / Agent-Attention

Official repository of Agent Attention (ECCV2024)

Python 621 40 Updated Nov 17, 2024

linexjlin / GPTs

leaked prompts of GPTs

29,885 4,062 Updated Sep 27, 2024

toggle1995 / RIS-DMMI

Python 41 Updated Oct 3, 2023

palchenli / VL-Instruction-Tuning

91 4 Updated Nov 25, 2023

LeapLabTHU / FamO2O

Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)

Python 37 2 Updated Oct 30, 2023

yueyang130 / SEEM

Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

Python 23 1 Updated Oct 30, 2023

lichengunc / refer

Referring Expression Datasets API

Jupyter Notebook 516 82 Updated Aug 27, 2024

LeapLabTHU / Rank-DETR

[NeurIPS 2023] Rank-DETR for High Quality Object Detection

Python 91 8 Updated Oct 19, 2023

shanice-l / gdrnpp_bop2022

[T-PAMI'25] PyTorch Implementation of GDRNPP, winner (most of the awards) of the BOP Challenge 2022 at ECCV'22

C++ 271 57 Updated May 5, 2025

LeapLabTHU / ARC

[ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection

Python 135 6 Updated Mar 15, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,404 997 Updated May 30, 2025

witnessai / Awesome-Open-Vocabulary-Object-Detection

A curated list of papers, datasets and resources pertaining to open vocabulary object detection.

324 19 Updated May 13, 2025

jianzongwu / Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

930 49 Updated Mar 23, 2025

baaivision / Emu

Emu Series: Generative Multimodal Models from BAAI

Python 1,723 85 Updated Sep 27, 2024

LeapLabTHU / DAT-Segmentation

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Python 23 2 Updated Sep 7, 2023

LeapLabTHU / DAT-Detection

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Python 19 2 Updated Apr 17, 2024

dvlab-research / LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,227 157 Updated Feb 16, 2025

LeapLabTHU / DAPrompt

Pytorch implementation of DAPrompt: https://arxiv.org/abs/2202.06687

Python 93 12 Updated Feb 12, 2023

LeapLabTHU / MOSS

Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning

Python 22 Updated Nov 16, 2022

LeapLabTHU / Cross-Modal-Adapter

[arXiv] Cross-Modal Adapter for Text-Video Retrieval

55 2 Updated Nov 21, 2022

LeapLabTHU / Text4Point

36 1 Updated Jan 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly