8000 Vladimir2506 (Zhuofan Xia) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Vladimir2506's full-sized avatar

Highlights

  • Pro

Block or report Vladimir2506

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation

Jupyter Notebook 299 34 Updated Apr 29, 2025
Python 85 7 Updated Dec 29, 2024

[ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Python 44 2 Updated Sep 11, 2024

[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models

Python 133 Updated Sep 12, 2024

We write your reusable computer vision tools. 💜

Python 26,688 2,016 Updated May 31, 2025

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 915 40 Updated Sep 27, 2024

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

682 43 Updated May 30, 2025
Python 135 16 Updated Dec 20, 2024

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024

Python 340 7 Updated Sep 24, 2024

Official repository of Agent Attention (ECCV2024)

Python 621 40 Updated Nov 17, 2024

leaked prompts of GPTs

29,885 4,062 Updated Sep 27, 2024
Python 41 Updated Oct 3, 2023

Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)

Python 37 2 Updated Oct 30, 2023

Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

Python 23 1 Updated Oct 30, 2023

Referring Expression Datasets API

Jupyter Notebook 516 82 Updated Aug 27, 2024

[NeurIPS 2023] Rank-DETR for High Quality Object Detection

Python 91 8 Updated Oct 19, 2023

[T-PAMI'25] PyTorch Implementation of GDRNPP, winner (most of the awards) of the BOP Challenge 2022 at ECCV'22

C++ 271 57 Updated May 5, 2025

[ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection

Python 135 6 Updated Mar 15, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,404 997 Updated May 30, 2025

A curated list of papers, datasets and resources pertaining to open vocabulary object detection.

324 19 Updated May 13, 2025

(TPAMI 2024) A Survey on Open Vocabulary Learning

930 49 Updated Mar 23, 2025

Emu Series: Generative Multimodal Models from BAAI

Python 1,723 85 Updated Sep 27, 2024

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Python 23 2 Updated Sep 7, 2023

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Python 19 2 Updated Apr 17, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,227 157 Updated Feb 16, 2025

Pytorch implementation of DAPrompt: https://arxiv.org/abs/2202.06687

Python 93 12 Updated Feb 12, 2023

Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning

Python 22 Updated Nov 16, 2022

[arXiv] Cross-Modal Adapter for Text-Video Retrieval

55 2 Updated Nov 21, 2022
Next
0