Stars
🌲 Stanford CS 228 - Probabilistic Graphical Models
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥
[NeurIPS2024] Official Implementation of the paper [Learning Frequency-Adapted Vision Foundation Model for Domain Generalized Semantic Segmentation]
[NeurIPS 2024] Mixture of Experts for Audio-Visual Learning
The official pytorch implemention of our IJCV-2025 paper "Learning with Enriched Inductive Biases for Vision-Language Models".
The official repository for IJCV(2024) paper "Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks"
The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"
The official repository for ICLR2025 paper "HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts"
The official repository for ECCV2024 paper "RegionDrag: Fast Region-Based Image Editing with Diffusion Models"
[CVPR 2025 highlight] v-CLR: View-Consistent Learning for Open-World Instance Segmentation
[CVPR 2025] Mr. DETR: Instructive Multi-Route Training for Detection Transformers
Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models
[CVPR 2025] Official PyTorch Code for "DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models"
Code for CVPR2025 "MMRL: Multi-Modal Representation Learning for Vision-Language Models" and its extension "MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Lang…
Official PyTorch implementation for paper "ProAPO: Progressively Automatic Prompt Optimization for Visual Classification". The paper is accepted by CVPR 2025
Official implementation of ResCLIP: Residual Attention for Training-free Dense Vision-language Inference
Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)
[ICLR 2025] Official repository for “Efficient and Context-Aware Label Propagation for Zero-/Few-Shot Training-Free Adaptation of Vision-Language Model”
Official PyTorch Implementation of RA-TTA (ICLR25)
Curated list of recent visual autoregressive (VAR) modeling works
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation (ICLR 2025)
This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.
[NeurIPS 2024] Code for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models