8000 chaoleitan (Chaolei) / Starred · GitHub

More Web Proxy on the site http://driver.im/

chaoleitan

Follow

Chaolei chaoleitan

Follow

1 follower · 4 following

Achievements

Achievements

Highlights

Pro

Stars

Zhuo-Cao / FlashVTG

FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)

Python 24 2 Updated Apr 17, 2025

QingyangZhang / Label-Free-RLVR

66 Updated Jun 1, 2025

iSEE-Laboratory / LLMDet

(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 209 11 Updated Apr 7, 2025

Jayce1kk / SpaceVLLM

SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability

7 Updated May 8, 2025

Hanzy1996 / OpenSeg-R

OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning

Python 15 Updated May 24, 2025

dvlab-research / VisionReasoner

The official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"

Python 143 9 Updated May 30, 2025

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 887 46 Updated Mar 20, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,043 138 Updated May 28, 2025

dome272 / MaskGIT-pytorch

Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)

Python 444 35 Updated Sep 3, 2023

Star-xing1 / SAUGE

(AAAI 2025) Official PyTorch implementation of paper "SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection".

Python 13 Updated May 11, 2025

hojonathanho / diffusion

Denoising Diffusion Probabilistic Models

Python 4,432 418 Updated Aug 29, 2023

junyanz / pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Python 24,068 6,454 Updated May 14, 2024

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,697 148 Updated May 28, 2025

luca-medeiros / lang-segment-anything

SAM with text prompt

Python 2,198 252 Updated May 10, 2025

ChocoWu / Awesome-Scene-Graph-Generation

This is a repository for listing papers on scene graph generation and application.

327 23 Updated May 25, 2025

EsYoon7 / UVQA

[ICLR'25] Official code for "Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models"

Python 5 Updated May 10, 2025

LunarShen / DsicoVLA

[CVPR 2025 🔥] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval

5 Updated Mar 18, 2025

yuz1wan / video_distillation

Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.

Python 30 4 Updated Aug 27, 2024

V-STaR-Bench / V-STaR

Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning

Python 23 1 Updated May 15, 2025

Kelvin-ywc / diff-prompt

Python 3 1 Updated Mar 16, 2025

jozhang97 / DETA

Detection Transformers with Assignment

Python 255 21 Updated Sep 16, 2023

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,175 60 Updated May 28, 2025

yingsen1 / UniMD

UniMD: Towards Unifying Moment retrieval and temporal action Detection

Python 48 1 Updated Jul 5, 2024

Kangningthu / SUM

Uncertainty-aware Fine-tuning of Segmentation Foundation Models (NeurIPS 2024).

Python 12 Updated Jan 9, 2025

paperswithcode / releasing-research-code

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

2,794 734 Updated May 19, 2023

LeapLabTHU / GSVA

[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models

Python 133 Updated Sep 12, 2024

983632847 / SAM-for-Videos

This repository is for the first survey on SAM & SAM2 for Videos.

49 4 Updated Apr 29, 2025

minjoong507 / MPGN

[EMNLP 2022] Official Pytorch code for "Modal-specific Pseudo Query Generation for Video Corpus Moment Retrieval"

Python 10 Updated May 28, 2024

Ranking-VMR / SPR

Python 9 Updated Jan 10, 2025

houzhijian / CONQUER

[2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval

Python 42 7 Updated Sep 23, 2021

0