8000 chaoleitan (Chaolei) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View chaoleitan's full-sized avatar

Highlights

  • Pro

Block or report chaoleitan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)

Python 24 2 Updated Apr 17, 2025

(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 209 11 Updated Apr 7, 2025

SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability

7 Updated May 8, 2025

OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning

Python 15 Updated May 24, 2025

The official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"

Python 143 9 Updated May 30, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 887 46 Updated Mar 20, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,043 138 Updated May 28, 2025

Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)

Python 444 35 Updated Sep 3, 2023

(AAAI 2025) Official PyTorch implementation of paper "SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection".

Python 13 Updated May 11, 2025

Denoising Diffusion Probabilistic Models

Python 4,432 418 Updated Aug 29, 2023

Image-to-Image Translation in PyTorch

Python 24,068 6,454 Updated May 14, 2024

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,697 148 Updated May 28, 2025

SAM with text prompt

Python 2,198 252 Updated May 10, 2025

This is a repository for listing papers on scene graph generation and application.

327 23 Updated May 25, 2025

[ICLR'25] Official code for "Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models"

Python 5 Updated May 10, 2025

[CVPR 2025 🔥] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval

5 Updated Mar 18, 2025

Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.

Python 30 4 Updated Aug 27, 2024

Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning

Python 23 1 Updated May 15, 2025
Python 3 1 Updated Mar 16, 2025

Detection Transformers with Assignment

Python 255 21 Updated Sep 16, 2023

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,175 60 Updated May 28, 2025

UniMD: Towards Unifying Moment retrieval and temporal action Detection

Python 48 1 Updated Jul 5, 2024

Uncertainty-aware Fine-tuning of Segmentation Foundation Models (NeurIPS 2024).

Python 12 Updated Jan 9, 2025

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

2,794 734 Updated May 19, 2023

[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models

Python 133 Updated Sep 12, 2024

This repository is for the first survey on SAM & SAM2 for Videos.

49 4 Updated Apr 29, 2025

[EMNLP 2022] Official Pytorch code for "Modal-specific Pseudo Query Generation for Video Corpus Moment Retrieval"

Python 10 Updated May 28, 2024
Python 9 Updated Jan 10, 2025

[2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval

Python 42 7 Updated Sep 23, 2021
Next
0