8000 hnhbcc / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View hnhbcc's full-sized avatar

Block or report hnhbcc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TPAMI:Frequency-aware Feature Fusion for Dense Image Prediction

Jupyter Notebook 408 20 Updated Feb 14, 2025

Official codebase for ICRA oral paper "Fine-Grained Open-Vocabulary Object Detection with Fine-Grained Prompts: Task, Dataset and Benchmark"

Python 1 1 Updated Jun 1, 2025

New generation of CLIP with fine grained discrimination capability, ICML2025

Python 201 8 Updated May 21, 2025

Explore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.

Python 35 2 Updated Jan 20, 2024

🔥🔥First-ever hour scale video understanding models

Python 437 27 Updated Jun 3, 2025

Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future

185 7 Updated Apr 3, 2025

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 967 36 Updated Jan 21, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8,281 836 Updated Aug 12, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,387 641 Updated May 29, 2025

Code for "Multi-view Reconstruction via SfM-guided Monocular Depth Estimation". CVPR 2025 (Oral Presentation)

Python 293 24 Updated Apr 29, 2025

The official repository for the RealSyn dataset

34 2 Updated Apr 28, 2025

CUHK-SYSU-TBPS&&PRW-TBPS

3 Updated Mar 23, 2022

Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models

Python 43 2 Updated Jun 10, 2025

[OpenPAR] An open-source framework for Pedestrian Attribute Recognition, based on PyTorch

Python 130 17 Updated Jun 17, 2025

Collect the awesome works evolved around reasoning models like O1/R1 in visual domain

29 1 Updated Jun 16, 2025

A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)

Python 51 3 Updated May 7, 2024

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…

284 21 Updated Jun 6, 2025

A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).

Python 127 7 Updated Mar 20, 2024

The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.

Python 270 18 Updated May 28, 2025

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

712 36 Updated Nov 4, 2024
Python 24 4 Updated Feb 14, 2025

High-Resolution 3D Human Digitization from A Single Image.

Python 9,695 1,476 Updated Aug 19, 2024

The official code for "ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations" presented at CVPR 2022, along with its extended version ImFace++.

Python 174 9 Updated Jul 10, 2024

3D version of the MNIST database of handwritten digits

Jupyter Notebook 14 13 Updated Nov 4, 2016

[MICCAI 2024] Easy diffusion models (optionally with segmentation guidance) for medical images and beyond.

Python 168 12 Updated Jun 18, 2025
Python 19 1 Updated Apr 14, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,806 297 Updated Mar 10, 2025

Automatic segmentation of CBCT scans with a 3D Unet

Python 45 10 Updated Jul 29, 2022
Python 12 2 Updated Oct 1, 2024
Next
0