Stars
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
VMamba: Visual State Space Models,code is based on mamba
the official pytorch implementation of “Mamba-YOLO:SSMs-based for Object Detection”
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
Оценка Позы головы Python onnx runtime
[ECCV2022] Animation from Blur: Multi-modal Blur Decomposition with Motion Guidance
Implementation for Describe Anything: Detailed Localized Image and Video Captioning
Joint Video Multi-Frame Interpolation and Deblurring under Unknown Exposure Time (CVPR2023)
[CVPR2023] Blur Interpolation Transformer for Real-World Motion from Blur
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…
Python scripts form performing stereo depth estimation using the HITNET model in ONNX.
[CVPR 2023 Highlight] The Official Code for High-Frequency Stereo Matching Network
Image Restoration with Mean-Reverting Stochastic Differential Equations, ICML 2023. Winning solution of the NTIRE 2023 Image Shadow Removal Challenge.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
SwinIR: Image Restoration Using Swin Transformer (official repository)
VRT: A Video Restoration Transformer (official repository)
[IJCAI'24] Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer
[ICLR2025] Official Implementations "InterLCM: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration"
A simple face restoration TensorRT deployment solution.
当下热门的模糊人脸修复模型的部署,分别是:Codeformer,GFPGAN,GPEN,Restoreformer
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
A C++ implementation of Yolov5 helmet detection in Jetson Xavier nx and Jetson nano