qubvel

Pavel Iakubovskii qubvel

2k followers · 7 following

Achievements

x3 x3 x4

Achievements

x3 x3 x4

Organizations

Lists (1)

Sort

CVPR-2025

2 repositories

Stars

facebookresearch / vjepa2

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 1,459 108 Updated Jun 20, 2025

antonibigata / keyface_cvpr

[CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation

Python 48 5 Updated Apr 8, 2025

felixtaubner / cap4d

Official repository for the paper "CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models"

174 10 Updated Dec 15, 2024

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 3,426 291 Updated Jun 18, 2025

roboflow / trackers

A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms

Python 1,765 153 Updated Jun 19, 2025

EmmanuelleB985 / UK_BOB

Python 24 Updated Jun 16, 2025

MMMU-Benchmark / MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Python 447 34 Updated May 19, 2025

thuml / depyf

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 690 26 Updated Apr 20, 2025

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 8,648 846 Updated Jun 18, 2025

openai / preparedness

Releases from OpenAI Preparedness

Python 782 77 Updated May 30, 2025

DepthAnything / PromptDA

[CVPR 2025] Prompt Depth Anything

Python 836 50 Updated Mar 4, 2025

tue-mps / eomt

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).

Jupyter Notebook 184 12 Updated Jun 19, 2025

TianHuiLab / Falcon

Falcon: A Remote Sensing Vision-Language Foundation Model

Python 291 28 Updated Apr 10, 2025

gradio-app / fastrtc

The python library for real-time communication

JavaScript 4,044 366 Updated Jun 13, 2025

PatrickJS / awesome-cursorrules

📄 A curated list of awesome .cursorrules files

28,766 2,335 Updated Mar 20, 2025

casey / just

🤖 Just a command runner

Rust 26,058 554 Updated Jun 17, 2025

qubvel / rt-pose

Real-time pose estimation pipeline with 🤗 Transformers

Python 60 8 Updated Feb 7, 2025

qubvel / transformers-notebooks

Inference and fine-tuning examples for vision models from 🤗 Transformers

Jupyter Notebook 151 26 Updated May 5, 2025

iSEE-Laboratory / LLMDet

(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 236 15 Updated Jun 6, 2025

mbzuai-oryx / GeoPixel

GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image analysis, offering advanced multi-target pixel grounding cap…

Python 96 9 Updated May 28, 2025

mikel-brostrom / boxmot

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

Python 7,429 1,808 Updated Jun 21, 2025

qubvel-org / segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 10,581 1,757 Updated Jun 19, 2025

yformer / EfficientTAM

Efficient Track Anything

Python 566 19 Updated Jan 6, 2025

IDEA-Research / X-Pose

[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"

Python 693 34 Updated Aug 16, 2024

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 35,605 2,904 Updated Jun 21, 2025

HVision-NKU / SRFormer

Official code for "SRFormer: Permuted Self-Attention for Single Image Super-Resolution" (ICCV 2023) and SRFormerV2

Python 273 23 Updated Aug 18, 2024

facebookresearch / videoseal

Open and efficient video watermarking

Python 413 48 Updated Jun 21, 2025

CVMI-Lab / CoDet

(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

Python 116 7 Updated Apr 26, 2024

ShihuaHuang95 / DEIM

[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence

Python 878 132 Updated Mar 12, 2025

microsoft / torchgeo

TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data

Python 3,475 444 Updated Jun 20, 2025