8000 ZlodeiBaal (Anton) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ZlodeiBaal's full-sized avatar

Block or report ZlodeiBaal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching

Python 1,639 98 Updated May 17, 2025

SoTA open-source TTS

Python 5,972 663 Updated Jun 4, 2025

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

Python 1,220 75 Updated Jun 7, 2025

Training a transformer to generate cursive handwriting

Jupyter Notebook 23 3 Updated Apr 2, 2025

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,571 207 Updated Jun 5, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,481 555 Updated Jun 6, 2025

Depth Any Video with Scalable Synthetic Data (ICLR 2025)

Python 483 29 Updated Dec 4, 2024

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,822 133 Updated Dec 6, 2024

Python Computer Vision & Video Analytics Framework With Batteries Included

Python 647 58 Updated Jun 3, 2025

rtop, a performance monitor for the Rockchips RK3566/68/88

C++ 13 1 Updated Mar 17, 2025

rtop, a performance monitor for the Rockchips RK3566/68/88

C++ 4 Updated Sep 16, 2024

Light-weight framework for Objects AI-detection with Live-Cameras (USB/IP) and Telegram-bot notifications. Use Yolo or adjust for you own AI-models support and catch the best shot!

Python 4 Updated Oct 19, 2024

Theia: Distilling Diverse Vision Foundation Models for Robot Learning

Python 232 9 Updated Apr 3, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 15,739 1,811 Updated Dec 25, 2024

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 14,971 1,690 Updated Jun 6, 2025

Benchmarking Generalized Out-of-Distribution Detection

Python 962 140 Updated May 19, 2025

[ECCV2024] Video Foundation A491 Models & Data for Multimodal Understanding

Python 1,901 112 Updated May 25, 2025

DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual Alignment Benefit Vision Representations? (NeurIPS 2024)

Python 489 28 Updated Mar 28, 2025

Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything

Python 1,298 87 Updated May 1, 2025
C++ 816 94 Updated May 19, 2025

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

Jupyter Notebook 3,253 618 Updated Jun 6, 2025

A simple demo of yolov5s running on rk3588/3588s using Python (about 72 frames). / 一个使用Python在rk3588/3588s上运行的yolov5s简单demo(大约72帧/s)。

Python 306 53 Updated May 7, 2023

A simple demo of yolov5s running on rk3588/3588s using c++ (about 142 frames). / 一个使用c++在rk3588/3588s上运行的yolov5s简单demo(142帧/s)。

C 591 110 Updated Apr 9, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,913 3,442 Updated May 18, 2024

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 3,011 275 Updated Jun 4, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,094 2,348 Updated Mar 13, 2025

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Jupyter Notebook 4,780 543 Updated Sep 17, 2024

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.

Python 3,064 386 Updated May 25, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 79,079 8,733 Updated Jun 7, 2025
Next
0