Light-weight framework for Objects AI-detection with Live-Cameras (USB/IP) and Telegram-bot notifications. Use Yolo or adjust for you own AI-models support and catch the best shot!

Python 4 Updated Oct 19, 2024

bdaiinstitute / theia

Theia: Distilling Diverse Vision Foundation Models for Robot Learning

Python 232 9 Updated Apr 3, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 15,739 1,811 Updated Dec 25, 2024

albumentations-team / albumentations

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 14,971 1,690 Updated Jun 6, 2025

Jingkang50 / OpenOOD

Benchmarking Generalized Out-of-Distribution Detection

Python 962 140 Updated May 19, 2025

OpenGVLab / InternVideo

[ECCV2024] Video Foundation A491 Models & Data for Multimodal Understanding

Python 1,901 112 Updated May 25, 2025

ssundaram21 / dreamsim

DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual Alignment Benefit Vision Representations? (NeurIPS 2024)

Python 489 28 Updated Mar 28, 2025

siyuanliii / masa

Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything

Python 1,298 87 Updated May 1, 2025

airockchip / rknn-llm

C++ 816 94 Updated May 19, 2025

dusty-nv / jetson-containers

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

Jupyter Notebook 3,253 618 Updated Jun 6, 2025

HimaxWiseEyePlus / Seeed_Grove_Vision_AI_Module_V2

C 88 38 Updated Apr 16, 2025

leafqycc / rknn-multi-threaded

A simple demo of yolov5s running on rk3588/3588s using Python (about 72 frames). / 一个使用Python在rk3588/3588s上运行的yolov5s简单demo(大约72帧/s)。

Python 306 53 Updated May 7, 2023

leafqycc / rknn-cpp-Multithreading

A simple demo of yolov5s running on rk3588/3588s using c++ (about 142 frames). / 一个使用c++在rk3588/3588s上运行的yolov5s简单demo(142帧/s)。

C 591 110 Updated Apr 9, 2024

XingangPan / DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,913 3,442 Updated May 18, 2024

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 3,011 275 Updated Jun 4, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,094 2,348 Updated Mar 13, 2025

Deci-AI / super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Jupyter Notebook 4,780 543 Updated Sep 17, 2024

tinyvision / DAMO-YOLO

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.

Python 3,064 386 Updated May 25, 2024

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 79,079 8,733 Updated Jun 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anton ZlodeiBaal

Achievements

Achievements

Block or report ZlodeiBaal

Stars

NVlabs / FoundationStereo

resemble-ai / chatterbox

RoboVerseOrg / RoboVerse

greydanus / cursivetransformer

roboflow / maestro

InternLM / lmdeploy

Nightmare-n / DepthAnyVideo

eloialonso / diamond

insight-platform / Savant

Qengineering / rtop-Ubuntu

Qengineering / rtop-KDE

agudym / watchdog