xcarson

xcarson

3 followers · 3 following

Starred repositories

ali-vilab / ACE_plus

Python 1,090 64 Updated Apr 21, 2025

ali-vilab / In-Context-LoRA

Official repository of In-Context LoRA for Diffusion Transformers

1,850 90 Updated Dec 20, 2024

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,292 1,489 Updated Sep 5, 2024

open-mmlab / PowerPaint

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 899 58 Updated Sep 8, 2024

TencentARC / BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,591 131 Updated Dec 17, 2024

juliangarnier / anime

JavaScript animation engine

JavaScript 60,367 4,052 Updated Apr 25, 2025

software-mansion / react-native-reanimated

React Native's Animated library reimplemented

TypeScript 9,660 1,362 Updated May 14, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,075 611 Updated Apr 27, 2025

open-mmlab / mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,588 1,284 Updated Aug 14, 2024

OpenGVLab / Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,233 262 Updated Jan 18, 2025

rese1f / MovieChat

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

Python 619 41 Updated Jan 29, 2025

Vision-CAIR / MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Python 615 68 Updated Dec 10, 2024

chn-lee-yumi / MaterialSearch

AI语义搜索本地素材。以图搜图、查找本地素材、根据文字描述匹配画面、视频帧搜索、根据画面描述搜索视频。Semantic search. Search local photos and videos through natural language.

Python 1,448 168 Updated May 10, 2025

HKUDS / VideoRAG

"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"

Python 650 72 Updated May 6, 2025

NVlabs / describe-anything

Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,054 52 Updated May 6, 2025

web-infra-dev / midscene

Your AI Operator for Web, Android, Automation & Testing.

TypeScript 8,812 524 Updated May 14, 2025

mifi / lossless-cut

The swiss army knife of lossless video/audio editing

TypeScript 31,134 1,463 Updated May 8, 2025

skeskinen / smartcut

Cut video files with minimal recoding

Python 155 11 Updated Feb 27, 2025

SamurAIGPT / AI-Youtube-Shorts-Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

Python 2,193 307 Updated Feb 13, 2025

RayVentura / ShortGPT

🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation

Python 6,481 863 Updated Feb 10, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 12,984 1,095 Updated May 4, 2025

WyattBlue / auto-editor

Auto-Editor: Efficient media analysis and rendering

Python 3,292 452 Updated May 13, 2025

MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Python 1,470 254 Updated Mar 25, 2025

readbeyond / aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Python 2,645 249 Updated Jun 22, 2024

bytedance / InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Python 2,232 226 Updated Apr 16, 2025

Xiaojiu-z / EasyControl

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"

Python 1,463 115 Updated Apr 14, 2025

qinghew / CharacterFactory

[TIP 2025] CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models 🔥

Python 212 18 Updated Apr 18, 2025

WebAV-Tech / WebAV

A web-based Video Editing SDK built on WebCodecs. 基于 WebCodecs 构建的网页视频编辑 SDK。

TypeScript 1,597 179 Updated May 12, 2025

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 1,701 150 Updated May 14, 2025

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 10,580 958 Updated Aug 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly