8000 xcarson / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View xcarson's full-sized avatar

Block or report xcarson

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 1,090 64 Updated Apr 21, 2025

Official repository of In-Context LoRA for Diffusion Transformers

1,850 90 Updated Dec 20, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,292 1,489 Updated Sep 5, 2024

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 899 58 Updated Sep 8, 2024

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,591 131 Updated Dec 17, 2024

JavaScript animation engine

JavaScript 60,367 4,052 Updated Apr 25, 2025

React Native's Animated library reimplemented

TypeScript 9,660 1,362 Updated May 14, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,075 611 Updated Apr 27, 2025

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,588 1,284 Updated Aug 14, 2024

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,233 262 Updated Jan 18, 2025

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

Python 619 41 Updated Jan 29, 2025

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Python 615 68 Updated Dec 10, 2024

AI语义搜索本地素材。以图搜图、查找本地素材、根据文字描述匹配画面、视频帧搜索、根据画面描述搜索视频。Semantic search. Search local photos and videos through natural language.

Python 1,448 168 Updated May 10, 2025

"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"

Python 650 72 Updated May 6, 2025

Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,054 52 Updated May 6, 2025

Your AI Operator for Web, Android, Automation & Testing.

TypeScript 8,812 524 Updated May 14, 2025

The swiss army knife of lossless video/audio editing

TypeScript 31,134 1,463 Updated May 8, 2025

Cut video files with minimal recoding

Python 155 11 Updated Feb 27, 2025

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

Python 2,193 307 Updated Feb 13, 2025

🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation

Python 6,481 863 Updated Feb 10, 2025

Lets make video diffusion practical!

Python 12,984 1,095 Updated May 4, 2025

Auto-Editor: Efficient media analysis and rendering

Python 3,292 452 Updated May 13, 2025

Command line utility for forced alignment using Kaldi

Python 1,470 254 Updated Mar 25, 2025

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Python 2,645 249 Updated Jun 22, 2024

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Python 2,232 226 Updated Apr 16, 2025

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"

Python 1,463 115 Updated Apr 14, 2025

[TIP 2025] CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models 🔥

Python 212 18 Updated Apr 18, 2025

A web-based Video Editing SDK built on WebCodecs. 基于 WebCodecs 构建的网页视频编辑 SDK。

TypeScript 1,597 179 Updated May 12, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 1,701 150 Updated May 14, 2025

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 10,580 958 Updated Aug 7, 2024
Next
0