8000 brainhome / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View brainhome's full-sized avatar

Block or report brainhome

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 572 60 Updated Jul 2, 2025

Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Python 1,029 109 Updated Jul 1, 2025

OpenShot Video Editor is an award-winning free and open-source video editor for Linux, Mac, and Windows, and is dedicated to delivering high quality video editing and animation solutions to the world.

Python 4,871 590 Updated Jun 18, 2025

Self-Adapting Language Models

Python 644 110 Updated Jun 18, 2025

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 1,500 91 Updated Jul 2, 2025

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python 3,447 258 Updated Jun 24, 2025

Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 2,805 190 Updated May 15, 2025

🔥🔥First-ever hour scale video understanding models

Python 475 28 Updated Jul 1, 2025

A cross-platform framework for deploying LLMs, VLMs, Embedding Models, TTS models and more locally on smartphones.

C++ 1,125 60 Updated Jul 2, 2025

SoTA open-source TTS

Python 9,042 1,038 Updated Jun 13, 2025

[CVPR 2025] HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation

Python 258 30 Updated Jun 6, 2025

unsloth-5090-multiple

Python 22 9 Updated May 21, 2025

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Python 1,403 111 Updated Jul 1, 2025

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,515 1,109 Updated Sep 14, 2024

Official Repository of Absolute Zero Reasoner

Python 1,577 267 Updated Jul 1, 2025
Python 419 39 Updated May 6, 2025

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.

Python 79 2 Updated May 9, 2025

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

Python 756 112 Updated Jul 2, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 107 16 Updated May 19, 2025

Lets make video diffusion practical!

Python 14,835 1,344 Updated Jun 27, 2025

Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]

Python 98 15 Updated Mar 20, 2025
Python 5,590 418 Updated May 11, 2025

SkyReels-A2: Compose anything in video diffusion transformers

Python 621 58 Updated Jun 3, 2025

Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait

Python 267 37 Updated May 26, 2025

Free speech dataset consisting of 24018 short audio clips of a single speaker reading sentences in Polish

9 Updated Dec 29, 2023

This repository serves as a collection of research notes and resources on training large language models (LLMs) and Reinforcement Learning from Human Feedback (RLHF). It focuses on the latest resea…

Python 98 8 Updated Jun 26, 2025

Towards Human-Sounding Speech

Python 5,134 419 Updated May 6, 2025

Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.

C++ 274 20 Updated May 22, 2025

A Conversational Speech Generation Model

Python 13,650 1,327 Updated May 27, 2025
Next
0