-
Home Intelligent System
- Poland
-
08:42
(UTC -12:00) - https://gadzety360.pl
- @Gadzety360pl
Stars
Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
OpenShot Video Editor is an award-winning free and open-source video editor for Linux, Mac, and Windows, and is dedicated to delivering high quality video editing and animation solutions to the world.
Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA
SpatialLM: Training Large Language Models for Structured Indoor Modeling
Official implementations for paper: VACE: All-in-One Video Creation and Editing
🔥🔥First-ever hour scale video understanding models
A cross-platform framework for deploying LLMs, VLMs, Embedding Models, TTS models and more locally on smartphones.
[CVPR 2025] HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Official Repository of Absolute Zero Reasoner
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
stlohrey / dia-finetuning
Forked from nari-labs/diaA TTS model capable of generating ultra-realistic dialogue in one pass.
Lets make video diffusion practical!
Saganaki22 / OrpheusTTS-WebUI
Forked from canopyai/Orpheus-TTSLightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]
SkyReels-A2: Compose anything in video diffusion transformers
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
Free speech dataset consisting of 24018 short audio clips of a single speaker reading sentences in Polish
This repository serves as a collection of research notes and resources on training large language models (LLMs) and Reinforcement Learning from Human Feedback (RLHF). It focuses on the latest resea…
Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.
A Conversational Speech Generation Model