Stars
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
High-speed Large Language Model Serving for Local Deployment
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
Official implementations for paper: Anydoor: zero-shot object-level image customization
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset (CVPR2022)
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Anime Face Detector using mmdet and mmpose
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.
Stable Diffusion-based image manipulation method with a sketch and reference image
Official implementation of "AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment" (ECCV 2022)
Auto detecting, masking and inpainting with detection model.
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Generative Models by Stability AI
[SIGGRAPH Asia '23] FLARE: Fast Learning of Animatable and Relightable Mesh Avatars
The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
[NeurIPS 2023] Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
Bark Voice Cloning and Voice Cloning for Chinese Speech
Command & Conquer: Remastered Collection
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
Python Library for Accessing the Cohere API