8000 wy3406 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View wy3406's full-sized avatar

Block or report wy3406

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 760 15 Updated May 21, 2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 911 69 Updated May 15, 2025

DreamO: A Unified Framework for Image Customization

Python 1,243 76 Updated May 13, 2025

KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution

Jupyter Notebook 286 28 Updated May 20, 2025

Lets make video diffusion practical!

Python 13,426 1,160 Updated May 4, 2025

Pythonic AI generation of images and videos

Python 8,106 465 Updated Sep 22, 2024

Coherent Video Inpainting Using Optical Flow-Guided Efficient Diffusion

Python 285 3 Updated May 17, 2025

[Support 0.49.x](Reset Cursor AI MachineID & Bypass Higher Token Limit) Cursor Ai ,自动重置机器ID , 免费升级使用Pro功能: You've reached your trial request limit. / Too many free trial accounts used on this machi…

Python 26,485 3,314 Updated May 21, 2025

Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers

Python 206 6 Updated Apr 16, 2025

Liquid: Language Models are Scalable and Unified Multi-modal Generators

Python 576 32 Updated Apr 8, 2025

PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask

Python 122 22 Updated Jan 11, 2025

ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., audio, expression).

264 17 Updated Apr 19, 2025

Awesome Instruction Editing. Image and Media Editing with Human Instructions. Instruction-Guided Image and Media Editing.

59 2 Updated May 21, 2025

Enjoy the magic of Diffusion models!

Python 8,668 776 Updated May 19, 2025

High quality training free inpaint for every stable diffusion model.

Python 267 7 Updated Apr 22, 2025

[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"

Python 368 20 Updated Apr 8, 2025

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 435 24 Updated May 21, 2025

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Python 1,967 341 Updated Jun 4, 2023

[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation

Python 229 6 Updated May 18, 2025

JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.

Python 558 27 Updated May 15, 2025

Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait

Python 256 37 Updated Apr 20, 2025

OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting

215 7 Updated Apr 22, 2025

LBM: Latent Bridge Matching for Fast Image-to-Image Translation ✨

Python 523 26 Updated May 14, 2025

introduce video face restoration method

16 Updated Aug 28, 2024

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 1,400 70 Updated Mar 29, 2025

Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 1,992 105 Updated May 15, 2025

Spark-TTS Inference Code

Python 9,484 993 Updated Apr 9, 2025

🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.

Python 258 22 Updated May 21, 2025

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 4,167 527 Updated Apr 22, 2025

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Python 1,066 187 Updated Sep 25, 2023
Next
0