8000 eavven / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View eavven's full-sized avatar

Block or report eavven

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ComfyUI-ReduxFineTune is a custom node for ComfyUI that enables advanced style fine-tuning using the Flux Redux approach. It offers multiple unified fusion modes for precise and consistent control …

Python 41 3 Updated May 5, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,281 987 Updated May 15, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,002 229 Updated May 19, 2025

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,060 56 Updated Apr 17, 2025

collection of diffusion model papers categorized by their subareas

1,727 80 Updated May 23, 2025

Implementation of ColorizeDiffusion

8000 Python 61 5 Updated May 11, 2025

[CVPR 2025] DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles

22 1 Updated May 13, 2025

CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)

Python 336 8 Updated Jul 26, 2024

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Python 700 46 Updated May 11, 2025

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Python 1,177 150 Updated Jan 26, 2025

UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer

Jupyter Notebook 100 10 Updated May 5, 2024

[CVPR 2025] Attention Distillation: A Unified Approach to Visual Characteristics Transfer

Python 171 14 Updated Mar 8, 2025

M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment

Python 9 2 Updated Feb 26, 2025

Code Implementation of "PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data"

Python 387 26 Updated Apr 23, 2025

An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional variability in sampling steps

Python 128 5 Updated Feb 17, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 2,167 222 Updated Mar 10, 2025

[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

Python 185 11 Updated Feb 19, 2025

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

Python 263 32 Updated Apr 23, 2025
Python 18 Updated Mar 3, 2025

https://wavespeed.ai/ [WIP] The all in one inference optimization solution for ComfyUI, universal, flexible, and fast.

Python 1,023 43 Updated Mar 27, 2025

Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control

185 3 Updated Dec 31, 2024
Python 1,120 66 Updated Apr 21, 2025

A Training-free Iterative Framework for Long Story Visualization

Python 889 125 Updated Jan 18, 2025

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 245 6 Updated Mar 26, 2025
Python 519 29 Updated Jan 20, 2025

[NeurIPS 2024] Generalizable Implicit Motion Modeling for Video Frame Interpolation

Python 301 12 Updated Nov 18, 2024

Official repository for LTX-Video

Python 5,926 475 Updated May 21, 2025
Next
0