Stars
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
A Python package to stabilize videos using OpenCV
Near Duplicate Video Detection (Perceptual Video Hashing) - Get a 64-bit comparable hash-value for any video.
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
[EMNLP2024 Demo], [ICASSP 2025] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
DSNet: A Flexible Detect-to-Summa 8000 rize Network for Video Summarization
HunyuanVideo: A Systematic Framework For Large Video Generation Model
SkyReels V1: The first and most advanced open-source human-centric video foundation model
SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers
Open-source and strong foundation image recognition models.
Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)
An open-source toolbox for fast sampling of diffusion models. Official implementations of our works published in ICML, NeurIPS, CVPR.
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
Online Multi-Granularity Distillation for GAN Compression (ICCV2021)
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
[WACV 2023] Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…
This node was designed to help AI image creators to generate prompts for human portraits.
肖像大师 中文版 comfyui-portrait-master
ComfyUI's ControlNet Auxiliary Preprocessors
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
[ECCV 2022] StyleHEAT: A framework for high-resolution editable talking face generation