8000 emacs622 (Ben) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View emacs622's full-sized avatar

Block or report emacs622

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LTX-Video Support for ComfyUI

Python 2,086 182 Updated Jun 20, 2025

Lets make video diffusion practical!

Python 14,655 1,316 Updated May 4, 2025

Start and end frames video generation nodes based on the modified Kijai version Wan2.1 nodes

Python 353 20 Updated Mar 22, 2025

A collection of MCP servers.

57,251 4,389 Updated Jun 22, 2025

ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…

Python 10,546 1,419 Updated Jun 25, 2025

Kolors Team

Python 4,467 330 Updated Nov 13, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,853 175 Updated May 26, 2025

DeepMind's Tacotron-2 Tensorflow implementation

Python 2,309 913 Updated Jul 6, 2023

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IF…

C++ 14,699 958 Updated Jun 22, 2025

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,171 349 Updated Jan 13, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,410 647 Updated May 29, 2025
Python 6,323 1,045 Updated Jun 15, 2025

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Python 5,440 647 Updated Jun 11, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 83,823 10,193 Updated May 13, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 83,915 61,089 Updated Jun 23, 2025

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,117 191 Updated Oct 31, 2024

🔀 Convert small PNG images to SVG Tiny 1.2

Go 337 40 Updated Jul 17, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 80,693 8,928 Updated Jun 24, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 48,045 5,276 Updated Jun 19, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 40,941 5,293 Updated Aug 16, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,288 282 Updated May 4, 2024

tiny vision language model

Python 8,103 639 Updated Jun 20, 2025
Jupyter Notebook 901 114 Updated Sep 13, 2024

NLTK Source

Python 14,135 2,928 Updated Jun 15, 2025

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,370 128 Updated Apr 24, 2024

Static Type Checker for Python

Python 14,471 1,705 Updated Jun 16, 2025

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,636 937 Updated Aug 21, 2024

Windows system utilities to maximize productivity

C# 120,224 7,116 Updated Jun 25, 2025

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 15,322 3,043 Updated Jun 13, 2025

Inference code for Llama models

Python 58,415 9,776 Updated Jan 26, 2025
Next
0