8000 CoderJackZhu (Jack Zhu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View CoderJackZhu's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report CoderJackZhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 482 24 Updated May 9, 2025

Image editing is worth a single LoRA! 0.1% training data and 1% training parameters for fantastic image editing! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM…

Python 1,018 67 Updated May 11, 2025
Python 291 21 Updated May 11, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 2,329 158 Updated May 9, 2025

✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

Python 107 6 Updated May 9, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 1,550 119 Updated May 11, 2025
Python 258 19 Updated May 6, 2025

Ping, but with a graph

Rust 11,560 328 Updated Apr 18, 2025

⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡

Python 19,302 1,255 Updated Mar 5, 2025
Python 788 85 Updated Apr 30, 2025

一个免费的公式智能识别软件

Python 180 15 Updated May 11, 2025

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 865 47 Updated Apr 23, 2025

Have a natural, spoken conversation with AI!

Python 1,969 141 Updated May 8, 2025

Lightweight Knowledge Base and Feed Reader.

TypeScript 705 44 Updated Feb 1, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 39,921 5,088 Updated Aug 16, 2024

开源文本转语音工具,支持超长文本,多角色配音

TypeScript 913 92 Updated May 8, 2025

MPC-BE – универсальный проигрыватель аудио и видеофайлов для операционной системы Windows.

C 2,878 106 Updated May 10, 2025

Mini website for testing both general CS knowledge and enforce coding practice and common algorithm/data structure memorization.

HTML 8,669 2,053 Updated Jan 13, 2025

研究生毕业设计源码

Python 4 Updated May 7, 2025

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

Python 499 65 Updated Mar 29, 2025
Python 69 4 Updated May 4, 2025

AI-Powered Photos App for the Decentralized Web 🌈💎✨

Go 37,288 2,066 Updated May 10, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,275 52 Updated May 8, 2025

ACI.dev is the open source platform that connects your AI agents to 600+ tool integrations with multi-tenant auth, granular permissions, and access through direct function calling or a unified MCP …

Python 3,200 233 Updated May 10, 2025

100% open source dev kit for EOS S3 MCU+eFPGA SoC supported by fully open source SDK and FPGA Toolchain

40 3 Updated Mar 24, 2021

造”派“计划,一起设计一块属于自己的”树莓派“吧

Shell 248 40 Updated Apr 19, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,486 206 Updated May 8, 2025

A feature-rich command-line audio/video downloader

Python 111,391 8,753 Updated May 11, 2025

Interface for OuteTTS models.

Python 1,214 101 Updated Apr 29, 2025

zero-shot voice conversion & singing voice conversion, with real-time support

Python 2,431 273 Updated Apr 20, 2025
Next
0