CoderJackZhu

🎯

Focusing

Jack Zhu CoderJackZhu

🎯

Focusing

I am a current master's student of Xidian University, majoring in artificial intelligence.

30 followers · 65 following

Xidian University
Xi’an
https://jackzhu.top/

Achievements

Highlights

Lists (7)

Sort

Stars

Tencent / HunyuanCustom

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 482 24 Updated May 9, 2025

River-Zhang / ICEdit

Image editing is worth a single LoRA! 0.1% training data and 1% training parameters for fantastic image editing! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM…

Python 1,018 67 Updated May 11, 2025

MYZY-AI / Muyan-TTS

Python 291 21 Updated May 11, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 2,329 158 Updated May 9, 2025

VITA-MLLM / VITA-Audio

✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

Python 107 6 Updated May 9, 2025

ace-step / ACE-Step

ACE-Step: A Step Towards Music Generation Foundation Model

Python 1,550 119 Updated May 11, 2025

maitrix-org / Voila

Python 258 19 Updated May 6, 2025

orf / gping

Ping, but with a graph

Rust 11,560 328 Updated Apr 18, 2025

bee-san / Ciphey

⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡

Python 19,302 1,255 Updated Mar 5, 2025

ZeyueT / AudioX

Python 788 85 Updated Apr 30, 2025

zstar1003 / FreeTex

一个免费的公式智能识别软件

Python 180 15 Updated May 11, 2025

Phantom-video / Phantom

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 865 47 Updated Apr 23, 2025

KoljaB / RealtimeVoiceChat

Have a natural, spoken conversation with AI!

Python 1,969 141 Updated May 8, 2025

mdSilo / mdSilo-app

Lightweight Knowledge Base and Feed Reader.

TypeScript 705 44 Updated Feb 1, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 39,921 5,088 Updated Aug 16, 2024

cosin2077 / easyVoice

开源文本转语音工具，支持超长文本，多角色配音

TypeScript 913 92 Updated May 8, 2025

Aleksoid1978 / MPC-BE

MPC-BE – универсальный проигрыватель аудио и видеофайлов для операционной системы Windows.

C 2,878 106 Updated May 10, 2025

jwasham / computer-science-flash-cards

Mini website for testing both general CS knowledge and enforce coding practice and common algorithm/data structure memorization.

HTML 8,669 2,053 Updated Jan 13, 2025

zrr1999 / emotion-recognition

研究生毕业设计源码

Python 4 Updated May 7, 2025

SpeechColab / Leaderboard

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

Python 499 65 Updated Mar 29, 2025

EvolvingLMMs-Lab / Aero-1

Python 69 4 Updated May 4, 2025

photoprism / photoprism

AI-Powered Photos App for the Decentralized Web 🌈💎✨

Go 37,288 2,066 Updated May 10, 2025

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,275 52 Updated May 8, 2025

aipotheosis-labs / aci

ACI.dev is the open source platform that connects your AI agents to 600+ tool integrations with multi-tenant auth, granular permissions, and access through direct function calling or a unified MCP …

Python 3,200 233 Updated May 10, 2025