8000 zousaisai (Nick) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zousaisai's full-sized avatar

Block or report zousaisai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 51,305 4,860 Updated May 6, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,204 2,232 Updated Feb 1, 2025
Python 5,078 344 Updated Apr 12, 2025

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 38,163 2,988 Updated May 6, 2025

Python实用教程,包括:Python基础,Python高级特性,面向对象编程,多线程,数据库,数据科学,Flask,爬虫开发教程。

Jupyter Notebook 2,152 425 Updated May 1, 2023

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 3,864 564 Updated Apr 18, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 48,305 5,885 Updated May 3, 2025

This is the official implementation of our mesh-based neural network (MESH2IR) to generate acoustic impulse responses (IRs) for indoor 3D scenes represented using a mesh.

Python 89 12 Updated Jul 24, 2024

An Open-Sourced LLM-empowered Foundation TTS System

Python 695 56 Updated Apr 15, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,538 733 Updated May 6, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,868 302 Updated Apr 21, 2025

Muon is Scalable for LLM Training

1,040 46 Updated Mar 28, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,516 829 Updated Apr 29, 2025

OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.

Python 362 23 Updated Apr 16, 2025

Making large AI models cheaper, faster and more accessible

Python 40,848 4,500 Updated May 6, 2025
Python 4,249 345 Updated Mar 12, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 11,171 2,027 Updated May 1, 2025

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 34,620 3,300 Updated Apr 27, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,329 154 Updated Mar 20, 2025

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 940 73 Updated Mar 27, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 13,587 1,379 Updated May 6, 2025

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,676 325 Updated Jan 4, 2024

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,911 328 Updated Jul 14, 2024

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,319 105 Updated Sep 24, 2023

Neural Generalized Cross Correlations https://arxiv.org/abs/2208.04654

Jupyter Notebook 29 11 Updated Feb 11, 2025
Python 163 23 Updated Dec 5, 2024

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 2,011 567 Updated Oct 27, 2023

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]

Python 107 12 Updated Dec 9, 2024

SoundTouch library compiled for iOS http://www.surina.net/soundtouch/index.html

C++ 323 87 Updated May 5, 2016
Next
0