This is the official implementation of our mesh-based neural network (MESH2IR) to generate acoustic impulse responses (IRs) for indoor 3D scenes represented using a mesh.

Python 89 12 Updated Jul 24, 2024

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 695 56 Updated Apr 15, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,538 733 Updated May 6, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 4,868 302 Updated Apr 21, 2025

MoonshotAI / Moonlight

Muon is Scalable for LLM Training

1,040 46 Updated Mar 28, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

Cuda 11,516 829 Updated Apr 29, 2025

ASLP-lab / OSUM

OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.

Python 362 23 Updated Apr 16, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 40,848 4,500 Updated May 6, 2025

haochengxi / Train_Transformers_with_INT4

Python 147 4 Updated Jun 22, 2023

stepfun-ai / Step-Audio

Python 4,249 345 Updated Mar 12, 2025

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 11,171 2,027 Updated May 1, 2025

chatboxai / chatbox

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 34,620 3,300 Updated Apr 27, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,329 154 Updated Mar 20, 2025

FireRedTeam / FireRedASR

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 940 73 Updated Mar 27, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 13,587 1,379 Updated May 6, 2025

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,676 325 Updated Jan 4, 2024

yang-song / score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,911 328 Updated Jul 14, 2024

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,319 105 Updated Sep 24, 2023

axeber01 / ngcc

Neural Generalized Cross Correlations https://arxiv.org/abs/2208.04654

Jupyter Notebook 29 11 Updated Feb 11, 2025

JusperLee / SPMamba

Python 163 23 Updated Dec 5, 2024

ming024 / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 2,011 567 Updated Oct 27, 2023

Audio-WestlakeU / FN-SSL

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]

Python 107 12 Updated Dec 9, 2024

rspeyer / soundtouch

SoundTouch library compiled for iOS http://www.surina.net/soundtouch/index.html

C++ 323 87 Updated May 5, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nick zousaisai

Block or report zousaisai

Stars

infiniflow / ragflow

deepseek-ai / Janus

bytedance / MegaTTS3

unslothai / unsloth

shibing624 / python-tutorial

shibing624 / MedicalGPT

hiyouga / LLaMA-Factory

anton-jeran / MESH2IR