8000 makabakas / Starred · GitHub

More Web Proxy on the site http://driver.im/

makabakas

Follow

makabakas

Follow

2 followers · 47 following

Stars

zhuangweiji / wfst-mkgraph

wfst make graph learning

Shell 1 Updated Oct 8, 2018

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,609 226 Updated May 8, 2025

FrancoisGrondin / smpphat

C 16 4 Updated Mar 29, 2022

Beilong-Tang / lauraTSE_code

Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.

Python 15 2 Updated May 6, 2025

skywind3000 / awesome-cheatsheets

超级速查表 - 编程语言、框架和开发工具的速查表，单个文件包含一切你需要知道的东西 ⚡

Shell 12,051 2,111 Updated Mar 12, 2025

X-LANCE / KWStreamingSearch

Python 55 3 Updated Mar 28, 2025

e13000 / directional_sparse_filtering

Directional sparse filtering for blind speech separation

MATLAB 10 4 Updated Jun 8, 2021

Kevin-naticl / LLaSE-G1

LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement

Python 72 16 Updated Apr 1, 2025

lmxue / Audio-FLAN

Audio-FLAN

153 4 Updated Mar 6, 2025

yaoxunji / gen-se

GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling

Python 132 19 Updated Feb 28, 2025

clearlab-sustech / rl-start

RL Start

Jupyter Notebook 9 Updated Dec 18, 2024

chenzomi12 / AIInfra

AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。

Python 2,516 333 Updated May 18, 2025

Ola-Omni / Ola

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 335 15 Updated Feb 28, 2025

qiuqiangkong / audio_understanding

Python 109 5 Updated Feb 6, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,540 7,456 Updated May 18, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,765 1,486 Updated Apr 24, 2025

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

994 60 Updated Apr 25, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,116 5,977 Updated May 16, 2025

introlab / egonoise

Python 4 Updated Apr 14, 2023

breizhn / DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Python 617 161 Updated Jul 28, 2023

AdiCohen501 / ExNet-BF-PF

Python 8 Updated Jul 23, 2024

seorim0 / DCCRN-with-various-loss-functions

DCCRN with various loss functions

Python 95 23 Updated Sep 29, 2022

ShigekiKarita / pytorch-distributed-slurm-example

Python 43 9 Updated Apr 25, 2019

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,776 216 Updated Apr 30, 2025

xuchenglin28 / WSCM-MUSIC

Weighted Spatial Covariance Matrix Estimation for MUSIC based TDOA Estimation of Speech Source

MATLAB 77 23 Updated Jan 24, 2021

ZuodaoTech / everyone-can-use-english

人人都能用英语

TypeScript 26,180 3,885 Updated Apr 13, 2025

ASAP-Group / Multichannel-Enhancement

Block-Online Multi-Channel Speech Enhancement Using DNN-Supported Relative Transfer Function Estimates

MATLAB 32 11 Updated May 26, 2020

sarulab-speech / spatial_voice_conversion

Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals

Python 17 1 Updated Aug 8, 2024

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,725 1,105 Updated May 15, 2025

avcodecs / DTLNtfliteC

C 28 12 Updated Jun 10, 2021

0