8000 makabakas / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View makabakas's full-sized avatar

Block or report makabakas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

wfst make graph learning

Shell 1 Updated Oct 8, 2018

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,609 226 Updated May 8, 2025

Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.

Python 15 2 Updated May 6, 2025

超级速查表 - 编程语言、框架和开发工具的速查表,单个文件包含一切你需要知道的东西 ⚡

Shell 12,051 2,111 Updated Mar 12, 2025
Python 55 3 Updated Mar 28, 2025

Directional sparse filtering for blind speech separation

MATLAB 10 4 Updated Jun 8, 2021

LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement

Python 72 16 Updated Apr 1, 2025

Audio-FLAN

153 4 Updated Mar 6, 2025

GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling

Python 132 19 Updated Feb 28, 2025

RL Start

Jupyter Notebook 9 Updated Dec 18, 2024

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Python 2,516 333 Updated May 18, 2025

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 335 15 Updated Feb 28, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,540 7,456 Updated May 18, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,765 1,486 Updated Apr 24, 2025

Awesome speech/audio LLMs, representation learning, and codec models

994 60 Updated Apr 25, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,116 5,977 Updated May 16, 2025
Python 4 Updated Apr 14, 2023

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Python 617 161 Updated Jul 28, 2023
Python 8 Updated Jul 23, 2024

DCCRN with various loss functions

Python 95 23 Updated Sep 29, 2022

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,776 216 Updated Apr 30, 2025

Weighted Spatial Covariance Matrix Estimation for MUSIC based TDOA Estimation of Speech Source

MATLAB 77 23 Updated Jan 24, 2021

人人都能用英语

TypeScript 26,180 3,885 Updated Apr 13, 2025

Block-Online Multi-Channel Speech Enhancement Using DNN-Supported Relative Transfer Function Estimates

MATLAB 32 11 Updated May 26, 2020

Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals

Python 17 1 Updated Aug 8, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,725 1,105 Updated May 15, 2025
Next
0