8000 haoqizhenhao / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View haoqizhenhao's full-sized avatar

Block or report haoqizhenhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,874 260 Updated Jun 21, 2025

SincNet is a neural architecture for efficiently processing raw audio samples.

Python 1,183 266 Updated Apr 28, 2021

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 14,925 2,954 Updated Jun 26, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,351 283 Updated Nov 5, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,141 771 Updated Dec 17, 2024

Noise suppression plugin based on Xiph's RNNoise

C++ 5,710 257 Updated May 18, 2024

SOTA Open Source TTS

Python 22,062 1,803 Updated Jun 12, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 83,927 10,214 Updated Jun 26, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 39,096 4,435 Updated Jun 25, 2025

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,345 406 Updated Jun 26, 2025

Simple Speech Keyword Detecting with Depthwise Separable Convolutions | DLology

C 42 14 Updated Jun 27, 2018

A PyTorch Library for Multi-Task Learning

Python 2,342 221 Updated May 14, 2025

Speech Restoration

Python 2 Updated Apr 3, 2023

General Speech Restoration

Python 280 56 Updated Jan 13, 2024

QbE Keyword Spotting System based on ASR

Python 7 1 Updated Jun 22, 2021

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,165 862 Updated Jul 6, 2024

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Python 574 121 Updated Feb 24, 2025

Inference code for Llama models

Python 58,422 9,776 Updated Jan 26, 2025

LLM inference in C/C++

C++ 82,214 12,200 Updated Jun 26, 2025

Large, modern dataset for speech recognition

Shell 678 62 Updated Feb 26, 2024

List of Large Lanugage Model Papers

59 3 Updated Jun 5, 2023

List of speech synthesis papers.

1,045 120 Updated Jul 24, 2023

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 32,747 3,422 Updated Apr 19, 2025

On-device wake word detection powered by deep learning

Python 4,209 533 Updated Jun 23, 2025

Conferencing Speech Challenge

Python 96 33 Updated Apr 6, 2021

Generating room impulse responses

C++ 453 148 Updated Jun 25, 2025

为音频加混响的代码

C++ 27 1 Updated Jul 6, 2023

Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.

Python 333 66 Updated Oct 4, 2022

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

Python 315 73 Updated Apr 26, 2022

notes of machine learning algorithm derivation

776 225 Updated Oct 9, 2019
Next
0