8000 jianganbai (Anbai Jiang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View jianganbai's full-sized avatar

Block or report jianganbai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,129 92 Updated Mar 2, 2025

Machine Learning applied to sound

Jupyter Notebook 270 48 Updated May 11, 2024

Unified automatic quality assessment for speech, music, and sound.

Python 485 31 Updated May 1, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 48,907 5,951 Updated May 15, 2025

Open rotating mechanical fault datasets (开源旋转机械故障数据集整理)

1,024 290 Updated Aug 10, 2020

A benchmark fault diagnosis dataset comprises vibration data collected from a gearbox under variable working conditions with intentionally induced faults, encompassing diverse fault severities and …

MATLAB 48 3 Updated Mar 2, 2025

Benchmark popular audio i/o packages

Python 140 11 Updated Dec 19, 2023

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,876 480 Updated Mar 22, 2025

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 380 30 Updated Feb 21, 2024

PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

Python 12 Updated Dec 20, 2024

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

Jupyter Notebook 99 5 Updated Aug 1, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,435 139 Updated Jul 11, 2024

Multilingual Voice Understanding Model

Python 5,623 500 Updated Mar 23, 2025

Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"

Python 65 3 Updated Apr 22, 2025
Python 694 63 Updated Jun 7, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,723 129 Updated Apr 21, 2025

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 400 32 Updated Jan 25, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,683 325 Updated Jan 4, 2024

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,301 2,476 Updated May 14, 2025

Speech, Language, Audio, Music Processing with Large Language Model

Python 806 76 Updated Apr 24, 2025
Python 504 85 Updated Aug 12, 2024

A PyTorch Implementation of Federated Learning

Python 1,391 381 Updated Jul 25, 2024

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 896 136 Updated Feb 26, 2025

Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。

2,241 197 Updated May 13, 2025

Audio Codec Speech processing Universal PERformance Benchmark

Python 253 24 Updated Apr 14, 2025

Modeling, training, eval, and inference code for OLMo

Python 5,603 605 Updated May 14, 2025

A library built for easier audio self-supervised training, downstream tasks evaluation

Python 117 10 Updated Aug 27, 2024

Mamba SSM architecture

Python 14,865 1,301 Updated May 9, 2025
Next
0