jianganbai

Anbai Jiang jianganbai

PhD student at EE, Tsinghua. Anomaly Detection | Audio Processing

23 followers · 17 following

Tsinghua University
Beijing
11:29 (UTC +08:00)
https://scholar.google.com/citations?user=w68g1qkAAAAJ&hl=zh-CN&oi=ao

Achievements

Lists (12)

Sort

NSFW

Speech

6 repositories

Toolkit

2 repositories

Vibration

2 repositories

Stars

jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,129 92 Updated Mar 2, 2025

jonnor / machinehearing

Machine Learning applied to sound

Jupyter Notebook 270 48 Updated May 11, 2024

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 485 31 Updated May 1, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 48,907 5,951 Updated May 15, 2025

hustcxl / Rotating-machine-fault-data-set

Open rotating mechanical fault datasets (开源旋转机械故障数据集整理)

1,024 290 Updated Aug 10, 2020

liuzy0708 / MCC5-THU-Gearbox-Benchmark-Datasets

A benchmark fault diagnosis dataset comprises vibration data collected from a gearbox under variable working conditions with intentionally induced faults, encompassing diverse fault severities and …

MATLAB 48 3 Updated Mar 2, 2025

faroit / python_audio_loading_benchmark

Benchmark popular audio i/o packages

Python 140 11 Updated Dec 19, 2023

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,876 480 Updated Mar 22, 2025

YuanGongND / whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 380 30 Updated Feb 21, 2024

Jinbo-Hu / PSELDNets

PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

Python 12 Updated Dec 20, 2024

nttcslab / m2d

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

Jupyter Notebook 99 5 Updated Aug 1, 2024

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,435 139 Updated Jul 11, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 5,623 500 Updated Mar 23, 2025

RicherMans / Dasheng

Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"

Python 65 3 Updated Apr 22, 2025

nttcslab / dcase2024_task2_evaluator

Python 9 1 Updated Sep 10, 2024

Tele-AI / TeleSpeech-ASR

Python 694 63 Updated Jun 7, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,723 129 Updated Apr 21, 2025

modelscope / FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 400 32 Updated Jan 25, 2024

nttcslab / dcase2023_task2_evaluator

Python 13 2 Updated Aug 10, 2023

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,683 325 Updated Jan 4, 2024

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,301 2,476 Updated May 14, 2025