yuexianghubit

YUE XIANGHU yuexianghubit

Ph.D Student - Automatic Speech Recognition, Self-Supervised Learning, Multi-Modal Learning

2 followers · 0 following

National University of Singapore

Stars

QiShanZhang / SLSforASVspoof-2021-DF

Code for paper "Audio Deepfake Detection with Self-supervised XLS-R and SLS classifier

Python 36 3 Updated Feb 7, 2025

xieyuankun / Codecfake

This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".

Python 59 4 Updated Dec 13, 2024

Daisy-Zhang / Awesome-Deepfakes-Detection

A list of tools, papers and code related to Deepfake Detection.

1,346 126 Updated Jan 8, 2025

LetterLiGo / SafeEar

[ACM CCS'24] SafeEar: Content Privacy-Preserving Audio Deepfake Detection

Python 141 16 Updated Mar 24, 2025

media-sec-lab / Audio-Deepfake-Detection

Research progress on speech deepfake detection: Relevant datasets aggregated from the review literature and publicly available codes

198 13 Updated Feb 21, 2025

jhairgallardo / awesome-continual-self-supervised-learning

List of papers that combine self-supervision and continual learning

65 2 Updated Mar 12, 2025

Jiang-Yidi / UniCodec

UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and sound

118 2 Updated Feb 28, 2025

slp-rl / slamkit

SlamKit is an open source tool kit for E8F2 efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"

Python 210 10 Updated May 18, 2025

hkchengrex / MMAudio

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 1,510 173 Updated May 8, 2025

colaudiolab / AudioCIL

Welcome to AudioCIL, the toolbox for audio class-incremental learning with the most implemented methods.

Python 32 4 Updated Dec 19, 2024

fixie-ai / ultravox

A fast multimodal LLM for real-time voice

Python 3,950 297 Updated Feb 14, 2025

soupdtag / speak-tool

A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github.mit.edu)

Python 12 3 Updated Dec 19, 2022

MatthewCYM / VoiceBench

VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 201 11 Updated May 7, 2025

XLearning-SCU / 2021-NeurIPS-NCR

Python 74 7 Updated Nov 6, 2023

liyongqi67 / MINDER

Python 57 3 Updated Jan 11, 2025

catherine-qian / TASLP2022-AVRI

Python 6 2 Updated Mar 5, 2024

xinyu1205 / IDEA-pytorch

Code for paper: IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training [ACM MM2022]

Python 9 Updated Dec 1, 2022

xialeiliu / Awesome-Incremental-Learning

Awesome Incremental Learning

4,064 598 Updated Apr 28, 2025

stoneMo / AVGN

Official implementation for AVGN

Python 34 3 Updated Mar 24, 2023

witdsl / KRT-MLCIL

Python 13 2 Updated Apr 16, 2024

ZHUANGHP / Analytic-federated-learning

This repo will be continually updating analytic federated learning methods.

Python 55 1 Updated Mar 27, 2025

Tree-Shu-Zhao / RebQ.pytorch

This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaboration."

Python 10 Updated Aug 13, 2024

LAMDA-CL / LAMDA-PILOT

🎉 PILOT: A Pre-trained Model-Based Continual Learning Toolbox

Python 434 46 Updated Apr 19, 2025

NeurAI-Lab / CLS-ER

The official PyTorch code for ICLR'22 Paper "Learning Fast, Learning Slow: A General Continual Learning Method based on Complementary Learning System""

Python 49 9 Updated Aug 7, 2023

artelab / Image-and-Text-fusion-for-UPMC-Food-101-using-BERT-and-CNNs

Jupyter Notebook 59 20 Updated Jun 25, 2021

ZHUANGHP / Analytic-continual-learning

This repository will be posting analytic continual learning series, including Analytic Class-Incremental Learning (ACIL), Gaussian Kernel Embedded Analytic Learning (GKEAL), Dual-Stream Analytic Le…

Python 243 22 Updated Dec 9, 2024

sungwon23 / BSRNN

Python 107 18 Updated Apr 24, 2023

Sato-Kunihiko / audio-SNR

Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)

Python 219 73 Updated Jul 31, 2023

swasun / VQ-VAE-Speech

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Python 269 54 Updated Aug 13, 2019

harritaylor / torchvggish

Pytorch port of Google Research's VGGish model used for extracting audio features.

Python 388 70 Updated Nov 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly