8000 victorbcyang (Victor Yang) / Starred · GitHub

More Web Proxy on the site http://driver.im/

victorbcyang

Follow

Victor Yang victorbcyang

Follow

5 followers · 12 following

HP Inc.
Boston, MA

Achievements

Achievements

Stars

wavlab-speech / versa

Versatile Evaluation of Speech and Audio

Python 252 19 Updated May 20, 2025

reuk / wayverb

This project is not under active development. Hybrid waveguide and raytracer for room acoustics on the GPU

C++ 179 24 Updated Aug 10, 2017

abraunegg / onedrive

OneDrive Client for Linux

D 10,947 887 Updated May 20, 2025

LCAV / pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,575 450 Updated May 18, 2025

marl / SpatialScaper

Jupyter Notebook 47 4 Updated Apr 28, 2025

apple / ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

105 9 Updated Oct 25, 2023

mrDIMAS / hrtf

Head-Related Transfer Function (HRTF) audio signal processor.

Rust 84 10 Updated Oct 17, 2023

partha2409 / DCASE2024_seld_baseline

Python 37 10 Updated Jun 18, 2024

kaistmm / SSLalignment

Python 32 3 Updated May 12, 2025

msaddler / phaselocknet

Code to accompany "Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing" by Mark R. Saddler and Josh H. McDermott (2024, Nature Communicat…

Jupyter Notebook 4 Updated Feb 17, 2025

ShengKuangCN / BAST

Python 11 1 Updated Jul 11, 2022

katspaugh / wavesurfer.js

Audio waveform player

TypeScript 9,310 1,683 Updated May 6, 2025

macosforge / alac

The Apple Lossless Audio Codec (ALAC) is a lossless audio codec developed by Apple and deployed on all of its platforms and devices.

C++ 382 68 Updated Jul 29, 2020

mcdermottLab / pycochleagram

Generate cochleagrams natively in Python. Ported from Josh McDermott's MATLAB code.

Python 56 17 Updated Jun 28, 2023

gabrielmittag / NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python 783 132 Updated Dec 1, 2024

facebookresearch / ears_dataset

Expressive Anechoic Recordings of Speech (EARS)

Python 166 10 Updated Jun 25, 2024

torvalds / linux

Linux kernel source tree

C 193,893 55,905 Updated May 20, 2025

afrancl / BinauralLocalizationCNN

Code to create networks that localize sounds sources in 3D environments

Python 49 13 Updated Jan 27, 2024

Picovoice / cobra

On-device voice activity detection (VAD) powered by deep learning

Python 215 14 Updated May 5, 2025

exo-explore / exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 28,183 1,762 Updated Mar 21, 2025

Xiaobin-Rong / deepvqe

An unofficial implementation of DeepVQE proposed by Microsoft Corp.

Python 87 22 Updated Mar 24, 2025

tonarino / webrtc-audio-processing

Rust bindings for the webrtc-audio-processing library

Rust 276 31 Updated Apr 9, 2025

breizhn / sms_wsj

Forked from fgnt/sms_wsj

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

Python 2 Updated Mar 25, 2021

koute / bytehound

A memory profiler for Linux.

C 4,629 196 Updated Jul 28, 2023

echocatzh / GFTNN

Gated Convolutional F-T-LSTM Neural Network

HTML 34 14 Updated Jun 15, 2022

unilight / seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

Python 98 13 Updated Jul 5, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 81,965 9,876 Updated May 13, 2025

xai-org / grok-1

Grok open release

Python 50,279 8,354 Updated Aug 30, 2024

GAMMA-UMD / pygsound

Impulse response generation based on state-of-the-art geometric sound propagation engine.

C++ 160 22 Updated Jan 17, 2023

Okrio / CRUSE

a lightweight network for monaural speech enhancement

Python 52 10 Updated Oct 12, 2023

0