-
HP Inc.
- Boston, MA
Stars
This project is not under active development. Hybrid waveguide and raytracer for room acoustics on the GPU
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
A large synthetic dataset of spatial audio with multiple labels
Head-Related Transfer Function (HRTF) audio signal processor.
Code to accompany "Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing" by Mark R. Saddler and Josh H. McDermott (2024, Nature Communicat…
The Apple Lossless Audio Codec (ALAC) is a lossless audio codec developed by Apple and deployed on all of its platforms and devices.
Generate cochleagrams natively in Python. Ported from Josh McDermott's MATLAB code.
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
Expressive Anechoic Recordings of Speech (EARS)
Code to create networks that localize sounds sources in 3D environments
On-device voice activity detection (VAD) powered by deep learning
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
An unofficial implementation of DeepVQE proposed by Microsoft Corp.
Rust bindings for the webrtc-audio-processing library
breizhn / sms_wsj
Forked from fgnt/sms_wsjSMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
A sequence-to-sequence voice conversion toolkit.
Robust Speech Recognition via Large-Scale Weak Supervision
Impulse response generation based on state-of-the-art geometric sound propagation engine.
a lightweight network for monaural speech enhancement