10000 eesungkim (Aventura) / Starred Β· GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View eesungkim's full-sized avatar
  • New York

Block or report eesungkim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 6,018 570 Updated Mar 24, 2025

JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech

Python 110 12 Updated Jun 6, 2022

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,972 1,917 Updated May 26, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,674 914 Updated Jun 5, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,345 2,630 Updated Jun 3, 2025

A state-of-the-art semi-supervised method for image recognition

Python 1,623 342 Updated Oct 8, 2020

PyTorch CTC Decoder bindings

C++ 840 250 Updated Apr 4, 2024

Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode

Python 111 17 Updated Aug 31, 2022

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Python 446 93 Updated Jul 13, 2023

Efficient, scalable and enterprise-grade CPU/GPU inference server for πŸ€— Hugging Face transformer models πŸš€

Python 1,688 153 Updated Oct 23, 2024

Python library for downloading, loading & working with sound datasets

Python 342 27 Updated May 20, 2025

Convert images of LaTex math equations into LaTex code.

Python 2,124 318 Updated Oct 4, 2022

Distributed Asynchronous Hyperparameter Optimization in Python

Python 7,430 1,067 Updated May 26, 2025

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,408 501 Updated Mar 11, 2025

G2P with Tensorflow

Python 674 192 Updated Jul 29, 2024

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.

Python 631 60 Updated Mar 1, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,761 4,411 Updated Jun 7, 2025

πŸ’Ž A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,331 145 Updated Jun 6, 2024

FFTW3 binding for Rust

Rust 57 28 Updated Apr 29, 2023

An open-source NLP research library, built on PyTorch.

Python 11,851 2,243 Updated Nov 22, 2022

Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

C++ 347 94 Updated Jan 22, 2023

torch-optimizer -- collection of optimizers for Pytorch

Python 3,115 305 Updated Mar 22, 2024

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,549 1,134 Updated Jun 1, 2025

A curated list of awesome self-supervised methods

6,287 835 Updated Jul 3, 2024

On-device wake word detection powered by deep learning

Python 4,165 530 Updated Jun 6, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 37,410 6,359 Updated Jun 9, 2025

πŸ§‘β€πŸ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes πŸ“; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 60,929 6,150 Updated Aug 24, 2024

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python 566 155 Updated Aug 19, 2023

Golang audio/video library and streaming server

Go 178 22 Updated Oct 13, 2020

A Python Interpreter written in Rust

Rust 20,134 1,316 Updated Jun 6, 2025
Next
0