-
Hannam University, Republic Of Korea
- Daejeon, Korea
Starred repositories
Intelligent Component Registry web service for managing and using snn, dnn, and ml models, which is stored in onnx format.
Super simple fit method for PyTorch Modules
Train AI models efficiently on medical images using any framework
A pure Unix shell script implementing ACME client protocol
This program calculates the word error rate of hypothesis in ASR and print the aligned result.
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
A pitch tracker inspired by David Talkin's RAPT (Robust Algorithm for Pitch Tracking) written in Python.
Sound Related Deep Learning Tasks boosting repository with pytorch
Time delay neural network (TDNN) implementation in Pytorch using unfold method
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
An interactive document scanner built in Python using OpenCV featuring automatic corner detection, image sharpening, and color thresholding.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
A collection of various deep learning architectures, models, and tips
A drop-in replacement for GPGTools libmacgpg that disables the paywall (free MacGPG)
PyTorch deep learning projects made easy.
This project reproduces the book Dive Into Deep Learning (https://d2l.ai/), adapting the code from MXNet into PyTorch.
Paper: https://arxiv.org/abs/1702.02285
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Probabilistic Linear Discriminant Analysis & classification, written in Python.
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data