- Moscow
- https://vk.com/staskanochek
Stars
1st Place solution to the Cornell Birdcall Identification competition.
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
🚀 Your next Python package needs a bleeding-edge project structure.
Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks
Toolbox of models, callbacks, and datasets for AI/ML researchers.
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Kaggle | 1st place solution for Freesound Audio Tagging 2019
Code for the 3rd place solution to Freesound Audio Tagging 2019 Challenge
Mongolian speech recognition with PyTorch
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.
Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
HIT-SCIR / ELMoForManyLangs
Forked from bozheng-hit/ELMoPre-trained ELMo Representations for Many Languages
YSDA course in Natural Language Processing
GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm