8000 stas6626 (Stanislav) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View stas6626's full-sized avatar

Block or report stas6626

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

1st Place solution to the Cornell Birdcall Identification competition.

Python 153 29 Updated Sep 19, 2020

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook 5,295 336 Updated Oct 18, 2023

some useless python stuff

Python 11 1 Updated Jul 30, 2020

🚀 Your next Python package needs a bleeding-edge project structure.

Python 1,094 123 Updated Sep 13, 2023

Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks

Python 119 15 Updated Mar 15, 2021

Toolbox of models, callbacks, and datasets for AI/ML researchers.

Python 1,727 320 Updated May 5, 2025

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Python 6,145 662 Updated Apr 20, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 34,200 4,916 Updated May 21, 2025

Accelerated deep learning R&D

Python 3,350 395 Updated Mar 20, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 144,629 29,025 Updated May 21, 2025

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Python 1,561 369 Updated May 11, 2021
Dockerfile 20 1 Updated Jul 28, 2020
Jupyter Notebook 44 6 Updated Aug 30, 2019

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 29,501 3,492 Updated May 20, 2025

Just another DL library

Python 182 18 Updated Mar 9, 2021

Kaggle | 1st place solution for Freesound Audio Tagging 2019

Python 314 55 Updated Jun 22, 2022

Code for the 3rd place solution to Freesound Audio Tagging 2019 Challenge

Python 55 8 Updated Dec 8, 2022

Mongolian speech recognition with PyTorch

Python 134 52 Updated Mar 22, 2021

PyTorch CTC Decoder bindings

C++ 837 251 Updated Apr 4, 2024

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

Python 828 183 Updated Jul 26, 2021

Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.

Python 9,088 1,830 Updated Apr 22, 2022

Open STT

Python 796 84 Updated Mar 11, 2022

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

1,862 225 Updated Jun 27, 2022

Pre-trained ELMo Representations for Many Languages

Python 1,461 241 Updated May 19, 2021

YSDA course in Natural Language Processing

Jupyter Notebook 10,114 2,641 Updated Dec 25, 2024

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

C 9,121 324 Updated Apr 19, 2025
Jupyter Notebook 3 Updated Dec 19, 2018

ML Training website

HTML 43 14 Updated Oct 1, 2019
Next
0