Nikita Krasnytskyi Nik-Kras
- LinkedIn: /nikitakrasnytskyi
- https://medium.com/@nkrasnytskyi
Stars
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
A Bulletproof Way to Generate Structured JSON from Language Models
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
High-quality datasets, tools, and concepts for LLM fine-tuning.
DSPy: The framework for programming—not prompting—language models
A fast inference library for running LLMs locally on modern consumer-class GPUs
CoreNet: A library for training deep neural networks
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
We have made you a wrapper you can't refuse
aiogram is a modern and fully asynchronous framework for Telegram Bot API written in Python using asyncio
Blazing fast whisper turbo for ASR (speech-to-text) tasks
A simple client and utils for interacting with OpenAI's Realtime API in Python
OpenAI Realtime API Voice Agent with RAG, Function Calling, and Caller History
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
🌏🌍🌎Translators🌎🌍🌏 is a library that aims to bring free, multiple, enjoyable translations to individuals and students in Python. Translators是一个旨在用Python为个人和学生带来免费、多样、愉快翻译的库。
Official implementation of Half-Quadratic Quantization (HQQ)
(unofficial) Googletrans: Free and Unlimited Google translate API for Python. Translates totally free of charge.
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models