A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
-
Updated
Nov 4, 2024 - Python
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
an editor for spoken-word audio with automatic transcription
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Instant, controllable, local pre-trained AI models in Rust
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加入吧!
Simple GUI for ByteDance's Piano Transcription with Pedals
A python package to build AI-powered real-time audio applications
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
Generate subtitles, summaries, and chapters from videos in seconds
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
turnkey self-hosted offline transcription and diarization service with llm summary
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
🎤 The easiest way to transcribe audio in Swift
On-device streaming speech-to-text engine powered by deep learning
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
OBS plugin for local speech recognition and captioning using AI
Add a description, image, and links to the transcription topic page so that developers can more easily learn about it.
To associate your repository with the transcription topic, visit your repo's landing page and select "manage topics."