Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
-
Updated
Jun 17, 2025 - Python
8000
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Free speech to text
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI’s Whisper for free.
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time audio transcription using OPENAI's Whisper Ai.
Generate subtitles for long movies / podcasts with OpenAI Whisper API.
Speakscribe is a web application that allows users to transcribe audios using OpenAI and also interact with a chat bot. The web application is created in Python using NiceGUI.
Scribe is a Python script that transcribes audio and video files using OpenAI Whisper and exports the transcriptions as PDF documents, enhanced by the gpt-3.5-turbo model.
OpenSceneSense is a Python library that harnesses AI for advanced video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.
🎬 A tool with a UI that transcribes audio files into subtitles in SRT format using OpenAI's Whisper and runs completely on your local machine.
[Russian] This script will split audio file on silence, transcript it with google recognition and save it in LJSpeech-1.1 dataset manner.
Python package to scrape webpages and transcribe video content from a video sharing platform.
A cross-platform, fully functional, full-featured GUI implementation of the OpenAI API.
WhisperForge is a Python tool that leverages OpenAI's Whisper model to transcribe large audio files. It automatically splits files into manageable chunks, processes them, and combines the transcriptions into a single document. Ideal for handling lengthy recordings and generating clear, organized transcriptions.
ChatGPT API based video game audio translator application
WhisperingWizard is a user-friendly tool designed for anyone wanting to transcribe audio and video files using OpenAI's Whisper, without needing any Python or coding knowledge. With a simple setup, it offers an accessible way to leverage Whisper's capabilities for high-quality transcription effortlessly.
ClearSpeak is a real-time audio transcription application using Google's Speech-to-Text API. It features a Tkinter-based GUI, filtering background noise, and providing clear speech transcription.
Deepgram Transcription Processor is a Python program designed to process transcription output obtained from Deepgram's transcription service. It extracts key information such as conversation, summary, and paragraphs from the transcription output JSON and writes them to separate text files for further analysis and reference.
Add a description, image, and links to the audio-transcription topic page so that developers can more easily learn about it.
To associate your repository with the audio-transcription topic, visit your repo's landing page and select "manage topics."