Audio Transcription Web App

This is a web application that transcribes audio files to text using OpenAI's Whisper model. Upload any audio file and get its text transcription.

Features

Simple and modern web interface
Supports various audio file formats
Real-time transcription status updates
Error handling and user feedback
Responsive design

Prerequisites

Python 3.8 or higher
FFmpeg (required for audio processing)

Installation

Install FFmpeg (if not already installed):

sudo apt update
sudo apt install ffmpeg

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate

Install the required Python packages:

pip install -r requirements.txt

Usage

Start the Flask application:

python app.py

Open your web browser and navigate to http://localhost:5000
Upload an audio file using the drag-and-drop interface or file selector
Click "Transcribe Audio" and wait for the transcription to complete

Supported Audio Formats

MP3
WAV
M4A
OGG
WMA
And more...

Notes

The maximum file size limit is set to 16MB
Transcription time depends on the length of the audio file and your computer's processing power
The application uses the "base" model from Whisper, which provides a good balance between accuracy and speed

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
templates		templates
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Audio Transcription Web App

Features

Prerequisites

Installation

Usage

Supported Audio Formats

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Pf0rd/Voice2Text

Folders and files

Latest commit

History

Repository files navigation

Audio Transcription Web App

Features

Prerequisites

Installation

Usage

Supported Audio Formats

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages