8000 GitHub - Pf0rd/Voice2Text: A simple Voice to text web app that translates audio files into text.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Pf0rd/Voice2Text

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Audio Transcription Web App

This is a web application that transcribes audio files to text using OpenAI's Whisper model. Upload any audio file and get its text transcription.

Features

  • Simple and modern web interface
  • Supports various audio file formats
  • Real-time transcription status updates
  • Error handling and user feedback
  • Responsive design

Prerequisites

  • Python 3.8 or higher
  • FFmpeg (required for audio processing)

Installation

  1. Install FFmpeg (if not already installed):
sudo apt update
sudo apt install ffmpeg
  1. Create a virtual environment and activate it:
python -m venv venv
source venv/bin/activate
  1. Install the required Python packages:
pip install -r requirements.txt

Usage

  1. Start the Flask application:
python app.py
  1. Open your web browser and navigate to http://localhost:5000

  2. Upload an audio file using the drag-and-drop interface or file selector

  3. Click "Transcribe Audio" and wait for the transcription to complete

Supported Audio Formats

  • MP3
  • WAV
  • M4A
  • OGG
  • WMA
  • And more...

Notes

  • The maximum file size limit is set to 16MB
  • Transcription time depends on the length of the audio file and your computer's processing power
  • The application uses the "base" model from Whisper, which provides a good balance between accuracy and speed

About

A simple Voice to text web app that translates audio files into text.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0