🤗 Model Hub

WhisperPlus: Advancing Speech-to-Text Processing 🚀

🛠️ Installation

pip install whisperplus

🤗 Model Hub

You can find the models on the HuggingFace Spaces or on the HuggingFace Model Hub

🎙️ Usage

To use the whisperplus library, follow the steps below for different tasks:

🎵 Youtube URL to Audio

from whisperplus import SpeechToTextPipeline, download_and_convert_to_mp3

url = "https://www.youtube.com/watch?v=di3rHkEZuUw"
video_path = download_and_convert_to_mp3(url)
pipeline = SpeechToTextPipeline(model_id="openai/whisper-large-v3")
transcript = pipeline(
    audio_path=video_path, model_id="openai/whisper-large-v3", language="english
)

return transcript

### Contributing

pip install -r dev-requirements.txt
pre-commit install
pre-commit run --all-files

📜 License

This project is licensed under the terms of the Apache License 2.0.

🤗 Acknowledgments

This project is based on the HuggingFace Transformers library.

🤗 Citation

@misc{radford2022whisper,
  doi = {10.48550/ARXIV.2212.04356},
  url = {https://arxiv.org/abs/2212.04356},
  author = {Radford, Alec and Kim, Jong Wook and Xu, Tao and Brockman, Greg and McLeavey, Christine and Sutskever, Ilya},
  title = {Robust Speech Recognition via Large-Scale Weak Supervision},
  publisher = {arXiv},
  year = {2022},
  copyright = {arXiv.org perpetual, non-exclusive license}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
doc		doc
scripts		scripts
whisperplus		whisperplus
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
dev-requirements.txt		dev-requirements.txt
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WhisperPlus: Advancing Speech-to-Text Processing 🚀

🛠️ Installation

🤗 Model Hub

🎙️ Usage

🎵 Youtube URL to Audio

📜 License

🤗 Acknowledgments

🤗 Citation

About

Uh oh!

Releases

Packages

Languages

License

brkgyln/whisper-plus

Folders and files

Latest commit

History

Repository files navigation

WhisperPlus: Advancing Speech-to-Text Processing 🚀

🛠️ Installation

🤗 Model Hub

🎙️ Usage

🎵 Youtube URL to Audio

📜 License

🤗 Acknowledgments

🤗 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages