8000 [pull] main from rsxdalv:main by pull[bot] · Pull Request #67 · leftomelas/tts-generation-webui · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[pull] main from rsxdalv:main #67

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 6, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -40,6 +40,7 @@ extensions.external.json
/data/models/ap_bwe/weights/
/data/models/openvoice/
/data/models/openvoice_v2/
/data/models/kimi-audio/

# Ignore temporary files
temp/
Expand Down
7 changes: 7 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -80,6 +80,13 @@

## Changelog

May 6:
* Add Kimi Audio 7B Instruct extension
* Fix React-Gradio file proxy missing slash
* Add Kokoro TTS API extension

## April 2025

Apr 25:
* Add OpenVoice V2 extension

Expand Down
30 changes: 30 additions & 0 deletions extensions.json
Original file line number Diff line number Diff line change
Expand Up @@ -597,6 +597,36 @@
"website": "https://github.com/myshell-ai/OpenVoice",
"extension_website": "https://github.com/rsxdalv/extension_openvoice_v2",
"extension_platform_version": "0.0.1"
},
{
"package_name": "extension_kimi_audio",
"name": "Kimi Audio",
"version": "0.0.1",
"requirements": "git+https://github.com/rsxdalv/extension_kimi_audio@main",
"description": "Kimi Audio is a powerful text-to-speech and speech-to-text model by Moonshot AI",
"extension_type": "interface",
"extension_class": "tools",
"author": "Moonshot AI",
"extension_author": "rsxdalv",
"license": "MIT",
"website": "https://github.com/moonshotai/Kimi-Audio",
"extension_website": "https://github.com/rsxdalv/extension_kimi_audio",
"extension_platform_version": "0.0.1"
},
{
"package_name": "extension_kokoro_tts_api",
"name": "Kokoro TTS API",
"version": "0.0.1",
"requirements": "git+https://github.com/rsxdalv/extension_kokoro_tts_api@main",
"description": "Kokoro TTS API is a text-to-speech model by hexgrad",
"extension_type": "interface",
"extension_class": "tools",
"author": "hexgrad",
"extension_author": "rsxdalv",
"license": "MIT",
"website": "https://huggingface.co/hexgrad/Kokoro-82M",
"extension_website": "https://github.com/rsxdalv/extension_kokoro_tts_api",
"extension_platform_version": "0.0.1"
}
],
"decorators": [
Expand Down
5 changes: 5 additions & 0 deletions requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -15,8 +15,13 @@ fairseq @ https://github.com/rsxdalv/fairseq/releases/download/v0.12.3/fairseq-0
fairseq @ https://github.com/rsxdalv/fairseq/releases/download/v0.12.3/fairseq-0.12.13-cp310-cp310-macosx_11_0_universal2.whl ; sys_platform == 'darwin' # MIT License
accelerate>=0.33.0
# optimum-quanto==0.2.6

# experimental
# triton>=3.2.0 ; sys_platform == 'linux',
# triton-windows>=3.2.0.post18 ; sys_platform == 'win32',
# flash-attn @ https://github.com/kingbri1/flash-attention/releases/download/v2.7.4.post1/flash_attn-2.7.4.post1+cu124torch2.6.0cxx11abiFALSE-cp310-cp310-win_amd64.whl ; sys_platform == 'win32'
# flash-attn @ https://github.com/Dao-AILab/flash-attention/releases/download/v2.7.4.post1/flash_attn-2.7.4.post1+cu12torch2.6cxx11abiFALSE-cp310-cp310-linux_x86_64.whl ; sys_platform == 'linux'


extension_bark @ git+https://github.com/rsxdalv/extension_bark@main
extension_tortoise @ git+https://github.com/rsxdalv/extension_tortoise@main
Expand Down
2 changes: 1 addition & 1 deletion server.py
Original file line number Diff line number Diff line change
Expand Up @@ -257,7 +257,7 @@ def signal_handler(signal, frame, postgres_process):
"npm start --prefix react-ui",
env={
**os.environ,
"GRADIO_BACKEND_AUTOMATIC": f"http://127.0.0.1:{gradio_interface_options['server_port']}",
"GRADIO_BACKEND_AUTOMATIC": f"http://127.0.0.1:{gradio_interface_options['server_port']}/",
},
shell=True,
)
Expand Down
Loading
0