8000 ishine (ishine) / Repositories · GitHub

Address: [go: up one dir, main page]

Include Form Remove Scripts Accept Cookies Show Images Show Referer Rotate13 Base64 Strip Meta Strip Title Session Cookies

More Web Proxy on the site http://driver.im/

Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Sign up

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

ishine Follow

Overview Repositories 3.8k Projects 0 Packages 0 Stars 733

More

Overview
Repositories
Projects
Packages
Stars

ishine

Follow

ishine

Follow

speech asr/speech-recognition tts/text-to-speech vc/voice-conversion ac/accent-conversion

150 followers · 206 following

gerzz.inc
shanghai
dubbing-ai.com dubbingai.io

Achievements

Achievements

Block or report ishine

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Overview Repositories 3.8k Projects 0 Packages 0 Stars 733

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Python C C++ Jupyter Notebook Cuda JavaScript HTML Rust

Sort Last updated

Select order

Last updated Name Stars

Sonic1 Public
Forked from jixiaozhong/Sonic

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python Other Updated May 6, 2025
sonic Public
Forked from waywardgeek/sonic

Simple library to speed up or slow down speech

C Apache License 2.0 Updated May 6, 2025
TTS.cpp Public
Forked from mmwillet/TTS.cpp

TTS support with GGML

C++ MIT License Updated May 6, 2025
Orpheus-TTS Public
Forked from canopyai/Orpheus-TTS

TTS Towards Human-Sounding Speech

Python Apache License 2.0 Updated May 6, 2025
CosyVoice Public
Forked from FunAudioLLM/CosyVoice

LLM based TTS model, providing inference/training/deployment full-stack ability.

Python Apache License 2.0 Updated May 6, 2025
stylish-tts Public
Forked from Stylish-TTS/stylish-tts

Python 1 MIT License Updated May 6, 2025
10000 ACE-Step Public
Forked from ace-step/ACE-Step

ACE-Step: A Step Towards Music Generation Foundation Model

Python Apache License 2.0 Updated May 6, 2025
VoxBox Public
Forked from SparkAudio/VoxBox

A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.

Python Other Updated May 5, 2025
Voila Public
Forked from maitrix-org/Voila

Python MIT License Updated May 3, 2025
InspireMusic Public
Forked from FunAudioLLM/InspireMusic

InspireMusic: A fundamental toolkit for music, song and audio generation.

Python Apache License 2.0 Updated May 2, 2025
Whisper-Sidecar Public
Forked from LingweiMeng/Whisper-Sidecar

The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".

Python MIT License Updated May 2, 2025
Qwen2.5-Omni Public
Forked from QwenLM/Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook Apache License 2.0 Updated May 1, 2025
EmoVoice Public
Forked from yanghaha0908/EmoVoice

Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"

Python Updated May 1, 2025
OSUM Public
Forked from ASLP-lab/OSUM

西北工业大学ASLP实验室OSUM项目官方库

Python Apache License 2.0 Updated Apr 30, 2025
kokoro-rust Public
Forked from mzdk100/kokoro

Kokoro TTS的Rust推理实现

C Apache License 2.0 Updated Apr 30, 2025
Muyan-TTS Public
Forked from MYZY-AI/Muyan-TTS

Python Apache License 2.0 Updated Apr 30, 2025
dia Public
Forked from nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python Apache License 2.0 Updated Apr 29, 2025
CycleDiffusion Public
Forked from hpjang/CycleDiffusion

This repository provides the source code associated with the paper "CycleDiffusion: Voice Conversion Using Cycle-Consistent Diffusion Models."

Python MIT License Updated Apr 29, 2025
TASTE-SpokenLM Public
Forked from mtkresearch/TASTE-SpokenLM

Python Updated Apr 29, 2025
NeMo Public
Forked from NVIDIA/NeMo

NeMo: a toolkit for conversational AI

Python 1 1 Apache License 2.0 Updated Apr 29, 2025
Marco-o1 Public
Forked from AIDC-AI/Marco-o1

An Open Large Reasoning Model for Real-World Solutions

Python Other Updated Apr 28, 2025
ClearerVoice-Studio Public
Forked from modelscope/ClearerVoice-Studio

ClearVoice

Python Apache License 2.0 Updated Apr 28, 2025
NeMo_VoiceTextBlender Public
Forked from pyf98/NeMo_VoiceTextBlender

Code for our NAACL 2025 paper: "VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning"

Python Apache License 2.0 Updated Apr 28, 2025
tts_impl Public
Forked from uthree/tts_impl

implementation of text to speech models

Python MIT License Updated Apr 28, 2025
transformer-vocos Public
Forked from Mddct/transformer-vocos

Python Updated Apr 28, 2025
Kimi-Audio Public
Forked from MoonshotAI/Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python Updated Apr 28, 2025
onnxruntime Public
Forked from microsoft/onnxruntime

ONNX Runtime: cross-platform, high performance scoring engine for ML models

C++ MIT License Updated Apr 27, 2025
DASS Public
Forked from Saurabhbhati/DASS

Python Other Updated Apr 26, 2025
StyleTTS2-lite Public
Forked from dangtr0408/StyleTTS2-lite

A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.

Python MIT License Updated Apr 26, 2025
Dual-channel-mvdr Public
Forked from William1617/Dual-channel-mvdr

C 1 Updated Apr 26, 2025

Previous Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.

0