Stars
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
Open Source framework for voice and multimodal conversational AI
Easily train a good VC model with voice data <= 10 mins!
An Implementation of NTQQ Protocol, with Pure C#, Derived from Konata.Core
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
Official PyTorch implementation of BigVGAN (ICLR 2023)
Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV).…
The official implementation of HierSpeech++
vits2 backbone with multilingual-bert
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
A fork version of Remote Party Finder from https://git.anna.lgbt/ascclemens/remote-party-finder
Reverse engineered API of Microsoft's Bing Chat AI
跨平台 Python 异步聊天机器人框架 / Asynchronous multi-platform chatbot framework written in Python
A professional cross-platform SSH/Sftp/Shell/Telnet/Tmux/Serial terminal.
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
Chinese text normalization for speech processing