AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

738 60 Updated Feb 25, 2025

facebookresearch / AnimatedDrawings

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Python 12,443 1,076 Updated Apr 28, 2025

LHRUN / paint-board

🎨 A powerful multi-end drawing board that brings together a lot of creative brushes to experience a whole new range of drawing effects!

TypeScript 2,352 258 Updated Mar 20, 2025

vipstone / drawingboard

高级画板—自由绘、直/虚线、箭头、所有几何图形

JavaScript 711 186 Updated Jun 27, 2018

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,301 649 Updated May 31, 2024

andrewsilva9 / tune_tortoise_autoregressor

Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.

Python 15 Updated Nov 25, 2023

lifeiteng / naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 197 17 Updated Apr 20, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,239 984 Updated May 15, 2025

dengxiuqi / ChineseLyrics

10W首中文歌词数据库

472 81 Updated Jun 13, 2021

NLPBLCU / Chinese-Celebrities-Names

中国名人人名数据库

8 3 Updated Sep 16, 2020

fighting41love / funNLP

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 73,445 14,858 Updated May 10, 2024

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,531 877 Updated May 20, 2025

shibing624 / pycorrector

pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。

Python 5,982 1,137 Updated Dec 26, 2024

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,692 121 Updated Jul 5, 2024

Text-to-Audio / Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Python 647 90 Updated May 22, 2024

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,325 144 Updated Jun 6, 2024

asteroid-team / asteroid

The PyTorch-based audio source separation toolkit for researchers

Python 2,380 433 Updated Jan 11, 2025

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,982 688 Updated Aug 13, 2024

OpenGVLab / LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,873 381 Updated Mar 14, 2024

bytedance / SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Python 1,228 100 Updated Mar 4, 2025

SkyworkAI / Skywork

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,310 119 Updated Mar 7, 2025