8000 snsun (Sining Sun) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View snsun's full-sized avatar
  • soundraw.top
  • Beijing

Block or report snsun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official repository for LTX-Video

Python 5,857 469 Updated May 15, 2025

Build your own AI friend

C++ 13,281 2,577 Updated May 20, 2025

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

8000
1,916 241 Updated Jun 6, 2024

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

Python 242 23 Updated Jun 10, 2024

a curated list of speech datasets (110+ datasets, 75+ easy to download)

131 4 Updated Feb 15, 2023

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

738 60 Updated Feb 25, 2025

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Python 12,443 1,076 Updated Apr 28, 2025

🎨 A powerful multi-end drawing board that brings together a lot of creative brushes to experience a whole new range of drawing effects!

TypeScript 2,352 258 Updated Mar 20, 2025

高级画板—自由绘、直/虚线、箭头、所有几何图形

JavaScript 711 186 Updated Jun 27, 2018

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,301 649 Updated May 31, 2024

Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.

Python 15 Updated Nov 25, 2023

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 197 17 Updated Apr 20, 2024

✨✨Latest Advances on Multimodal Large Language Models

15,239 984 Updated May 15, 2025

10W首中文歌词数据库

472 81 Updated Jun 13, 2021

中国名人人名数据库

8 3 Updated Sep 16, 2020

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 73,445 14,858 Updated May 10, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,531 877 Updated May 20, 2025

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。

Python 5,982 1,137 Updated Dec 26, 2024

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,692 121 Updated Jul 5, 2024

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Python 647 90 Updated May 22, 2024

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,325 144 Updated Jun 6, 2024

The PyTorch-based audio source separation toolkit for researchers

Python 2,380 433 Updated Jan 11, 2025

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,982 688 Updated Aug 13, 2024

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,873 381 Updated Mar 14, 2024

SALMONN: Speech Audio Language Music Open Neural Network

Python 1,228 100 Updated Mar 4, 2025

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,310 119 Updated Mar 7, 2025

第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。

Python 553 54 Updated Sep 11, 2023

Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch

Python 270 19 Updated Oct 30, 2023

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,689 505 Updated Jul 18, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,850 1,617 Updated Feb 29, 2024
Next
0