8000 980202006 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View 980202006's full-sized avatar

Block or report 980202006

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 3,862 435 Updated May 3, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 343 16 Updated May 10, 2025

Train high-quality text-to-image diffusion models in a data & compute efficient manner

Python 493 36 Updated Mar 27, 2025

Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis"

Python 6 1 Updated Mar 13, 2025

Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。

648 118 Updated Jun 12, 2020

在不同城市要过上同等生活水平的我到底需要多少钱?

TypeScript 97 7 Updated Apr 6, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 1,518 113 Updated May 11, 2025

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,882 124 Updated May 8, 2025

[ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling

Python 17 Updated May 1, 2025

applying audio FX with text descriptors

Jupyter Notebook 20 Updated Apr 16, 2025
Python 290 21 Updated May 11, 2025

Variable Bitrate Residual Vector Quantization for Audio Coding

Python 41 2 Updated May 1, 2025
Python 56 9 Updated Jun 8, 2022

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,274 52 Updated May 8, 2025
Python 85 4 Updated Apr 28, 2025

Implementation of SoundStorm built upon SpeechTokenizer.

Python 112 14 Updated Nov 2, 2023

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,484 204 Updated May 8, 2025

Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.

38 2 Updated May 2, 2025

High-performance Image Tokenizers for VAR and AR

Python 257 5 Updated Apr 25, 2025

Official repository of SepReformer for speech separation

Python 197 23 Updated Jan 13, 2025

Wavelet Learned Lossy Compression

Jupyter Notebook 6 Updated Dec 29, 2024
Python 64 7 Updated May 7, 2025
Python 17 2 Updated May 4, 2025

Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"

Python 14 2 Updated Apr 22, 2025

Official PyTorch code for Deep Audio-Signal Holistic Embeddings

Python 93 10 Updated Apr 22, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 15,250 1,179 Updated May 9, 2025

Self-supervised Generative LM-based Voice Conversion

Python 35 6 Updated Apr 24, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 2,973 158 Updated May 8, 2025
Next
0