8000 signofthefour (Nguyen Tan Dat) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View signofthefour's full-sized avatar
🥰
seeking
🥰
seeking

Highlights

  • Pro

Block or report signofthefour

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization (IROS 2024)

Python 11 1 Updated Jun 19, 2025

Predictive Coding for Decision Transformer (IROS 2024)

Python 8 1 Updated Jun 19, 2025

[CVPR'25] SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video

Python 41 4 Updated Jun 6, 2025
Python 34 3 Updated May 28, 2025

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 711 41 Updated Jun 28, 2025

Official implementation of Inductive Moment Matching

Python 494 12 Updated Mar 12, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 14,840 1,571 Updated Jun 25, 2025

LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation with Spoken Language Models" (arXiv 2024).

67 1 Updated Dec 28, 2024

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,141 327 Updated Jun 23, 2025

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 1,137 158 Updated May 19, 2025

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Python 195 11 Updated Sep 10, 2024

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 576 53 Updated Jun 9, 2024

The open source code for LLM-Codec

Python 135 9 Updated Aug 18, 2024

Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)

Python 147 10 Updated Sep 14, 2023

The Art of Debugging

C 894 41 Updated Aug 3, 2024
Python 290 15 Updated Oct 10, 2024

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 83 20 Updated May 23, 2023

Code repository for FreGrad

Python 52 4 Updated May 19, 2024
Python 33 4 Updated May 13, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,876 1,140 Updated Jun 26, 2025

A TTS model that makes a speaker speak new languages

Roff 76 7 Updated Jun 18, 2024

VoiceLDM: Text-to-Speech with Environmental Context

Python 179 9 Updated Aug 9, 2024
Python 12 5 Updated Dec 13, 2023

Resumes generated using the GitHub informations

JavaScript 62,522 1,357 Updated Feb 15, 2023

Architecture decision record (ADR) examples for software planning, IT leadership, and template documentation

13,541 2,556 Updated May 29, 2025
TypeScript 1 Updated Mar 1, 2022

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Python 194 24 Updated Feb 10, 2022

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Python 57 13 Updated Oct 15, 2021
Next
0