8000 isjwdu / Starred · GitHub

More Web Proxy on the site http://driver.im/

isjwdu

Follow

isjwdu

Follow

14 followers · 30 following

National Taiwan University
Taipei, Taiwan
09:19 (UTC +08:00)
www.jiaweidu.top

Achievements

Achievements

Highlights

Pro

Lists (3)

Sort

Audio Generation

Audio Neural Codec

38 repositories

DeepFake

Audio/Visual/Multimodel Deepfake Detection

18 repositories

Stars

tommymsw / vscode-1

70 88 Updated Dec 16, 2018

aliutkus / speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 965 164 Updated Jul 5, 2023

bfs18 / rfwave

Python 135 9 Updated Apr 25, 2025

ace-step / ACE-Step

ACE-Step: A Step Towards Music Generation Foundation Model

Python 2,086 193 Updated May 18, 2025

ictnlp / LLaMA-Omni2

Python 165 19 Updated May 6, 2025

AuroraZengfh / MambaIC

[CVPR'25] Official Implementation of MambaIC: State Space Models for High-Performance Learned Image Compression

36 2 Updated Mar 18, 2025

yzhao062 / cs-paper-checklist

A final sanity checklist to help your CS paper get accepted, not desk rejected.

927 105 Updated May 7, 2025

MYZY-AI / Muyan-TTS

Python 377 35 Updated May 13, 2025

DanielLin94144 / Full-Duplex-Bench

A benchmark to evaluate full-duplex spoken dialogue models on pause handling, backchanneling, turn-taking, and user interruptions.

Python 30 Updated May 15, 2025

sunnyyoung / WeChatTweak-macOS

A dynamic library tweak for WeChat macOS - 首款微信 macOS 客户端撤回拦截与多开 🔨

Objective-C 12,079 1,451 Updated Aug 1, 2024

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,295 1,403 Updated May 16, 2025

jujumilk3 / leaked-system-prompts

Collection of leaked system prompts

7,743 945 Updated May 10, 2025

anan235 / dia-multilingual

Forked from nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 154 9 Updated Apr 23, 2025

pytorch / torchcodec

PyTorch video decoding

Python 547 35 Updated May 18, 2025

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,609 226 Updated May 8, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,036 164 Updated May 14, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 15,844 1,246 Updated May 15, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,339 51 Updated Apr 18, 2025

ensemble-core / NdLinear

NdLinear by Ensemble is a drop-in PyTorch module that shrinks your models with no accuracy loss. It powers the Ensemble Platform—upload any model and get back a smaller, faster version, ready to de…

Python 292 18 Updated May 13, 2025

ourongxing / newsnow

Elegant reading of real-time and hottest news

TypeScript 9,957 2,865 Updated May 10, 2025

openai / codex

Lightweight coding agent that runs in your terminal

TypeScript 24,672 2,451 Updated May 18, 2025

Mddct / transformer-vocos

Python 28 3 Updated May 13, 2025

yangdongchao / ALMTokenizer

The demo page for ALMTokenizer

Python 47 2 Updated Apr 14, 2025

xming521 / WeClone

🚀从聊天记录创造数字分身的一站式解决方案💡 使用聊天记录微调大语言模型，让大模型有“那味儿”，并绑定到聊天机器人，实现自己的数字分身。数字克隆/数字分身/数字永生/LLM/聊天机器人/LoRA

Python 10,365 786 Updated May 18, 2025

koudounasalkis / voc2vec

This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.

Python 29 Updated Apr 14, 2025

Liu-Tianchi / Nes2Net

Python 33 4 Updated Apr 29, 2025

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,049 86 Updated Apr 3, 2025

zhai-lw / SQCodec

A lightweight audio codec based on a single quantizer

Python 58 3 Updated Apr 9, 2025

kyutai-labs / moshi-finetune

Python 215 13 Updated Apr 3, 2025

bytedance / MegaTTS3

Python 5,323 374 Updated May 11, 2025

0