8000 isjwdu / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View isjwdu's full-sized avatar

Highlights

  • Pro

Block or report isjwdu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 965 164 Updated Jul 5, 2023
Python 135 9 Updated Apr 25, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 2,086 193 Updated May 18, 2025
Python 165 19 Updated May 6, 2025

[CVPR'25] Official Implementation of MambaIC: State Space Models for High-Performance Learned Image Compression

36 2 Updated Mar 18, 2025

A final sanity checklist to help your CS paper get accepted, not desk rejected.

927 105 Updated May 7, 2025
Python 377 35 Updated May 13, 2025

A benchmark to evaluate full-duplex spoken dialogue models on pause handling, backchanneling, turn-taking, and user interruptions.

Python 30 Updated May 15, 2025

A dynamic library tweak for WeChat macOS - 首款微信 macOS 客户端撤回拦截与多开 🔨

Objective-C 12,079 1,451 Updated Aug 1, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,295 1,403 Updated May 16, 2025

Collection of leaked system prompts

7,743 945 Updated May 10, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 154 9 Updated Apr 23, 2025

PyTorch video decoding

Python 547 35 Updated May 18, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,609 226 Updated May 8, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,036 164 Updated May 14, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 15,844 1,246 Updated May 15, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,339 51 Updated Apr 18, 2025

NdLinear by Ensemble is a drop-in PyTorch module that shrinks your models with no accuracy loss. It powers the Ensemble Platform—upload any model and get back a smaller, faster version, ready to de…

Python 292 18 Updated May 13, 2025

Elegant reading of real-time and hottest news

TypeScript 9,957 2,865 Updated May 10, 2025

Lightweight coding agent that runs in your terminal

TypeScript 24,672 2,451 Updated May 18, 2025
Python 28 3 Updated May 13, 2025

The demo page for ALMTokenizer

Python 47 2 Updated Apr 14, 2025

🚀从聊天记录创造数字分身的一站式解决方案💡 使用聊天记录微调大语言模型,让大模型有“那味儿”,并绑定到聊天机器人,实现自己的数字分身。 数字克隆/数字分身/数字永生/LLM/聊天机器人/LoRA

Python 10,365 786 Updated May 18, 2025

This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.

Python 29 Updated Apr 14, 2025
Python 33 4 Updated Apr 29, 2025

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,049 86 Updated Apr 3, 2025

A lightweight audio codec based on a single quantizer

Python 58 3 Updated Apr 9, 2025
Python 5,323 374 Updated May 11, 2025
Next
0