8000 iris2c (iris2c) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View iris2c's full-sized avatar

Block or report iris2c

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] Official implementation for "Empowering LLMs to Understand and Generate Complex Vector Graphics" https://arxiv.org/abs/2412.11102

Python 517 5 Updated May 22, 2025

A song aesthetic evaluation toolkit trained on SongEval.

Python 184 14 Updated May 23, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 2,388 231 Updated Jun 4, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,899 1,458 Updated May 29, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,10 8000 4 1,173 Updated Jun 7, 2025

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 40,166 3,183 Updated Jun 6, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,580 185 Updated Jun 6, 2025
Python 80 13 Updated Apr 29, 2025

official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification

Python 10 Updated Mar 14, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,468 1,002 Updated Jun 6, 2025
Python 895 94 Updated Apr 30, 2025

[ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and sound

Python 119 2 Updated May 30, 2025
Python 4,330 351 Updated Mar 12, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 26,620 2,579 Updated Apr 30, 2025

VideoSys: An easy and efficient system for video generation

Python 1,974 130 Updated Mar 9, 2025

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 4,668 310 Updated Jun 7, 2025

InspireMusic: Music, Song, Audio Generation.

Python 1,114 104 Updated May 20, 2025

More relighting!

Python 8,062 497 Updated Feb 20, 2025

C++ builds C++

C++ 23 Updated Apr 30, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,896 229 Updated May 23, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,454 1,511 Updated Apr 29, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 51,785 6,259 Updated Jun 7, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,394 708 Updated Jun 5, 2025

Multilingual Voice Understanding Model

Python 5,838 512 Updated Mar 23, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 14,379 1,504 Updated Jun 2, 2025

Speech, Language, Audio, Music Processing with Large Language Model

Python 823 80 Updated Apr 24, 2025

Audio synthesis, processing, & analysis platform for iOS, macOS and tvOS

Swift 11,010 1,580 Updated May 13, 2025

INSPIRE: Instruction-based Multi-Task Speech and Audio Processing Benchmark

3 Updated May 14, 2024
Next
0