8000 tae-su-kim (Taesu Kim) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View tae-su-kim's full-sized avatar

Block or report tae-su-kim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,138 394 Updated Jun 1, 2025

Ditto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.

Python 41 3 Updated May 29, 2025
HTML 3 3 Updated Jan 16, 2025

QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference

Python 118 4 Updated Mar 6, 2024

OwLite is a low-code AI model compression toolkit for AI models.

Python 45 5 Updated May 16, 2025
0