8000 Panchovix / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Panchovix's full-sized avatar

Block or report Panchovix

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs

Python 426 31 Updated Jun 19, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)

Python 86 5 Updated Jun 21, 2025

An Extension for Forge Webui that implements Attention Couple

Python 367 18 Updated Apr 11, 2025
Python 26 3 Updated Jul 1, 2025

A fast inference library for running LLMs locally on modern consumer-class G 6095 PUs

Python 4,223 318 Updated Jun 4, 2025

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,885 220 Updated Sep 30, 2023

LLM UI with advanced features, easy setup, and multiple backend support.

Python 44,220 5,686 Updated Jul 5, 2025

Windows compile of bitsandbytes for use in text-generation-webui.

HTML 355 40 Updated Nov 18, 2023

Karras et al. (2022) diffusion models for PyTorch

Python 2,480 385 Updated Jan 7, 2025
Python 64 27 Updated Jul 4, 2025
0