8000 shamio / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View shamio's full-sized avatar

Block or report shamio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Lord of Large Language and Multi modal Systems Web User Interface

CSS 4,688 573 Updated Jun 20, 2025

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

C++ 7,626 483 Updated Jun 23, 2025

LLM UI with advanced features, easy setup, and multiple backend support.

Python 44,051 5,676 Updated Jun 23, 2025

Official implementation for 'Extending LLMs’ Context Window with 100 Samples'

Python 78 3 Updated Jan 18, 2024
Jupyter Notebook 58 8 Updated Jul 24, 2024

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Python 250 20 Updated Dec 16, 2024

Modeling, training, eval, and inference code for OLMo

Python 5,706 621 Updated Jun 23, 2025

An innovative library for efficient LLM inference via low-bit quantization

C++ 349 38 Updated Aug 30, 2024

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,168 214 Updated Oct 8, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,913 386 Updated Jul 11, 2024

High-speed Large Language Model Serving for Local Deployment

C++ 8,223 434 Updated Feb 19, 2025

Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.

Python 17,011 1,764 Updated Jun 21, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 40,998 3,264 Updated Jun 24, 2025

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval cap…

Python 589 44 Updated Nov 17, 2023

Web UI for ExLlamaV2

JavaScript 502 47 Updated Feb 5, 2025

Tools for merging pretrained large language models.

Python 5,838 566 Updated Jun 19, 2025

Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

Python 277 22 Updated Nov 3, 2023

LLM inference in C/C++

C++ 82,128 12,179 Updated Jun 24, 2025

Explore large language models in 512MB of RAM

HTML 1,197 80 Updated Feb 28, 2025
0