10000 WashSwang (ShenGuan) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View WashSwang's full-sized avatar

Block or report WashSwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High-speed and easy-use LLM serving framework for local deployment

C++ 110 9 Updated Mar 18, 2025

💻 A better and friendly vi(vim) mode plugin for ZSH.

Shell 3,777 132 Updated Apr 22, 2025

[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale

Python 251 17 Updated Jun 5, 2025

Apple GPU microarchitecture

Metal 526 26 Updated Sep 22, 2024

⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other building blocks

TypeScript 27,171 2,981 Updated Jun 18, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 144,003 12,117 Updated Jun 18, 2025

MLX: An array framework for Apple silicon

C++ 21,111 1,233 Updated Jun 18, 2025

Swift Package to implement a transformers-like API in Swift

Swift 1,000 130 Updated Jun 5, 2025

Everything we actually know about the Apple Neural Engine (ANE)

2,225 81 Updated Mar 7, 2025

llm deploy project based mnn. This project has merged into MNN.

C++ 1,593 172 Updated Jan 20, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,770 1,505 Updated Jun 18, 2025

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,548 178 Updated Jun 25, 2024

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 550 33 Updated Oct 28, 2023

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,546 559 Updated Jun 18, 2025

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Python 4,340 476 Updated Nov 18, 2024

LLM inference in C/C++

C++ 81,916 12,121 Updated Jun 18, 2025

The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".

Python 65 3 Updated Apr 18, 2023

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

TypeScript 1,724 66 Updated Apr 1, 2025

[EMNLP 2022] TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data

17 Updated May 17, 2023

Training and serving large-scale neural networks with auto parallelization.

Python 3,138 355 Updated Dec 9, 2023

Multithreaded Python without the GIL

Python 2,915 104 Updated May 20, 2025

pytorch-profiler

Python 51 8 Updated Jun 1, 2023

A co-design architecture on sparse attention

Python 52 4 Updated Aug 23, 2021

Showcase for rankit http://github.com/wattlebird/ranking/

C# 15 Updated Jul 16, 2020

A python library for fractional fixed-point (base 2) arithmetic and binary manipulation with Numpy compatibility.

Python 194 24 Updated Feb 12, 2024

🚀 A very efficient Texas Holdem GTO solver ♠️♥️♣️♦️

C++ 2,072 351 Updated Nov 5, 2024

简单、轻量、好用的划词翻译软件

Rust 1,290 109 Updated Feb 20, 2023

Rust parser combinator framework

Rust 9,951 835 Updated Feb 8, 2025

Raytracer tutorial for PPCA 2021, written in Rust.

Rust 117 5 Updated Aug 10, 2021
Next
0