8000 aksarben09 (aksarben) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View aksarben09's full-sized avatar

Highlights

  • Pro

Block or report aksarben09

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,143 1,004 Updated May 23, 2025

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 11,083 1,865 Updated May 23, 2025

A micro Vulkan compute pipeline and a collection of benchmarking compute shaders

C++ 236 40 Updated Mar 27, 2025

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,411 51 Updated Jan 12, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 141,609 11,868 Updated May 25, 2025

Distributed LLM and StableDiffusion inference for mobile, desktop and server.

Rust 2,852 164 Updated Oct 23, 2024

Extract Metal functions from .metallib files.

Swift 152 9 Updated May 24, 2023

Apple G13 GPU architecture docs and tools

HTML 587 42 Updated May 16, 2025

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Cuda 249 17 Updated Oct 28, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,167 1,863 Updated Mar 26, 2025

An app that brings language models directly to your phone.

TypeScript 3,613 344 Updated May 24, 2025

Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI

Python 281 21 Updated Mar 19, 2025

This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.

Python 102 5 Updated Jul 1, 2024

The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more.

Swift 1,488 83 Updated May 22, 2025

Reverse Engineering: Decompiling Binary Code with Large Language Models

Python 5,601 379 Updated May 23, 2025

Official inference framework for 1-bit LLMs

Python 19,813 1,479 Updated May 23, 2025
Jupyter Notebook 90 16 Updated Dec 23, 2024

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,583 249 Updated May 20, 2025

A tool which profiles OpenCL devices to find their peak capacities

C++ 446 120 Updated Dec 24, 2024

FlashAttention (Metal Port)

Swift 485 27 Updated Sep 22, 2024

2 books and related source codes for algorithmic trading.

Python 516 219 Updated Jun 17, 2024

Demonstration of various hardware effects on CUDA GPUs.

C++ 379 31 Updated Nov 22, 2023

Python library for portfolio optimization built on top of scikit-learn

Python 1,472 132 Updated May 15, 2025

Investment Research for Everyone, Everywhere.

Python 41,733 3,740 Updated May 24, 2025

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Python 107 15 Updated Oct 15, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,968 1,267 Updated May 23, 2024

talipp - incremental technical analysis library for python

Python 457 70 Updated Mar 16, 2025

STUMPY is a powerful and scalable Python library for modern time series analysis

Python 3,925 332 Updated Apr 8, 2025

An LLVM/Clang/LLD based mingw-w64 toolchain

C 2,268 213 Updated May 25, 2025
Next
0