Stars
Create Ethereum-powered apps with one command
Open source forkable Ethereum dev stack
Scrapy, a fast high-level web crawling & scraping framework for Python.
PIM-ML is a benchmark for training machine learning algorithms on the UPMEM architecture, which is the first publicly-available real-world processing-in-memory (PIM) architecture. Described in the …
A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture
Awesome LLM compression research papers and tools.
Awesome-LLM: a curated list of Large Language Model
A Easy-to-understand TensorOp Matmul Tutorial
TinyChatEngine: On-Device LLM Inference Library
An Open-source Toolkit for LLM Development
fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…
Examples for using ONNX Runtime for machine learning inferencing.
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox.
A curated list for Efficient Large Language Models
A model compression and acceleration toolbox based on pytorch.
A simple, modular, and fast framework for writing MEV bots in Rust.
ChatGPT in command line with OpenAI API (gpt-3.5-turbo/gpt-4/gpt-4-32k)
Universal LLM Deployment Engine with ML Compilation