More
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Me patching up the `stress` tool to build properly on school computers
Machine Learning Journal for Intermediate to Advanced Topics.
A natural language interface for computers
SGLang is a fast serving framework for large language models and vision language models.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepEP: an efficient expert-parallel communication library
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
A high-throughput and memory-efficient inference and serving engine for LLMs
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
A simple project (without tests) to get started with (modern) CMake and CMakePresets.json
A collection of resources on modern C++
CMake Tools provides a robust, convenient workflow for CMake projects in VS Code. It simplifies configurations with CMake presets, supports IntelliSense and built-in debugging for CMake scripts, an…
Vundle, the plug-in manager for Vim
A General-purpose Task-parallel Programming System using Modern C++
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Collaborative Collection of C++ Best Practices. This online resource is part of Jason Turner's collection of C++ Best Practices resources. See README.md for more information.
A library for efficient similarity search and clustering of dense vectors.
Computer Architecture Written Node in Chinese | 计算机系统结构的学习笔记
Reference code for the paper Auto White-Balance Correction for Mixed-Illuminant Scenes.