Stars
Triton
7 repositories
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
A performance library for machine learning applications.
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
Hackable and optimized Transformers building blocks, supporting a composable construction.
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.