Efficient Triton Kernels for LLM Training
-
Updated
Feb 24, 2025 - Python
8000
Efficient Triton Kernels for LLM Training
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
FlagGems is an operator library for large language models implemented in Triton Language.
Automatic ROPChain Generation
SymGDB - symbolic execution plugin for gdb
A performance library for machine learning applications.
ClearML - Model-Serving Orchestration and Repository Solution
(WIP)The deployment framework aims to provide a simple, lightweight, fast integrated, pipelined deployment framework for algorithm service that ensures reliability, high concurrency and scalability of services.
Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
Triton implementation of FlashAttention2 that adds Custom Masks.
Deploy DL/ ML inference pipelines with minimal extra code.
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
Transformers components but in Triton
Symbolic debugging tool using JonathanSalwan/Triton
🦎Triton_RAT is free and easy to use, one of the best remote administration tools written in Python, fully integrated with Telegram🦎
Increase the inference speed of the model
삼각형의 실전! Triton
Add a description, image, and links to the triton topic page so that developers can more easily learn about it.
To associate your repository with the triton topic, visit your repo's landing page and select "manage topics."