-
birentech
- 中国
-
19:34
(UTC -12:00)
8000
More
Stars
Distributed Compiler Based on Triton for Parallel Systems
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA.
A category wise collection of 200+ LLM survey papers.
A data augmentations library for audio, image, text, and video.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A high-throughput and memory-efficient inference and serving engine for LLMs
Multilingual Voice Understanding Model
Fine-Grained Open Domain Image Animation with Motion Guidance
A generative speech model for daily dialogue.
Effortless data labeling with AI support from Segment Anything and other awesome models.
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
SGLang is a fast serving framework for large language models and vision language models.
Efficiently computes derivatives of NumPy code.
High-speed Large Language Model Serving for Local Deployment
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Minimalistic large language model 3D-parallelism training
Instant voice cloning by MIT and MyShell. Audio foundation model.
You like pytorch? You like micrograd? You love tinygrad! ❤️
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
DLRover: An Automatic Distributed Deep Learning System