Denverzyl

Denverzyl

1 follower · 9 following

Achievements

Stars

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,056 230 Updated Jun 26, 2025

feifeibear / long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 520 53 Updated May 27, 2025

Repeerc / flash-attention-v2-RDNA3-minimal

a simple Flash Attention v2 implementation with ROCM (RDNA3 GPU, roc wmma), mainly used for stable diffusion(ComfyUI) in Windows ZLUDA environments.

Python 43 6 Updated Aug 25, 2024

Tencent-Hunyuan / HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,535 144 Updated May 20, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 10,482 953 Updated Jun 3, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 29,518 6,067 Updated Jun 27, 2025

ali-vilab / TeaCache

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Python 908 37 Updated Jun 8, 2025

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 29,575 3,649 Updated Jul 23, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 18,047 1,772 Updated Jun 25, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,510 1,525 Updated Jun 13, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 15,520 2,204 Updated Jun 27, 2025

ZihanWang314 / CoE

Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models

Python 185 24 Updated Jun 25, 2025

JundaLi07 / ktransformers

Forked from kvcache-ai/ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 1 Updated Mar 12, 2025

ROCm / aiter

AI Tensor Engine for ROCm

Python 210 59 Updated Jun 27, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,840 279 Updated May 15, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,481 630 Updated Jun 23, 2025

Infini-AI-Lab / MagicDec

[ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

Python 116 9 Updated Dec 4, 2024

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,217 822 Updated Jun 27, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

Cuda 11,628 871 Updated Apr 29, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,461 1,029 Updated Jun 26, 2025

tinygrad / open-gpu-kernel-modules

Forked from NVIDIA/open-gpu-kernel-modules

NVIDIA Linux open GPU with P2P support

C 1,178 116 Updated Jun 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Denverzyl

Achievements

Achievements

Block or report Denverzyl

Stars

xdit-project / xDiT

feifeibear / long-context-attention

Repeerc / flash-attention-v2-RDNA3-minimal

Tencent-Hunyuan / HunyuanVideo-I2V

Tencent-Hunyuan / HunyuanVideo

huggingface / diffusers

ali-vilab / TeaCache

openai / CLIP

Dao-AILab / flash-attention

Wan-Video / Wan2.1

sgl-project / sglang

ZihanWang314 / CoE

JundaLi07 / ktransformers

ROCm / aiter

deepseek-ai / open-infra-index

deepseek-ai / DeepGEMM

Infini-AI-Lab / MagicDec

deepseek-ai / DeepEP

deepseek-ai / FlashMLA

kvcache-ai / ktransformers

tinygrad / open-gpu-kernel-modules

AzatAI / cs_books

pemagrg1 / AI_class2022

yanshengjia / ml-road

ModelTC / lightllm

langchain-ai / langchain

apple / ml-recurrent-drafter

alipay / PainlessInferenceAcceleration

FasterDecoding / REST

flexflow / flexflow-train