8000 richardburleigh (Richard Burleigh) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View richardburleigh's full-sized avatar

Block or report richardburleigh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Prompts for our Grok chat assistant and the `@grok` bot on X.

Jinja 2,483 231 Updated May 16, 2025

Transformer-Mamba Diffusion Models

Python 107 8 Updated Jun 30, 2024

DFloat11: Lossless LLM Compression for Efficient GPU Inference

Python 365 21 Updated May 23, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,119 169 Updated May 24, 2025

Adding timestamped prompts and general quality of life features to FramePack https://discord.gg/MtuM7gFJ3V

Python 358 32 Updated May 22, 2025

Official inference framework for 1-bit LLMs

Python 19,803 1,477 Updated May 23, 2025

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache

Python 102 6 Updated Apr 24, 2025

Auto generate MindMap with ChatGPT

Vue 244 53 Updated Feb 28, 2024
Python 19 2 Updated Mar 3, 2025

Only implemented through torch: "bi - mamba2" , "vision- mamba2 -torch". support 1d/2d/3d/nd and support export by jit.script/onnx;

Python 327 12 Updated Dec 11, 2024

Simple go utility to download HuggingFace Models and Datasets

Go 674 80 Updated Oct 26, 2024

Command and Conquer: Generals - Zero Hour

C++ 4,172 1,366 Updated Feb 27, 2025

Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)

Python 51 Updated Mar 25, 2025

Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"

Python 298 17 Updated Dec 23, 2024

🤗 AutoTrain Advanced

Python 4,394 574 Updated Jan 21, 2025

This is InfiniRetri, a tool enhance Transformer-based LLMs(Large Language Model) ablity to hangle Long-Context.

105 10 Updated Mar 27, 2025
50 2 Updated Feb 17, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,778 277 Updated May 15, 2025

A Conversational Speech Generation Model

Python 13,333 1,270 Updated Mar 27, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,922 886 Updated May 21, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,639 415 Updated Mar 5, 2025

Expert Parallelism Load Balancer

Python 1,196 190 Updated Mar 24, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,782 295 Updated Mar 10, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,371 599 Updated May 20, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,689 773 Updated May 23, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,570 835 Updated Apr 29, 2025

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 628 35 Updated May 16, 2025

Audiobook Creator is an open-source tool that converts books (EPUB, PDF, TXT) into fully voiced audiobooks with intelligent character voice attribution. It uses NLP, LLMs, and Kokoro TTS to generat…

Python 256 21 Updated May 4, 2025

Explorations into adversarial losses on top of autoregressive loss for language modeling

Python 36 1 Updated Feb 22, 2025
Next
0