8000 tdh-archive repositories · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Change the repository type filter

All

    Repositories list

    • Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.
      C++
      MIT License
      146000Updated Apr 26, 2025Apr 26, 2025
    • A concise but complete full-attention transformer with a set of promising experimental features from various papers
      Python
      MIT License
      456000Updated Apr 18, 2025Apr 18, 2025
    • Python
      MIT License
      16k000Updated Apr 18, 2025Apr 18, 2025
    • prima.cpp

      Public
      prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters
      C++
      MIT License
      46000Updated Apr 17, 2025Apr 17, 2025
    • burn

      Public
      Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.
      Rust
      Apache License 2.0
      569000Updated Mar 25, 2025Mar 25, 2025
    • Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
      Python
      MIT License
      31000Updated Mar 15, 2025Mar 15, 2025
    • FlashMLA

      Public
      C++
      MIT License
      831000Updated Mar 15, 2025Mar 15, 2025
    • DeepGEMM

      Public
      DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
      Cuda
      MIT License
      585000Updated Mar 15, 2025Mar 15, 2025
    • zstd

      Public
      Zstandard - Fast real-time compression algorithm
      C
      Other
      2.2k000Updated Mar 15, 2025Mar 15, 2025
    • exo

      Public
      Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
      Python
      GNU General Public License v3.0
      1.8k000Updated Mar 15, 2025Mar 15, 2025
    • A cross-platform, reimplementation of Notepad++
      C++
      GNU General Public License v3.0
      598000Updated Mar 15, 2025Mar 15, 2025
    • mflux

      Public
      A MLX port of FLUX based on the Huggingface Diffusers implementation.
      Python
      MIT License
      81000Updated Mar 15, 2025Mar 15, 2025
    • picotron

      Public
      Minimalistic 4D-parallelism distributed training framework for education purpose
      Python
      Apache License 2.0
      98000Updated Mar 7, 2025Mar 7, 2025
    • Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
      Python
      MIT License
      3.3k000Updated Mar 5, 2025Mar 5, 2025
    • mamba.py

      Public
      A simple and efficient Mamba implementation in pure PyTorch and MLX.
      Python
      MIT License
      104000Updated Dec 4, 2024Dec 4, 2024
    • Lime3DS

      Public
      A 3DS emulator based on Citra
      C++
      GNU General Public License v2.0
      261000Updated Oct 30, 2024Oct 30, 2024
    • cake

      Public
      Distributed LLM and StableDiffusion inference for mobile, desktop and server.
      Rust
      Other
      164000Updated Oct 23, 2024Oct 23, 2024
    • grok-1

      Public
      Grok open release
      Python
      Apache License 2.0
      8.4k000Updated Aug 30, 2024Aug 30, 2024
    • A pure and fast NumPy implementation of Mamba with cache support.
      Python
      MIT License
      1000Updated Jun 16, 2024Jun 16, 2024
    • 1.58 Bit LLM on Apple Silicon using MLX
      Python
      24000Updated May 10, 2024May 10, 2024
    • candle

      Public
      Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.
      Jupyter Notebook
      4000Updated Apr 12, 2024Apr 12, 2024
    0