8000 vectorch-ai repositories · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Change the repository type filter

All

    Repositories list

    • ScaleLLM

      Public
      A high-performance inference system for large language models, designed for production environments.
      C++
      Apache License 2.0
      35439489Updated May 15, 2025May 15, 2025
    • flux

      Public
      A fast communication-overlapping library for tensor/expert parallelism on GPUs.
      C++
      Apache License 2.0
      59000Updated Apr 15, 2025Apr 15, 2025
    • whl

      Public
      repository to host python whl package.
      HTML
      0000Updated Mar 2, 2025Mar 2, 2025
    • 3FS

      Public
      A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
      C++
      MIT License
      881000Updated Feb 28, 2025Feb 28, 2025
    • FlashInfer: Kernel Library for LLM Serving
      Cuda
      Apache License 2.0
      306000Updated Feb 27, 2025Feb 27, 2025
    • FlashMLA

      Public
      C++
      MIT License
      834000Updated Feb 26, 2025Feb 26, 2025
    • vcpkg

      Public
      C++ Library Manager for Windows, Linux, and MacOS
      CMake
      MIT License
      6.8k000Updated Feb 24, 2025Feb 24, 2025
    • 0000Updated Jun 5, 2024Jun 5, 2024
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.7k000Updated Oct 15, 2023Oct 15, 2023
    • 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
      Rust
      Apache License 2.0
      896000Updated Aug 4, 2023Aug 4, 2023
    • xformers

      Public
      Hackable and optimized Transformers building blocks, supporting a composable construction.
      Python
      Other
      671000Updated Aug 1, 2023Aug 1, 2023
    • Transformer related optimization, including BERT, GPT
      C++
      Apache License 2.0
      904000Updated Jul 28, 2023Jul 28, 2023
    • optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
      C++
      Apache License 2.0
      37000Updated Jul 24, 2023Jul 24, 2023
    0