8000 roastduck (roastduck) / Repositories · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View roastduck's full-sized avatar

Highlights

  • Pro

Block or report roastduck

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • FlashMLA Public

    Forked from deepseek-ai/FlashMLA
    C++ MIT License Updated Feb 26, 2025
  • Python MIT License Updated Feb 7, 2025
  • FreeTensor Public

    A language and compiler for irregular tensor programs.

    C++ 138 10 Apache License 2.0 Updated Nov 29, 2024
  • PyTorch extensions for high performance and large scale training.

    Python Other Updated Nov 26, 2024
  • vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 1 Apache License 2.0 Updated Oct 17, 2024
  • cutlass Public

    Forked from NVIDIA/cutlass

    CUDA Templates for Linear Algebra Subroutines

    C++ Other Updated Sep 10, 2024
  • YAUJ Public

    Yet Another Universal Judge

    C 7 5 Updated Aug 17, 2024
  • benchmark Public

    Forked from google/benchmark

    A microbenchmark support library

    C++ Apache License 2.0 Updated Aug 13, 2024
  • taskflow Public

    Forked from taskflow/taskflow

    A General-purpose Task-parallel Programming System using Modern C++

    C++ Other Updated Aug 13, 2024
  • AutoGPTQ Public

    Forked from AutoGPTQ/AutoGPTQ

    An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

    Python MIT License Updated Aug 9, 2024
  • pybind11 Public

    Forked from pybind/pybind11

    Seamless operability between C++11 and Python

    C++ Other Updated Aug 9, 2024
  • googletest Public

    Forked from google/googletest

    GoogleTest - Google Testing and Mocking Framework

    C++ BSD 3-Clause "New" or "Revised" License Updated Aug 7, 2024
  • spdlog Public

    Forked from gabime/spdlog

    Fast C++ logging library.

    C++ Other Updated Jul 22, 2024
  • llm-awq Public

    Forked from mit-han-lab/llm-awq

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python MIT License Updated Jul 16, 2024
  • EETQ Public

    Forked from NetEase-FuXi/EETQ

    Easy and Efficient Quantization for Transformers

    C++ Apache License 2.0 Updated Jul 15, 2024
  • fastmoe Public

    Forked from laekov/fastmoe

    A fast MoE impl for PyTorch

    Python Apache License 2.0 Updated Jun 3, 2024
  • TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

    Python BSD 3-Clause "New" or "Revised" License Updated May 2, 2024
  • ADBench Public

    Forked from microsoft/ADBench

    Benchmarking various AD tools.

    C++ 1 3 MIT License Updated Mar 15, 2024
  • checkout Public

    Forked from actions/checkout

    Action for checking out a repo

    TypeScript MIT License Updated Jan 12, 2024
  • Experiments on FreeTensor

    Jupyter Notebook 3 1 Apache License 2.0 Updated Nov 14, 2023
  • Enzyme Public

    Forked from EnzymeAD/Enzyme

    High-performance automatic differentiation of LLVM and MLIR.

    LLVM Other Updated Jul 6, 2023
  • onnx Public

    Forked from onnx/onnx

    Open standard for machine learning interoperability

    C++ Apache License 2.0 Updated Oct 8, 2021
  • taco Public

    Forked from tensor-compiler/taco

    The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs

    C++ Other Updated Dec 16, 2020
  • 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

    Python Apache License 2.0 Updated Dec 7, 2020
  • incubator-tvm Public

    Forked from apache/tvm

    Open deep learning compiler stack for cpu, gpu and specialized accelerators

    Python Apache License 2.0 Updated Sep 17, 2020
  • longformer Public

    Forked from allenai/longformer

    Longformer: The Long-Document Transformer

    Python Apache License 2.0 Updated Jun 6, 2020
  • Userspace for roastduck/linux:async. Working in progress.

    C++ Updated May 26, 2020
  • linux Public

    Forked from torvalds/linux

    Linux kernel source tree

    C 1 Other Updated May 23, 2020
  • capsule Public archive

    Capsule network implemented with TVM

    Python Updated Oct 23, 2019
  • rfs Public

    A simple FS as a practice of Rust

    Rust Updated Oct 13, 2019
0