8000 aiha-lab repositories · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Change the repository type filter

All

    Repositories list

    • 0100Updated Jun 23, 2025Jun 23, 2025
    • sqil

      Public
      HTML
      0000Updated May 28, 2025May 28, 2025
    • RILQ

      Public
      Python
      0110Updated Mar 19, 2025Mar 19, 2025
    • Quantization Framework for LLM Inferences
      Python
      0200Updated Mar 11, 2025Mar 11, 2025
    • MX-QLLM

      Public
      LLM Inference with Microscaling Format
      Python
      22310Updated Nov 12, 2024Nov 12, 2024
    • MapCoder

      Public
      MapCoder: Multi-Agent Code Generation for Competitive Problem Solving
      Python
      MIT License
      29000Updated Oct 7, 2024Oct 7, 2024
    • pim-iree

      Public
      Compiler and runtime implementation for PIM device.
      C++
      Apache License 2.0
      718200Updated Dec 15, 2023Dec 15, 2023
    • serpim

      Public
      👻
      C++
      Apache License 2.0
      718000Updated Dec 14, 2023Dec 14, 2023
    • iree

      Public
      👻
      C++
      Apache License 2.0
      718000Updated Dec 14, 2023Dec 14, 2023
    • TSLD

      Public
      [NeurIPS 2023] Token-Scaled Logit Distillation for Ternary Weight Generative Language Models
      Python
      11700Updated Dec 6, 2023Dec 6, 2023
    • TVM-VTA

      Public
      setting
      CMake
      0000Updated Apr 28, 2023Apr 28, 2023
    • tpu-mlir

      Public
      Machine learning compiler based on MLIR for Sophgo TPU.
      C++
      Other
      182000Updated Jan 16, 2023Jan 16, 2023
    • AI System Design - Final Project
      0000Updated Dec 20, 2022Dec 20, 2022
    • Python
      Apache License 2.0
      2901Updated Nov 4, 2022Nov 4, 2022
    • Inference code for AI Challenge (Dec 2020)
      Jupyter Notebook
      GNU General Public License v3.0
      0600Updated Feb 22, 2022Feb 22, 2022
    • TernGEMM

      Public
      TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference
      C++
      GNU General Public License v3.0
      11310Updated Feb 22, 2022Feb 22, 2022
    • Layer-wise Pruning of Transformer Heads for Efficient Language Modeling
      Python
      GNU General Public License v3.0
      02100Updated Feb 22, 2022Feb 22, 2022
    • Python
      GNU General Public License v3.0
      0800Updated Feb 22, 2022Feb 22, 2022
    • Python
      Apache License 2.0
      0000Updated Aug 31, 2021Aug 31, 2021
    • Cuda
      0000Updated Aug 12, 2021Aug 12, 2021
    • lsq-lab

      Public
      Python
      MIT License
      0000Updated Aug 9, 2021Aug 9, 2021
    • Samsung 2021 QPyTorch Lab
      Jupyter Notebook
      1000Updated Aug 9, 2021Aug 9, 2021
    • optimus + timeloop implementation
      Python
      MIT License
      6000Updated May 10, 2021May 10, 2021
    • [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
      Python
      Apache License 2.0
      287000Updated Mar 12, 2020Mar 12, 2020
    • DNN

      Public
      C++
      25000Updated Jun 26, 2017Jun 26, 2017
    • cuMat

      Public
      Cuda
      15000Updated Jun 22, 2017Jun 22, 2017
    0