8000 Release v0.3.0 · AutoGPTQ/AutoGPTQ · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
This repository was archived by the owner on Apr 11, 2025. It is now read-only.

v0.3.0

Compare
Choose a tag to compare
@PanQiWei PanQiWei released this 16 Jul 08:11
· 291 commits to main since this release

Overview

  • CUDA kernels improvement: support models whose hidden_size can only divisible by 32/64 instead of 256.
  • Peft integration: support training and inference using LoRA, AdaLoRA, AdaptionPrompt, etc.
  • New models: BaiChuan, InternLM.
  • Other updates: see 'Full Change Log' below for details.

Full Change Log

What's Changed

New Contributors

Full Changelog: v0.2.1...v0.3.0

0