8000 [LLM] Support block_attention/cachekv quant for llama by RichardWooSJTU · Pull Request #7649 · PaddlePaddle/PaddleNLP · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[LLM] Support block_attention/cachekv quant for llama#7649

Merged
wawltor merged 15 commits intoPaddlePaddle:developfrom
RichardWooSJTU:restruct_52_dev
Jan 10, 2024

Commits

Commits on Dec 20, 2023

Commits on Dec 29, 2023

Commits on Jan 5, 2024

Commits on Jan 8, 2024

Commits on Jan 9, 2024

Commits on Jan 10, 2024

0