8000 Release v0.3.1 · thu-pacman/chitu · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

v0.3.1

Latest
Compare
Choose a tag to compare
@roastduck roastduck released this 30 Apr 07:21
· 13 commits to public-main since this release

Better support for MetaX (沐曦) GPUs:

  • Support of both Llama-like models and DeepSeek models. Tested with DeepSeek-R1-Distill-Llama-70B and DeepSeek-R1-671B using bf16, fp16, and soft fp8 precision.
  • New infer.op_impl=muxi_custom_kernel mode optimized for small batches.
0