Release 0.0.2 -- inference server & other improvements! #108
francoishernandez
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
We just released 0.0.2, which has been quite an iteration since 0.0.1.
Some long awaited changes are finally making it to the main branch!
🌟 Key Features
mapped_tokens
to efficiently handle specific prompt tokens.⚙️ Notable Improvements
bfloat16
Support: Unlock better performance on specialized hardware with our new bfloat16 support, balancing memory use and precision.💬 Drop a comment or get in touch if you have any feedback or question!
Beta Was this translation helpful? Give feedback.
All reactions