8000 use fancy-regex instead of onig as tokenizers regex library by Slowki · Pull Request #172 · guidance-ai/llguidance · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

use fancy-regex instead of onig as tokenizers regex library #172

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By cli 8000 cking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 6, 2025

Conversation

Slowki
Copy link
Contributor
@Slowki Slowki commented May 6, 2025

The version of Oniguruma used in onig_sys doesn't build on GCC 15 and the oniguruma project itself got archived last week, so this PR switches tokenizers to the fancy-regex backend.

fancy-regex also requires flipping on the unstable_wasm feature until huggingface/tokenizers#1772 lands, that flag doesn't have any ill effects though since everything WASM related downstream is behind target_arch checks.

tl;dr: This fixes builds on Linux distros with newer GCC versions like Arch Linux and Fedora.

Oniguruma doesn't build on GCC 15 and the project got archived last week,
so this PR switches to the fancy-regex backend.

`fancy-regex` also requires flipping on the `unstable_wasm` feature until
huggingface/tokenizers#1772 lands. Right now that flag doesn't have any
ill effects since everything WASM related downstream is behind `target_arch`
checks.
Copy link
Member
@mmoskal mmoskal left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you, looks good!

@mmoskal mmoskal merged commit b59f98f into guidance-ai:main May 6, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
0