10000 Building mildly deep automatons takes a long time · Issue #10 · scoder/acora · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Building mildly deep automatons takes a long time #10

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
pombredanne opened this issue Jun 24, 2016 · 2 comments
Open

Building mildly deep automatons takes a long time #10

pombredanne opened this issue Jun 24, 2016 · 2 comments

Comments

@pombredanne
Copy link

With this snippet and the latest 2.0, which creates an automaton with 1000 strings of 2000 characters each build() takes forever to complete, I eventually killed it:

>>> from array import array
>>> from acora import AcoraBuilder
>>> tks =[array('h', range(x, x+1000)).tostring() for x in range(1000)]
>>> builder = AcoraBuilder(*tks)
>>> ac=builder.build()
@pombredanne
Copy link
Author
pombredanne commented Jun 24, 2016

note this is a follow up on #6

@pombredanne
Copy link
Author

FWIW, the building of an automaton in @WojciechMula 's https://github.com/WojciechMula/pyahocorasick/blob/master/py/pyahocorasick.py (not even the C implementation) is much much faster.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant
0