Closed
Description
While trying to parse mediawiki to plaintext, pandoc freezes at some point (seems like non-deterministic). I am using ruby gem pandoc-ruby as a wrapper for pandoc 1.12. Trying to parse 6000 character chunks of text, because whole one page (parsing wikipedia pages) could be several hundred thousand characters long.
Can't tell you what input is processing when it freezes, because it is running 12h and dont know where in file is script now. Could it be some kind of bug, because it occurs randomly.
Metadata
Metadata
Assignees
Labels
No labels