Closed
Description
When trying to parse certain malformed unicode characters, Sedlexing calls Uchar.of_int
without sufficient verification that the codepoint is well-formed. This results in Uchar.of_int
triggering a Stdlib.Invalid_argument s
where s
is some error message string. Instead, MalFormed
should be raised, which would allow clean handling of this particular exception from the sedlex user side instead of a catch-all treatment of Invalid_argument
.
An example file that contains such a malformed codepoint triggering this error can be found at
https://github.com/Andromedans/andromeda/blob/c45d500b3650a6df30d5a1ef09e2d677aeacc190/tests/malformed-unicode.m31
Metadata
Metadata
Assignees
Labels
No labels