8000
We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Links in this format: "MD link <https://heise.de> example." have an issue.
"MD link <https://heise.de> example."
Code:
text = "MD link <https://heise.de> example." sentences = somajo.tokenize_text([text]) for sentence in sentences: for token in sentence: print(f"{token.text}\t{token.token_class}\t{token.extra_info}")
Returns:
MD regular link regular < symbol SpaceAfter=No https://heise.de/> URL example regular SpaceAfter=No . symbol
Should return something like this:
MD regular link regular < symbol SpaceAfter=No https://heise.de/ URL > symbol example regular SpaceAfter=No . symbol
Full code: https://colab.research.google.com/drive/16-CKdzp20Gin02emrLVeHfFFir2veK8M?usp=sharing
The text was updated successfully, but these errors were encountered:
This should be closely related to #26 and can probably be fixed in the same manner. I’ll see if I can take a closer look tomorrow.
Sorry, something went wrong.
Correctly tokenize URLs in angle brackets (#27)
fb788e6
It should be fixed in version 2.4.1!
it is fixed. Many thanks.
No branches or pull requests
Links in this format:
"MD link <https://heise.de> example."
have an issue.Code:
Returns:
Should return something like this:
Full code: https://colab.research.google.com/drive/16-CKdzp20Gin02emrLVeHfFFir2veK8M?usp=sharing
The text was updated successfully, but these errors were encountered: