-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Issues: nltk/nltk
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
word_tokenize fails due to punkt_tab reference in multiple environments
#3394
opened May 29, 2025 by
daniel-mehta
Punkt Sentence Tokenizer incorrectly splits on multi-period abbreviations
#3370
opened Feb 24, 2025 by
alexcannan
ccg_semantics’ parser has undesirable side effects on the lexicon
8000
#3345
opened Dec 9, 2024 by
ShadokDuBas
duplicated short option name for tokenize's
--language
and --preserve-line
#3342
opened Nov 25, 2024 by
stanislavlevin
KneserNeyInterpolated taking an unreasonable amount of time to generate text
#3317
opened Sep 3, 2024 by
owo
Replacing black with ruff in CI/CD precommit hook
CI
good first issue
nice idea
#3278
opened Jul 8, 2024 by
alvations
Summing Ngram LM probabilities requires math.fsum
bug
language-model
#3275
opened Jul 6, 2024 by
alvations
Add functionality to return the lemmas of words used in a corpus.
#3256
opened May 21, 2024 by
Sion1225
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-05-02.