Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
-
Updated
Jan 8, 2025 - Python
8000
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
Automatic Speech Recognition (ASR) - German
Calculate your taxes from cryptocurrency gains
A tokenizer and sentence splitter for German and English web and social media texts.
A lemmatizer for German language text
Ten Thousand German News Articles Dataset for Topic Classification
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English. LLMs, FSTs and More!
An easy to use python package for deep learning-based german sentiment classification.
Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages
📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF
Finetuning instruct-LLaMA on german datasets.
Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to clean messy text (e.g. map peculiar Unicode encodings to ASCII) or replace common abbreviations in text in combination with various text mining tasks.
Vocab-based profanity checking tool for English, Spanish, Portuguese, German, and Turkish.
a decent German Diceware word list to generate memorable passphrases
product recommendation text generation using OpenCCG
Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German
Add a description, image, and links to the german topic page so that developers can more easily learn about it.
To associate your repository with the german topic, visit your repo's landing page and select "manage topics."