Stars
Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including CPU, AMD, and NVIDIA GPUs.
A lexical normalizer for historical spelling variants using a transformer architecture.
MetaQuRe - code and data for rethinking and performing AutoML and meta-learning in a resource-aware way
Python wrapper for obtaining synonyms in the German language from OpenThesaurus
optimagic is a Python package for numerical optimization. It is a unified interface to optimizers from SciPy, NlOpt and other packages. optimagic's minimize function works just like SciPy's, so you…
State of the Art Language models and Classifier for Code mixed Manglish (Malayalam and English) - spoken in Indian sub-continent.
A collaborative catalog of NLP resources for Indic languages
Code for Paper: “Low-Resource” Text Classification: A Parameter-Free Classification Method with Compressors
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Code from Jia and Liang, "Adversarial Examples for Evaluating Reading Comprehension Systems" (EMNLP 2017)
A python package to simulate typographical errors.
Track emissions from Compute and recommend ways to reduce their impact on the environment.
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
🇩🇪 Preprocess German texts to do some serious natural-language processing.
Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German
📖 A curated list of resources dedicated to Natural Language Processing (NLP)
OpenAI Whisper ASR Webservice API
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
All Algorithms implemented in Python
Curated list of project-based tutorials
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
SapiMouse - a new dataset for Mouse Dynamics
Project codes used in "The Docker Handbook"