Highlights
- Pro
Stars
Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"
A toolkit for identifying pretrained language models from potentially AI-generated text
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
Ensembling Hugging Face transformers made easy
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
OSLO: Open Source framework for Large-scale model Optimization
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)
PPETrackr is a web app that provides local governments with PPE inventory analytics from healthcare facilities.