Stars
A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GRPC services and as a standalone library, providing highly ef…
The Unofficial TikTok API Wrapper In Python
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Huggingface cloth segmentation using U2NET
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
Stable diffusion for real-time music generation
This is a horoscope generating code
Robust Speech Recognition via Large-Scale Weak Supervision
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
Soft speech units for voice conversion
High-Resolution Image Synthesis with Latent Diffusion Models
Repo accompanying the blog post "How to Deploy PyTorch Models with Core ML Conversion Issues"
This library augments road images to introduce various real world scenarios that pose challenges for training neural networks of Autonomous vehicles. Automold is created to train CNNs in specific w…
ModaNet: A large-scale street fashion dataset with polygon annotations
A library for efficient similarity search and clustering of dense vectors.
Taming Transformers for High-Resolution Image Synthesis
Implementation of Analyzing and Improving the Image Quality of StyleGAN (StyleGAN 2) in PyTorch
Python/numpy/pandas convenience wrapper for the TIMIT database.
Improving the Goodness of Pronunciation with DNNs and RNNs
Phone-level evaluation of L2 speakers (GOP algorithm)