Stars
An arbitrary face-swapping framework on images and videos with one single trained model!
🎨 Semantic segmentation models, datasets and losses implemented in PyTorch.
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
High-Resolution Image Synthesis with Latent Diffusion Models
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP
An open source implementation of CLIP.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
deep learning for image processing including classification and object-detection etc.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…