Stars
🤗 smolagents: a barebones library for agents that think in code.
This repository contains the Hugging Face Agents Course.
Vector (and Scalar) Quantization, in Pytorch
Transformer based on a variant of attention that is linear complexity in respect to sequence length
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
Implementation of Alphafold 3 from Google Deepmind in Pytorch
To eventually become an unofficial Pytorch implementation / replication of Alphafold2, as details of the architecture get released
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
CDAN: Convolutional Dense Attention-guided Network for Low-light Image Enhancement
Semantic segmentation for aerial urban understanding using an attention-guided U-Net model.
Adversarial attacks againsts Large Language Models
Curated list of datasets and tools for post-training.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.