8000 safinal (Ali Nafisi) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View safinal's full-sized avatar
💻
💻

Organizations

@rivlab

Block or report safinal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤗 smolagents: a barebones library for agents that think in code.

Python 19,459 1,691 Updated May 30, 2025

This repository contains the Hugging Face Agents Course.

MDX 19,153 1,280 Updated May 28, 2025

Vector (and Scalar) Quantization, in Pytorch

Python 3,270 264 Updated May 3, 2025

Transformer based on a variant of attention that is linear complexity in respect to sequence length

Python 768 71 Updated May 5, 2024

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Python 758 60 Updated Sep 13, 2023

Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways

Python 822 82 Updated Nov 9, 2022

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,128 52 Updated May 13, 2025

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Python 1,145 136 Updated Aug 22, 2023

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Python 1,317 135 Updated May 3, 2024

Implementation of Alphafold 3 from Google Deepmind in Pytorch

Python 1,440 188 Updated Jan 22, 2025

To eventually become an unofficial Pytorch implementation / replication of Alphafold2, as details of the architecture get released

Python 1,604 264 Updated Oct 29, 2022

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,254 264 Updated Sep 6, 2023

Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement

Python 3,763 590 Updated Jan 12, 2025

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,337 459 Updated May 30, 2025

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Python 5,617 639 Updated Feb 17, 2024

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,836 677 Updated Apr 23, 2025

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,273 783 Updated Oct 7, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 9,403 1,139 Updated Oct 9, 2024

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Python 11,281 1,095 Updated May 11, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 22,945 3,292 Updated Mar 5, 2025

CDAN: Convolutional Dense Attention-guided Network for Low-light Image Enhancement

Python 37 3 Updated Mar 9, 2025

Semantic segmentation for aerial urban understanding using an attention-guided U-Net model.

Python 5 Updated Aug 30, 2023

Adversarial attacks againsts Large Language Models

Python 7 Updated Sep 29, 2024

Curated list of datasets and tools for post-training.

3,096 266 Updated Jan 29, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 60,832 6,146 Updated Aug 24, 2024

Mamba SSM architecture

Python 14,981 1,314 Updated May 25, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 53,576 5,688 Updated May 12, 2025
0