Highlights
- Pro
Stars
A High-efficiency Open-source Toolkit for Table-to-Latex Task
A reinforcement learning agent that learns to solve mazes using Group Relative Policy Optimization (GRPO).
🐭 A tiny single-file implementation of Group Relative Policy Optimization (GRPO) as introduced by the DeepSeekMath paper
Reinforcement learning assisted analog layout design flow.
A Comprehensive Toolkit for High-Quality PDF Content Extraction
BME-X: A foundation model for enhancing magnetic resonance images and downstream segmentation, registration and diagnostic tasks
😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.
Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020
Codes accompanying the paper "Bayesian Design Principles for Offline-to-Online Reinforcement Learning" (ICML 2024)
Secrets of RLHF in Large Language Models Part I: PPO
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Deep Reinforcement Learning of Analog Circuit Designs
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
This repository contains demos I made with the Transformers library by HuggingFace.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Awesome Papers About Performing Prompting On Graphs
Must-read papers on Graph Neural Networks (GNNs) for Integrated Circuits (ICs) design, security and reliability.