8000 HaoningJiang-space / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View HaoningJiang-space's full-sized avatar

Highlights

  • Pro

Block or report HaoningJiang-space

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A High-efficiency Open-source Toolkit for Table-to-Latex Task

Python 242 21 Updated Dec 12, 2024

A reinforcement learning agent that learns to solve mazes using Group Relative Policy Optimization (GRPO).

Python 7 1 Updated Feb 9, 2025

🐭 A tiny single-file implementation of Group Relative Policy Optimization (GRPO) as introduced by the DeepSeekMath paper

Python 30 1 Updated Feb 7, 2025

Reinforcement learning assisted analog layout design flow.

Python 23 13 Updated Jun 17, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 7,813 565 Updated Jan 3, 2025
Python 4 Updated Jan 12, 2025
HTML 2 Updated Jan 1, 2025

BME-X: A foundation model for enhancing magnetic resonance images and downstream segmentation, registration and diagnostic tasks

Python 37 3 Updated Apr 20, 2025

😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.

313 13 Updated Jun 3, 2025

Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020

Python 45 8 Updated Jul 19, 2023

Codes accompanying the paper "Bayesian Design Principles for Offline-to-Online Reinforcement Learning" (ICML 2024)

Python 7 Updated May 30, 2024

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,368 101 Updated Mar 3, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,664 478 Updated Jan 8, 2024

Deep Reinforcement Learning of Analog Circuit Designs

Python 111 42 Updated Jun 12, 2023

Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch

Python 825 68 Updated Dec 24, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 10,963 1,630 Updated May 26, 2025

SQLi detection on DPU

Jupyter Notebook 2 1 Updated Dec 11, 2023

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 37,400 6,355 Updated Jun 7, 2025

Awesome Papers About Performing Prompting On Graphs

392 26 Updated May 15, 2025

Must-read papers on Graph Neural Networks (GNNs) for Integrated Circuits (ICs) design, security and reliability.

54 5 Updated Mar 13, 2025
0