8000 sofan110 (Fan Wu) / Starred · GitHub

More Web Proxy on the site http://driver.im/

sofan110

Follow

Fan Wu sofan110

Follow

11 followers · 18 following

UCAS

Achievements

Achievements

Highlights

Pro

Stars

TianxingChen / Embodied-AI-Guide

[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide

5,473 356 Updated May 29, 2025

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,962 242 Updated Apr 30, 2025

qiwang067 / awesome-visual-rl

A curated list of visual reinforcement learning resources

282 12 Updated May 19, 2025

MoreanP / CSRO

Python 10 2 Updated May 6, 2024

123penny123 / Awesome-LLM-RL

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

368 20 Updated Apr 24, 2024

GT-RIPL / Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,724 291 Updated May 27, 2025

OFA-Sys / InsTag

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

261 7 Updated Aug 20, 2023

hanjuku-kaso / awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

982 90 Updated May 23, 2024

opendilab / awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

1,120 67 Updated May 16, 2025

htyao89 / KgCoOp

Python 96 10 Updated Dec 7, 2023

Stanford-ILIAD / PantheonRL

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Python 148 22 Updated Nov 6, 2023

RElbers / info-nce-pytorch

PyTorch implementation of the InfoNCE loss for self-supervised learning.

Python 557 42 Updated Nov 17, 2023

Linear95 / CLUB

Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

Jupyter Notebook 334 41 Updated May 10, 2024

martius-lab / EQL

Equation Learner, a neural network approach to symbolic regression

Jupyter Notebook 78 19 Updated Sep 24, 2024

CR-Gjx / RIA

TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning" (ICLR 2022).

Python 16 4 Updated Jul 2, 2022

dennisl88 / rand_param_envs

Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7

Python 20 16 Updated Feb 14, 2019

metaopt / torchopt

TorchOpt is an efficient library for differentiable optimization built upon PyTorch.

Python 592 36 Updated May 5, 2025

Pengwei-Jin / AICS-homework

智能计算系统作业（2021年）

C++ 54 11 Updated May 25, 2021

rraileanu / idaac

Python 53 13 Updated Feb 28, 2024

google-research / rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Jupyter Notebook 832 49 Updated Aug 12, 2024

Anduin2017 / HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 85,789 9,992 Updated May 19, 2025

google-deepmind / bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Python 1,522 186 Updated Apr 13, 2024

sparisi / cbet

Change-Based Exploration Transfer

Python 36 5 Updated Apr 24, 2022

iclavera / learning_to_adapt

Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning

Python 212 50 Updated Dec 27, 2022

instillai / TensorFlow-Course

📡 Simple and ready-to-use tutorials for TensorFlow

Jupyter Notebook 16,382 3,182 Updated Nov 28, 2022

katerakelly / oyster

Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)

Python 491 131 Updated Dec 1, 2022

rail-berkeley / rlkit

Collection of reinforcement learning algorithms

Python 2,692 562 Updated Jun 17, 2024

younggyoseo / CaDM

CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning

Python 63 9 Updated May 20, 2020

p-christ / Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,818 1,200 Updated Jul 25, 2024

guojm14 / HRL

A HRL framework based on stable-baseline3

Python 7 1 Updated Jul 23, 2021

0