8000 sofan110 (Fan Wu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View sofan110's full-sized avatar

Highlights

  • Pro

Block or report sofan110

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide

5,473 356 Updated May 29, 2025

A curated list of reinforcement learning with human feedback resources (continually updated)

3,962 242 Updated Apr 30, 2025

A curated list of visual reinforcement learning resources

282 12 Updated May 19, 2025
Python 10 2 Updated May 6, 2024

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

368 20 Updated Apr 24, 2024

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,724 291 Updated May 27, 2025

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

261 7 Updated Aug 20, 2023

An index of algorithms for offline reinforcement learning (offline-rl)

982 90 Updated May 23, 2024

A curated list of awesome model based RL resources (continually updated)

1,120 67 Updated May 16, 2025
Python 96 10 Updated Dec 7, 2023

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Python 148 22 Updated Nov 6, 2023

PyTorch implementation of the InfoNCE loss for self-supervised learning.

Python 557 42 Updated Nov 17, 2023

Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

Jupyter Notebook 334 41 Updated May 10, 2024

Equation Learner, a neural network approach to symbolic regression

Jupyter Notebook 78 19 Updated Sep 24, 2024

TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning" (ICLR 2022).

Python 16 4 Updated Jul 2, 2022

Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7

Python 20 16 Updated Feb 14, 2019

TorchOpt is an efficient library for differentiable optimization built upon PyTorch.

Python 592 36 Updated May 5, 2025

智能计算系统作业(2021年)

C++ 54 11 Updated May 25, 2021
Python 53 13 Updated Feb 28, 2024

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Jupyter Notebook 832 49 Updated Aug 12, 2024

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 85,789 9,992 Updated May 19, 2025

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Python 1,522 186 Updated Apr 13, 2024

Change-Based Exploration Transfer

Python 36 5 Updated Apr 24, 2022

Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning

Python 212 50 Updated Dec 27, 2022

📡 Simple and ready-to-use tutorials for TensorFlow

Jupyter Notebook 16,382 3,182 Updated Nov 28, 2022

Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)

Python 491 131 Updated Dec 1, 2022

Collection of reinforcement learning algorithms

Python 2,692 562 Updated Jun 17, 2024

CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning

Python 63 9 Updated May 20, 2020

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,818 1,200 Updated Jul 25, 2024

A HRL framework based on stable-baseline3

Python 7 1 Updated Jul 23, 2021
Next
0