cuijiaxun

Touching Fish

Jiaxun Cui cuijiaxun

Touching Fish

72 followers · 53 following

The University of Texas at Austin
Austin, TX
04:26 (UTC -05:00)
cuijiaxun.github.io
https://orcid.org/0009-0009-1987-9549
@cuijiaxun

Stars

multi-agent-systems-failure-taxonomy / MAST

Python 189 13 Updated May 21, 2025

pmariglia / poke-engine

A Pokemon battle engine that can search through Pokemon states

Rust 21 3 Updated Apr 21, 2025

PyO3 / pyo3

Rust bindings for the Python interpreter

Rust 13,666 831 Updated May 21, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 14,579 1,822 Updated May 23, 2025

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,235 295 Updated May 23, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,367 1,028 Updated May 23, 2025

facebookresearch / MLGym

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 495 45 Updated May 13, 2025

sethkarten / pokechamp

Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.

Python 58 6 Updated Mar 28, 2025

TauricResearch / TradingAgents

TradingAgents: Multi-Agents LLM Financial Trading Framework

JavaScript 306 31 Updated Feb 2, 2025

Princeton-RL / contrastive-successor-features

Python 8 2 Updated Dec 14, 2024

seohongpark / fql

The official implementation of flow Q-learning (FQL)

Python 151 12 Updated Mar 12, 2025

google-deepmind / simulation_streams

Simulation Streams is a programming paradigm designed to efficiently control and leverage Large Language Models (LLMs) for complex, dynamic simulations and agentic workflows.

Python 20 5 Updated Mar 13, 2025

Stanford-ILIAD / PantheonRL

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Python 147 22 Updated Nov 6, 2023

ucl-dark / paired

PAIRED in PyTorch 🔥

Python 60 20 Updated Mar 8, 2023

allenzren / open-pi-zero

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 904 60 Updated Jan 31, 2025

lqiang67 / rectified-flow

code based for rectified flow

Python 148 10 Updated May 20, 2025

FanGShiYuu / CoDrivingLLM

[IEEE-TVT(2025)]Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-making Framework

Python 46 5 Updated Mar 21, 2025

UM-ARM-Lab / pytorch_mppi

Model Predictive Path Integral (MPPI) with approximate dynamics implemented in pytorch

Python 553 65 Updated Nov 14, 2024

k4ntz / OC_Atari

Object Centric Atari games

Python 78 13 Updated May 14, 2025

denisyarats / exorl

ExORL: Exploratory Data for Offline Reinforcement Learning

Python 114 9 Updated Feb 8, 2022

interaction-dataset / interaction-dataset

Interaction Dataset Python Scripts

Python 223 67 Updated Jul 26, 2022

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

Jupyter Notebook 7,987 512 Updated Apr 29, 2025

brendenlake / SCAN

Simple language-driven navigation tasks for studying compositional learning

195 26 Updated Nov 5, 2020

microsoft / autogen

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 44,871 6,797 Updated May 23, 2025

BoosterRobotics / booster_gym

Booster Gym is a reinforcement learning (RL) framework designed for humanoid robot locomotion developed by Booster Robotics.

Python 90 14 Updated Jan 9, 2025

BoosterRobotics / robocup_demo

The Booster T1 Robocup official demo allows the robot to make autonomous decisions to kick the ball and complete the full Robocup match. It includes three programs: vision, brain, and game_controller.

C++ 27 Updated May 23, 2025

isaac-sim / IsaacLab

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 3,674 1,756 Updated May 23, 2025

SMPLOlympics / SMPLOlympics

Python 185 9 Updated Oct 2, 2024

yuandong-tian / arXiv_recbot

A Telegram bot to recommend arXiv papers

Python 270 23 Updated Apr 12, 2025

carolinewang01 / naht

Code repository for "N-agent Ad Hoc Teamwork" paper (Wang et al., Neurips 2024).

Jupyter Notebook 19 4 Updated Jan 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly